For Task 1:
A
Key Concept
Explanation
Least squares regression is used to fit a linear model by minimizing the sum of squares of the residuals.
For Task 2:
B
Key Concept
Explanation
Model selection using AIC and BIC helps in choosing a model that balances goodness of fit and model complexity.
For Task 3:
C
Key Concept
Predictive Model Selection
Explanation
The best predictive model is often the one with a good balance between accuracy and simplicity, avoiding overfitting.
For Task 4:
D
Key Concept
Explanation
Lasso regression is a type of linear regression that includes a penalty term to shrink coefficients towards zero to prevent overfitting.
For Task 5:
E
Key Concept
Explanation
Cross-validation is used to estimate the performance of the model on unseen data and to tune hyperparameters such as lambda in lasso regression.
For Task 6:
F
Key Concept
Variable Selection Comparison
Explanation
Comparing variable selection methods can reveal differences in model complexity and variable importance.
For Task 7:
G
Key Concept
Model Performance Evaluation
Explanation
The sum of squared errors (SSE) is a measure of model accuracy; the model with the lower SSE on test data is generally preferred.