Least Squares Regression - AP Statistics
Card 1 of 30
What is the formula for the least squares regression line?
What is the formula for the least squares regression line?
Tap to reveal answer
$y = a + bx$. Standard form where $a$ is intercept and $b$ is slope.
$y = a + bx$. Standard form where $a$ is intercept and $b$ is slope.
← Didn't Know|Knew It →
What does it mean if $R^2 = 1$ in regression analysis?
What does it mean if $R^2 = 1$ in regression analysis?
Tap to reveal answer
Perfect explanatory power in the model. Model explains all variation in $y$ perfectly.
Perfect explanatory power in the model. Model explains all variation in $y$ perfectly.
← Didn't Know|Knew It →
Identify the formula for calculating the correlation coefficient $r$.
Identify the formula for calculating the correlation coefficient $r$.
Tap to reveal answer
$r = \frac{\sum (x_i - \bar{x})(y_i - \bar{y})}{\sqrt{\sum (x_i - \bar{x})^2 \sum (y_i - \bar{y})^2}}$. Standardized covariance measuring linear association strength.
$r = \frac{\sum (x_i - \bar{x})(y_i - \bar{y})}{\sqrt{\sum (x_i - \bar{x})^2 \sum (y_i - \bar{y})^2}}$. Standardized covariance measuring linear association strength.
← Didn't Know|Knew It →
What is the effect of an outlier on a regression line?
What is the effect of an outlier on a regression line?
Tap to reveal answer
Can significantly alter slope and intercept. Outliers pull the line toward themselves disproportionately.
Can significantly alter slope and intercept. Outliers pull the line toward themselves disproportionately.
← Didn't Know|Knew It →
Which type of plot is used to assess residuals in regression?
Which type of plot is used to assess residuals in regression?
Tap to reveal answer
Residual plot. Plots residuals vs. fitted values or explanatory variable.
Residual plot. Plots residuals vs. fitted values or explanatory variable.
← Didn't Know|Knew It →
Find the intercept $a$ given $\bar{y} = 10$, $b = 2$, and $\bar{x} = 3$.
Find the intercept $a$ given $\bar{y} = 10$, $b = 2$, and $\bar{x} = 3$.
Tap to reveal answer
$a = 4$. Using formula $a = 10 - (2)(3) = 4$.
$a = 4$. Using formula $a = 10 - (2)(3) = 4$.
← Didn't Know|Knew It →
What does the term 'least squares' refer to in regression?
What does the term 'least squares' refer to in regression?
Tap to reveal answer
Minimizing the sum of squared residuals. Method minimizes sum of squared prediction errors.
Minimizing the sum of squared residuals. Method minimizes sum of squared prediction errors.
← Didn't Know|Knew It →
What is the effect of a high leverage point in regression?
What is the effect of a high leverage point in regression?
Tap to reveal answer
Can disproportionately influence the regression line. Points far from $\bar{x}$ have greater impact.
Can disproportionately influence the regression line. Points far from $\bar{x}$ have greater impact.
← Didn't Know|Knew It →
What condition does the normal probability plot of residuals check?
What condition does the normal probability plot of residuals check?
Tap to reveal answer
Normality of residuals. Tests if residuals follow normal distribution.
Normality of residuals. Tests if residuals follow normal distribution.
← Didn't Know|Knew It →
Identify the formula for the total sum of squares $SS_{total}$.
Identify the formula for the total sum of squares $SS_{total}$.
Tap to reveal answer
$SS_{total} = \sum (y_i - \bar{y})^2$. Total variation in $y$ around its mean.
$SS_{total} = \sum (y_i - \bar{y})^2$. Total variation in $y$ around its mean.
← Didn't Know|Knew It →
Identify the formula for the regression sum of squares $SS_{reg}$.
Identify the formula for the regression sum of squares $SS_{reg}$.
Tap to reveal answer
$SS_{reg} = \sum (\hat{y}_i - \bar{y})^2$. Variation in predicted values around mean of $y$.
$SS_{reg} = \sum (\hat{y}_i - \bar{y})^2$. Variation in predicted values around mean of $y$.
← Didn't Know|Knew It →
Identify the formula for the residual sum of squares $SS_{res}$.
Identify the formula for the residual sum of squares $SS_{res}$.
Tap to reveal answer
$SS_{res} = \sum (y_i - \hat{y}_i)^2$. Variation not explained by the regression model.
$SS_{res} = \sum (y_i - \hat{y}_i)^2$. Variation not explained by the regression model.
← Didn't Know|Knew It →
What is the condition of homoscedasticity in regression?
What is the condition of homoscedasticity in regression?
Tap to reveal answer
Constant variance of residuals. Residual spread remains constant across all fitted values.
Constant variance of residuals. Residual spread remains constant across all fitted values.
← Didn't Know|Knew It →
What is the primary assumption about errors in least squares regression?
What is the primary assumption about errors in least squares regression?
Tap to reveal answer
Errors are normally distributed. Required for valid inference and prediction intervals.
Errors are normally distributed. Required for valid inference and prediction intervals.
← Didn't Know|Knew It →
What is the effect of heteroscedasticity in regression?
What is the effect of heteroscedasticity in regression?
Tap to reveal answer
Non-constant variance of residuals. Residual spread changes systematically with fitted values.
Non-constant variance of residuals. Residual spread changes systematically with fitted values.
← Didn't Know|Knew It →
What does the term 'extrapolation' refer to in regression?
What does the term 'extrapolation' refer to in regression?
Tap to reveal answer
Predicting outside the range of the data. Risky because relationships may change outside observed range.
Predicting outside the range of the data. Risky because relationships may change outside observed range.
← Didn't Know|Knew It →
How do you interpret an $R^2$ value of 0.85?
How do you interpret an $R^2$ value of 0.85?
Tap to reveal answer
85% of variance is explained by the model. Strong model fit with good predictive power.
85% of variance is explained by the model. Strong model fit with good predictive power.
← Didn't Know|Knew It →
In least squares regression, what does $b$ represent?
In least squares regression, what does $b$ represent?
Tap to reveal answer
Slope of the regression line. Rate of change in $y$ per unit change in $x$.
Slope of the regression line. Rate of change in $y$ per unit change in $x$.
← Didn't Know|Knew It →
In least squares regression, what does $a$ represent?
In least squares regression, what does $a$ represent?
Tap to reveal answer
Y-intercept of the regression line. Value of $y$ when $x = 0$.
Y-intercept of the regression line. Value of $y$ when $x = 0$.
← Didn't Know|Knew It →
What does it mean if $R^2 = 0$ in regression analysis?
What does it mean if $R^2 = 0$ in regression analysis?
Tap to reveal answer
No explanatory power in the model. Model explains none of the variation in $y$.
No explanatory power in the model. Model explains none of the variation in $y$.
← Didn't Know|Knew It →
Which term describes the strength and direction of a linear relationship?
Which term describes the strength and direction of a linear relationship?
Tap to reveal answer
Correlation coefficient. Ranges from -1 to +1, denoted by $r$.
Correlation coefficient. Ranges from -1 to +1, denoted by $r$.
← Didn't Know|Knew It →
Find the slope of the regression line given $\sum (x_i - \bar{x})(y_i - \bar{y}) = 10$ and $\sum (x_i - \bar{x})^2 = 5$.
Find the slope of the regression line given $\sum (x_i - \bar{x})(y_i - \bar{y}) = 10$ and $\sum (x_i - \bar{x})^2 = 5$.
Tap to reveal answer
$b = 2$. Using formula $b = \frac{10}{5} = 2$.
$b = 2$. Using formula $b = \frac{10}{5} = 2$.
← Didn't Know|Knew It →
Identify the formula for calculating the intercept $a$ in regression.
Identify the formula for calculating the intercept $a$ in regression.
Tap to reveal answer
$a = \bar{y} - b\bar{x}$. Ensures line passes through point $(\bar{x}, \bar{y})$.
$a = \bar{y} - b\bar{x}$. Ensures line passes through point $(\bar{x}, \bar{y})$.
← Didn't Know|Knew It →
How do you interpret a residual plot that shows a pattern?
How do you interpret a residual plot that shows a pattern?
Tap to reveal answer
Model may not be appropriate. Patterns suggest violations of linearity assumption.
Model may not be appropriate. Patterns suggest violations of linearity assumption.
← Didn't Know|Knew It →
What does $R^2$ represent in regression analysis?
What does $R^2$ represent in regression analysis?
Tap to reveal answer
Proportion of variance explained by the model. Coefficient of determination; ranges from 0 to 1.
Proportion of variance explained by the model. Coefficient of determination; ranges from 0 to 1.
← Didn't Know|Knew It →
What does a positive slope indicate in a regression line?
What does a positive slope indicate in a regression line?
Tap to reveal answer
Positive relationship between variables. As $x$ increases, $y$ tends to increase.
Positive relationship between variables. As $x$ increases, $y$ tends to increase.
← Didn't Know|Knew It →
What does a negative slope indicate in a regression line?
What does a negative slope indicate in a regression line?
Tap to reveal answer
Negative relationship between variables. As $x$ increases, $y$ tends to decrease.
Negative relationship between variables. As $x$ increases, $y$ tends to decrease.
← Didn't Know|Knew It →
What is the objective of least squares regression?
What is the objective of least squares regression?
Tap to reveal answer
Minimize the sum of squared residuals. Finds line that best fits data by minimizing prediction errors.
Minimize the sum of squared residuals. Finds line that best fits data by minimizing prediction errors.
← Didn't Know|Knew It →
State the condition for using least squares regression.
State the condition for using least squares regression.
Tap to reveal answer
Linear relationship between variables. Assumes straight-line relationship exists.
Linear relationship between variables. Assumes straight-line relationship exists.
← Didn't Know|Knew It →
Identify the formula for calculating the slope $b$ in regression.
Identify the formula for calculating the slope $b$ in regression.
Tap to reveal answer
$b = \frac{\sum (x_i - \bar{x})(y_i - \bar{y})}{\sum (x_i - \bar{x})^2}$. Covariance divided by variance of $x$.
$b = \frac{\sum (x_i - \bar{x})(y_i - \bar{y})}{\sum (x_i - \bar{x})^2}$. Covariance divided by variance of $x$.
← Didn't Know|Knew It →