AP Statistics : AP Statistics

Study concepts, example questions & explanations for AP Statistics

varsity tutors app store varsity tutors android store

Example Questions

Example Question #1 : How To Find Correlation

Which of the following shows the least correlation between two variables? 

Possible Answers:

\displaystyle -0.99

\displaystyle 0.25

\displaystyle 0.99

\displaystyle -0.25

\displaystyle 0

Correct answer:

\displaystyle 0

Explanation:

The strength of correlation is measured on an absolute value scale of \displaystyle 0 to \displaystyle 1.0 with \displaystyle 0 being the least correlated and \displaystyle 1.0 being the most correlated. The positive or negative in front of the correlation integer simply determines whether or not there is a positive or negative correlation between the variables. 

A correlation of \displaystyle 0 means that there is no correlation at all between two variables.

Example Question #2 : Bivariate Data

Which of the following correlation coefficients implies the strongest relationship between variables:

\displaystyle 0.1

 

\displaystyle -0.2

 

\displaystyle 0.8

 

\displaystyle 0.4

 

\displaystyle -0.9

Possible Answers:

\displaystyle -0.2

\displaystyle -0.9

\displaystyle 0.8

\displaystyle 0.4

\displaystyle 0.1

Correct answer:

\displaystyle -0.9

Explanation:

A high correlation coefficient regardless of sign implies a stronger relationship. In this case \displaystyle -0.9 has a stronger negative relationship than the positive relationship described by a value of \displaystyle 0.8

Example Question #1 : Bivariate Data

A national study on cell phone use found the following correlations:

-The correlation between the number of texts sent each day and a person's average credit card debt is \displaystyle .35.

-The correlation between the number of texts sent each day and the number of books read each month is \displaystyle -.20.

Which of the following statements are true?

i. As the number of texts sent each day increases, average credit card debt increases.

ii. Sending more texts causes people to read less.

iii. A person's average credit card debt is related more strongly to the number of texts sent each day than the number of books read each month is related to the number of texts sent each day.

 

Possible Answers:

iii

i and iii

i and ii

ii and iii

ii

Correct answer:

i and iii

Explanation:

i is correct because there is a positive correlation between the number of texts sent each day and average credit card debt. 

ii is incorrect because the word "cause" was used in the statement. Correlation does not mean causation. There is a relationship between the number of texts sent each day and the number of books that a person reads each month. However, the number of texts sent each day does not cause a person to read a certain number of books each month. 

iii is correct because the absolute values of the correlations indicate which correlation is stronger. \displaystyle .35 is a stronger correlation than \displaystyle -.2.

Example Question #261 : Ap Statistics

A basketball coach wants to determine if a player's height can predict the number of points the player scores in a season. Which statistical test should the coach conduct?

Possible Answers:

Linear regression

T-test

P-score

ANOVA

Correlation

Correct answer:

Linear regression

Explanation:

Linear regression is the best option for determining whether the value of one variable predicts the value of a second variable.  Since that is exactly what the coach is trying to do, he should use linear regression.

Example Question #1 : How To Find The Least Squares Regression Line

In a regression analysis, the y-variable should be the ___________ variable, and the x-variable should be the ___________ variable.

Possible Answers:

Independent, Dependent

Dependent, Independent

Greater, Lesser

First, Second

Qualified, Unqualified

Correct answer:

Dependent, Independent

Explanation:

Regression tests seek to determine one variable's ability to predict another variable.  In this analysis, one variable is dependent (the one predicted), and the other is independent (the variable that predicts).  Therefore, the dependent variable is the y-variable and the independent variable is the x-variable.

Example Question #2 : Bivariate Data

If a data set has a perfect negative linear correlation, has a slope of \displaystyle -7 and an explanatory variable standard deviation of \displaystyle 2, what is the standard deviation of the response variable?

Possible Answers:

\displaystyle 0

\displaystyle 2

\displaystyle -14

\displaystyle 7

\displaystyle 14

Correct answer:

\displaystyle 14

Explanation:

The key here is to utilize

\displaystyle b_{1} = r\frac{s_{y}}{s_{x}}.

"Perfect negative linear correlation" means \displaystyle r = -1, while the rest of the problem indicates \displaystyle b_{1} = -7 and \displaystyle s_{x} = 2. This enables us to solve for \displaystyle s_{y}.

\displaystyle b_{1} = r\frac{s_{y}}{s_{x}}
\displaystyle \frac{b_{1}}{r} = \frac{s_{y}}{s_{x}}
\displaystyle s_{y} = \frac{b_{1}}{r}{s_{x}} = \frac{-7}{-1}2 = 7\cdot 2 = 14

Example Question #3 : How To Find The Least Squares Regression Line

A least-squares regression line has equation \displaystyle y = 0.64x + 8 and a correlation of \displaystyle r = 0.8. It is also known that \displaystyle s_x = 5. What is \displaystyle s_y?

Possible Answers:

\displaystyle 4

\displaystyle 15

\displaystyle 16

\displaystyle 14

\displaystyle 5

Correct answer:

\displaystyle 4

Explanation:

Use the formula \displaystyle \beta _1 = r\frac{s_y}{s_x}.

Plug in the given values for \displaystyle r and \displaystyle s_x and this becomes an algebra problem.

\displaystyle 0.64 = 0.8\frac{s_y}{5}

\displaystyle 0.8 = \frac{s_y}{5}

\displaystyle s_y = 0.8\cdot 5 = 4

 

Example Question #261 : Ap Statistics

Use the following five number summary to determine if there are any outliers in the data set:

Minimum: \displaystyle 3

Q1: \displaystyle 6

Median: \displaystyle 9

Q3: \displaystyle 12

Maximum: \displaystyle 20

Possible Answers:

There is at least one outlier on the high end of the distribution and at least one outlier on the low end of the distribution.

There is at least one outlier on the high end of the distribution and no outliers on the low end of the distribution.

There are no outliers.

There is at least one outlier on the low end of the distribution and no outliers on the high end of the distribution.

It is not possible to determine if there are outliers based on the information given.

Correct answer:

There are no outliers.

Explanation:

An observation is an outlier if it falls more than \displaystyle 1.5(IQR) above the upper quartile or more than \displaystyle 1.5(IQR) below the lower quartile.

\displaystyle IQR= Q3-Q1= 12-6=6

\displaystyle 1.5(IQR)= 11..5(6)= 9

\displaystyle Q1-9= 6-9= -3. The minimum value is \displaystyle 3 so there are no outliers in the low end of the distribution.

\displaystyle Q3+9= 12+9= 21. The maximum value is \displaystyle 20 so there are no outliers in the high end of the distribution.

Example Question #261 : Ap Statistics

For a data set, the first quartile is \displaystyle 40, the third quartile is \displaystyle 70 and the median is \displaystyle 55.

Based on this information, a new observation can be considered an outlier if it is greater than what?

Possible Answers:

\displaystyle 110

\displaystyle 100

\displaystyle 120

\displaystyle 95

\displaystyle 115

Correct answer:

\displaystyle 115

Explanation:

Use the \displaystyle 1.5 \cdot IQR criteria:

This states that anything less than \displaystyle Q1-1.5IQR or greater than \displaystyle Q3+1.5IQR will be an outlier.

Thus, we want to find

\displaystyle Q3+1.5IQR  where \displaystyle IQR =Q3-Q1.

\displaystyle 1.5 \cdot (70 - 40) =1.5\cdot 30=45

\displaystyle Q_3 + (1.5\cdot IQR) = 70 + 45 = 115

Therefore, any new observation greater than 115 can be considered an outlier.

Example Question #61 : Data

You are given the following information regarding a particular data set:

Q1: \displaystyle 45

Q3: \displaystyle 86

Assume that the numbers \displaystyle 23, 56, 73, 150, 2, 56, 145, and \displaystyle 200 are in the data set. How many of these numbers are outliers? 

Possible Answers:

One

Four

Three

None of the numbers are outliers

Two

Correct answer:

Two

Explanation:

In order to find the outliers, we can use the \displaystyle Q1-1.5*IQR and \displaystyle Q3+1.5IQR formulas.

\displaystyle IQR = Q3-Q1 = 86-45 = 41

\displaystyle 1.5*IQR = 61.5

\displaystyle Q1-1.5*IQR = 45-61.5 = -16.5

\displaystyle Q3+1.5*IQR = 86+61.5 = 147.5

Only two numbers are outside of the calculated range and therefore are outliers: \displaystyle 150 and \displaystyle 200.

Learning Tools by Varsity Tutors