Home

Tutoring

Subjects

Live Classes

Study Coach

Essay Review

On-Demand Courses

Colleges

Games

Opening subject page...

Loading your content

  1. AP Statistics
  2. Introduction to Planning a Study

AP STATISTICS • COLLECTING DATA

Introduction to Planning a Study

Understanding how study design determines the validity and scope of statistical conclusions.

SECTION 1

Historical Context & Motivation

The need for rigorous study design grew out of centuries of scientific inquiry in which poorly collected data led to misleading or outright incorrect conclusions. In the early history of medicine, for example, physicians routinely prescribed treatments based on anecdotal observations and personal conviction rather than systematic evidence. Agriculture faced similar challenges: farmers debated the merits of different fertilizers without any controlled method for determining which truly improved crop yields. The formalization of statistical study planning emerged when researchers recognized that how data are collected is just as important as how data are analyzed. This section traces the key milestones that shaped the modern approach to planning studies, from early census efforts to the randomized controlled trial.

1747
Lind's Scurvy Trial
James Lind conducted one of the first recorded controlled experiments by comparing six treatments for scurvy among sailors aboard HMS Salisbury. Although his sample was small, the deliberate assignment of treatments to groups foreshadowed modern experimental design.
1920s
Fisher's Agricultural Experiments
Sir Ronald A. Fisher, working at Rothamsted Experimental Station, developed the principles of randomization, replication, and blocking. His 1935 book, The Design of Experiments, established the theoretical foundation for statistical study planning.
1948
Streptomycin RCT
The British Medical Research Council conducted the first major randomized controlled trial (RCT) to evaluate streptomycin for treating tuberculosis. Random assignment of patients to treatment and control groups set the gold standard for clinical research.
1970s–Present
Expansion Across Disciplines
Statistical study design expanded beyond medicine and agriculture into psychology, education, economics, and public policy. The AP Statistics curriculum, introduced in 1997, placed data collection alongside inference as a core pillar, reflecting the consensus that valid conclusions require thoughtful study planning from the outset.

The central question that study planning addresses is deceptively simple: What kind of evidence do we need, and how should we collect it, to answer a specific question reliably? Without a clear answer to this question before data collection begins, even the most sophisticated analysis techniques cannot rescue a flawed study. The remainder of this lesson explores the principles, structures, and decisions that underpin effective study planning in statistics.

SECTION 2

Core Principles & Definitions

Planning a study requires a clear understanding of several foundational concepts that determine the type of study you conduct, the conclusions you can draw, and the population to which those conclusions apply. At the broadest level, every statistical investigation begins with a research question—a precisely stated inquiry that the study is designed to answer. The nature of that question dictates whether you need an observational study or an experiment. Understanding the distinction between these two broad categories—and the further subtypes within each—is arguably the single most important idea in the AP Statistics data-collection unit.

1

Observational Study vs. Experiment

In an observational study, the researcher measures variables without imposing any treatment. In an experiment, the researcher deliberately imposes a treatment on subjects to observe the effect. Only experiments can establish cause-and-effect relationships.
2

Population & Sample

The population is the entire group about which we want to draw conclusions. A sample is the subset actually studied. Proper sampling techniques ensure the sample is representative of the population.
3

Confounding Variables

A confounding variable is associated with both the explanatory variable and the response variable, making it impossible to determine which factor actually causes an observed effect. Good study design aims to control or eliminate confounding.
4

Scope of Inference

The scope of inference describes the extent to which conclusions can be generalized. Random selection supports generalization to the population; random assignment supports causal conclusions. A well-planned study clarifies both dimensions.
✦ KEY TAKEAWAY
Think of planning a study like planning a road trip. Your research question is the destination. Whether you choose an observational study or an experiment is your choice of route—highways (experiments) get you to causal conclusions faster, while scenic roads (observational studies) reveal interesting associations along the way. Your sampling method is your GPS—it determines whether you actually arrive at a destination that represents the broader landscape, or just one peculiar corner of it.
SECTION 3

Visual Overview: The Study Planning Framework

The diagram below provides a comprehensive visual map of the decision-making process that underlies planning a statistical study. Starting from a research question, the flowchart distinguishes the two main study types and highlights the key design elements associated with each. Understanding this branching structure is essential for the AP Statistics exam, where you will frequently be asked to identify the type of study, evaluate its design, and determine the scope of valid conclusions.

Research QuestionDoes the researcher impose a treatment?NoYesObservational StudyExperimentSample SurveyProspective / Retro.RandomizedBlocked / MatchedScope of InferenceAssociation only (no causation)Scope of InferenceCausation possibleRandom selection → Generalization to population
The flowchart shows the decision tree for study planning. Starting from a research question (top), the critical branching point asks whether the researcher imposes a treatment. The left branch leads to observational studies (association only), while the right branch leads to experiments (causation possible). The dashed lines to the amber box emphasize that random selection is what enables generalization to the broader population, regardless of study type.

Notice that the diagram distinguishes between two types of randomness that often confuse students. Random selection (choosing who is in the study from the population) supports generalizability—the ability to extend conclusions to the broader population. Random assignment (deciding who receives which treatment within the study) supports causal inference—the ability to attribute differences in the response to the treatment itself. A study can have one, both, or neither of these features, and the scope of inference changes accordingly.

SECTION 4

How Study Design Determines Valid Conclusions

While the "Collecting Data" unit of AP Statistics is less equation-heavy than inference or probability, there is a rigorous logical framework that governs how study design maps onto the conclusions you may legitimately draw. The relationship between design features and inferential scope can be organized into a 2 × 2 scope-of-inference matrix, which is one of the most tested conceptual structures on the AP exam.

The 2 × 2 Scope-of-Inference Matrix
Random Assignment (Yes)No Random Assignment
Random Selection (Yes)Causal conclusions generalizable to the population. This is the ideal scenario—an experiment with a randomly selected sample.Association conclusions generalizable to the population. Observational study with a representative sample.
No Random SelectionCausal conclusions limited to subjects in the study. A common experimental scenario where volunteers are used.No causal or generalizable conclusions. A convenience sample with no random assignment offers the weakest evidence.

The matrix above is not a formula to memorize; it is a logical consequence of what randomness accomplishes. Random selection ensures that the sample reflects the population, so findings can be generalized outward. Random assignment balances both known and unknown confounding variables across treatment groups, so any observed difference in the response variable can be attributed to the treatment rather than to lurking variables. When a study lacks one or both forms of randomness, the corresponding inferential claim—generalizability or causation—is no longer supported.

💡 AP Exam Tip
Free-response questions frequently ask you to identify the type of study and then state what conclusions can and cannot be drawn. Always address both dimensions: (1) Can we establish causation? (Only if random assignment was used.) (2) Can we generalize to a larger population? (Only if random selection was used.) Providing both parts earns full credit; addressing only one is a common reason for partial-credit scores.

Confounding: The Key Threat to Causal Inference

A confounding variable is a variable that is associated with both the explanatory variable and the response variable, creating an alternative explanation for the observed relationship. In an observational study comparing coffee drinkers to non-coffee drinkers on anxiety levels, for instance, stress could be a confounder: high-stress individuals may drink more coffee and experience more anxiety, making it unclear whether coffee itself drives the association. Random assignment in an experiment disrupts confounding by distributing all variables—measured or unmeasured—roughly equally across groups. This is why only well-designed experiments can support causal claims.

SECTION 5

Classifying Studies: A Detailed Breakdown

Within the two broad categories of observational studies and experiments, several subtypes appear regularly on the AP Statistics exam. Understanding the characteristics of each subtype helps you identify study designs in context and anticipate the strengths and limitations that follow.

Study Type ClassificationObservational StudiesSample SurveyCollects data at one point in time viaquestionnaire.Prospective StudyFollows subjects forward in time to track outcomes.Retrospective StudyLooks backward at existing data or records.Key LimitationCannot establish causation due topotential confounding variables.ExperimentsCompletely Randomized DesignSubjects randomly assigned to treatments.Randomized Block DesignGroups formed by a blocking variable, thenrandomized within blocks.Matched Pairs DesignEach subject paired with another (or servesas own control); treatments randomly assigned.Key StrengthRandom assignment controls confounding,enabling causal conclusions.
This side-by-side classification diagram contrasts the subtypes of observational studies (left panel) with experimental designs (right panel). Each subtype is defined by how data are collected over time and how subjects are assigned to conditions. The bottom boxes summarize the key limitation (for observational studies) and key strength (for experiments).

A sample survey collects information from respondents at a single point in time, typically through questionnaires or interviews. It is perhaps the most common form of observational study and is subject to biases such as nonresponse and wording effects. A prospective study follows a group of subjects forward in time, recording exposures and outcomes as they develop; this approach is stronger than a retrospective design because it can track the temporal sequence of events. A retrospective study looks backward, mining existing records or asking subjects to recall past behavior—useful when the outcome has already occurred but susceptible to recall bias.

On the experimental side, a completely randomized design (CRD) is the simplest structure: all subjects are randomly allocated to treatment groups without any preliminary grouping. A randomized block design first organizes subjects into blocks based on a variable that is expected to affect the response (such as age group or gender), and then randomly assigns treatments within each block. This reduces variability and increases the precision of comparisons. A matched pairs design is a special case of blocking in which each block contains exactly two subjects who are matched on relevant characteristics, or a single subject who receives both treatments in random order.

SECTION 6

Worked Example: Identifying and Evaluating a Study Design

Consider the following scenario, which mirrors the kind of prompt you would encounter on the AP Statistics free-response section.

📋 Scenario
A university health center wants to determine whether a new meditation app reduces self-reported stress levels among first-year students. Researchers recruit 120 volunteers from the first-year class and randomly assign 60 to use the app daily for four weeks and 60 to continue their normal routines. At the end of four weeks, all 120 students complete a validated stress questionnaire, and the mean scores are compared.

Analyzing the Study Design

Step 1 — Identify the Study Type

The researchers imposed a treatment (using the meditation app) on the treatment group. Because a treatment was deliberately applied, this is an experiment, not an observational study.
Study type: Experiment

Step 2 — Identify the Experimental Design

All 120 volunteers were randomly assigned to one of two groups without any preliminary grouping by a blocking variable. This means the design is a completely randomized design (CRD) with two treatment groups.
Design: Completely Randomized Design

Step 3 — Identify Key Components

The explanatory variable (factor) is whether or not the student uses the meditation app. The response variable is the self-reported stress score on the questionnaire. The experimental units are the 120 first-year student volunteers. The treatments are (1) daily use of the meditation app and (2) continuing normal routines (the control).
Explanatory: app usage; Response: stress score; Units: 120 students; Treatments: app vs. control

Step 4 — Assess Random Selection vs. Random Assignment

The students were volunteers, not a random sample from all first-year students. This means random selection was not used. However, random assignment was used to place students into treatment groups.
Random selection: No. Random assignment: Yes.

Step 5 — Determine the Scope of Inference

Because random assignment was used, we can draw a causal conclusion: if a statistically significant difference is found, we can say the meditation app caused a reduction in stress. However, because random selection was not used (participants were volunteers), we cannot generalize the results beyond the 120 volunteers to all first-year students at the university.
Causation: Yes (for these volunteers). Generalization: No (volunteers only).
SECTION 7

Strengths & Limitations of Each Study Type

No single study type is universally superior. The choice between observational and experimental designs depends on the research question, ethical constraints, available resources, and the population of interest. Recognizing the relative strengths and limitations of each approach is critical both for the AP exam and for evaluating real-world research.

Comparison of Observational Studies and Experiments
FeatureObservational StudyExperiment
Causal InferenceCannot establish causation; confounders remain uncontrolled.Can establish causation when random assignment is used.
Ethical FeasibilityOften the only option when imposing a treatment would be unethical (e.g., smoking, exposure to toxins).Limited to situations where treatment can be ethically imposed.
GeneralizabilityCan use random sampling from a large population, supporting strong generalizability.Often uses volunteers, limiting generalizability.
Cost & TimeRetrospective studies are inexpensive; prospective studies can be lengthy.Controlled conditions can be costly and time-intensive to maintain.
Control of ConfoundersRelies on statistical techniques (e.g., stratification) to address confounders.Random assignment balances both known and unknown confounders.
✦ KEY TAKEAWAY
In clinical research, an experiment is like testing a bridge with controlled loads before it opens: you actively apply stress and measure the response. An observational study is like monitoring traffic on existing bridges and noticing which designs develop cracks—you learn valuable patterns, but you cannot definitively attribute the cracks to one specific design choice because many factors differ simultaneously. Both approaches contribute to knowledge; the key is matching the study type to the research question and honestly communicating the limitations of the chosen design.
SECTION 8

Connection to Inference and Advanced Design

The principles introduced in this lesson lay the groundwork for every inference procedure you will encounter later in AP Statistics. When you perform a two-sample t-test or construct a confidence interval for a difference in proportions, the validity of those procedures depends on how the data were collected. A well-designed experiment with random assignment allows you to test the null hypothesis that the treatment has no effect, while a well-designed sample survey allows you to construct confidence intervals that apply to the population. Conversely, when study design is flawed, no amount of inferential machinery can fix the resulting bias.

How Study Planning Concepts Connect to Later Topics
Concept in This LessonAdvanced Connection
Random assignment → causal conclusionsPermutation tests and randomization distributions simulate what would happen under the null hypothesis if the treatment had no effect, directly leveraging the logic of random assignment.
Random selection → generalizationConfidence intervals for population parameters (μ, p) assume the sample is representative of the population, which random selection ensures.
Blocking in experimentsStratified random sampling in surveys uses the same logic: group similar units together to reduce variability and increase precision.
Confounding variablesIn regression analysis, lurking and confounding variables motivate the inclusion of additional predictors in multiple regression models to isolate the effect of each variable.

Looking beyond AP Statistics, advanced courses in biostatistics and econometrics formalize these ideas further. Quasi-experimental designs—such as regression discontinuity and difference-in-differences—attempt to approximate the benefits of randomization when true experiments are infeasible. Meta-analyses synthesize findings across many studies, weighting each study partly by the rigor of its design. The fundamental question remains the same one you are learning to ask now: Given how these data were collected, what conclusions are justified?

SECTION 9

Practice Problems

PROBLEM 1 — CONCEPTUAL
A researcher records the daily fruit and vegetable intake and body mass index (BMI) of 500 randomly selected adults. She finds that adults who eat more servings of fruits and vegetables tend to have lower BMIs. Which of the following is the most appropriate conclusion?
PROBLEM 2 — BASIC CALCULATION
A teacher wants to test whether listening to classical music during a test improves scores. She flips a coin for each of her 30 students: heads means the student listens to music during the test, tails means silence. Which of the following best describes this study?
PROBLEM 3 — INTERMEDIATE
Researchers want to study whether a new fertilizer increases tomato yield. They have 40 plots of land that differ in sun exposure: 20 plots receive full sun and 20 receive partial shade. The researchers form two blocks based on sun exposure and then randomly assign half the plots in each block to the new fertilizer and half to the standard fertilizer. Which of the following correctly identifies the design, the blocking variable, and the scope of inference?
PROBLEM 4 — APPLIED
A public health agency wants to determine whether a community-wide advertising campaign about the dangers of sugary drinks reduces soda consumption among teenagers. The agency selects a simple random sample of 200 teenagers from the county. Over the next 6 months, the campaign runs in local media. After 6 months, the agency surveys the same 200 teenagers about their current soda consumption. (a) Is this an observational study or an experiment? Explain. (b) Identify one potential confounding variable and explain how it could affect the results. (c) Describe the scope of inference for this study. (d) Propose a modification that would strengthen the study's ability to establish causation.
PROBLEM 5 — CRITICAL THINKING
A medical researcher is studying whether a new drug lowers blood pressure. She has 80 volunteers: 40 are male and 40 are female. (a) Explain why a randomized block design, blocking on sex, might be preferable to a completely randomized design in this context. (b) Describe in detail how the researcher should implement the randomized block design. Include how subjects are organized, how treatments are assigned, and what comparisons are made. (c) Suppose the researcher finds a statistically significant reduction in blood pressure for the drug group. State the appropriate conclusion, being precise about causation and generalization. (d) A colleague suggests the study would be improved by also randomly selecting participants from the broader population of adults. Explain what inferential advantage this would provide and why it is often difficult to achieve in practice.
SUMMARY

Summary

Planning a study is the foundational step of any statistical investigation. Every study begins with a clearly stated research question that determines whether an observational study or an experiment is appropriate. In observational studies—including sample surveys, prospective studies, and retrospective studies—the researcher records data without imposing a treatment, and conclusions are limited to association because confounding variables cannot be ruled out.

In experiments—such as completely randomized designs, randomized block designs, and matched pairs designs—random assignment of treatments enables causal conclusions by balancing confounders across groups. The scope of inference depends on two independent features: random selection supports generalization to the population, while random assignment supports causal inference. Mastering the 2 × 2 scope-of-inference matrix and being able to identify, evaluate, and critique study designs are essential skills for success on the AP Statistics exam.

Varsity Tutors • AP Statistics • Introduction to Planning a Study