Two-variable data & scatterplots

A scatterplot shows how two quantities move together. The questions ask you to read its direction, use a line (or curve) of best fit to predict, interpret what the line's slope means, and measure how far a real point sits from the prediction.

Tested on SAT Problem-Solving & Data Analysis

What College Board tests

Reading association (positive, negative, none) and its strength, using a line of best fit to estimate a value, interpreting the slope and intercept of that line in context, and computing a residual — the gap between an actual data point and the line's prediction.

The key ideas

Association
Points trending up → positive; trending down → negative; no pattern → none.
Best-fit line
The line that passes as close as possible to all points. Use it to predict and to read a rate.
Slope
The predicted change in for each one-unit increase in — a rate, in context.
Residual
. Positive means the point sits above the line; negative, below.

Four worked examples in SAT format. Read the approach, try it yourself, then tap Show the full solution.

1 · Predict with the line of best fit

A scatterplot (not shown) relates study hours to test score , and its line of best fit has equation . Based on the line of best fit, what is the predicted test score for a student who studies 10 hours?

Approach A prediction from the best-fit line is just substitution. Put the given -value into the equation and compute .

Show the full solution

Answer: C

substitute 10 for

Why the other choices are wrong

A  Uses a slope of 1 instead of 2.

B  Computes but drops the intercept.

D  Adds the intercept twice.

2 · Interpret the slope in context

A line of best fit for a city's data is , where is years since 2010 and is the number of charging stations. Which is the best interpretation of the slope, 3?

  1. There were 3 charging stations in 2010.
  2. The number of charging stations increased by about 3 each year.
  3. There were 3 charging stations total.
  4. It took 3 years to build the first station.

Approach The slope is the predicted change in per one-unit increase in . Here is years and is stations, so the slope is stations per year.

Show the full solution

Answer: B

the slope is 3 — the coefficient of
the slope is a rate; "per year" means "for each one year," so its denominator is 1 year

So each additional year predicts about 3 more charging stations. To see the rate in action, multiply by a span of years and watch "year" cancel: over 4 years. (The 12 in the equation is the intercept — the predicted count in 2010, when .)

Why the other choices are wrong

A  Describes the intercept (12), and even then uses the wrong number.

C  Treats 3 as a total, not a yearly rate.

D  Reads 3 as a duration rather than a rate of change.

3 · Identify the association

In a scatterplot, as the values on the horizontal axis increase, the values on the vertical axis tend to decrease, with the points falling roughly along a straight line. Which best describes the association?

  1. Strong positive linear association
  2. Strong negative linear association
  3. No association
  4. Strong positive exponential association

Approach Two questions: direction and shape. "As goes up, goes down" sets the direction; "roughly along a straight line" sets the shape.

Show the full solution

Answer: B

opposite directions → negative
straight-line pattern → linear, and tightly so → strong

Down-and-to-the-right along a line is a strong negative linear association.

Why the other choices are wrong

A  Positive would mean rises as rises — the opposite here.

C  A clear downward trend is an association, not the absence of one.

D  Wrong direction and wrong shape — it's linear and decreasing.

4 · Compute a residual

The scatterplot shows the relationship between hours studied and test score for several students, along with a line of best fit. For the highlighted data point at , what is the difference between the actual score and the score predicted by the line of best fit? (The equation of the line is .)

y x (9, 25)

Approach This difference is the residual: actual minus predicted. Read the actual score off the highlighted point, use the line to predict at , then subtract.

Show the full solution

Answer: B

predicted score at
residual = actual − predicted
the point sits 2 above the line
y x actual 25 predicted 23

The highlighted point lies above the best-fit line; the short vertical gap between the actual score (25) and the line's prediction (23) is the residual of +2.

Why the other choices are wrong

A  Reverses the subtraction (predicted − actual).

C  Reports the intercept rather than the residual.

D  Uses the predicted value's pieces; is not the residual.


Quick reference

Predict
Substitute into the best-fit equation.
Slope
Change in per one-unit — a rate in context.
Association
Up → positive; down → negative; no pattern → none.
Residual
; + above the line, − below.
Shu's Tutoring SAT · Problem-Solving & Data Analysis · Two-variable data