SAT Math · Statistics

Two-Variable Data & Scatterplots

SAT-M-2VD-01
Teaching Notes
v2 · 2026.05

A scatterplot shows how two variables relate to each other. Each dot represents one observation — a single data point with both an x-value and a y-value. The SAT tests scatterplots in two main ways: identifying which equation models the data (linear, quadratic, or exponential) and interpreting what the slope or y-intercept means in the context of the problem. The hardest questions don't test math computation — they test whether you can read a graph and connect it to algebra.

Reading a scatterplot

Every scatterplot has the same basic structure: an x-axis (independent variable) and a y-axis (dependent variable), with dots showing how the two variables co-vary. The first thing to do with any scatterplot is identify the shape of the relationship.

Positive linear
As x increases, y increases at a constant rate. Best-fit line slopes upward.
Negative linear
As x increases, y decreases at a constant rate. Best-fit line slopes downward.
Quadratic (parabola)
U-shaped or inverted-U. Variable rate of change. Has a single peak or trough (vertex).

The SAT also tests exponential relationships — these look like a curve that gets steeper and steeper (or shallower and shallower) without a peak.

Linear models

FORMULA
Slope-intercept form: y = mx + b

When the data falls roughly along a straight line, the best-fit equation is in slope-intercept form. m is the slope (the rate at which y changes per unit increase in x) and b is the y-intercept (the value of y when x = 0).

The SAT loves to ask what the slope or y-intercept means in the context of the real-world scenario.

Interpreting the slope in context

A scatterplot shows the relationship between the size of a house (in thousands of square feet, x-axis) and its sale price (in thousands of dollars, y-axis). The best-fit line is:

y = 100x + 100

What does the slope of 100 represent in this context?

  1. Slope = change in y per unit increase in x.
  2. x is in thousands of square feet. y is in thousands of dollars.
  3. So a slope of 100 means: for every additional 1,000 square feet, the predicted sale price increases by $100,000.
Each additional 1,000 sq ft adds $100,000 to the predicted price.
Interpreting the y-intercept in context

Using the same equation y = 100x + 100, what does the y-intercept of 100 represent?

  1. Y-intercept = value of y when x = 0. So when x = 0 thousand square feet, the predicted price is y = 100 thousand dollars.
  2. In context, this is the predicted price of a house with zero square feet — clearly not a realistic value, but mathematically it represents the model's "baseline" before adding any size.
$100,000 is the model's baseline price for size = 0.

Y-intercepts often represent something physically nonsensical (like a house with zero square feet). That's normal — the model is only meaningful in the range of the actual data.

Matching an equation to a scatterplot

METHOD
Y-intercept first, then slope

The SAT often gives you a scatterplot and four equations as answer choices. To pick the right one:

  1. Check the y-intercept. Find where the trend line would cross the y-axis (x = 0). Eliminate equations whose b-value doesn't match.
  2. Check the slope sign. If the trend goes up as x increases, slope is positive. If it goes down, slope is negative. Eliminate any equation with the wrong sign.
  3. Check the slope size. Estimate the rise/run from two visible points on the trend. Eliminate equations whose slope is wildly off.
Picking the right equation

A scatterplot shows the relationship between time t (in years) and distance d (in some unit). The trend line passes through approximately (0, 84) and (10, 414). Which equation is the best fit?

  • (A) d = 0.03t + 402
  • (B) d = 10t + 402
  • (C) d = 33t + 300
  • (D) d = 33t + 84
  1. The y-intercept (where x = 0) is about 84. Eliminate (A), (B), (C) — none have 84 as the constant.
  2. Verify the slope on (D). From (0, 84) to (10, 414):
    rise = 414 − 84 = 330 run = 10 slope = 330 / 10 = 33 ✓
Answer: (D) d = 33t + 84

The y-intercept is usually the fastest way to eliminate three of four answer choices. Look there first.

Quadratic models

FORMULA
Quadratic form: y = ax² + bx + c

When data forms a parabolic (U-shaped or inverted-U) curve, the best-fit equation is quadratic. The most-tested feature is the sign of the leading coefficient a:

  • If a is positive, the parabola opens UP (like a U). The vertex is a minimum.
  • If a is negative, the parabola opens DOWN (like an inverted U). The vertex is a maximum.
a > 0 (opens up) a < 0 (opens down)
When matching a quadratic equation to a scatterplot, the FIRST check is the sign of the leading coefficient a — does the parabola open up or down?
⚠ The SAT's favorite quadratic trap

The SAT often gives four answer choices that differ ONLY in the signs of the coefficients. Example:

(A) y = 1.674x² + 19.76x − 745.73 (B) y = −1.674x² − 19.76x − 745.73 (C) y = 1.674x² + 19.76x + 745.73 (D) y = −1.674x² + 19.76x + 745.73

The trick: the actual quadratic computation isn't needed. Just check (a) the parabola direction (sign of x² coefficient) and (b) the y-intercept (constant term). Two checks usually eliminate three options.

EXAMPLE
Picking a quadratic from the answer choices

A scatterplot shows electricity generated by nuclear sources over a 12-year period. The data forms an inverted-U shape (peaks in the middle). The y-intercept (where t = 0) is approximately 745. Which is the best-fit equation?

  • (A) y = 1.674x² + 19.76x − 745.73
  • (B) y = −1.674x² − 19.76x − 745.73
  • (C) y = 1.674x² + 19.76x + 745.73
  • (D) y = −1.674x² + 19.76x + 745.73
Solution
  1. The parabola opens DOWN (inverted-U), so the leading coefficient must be negative. Eliminate (A) and (C).
  2. The y-intercept is +745, so the constant term must be POSITIVE. Eliminate (B) (which has −745.73).
Answer: (D)

No quadratic computation needed. Two visual checks — parabola direction and y-intercept sign — fully determined the answer.

Recognizing the model type

Before matching to an equation, you need to know what TYPE of equation to look for. Here's how to tell at a glance:

Pattern in the scatterModel typeEquation form
Roughly straight line (up or down)Lineary = mx + b
U-shaped or inverted-UQuadraticy = ax² + bx + c
Curve that grows faster and faster (or decays slower and slower)Exponentialy = a · b^x
⚡ Linear vs. exponential — the trap

A scatter that curves upward could be either quadratic OR exponential. The difference:

Quadratic goes through a minimum (or maximum) somewhere — even if the visible portion is just one side of the parabola.

Exponential never has a peak or trough — it just keeps curving in the same direction.

On the SAT, look for the answer-choice form: if you see x² in the choices, it's quadratic. If you see b^x (like 1.05^t), it's exponential. Often the problem tells you the model type explicitly ("the data is best modeled by a quadratic function").

Sample SAT-style problems

Sample 1 — interpreting slope

A scatterplot shows the relationship between the years since 1940 (x) and the federal minimum wage in dollars (y). The best-fit line is:

y = 0.096x − 0.488

What does the slope of 0.096 mean in this context?

  1. Slope = change in y per unit change in x. The y is dollars, x is years.
  2. So a slope of 0.096 means: per year, the predicted minimum wage increases by $0.096 (about 10 cents).
Minimum wage increases by about $0.10 per year.
Sample 2 — picking a linear equation

A scatterplot of beach visitors (y) vs. average temperature in °C (x) suggests a linear relationship. The trend line passes through (25, 80) and (35, 560). Which equation best models the data?

  • (A) y = 48x − 1120
  • (B) y = 48x + 1120
  • (C) y = −48x + 1120
  • (D) y = 30x + 80
  1. Compute the slope from the two points:
    m = (560 − 80) / (35 − 25) = 480 / 10 = 48
    Eliminate (C) and (D).
  2. Plug (25, 80) into y = 48x + b to solve for b:
    80 = 48(25) + b b = 80 − 1200 = −1120
Answer: (A) y = 48x − 1120
Sample 3 — quadratic model selection

A scatterplot shows the relationship between the depth of a swimming pool (x) and the water pressure (y). The data forms a U-shape with the lowest point near x = 5, and the y-intercept appears to be around 25. Which is the best fit?

  • (A) y = 2x² − 20x + 25
  • (B) y = −2x² + 20x − 25
  • (C) y = 2x² − 20x − 25
  • (D) y = −2x² − 20x + 25
  1. Parabola opens UP (U-shape) → leading coefficient is POSITIVE. Eliminate (B) and (D).
  2. Y-intercept is +25 → constant term must be +25. Eliminate (C).
Answer: (A) y = 2x² − 20x + 25
Sample 4 — predicting from the model

The scatterplot of ice cream sales (y, in dollars) vs. temperature (x, in °C) is modeled by:

y = 45x − 200

Use the model to predict ice cream sales when the temperature is 22°C.

  1. Substitute x = 22 into the equation:
    y = 45(22) − 200 y = 990 − 200 y = 790
Answer: about $790
⚠ Common scatterplot mistakes

1. Misreading the axis units. If the y-axis is "thousands of dollars" and you read 100, the actual value is $100,000, not $100. Always check the axis labels for units like "thousands," "millions," "percent."

2. Swapping slope and y-intercept. "What does the y-intercept represent?" asks about the constant term (b in y = mx + b), not the slope. Read the question carefully — they often appear together to test whether you know which is which.

3. Ignoring the parabola direction on quadratics. A U-shape and an inverted-U are the same shape mathematically, but they have OPPOSITE signs on the leading coefficient. Always check which way the parabola opens FIRST when matching a quadratic equation.

4. Using the line of best fit as if it were exact. The line of best fit is a model — individual data points scatter around it. The SAT sometimes asks "according to the line of best fit" or "according to the model" — that means use the equation, not the actual scattered points.

SAT-M-2VD-01 · Two-Variable Data & Scatterplots Shu's Tutoring · Notes Library