When you have two sets of numbers (like training hours and goals scored), a scatterplot lets you see whether one is related to the other. The explanatory variable goes on the x-axis (the one doing the predicting), the response on the y-axis.
Your calculator gives r, a number from β1 to +1. The sign tells you the direction, the size tells you the strength.
| r near⦠| Means |
|---|---|
| +1 | strong positive |
| 0 | no linear correlation |
| β1 | strong negative |
8 strikers: training sessions/week (x) vs goals/season (y). The calculator gives r = 0.99.
β’ Correlation is not causation. A strong r means the two move together, not that one causes the other.
β’ r only measures linear association, a perfect curve can still have a low r.
β’ Always describe in context (name the variables), "strong positive" alone won't get full marks.
β’ An outlier can pull r up or down, flag it.
Up to the right is positive, down to the right is negative, no pattern means r near zero.
The closer r is to either end, the stronger the linear relationship. The middle means no linear pattern.
Don't read yet, just have a go in your head:
Points go up to the right (positive), straight-line pattern (linear), tight cluster (strong). r = 0.99 β very strong positive linear association.
r = 0.99 is close to ?, so the association is ? and ?.
A study finds r = β0.15 between hours of TV and exam marks. Describe the association in words. Check below.
Ice cream sales and drownings both rise together (high r). Does buying ice cream cause drownings? Explain.
In one sentence, out loud: what do the sign and size of r each tell you? If you can say it, you've got it.