Understanding Your Score — How Culture-Fair IQ Scores Work

A score on this test is a single number accompanied by a reasoning-type breakdown. The number tells you where your performance falls relative to the general population. The breakdown tells you where your reasoning runs deepest. Both are worth reading carefully, because a score taken out of context can mislead in either direction.

The scale

Scores are reported on the standard deviation IQ scale: mean 100, standard deviation 15. This is the scale used by modern editions of the Wechsler scales, the Stanford-Binet, and most other major intelligence instruments. It replaced the older ratio-based IQ formula (mental age over chronological age, multiplied by 100) decades ago and is now the near-universal convention.

On this scale, approximately two-thirds of scores fall between 85 and 115. Approximately 95% fall between 70 and 130. Scores above 130 or below 70 are statistically uncommon — each region contains roughly 2% of the population.

A quick reference:

145 — 99.9th percentile, top 0.1% of the population
140 — 99.6th percentile, top 0.4%
135 — 99th percentile, top 1%
130 — 98th percentile, top 2%
124 — 95th percentile, top 5%
120 — 91st percentile, top 9%
115 — 84th percentile, top 16%
110 — 75th percentile, top quartile
100 — 50th percentile, median
90 — 25th percentile, bottom quartile
85 — 16th percentile, bottom 16%
70 — 2nd percentile, bottom 2%

A score is, at its root, a statement about rank. A score of 124 does not mean "your brain is 24% better than average." It means your performance exceeded approximately 95% of the calibration sample.

Why the historical Cattell scale used a different standard deviation

Older editions of Cattell's Culture Fair Intelligence Test used a standard deviation of 24 rather than 15. This produces scores that look higher but are not comparable to modern scores on the standard scale without conversion. A Cattell-scale score of 148 corresponds roughly to a standard-scale score of 130.

This test reports scores on the modern standard scale only. When comparing results across instruments, always confirm the scale being used.

What the precision range on your score means

A single test score is an estimate, not a measurement with infinite precision. Give the same person the same test twice and the scores will typically differ slightly — not because the person's reasoning changed, but because test-taking involves variability. A lucky guess here, an unlucky misreading there, a moment of lapsed attention in the middle.

Statisticians quantify this variability as a standard error of measurement (SEM). For a well-designed intelligence test, the SEM is typically 3–5 points. This means if your obtained score is 124, the 68% confidence interval — the range where the true score is most likely to fall — is approximately 119 to 129. A wider 95% confidence interval extends roughly from 115 to 133.

This test reports precision ranges with each score:

Short Assessment (12 items) — approximate precision ±6 IQ points (roughly the 90% confidence interval)
Standard Test (25 items) — approximate precision ±3 IQ points (roughly the 95% confidence interval)

The Standard Test is more precise not because it asks harder questions, but because it samples more items, which reduces the influence of any single item's outcome on the total.

What to do with this information: treat your score as a neighborhood, not a pinpoint. A score of 127 and a score of 124 are, statistically, indistinguishable. A score of 130 and a score of 100 are not. When comparing your result to thresholds or to prior scores, the confidence range is part of the answer.

The reasoning-type breakdown

Alongside the composite score, this test reports four sub-scores corresponding to the reasoning types it measures:

Matrix Reasoning (MR) — rule extraction and application in 2D grid patterns
Series Completion (SC) — pattern recognition across sequences
Classification (CL) — categorical abstraction and property detection
Spatial Rotation (SR) — mental transformation and spatial visualization

Most test-takers score fairly evenly across all four types. Substantial unevenness (more than about 10 points between your strongest and weakest type) is uncommon and informative. It typically reflects a genuine preference in how your reasoning operates rather than any deficit.

A profile with unusually strong MR and SC combined with weaker SR, for example, suggests reasoning that runs more easily with rule-based abstract patterns than with visual-spatial transformation. This is a normal variation, not a limitation.

Because each reasoning-type sub-score is based on fewer items than the composite, sub-scores carry more measurement noise. Read them as general patterns, not as precise comparisons.

Comparison to traditional IQ instruments

Culture-fair tests and traditional full-scale IQ batteries measure overlapping but not identical constructs.

A traditional battery like the Wechsler Adult Intelligence Scale combines fluid reasoning with crystallized knowledge (vocabulary, general information), working memory, and processing speed. The resulting Full Scale IQ is a broader measure of overall cognitive functioning than a culture-fair test provides.

Research comparing scores across the two kinds of instrument typically finds correlations between .70 and .85. This means:

Most people score similarly on both kinds of test (around two-thirds are within 7–10 points)
Some people score meaningfully higher on one or the other (about one-third differ by more)
The direction of any discrepancy is informative: higher culture-fair than traditional often indicates strong fluid reasoning combined with less exposure to verbally-loaded content; higher traditional than culture-fair often indicates strong crystallized knowledge compensating for weaker nonverbal reasoning

Neither direction is "more correct." The two instruments measure different things and both are legitimate views of cognitive ability.

A note on high-range scoring

Culture-fair tests are often used in high-range cognitive assessment — measuring scores substantially above the normal IQ range where traditional instruments run out of ceiling. Raven's Advanced Progressive Matrices, for example, is specifically calibrated to discriminate among scores above 130.

The Standard Test on this site produces reliable scores through approximately the 99th percentile (around 135). Scores above that threshold should be interpreted with greater caution — a single test, even a well-designed one, can only resolve so finely at the extremes of the distribution. For careful high-range assessment, multiple instruments and qualified supervision remain the standard.

What your score is not

A score on this or any intelligence test measures a specific cognitive capacity under specific testing conditions. It is not a measure of personality, character, creativity, wisdom, practical competence, mental health, social skill, or the value of a person. The psychometric literature is clear on these limits and the test-taker should be too.

Within its actual domain — nonverbal reasoning and fluid intelligence — a well-administered culture-fair test produces a score that is stable across time, relatively independent of cultural background, and modestly informative about a narrow but real human capacity. That is what the instrument does. It is not more than that and does not need to be.

Ready to measure? Begin the assessment, or read about the culture-fair methodology first.

Understanding your score.