What Are Graded Readers
A graded reader is a piece of prose written under a strict vocabulary ceiling. Every content word in the text is one the learner has already been taught, which means the learner can read the whole thing without stopping for a dictionary. The format has existed in language pedagogy for at least 150 years; the digital version of it is what powers comprehensible-input methods like Mandarin Companion and Olly Richards' Short Stories series. This article covers what they are, why they work, the cognitive research behind them, how to pick a level, and what comes after.
For the Kilo Lingo curriculum, graded readers sit between the rote-vocabulary drill (lessons 1, 3-51) and the real-world content (podcasts, films, full books). They are the bridge most language-learning systems leave out.
The format: vocabulary ceiling as a hard constraint
A graded reader is defined by one variable: its vocabulary ceiling. The ceiling is usually expressed as a frequency rank (top 100, top 1,000, top 3,000) or a CEFR level (A1, A2, B1, B2). Either way, the constraint is the same: the writer can use any word at or below the ceiling, and no word above it.
The ceiling matters because it makes the text readable at speed. A learner who has just drilled the top 100 most-frequent Spanish lemmas can open a top-100 story and read it the way a native reads their own language - eyes moving forward, no lookups, the meaning landing as it goes. Take the same learner and hand them a B1 short story, and they stop every three or four words to check vocabulary, lose the narrative thread, and emerge with less retained than they would have from a single paragraph of in-tier prose.
The ceiling is also what makes graded readers feel patronisingly simple when you first see them. The top-100 vocabulary across most languages contains very few content words and almost no concrete nouns. A top-100 Spanish story has the three conjugatable verbs ser, querer, and hacer and the six available content nouns (casa, vez, tiempo, verdad, favor, señor). The result reads as childlike. That is the cost. The benefit is that the learner who has drilled exactly those words finally has a text that puts them to work.
Why they work: comprehensible input research
The pedagogy behind graded readers is Stephen Krashen's comprehensible input hypothesis, refined as the "i+1" principle in the 1980s. The claim is that language is acquired (as opposed to learned) when the learner is exposed to input that is slightly above their current level but still mostly comprehensible - the "i+1" being the existing competence (i) plus one increment of new material.
The empirical record since then is unusually clean for a humanities field. Studies on extensive reading at controlled difficulty levels (Day and Bamford 1998, Nation 2014, Webb and Nation 2017) consistently find that:
- Vocabulary acquisition rate is highest when 95-98% of the words in a text are already known. Below 95% known-word coverage, the learner spends too much cognitive load on decoding to retain new vocabulary. Above 98%, the new-vocabulary intake per hour drops because there is too little new material.
- Grammar acquisition through reading at the right level outpaces explicit grammar drilling by a wide margin for adult learners over 12 months. This is the contested claim of the field but the replication record now favours it.
- Reading speed in the target language increases roughly linearly with hours of in-tier reading, and plateaus only when the learner exhausts the in-tier corpus and refuses to move up.
- Retention of vocabulary from graded reading is dramatically higher than retention from rote drill alone. The number cited in Nation (2014) is around 5x retention from a single contextual exposure compared to a single flashcard pass.
The 95-98% known-word coverage figure is the load-bearing one. A graded reader at the right tier delivers that coverage by construction. A "real" text rarely does until you are well past B2.
Why most learners under-use them
There are three predictable reasons learners skip graded readers:
- The prose reads as patronising. A top-100 Spanish story sounds like a children's book to anyone who already speaks Spanish, which means the learner reading it for the first time feels self-conscious. The fix is to internalise that the prose is shaped that way by constraint, not by author choice. The simplicity is the feature.
- The format gets confused with "graded textbook exercises". Textbook reading exercises (the kind that come at the end of a chapter with comprehension questions) are not graded readers. They are tests. Graded readers are stories. The distinction matters because tests trigger anxiety; stories trigger flow.
- Adult learners over-rate flashcards. Anki and Quizlet are easy to commit to: they have measurable progress, gamified streaks, and a clear endpoint per session. Reading does not. The result is a generation of adult learners with enormous flashcard decks and no reading habit, who plateau at upper-intermediate because their vocabulary is theoretical rather than active.
The fix for the third one is to schedule reading the same way you schedule flashcards: a daily 15-minute block, at the same tier you are currently drilling, with no skipping.
How to pick the right level
The rule is: read at the tier you have just finished drilling, not the tier you are about to drill. Specifically:
- If you have just completed your top-100 vocabulary, read top-100 stories. The Spanish, French and Mandarin top-100 stories on this site are roughly 60-100 words each, designed to be read in a single minute, and they use the exact lemmas you have just spent a week with.
- After top-500: move to top-500 stories. These are longer (150-250 words), include past tense in Spanish and French, and start using the actual conjugational machinery rather than just present-tense action.
- After core-1000: move to publisher graded readers at A2-B1 level (Mandarin Companion, Penguin Readers, Hachette's Lire en français facile).
- After core-5000: move to abridged classic novels at B2 level (Olly Richards' Short Stories series, the original-language editions with footnotes).
- At C1 and above: stop reading graded material entirely and switch to unmodified native texts. Graded readers stop adding value once your in-context known-word coverage on native material climbs above 95%.
The single most common error is reading one tier too high, in the hope of pulling oneself up. This does not work. The brain processes texts above the comprehensibility threshold as decoding exercises rather than reading; vocabulary intake drops, narrative flow collapses, and the learner ends the session feeling discouraged rather than reinforced. Lower-tier reading at speed is the higher-yield strategy every time.
What comes after
Graded readers run out of usefulness somewhere around mid-B2. By that point, the learner has the vocabulary and grammar to handle most non-technical native text with a manageable number of lookups, and the diminishing returns of staying inside a constrained corpus catch up. The transition target is native media at a familiar topic - a newspaper opinion column on a topic you already understand in English, a YouTube video on a subject you already know, a podcast on a hobby you already follow.
The transition is uncomfortable and that is the point. Native material at B2 is doing the same job graded readers were doing at A2, with the difference that the unfamiliar 5-15% of words are now the surface of a much larger iceberg. The shape of the work changes from "read for pleasure" to "read with a lookup tool open"; this is normal and not a regression. By the time the lookup rate drops below one per page, you are reading as a competent adult in the language, and graded readers will look in retrospect like the scaffold they always were.
Cross-links
- Spanish curriculum and the top-100 Spanish story.
- French curriculum and the top-100 French story.
- Mandarin curriculum and the top-100 Mandarin story.
- What is a polyglot covers the broader methodology question of how adult learners actually reach fluency.
- How polyglots learn languages is the companion piece on input-driven acquisition.
- CEFR levels explained covers the level framework that maps graded-reader tiers to skill ranges.