Mandarin Vocabulary by HSK Level: What You Actually Need at HSK 1 to HSK 6

Two metrics matter in Mandarin and most learner blogs only count one of them. The first is words: the HSK official syllabus is calibrated in cumulative word counts, and the headline number for each level (150 at HSK 1, up to 5,000+ at HSK 6) is what the exam tests. The second is characters: Mandarin orthography is logographic, and the Core 1,000 characters cover around 90% of running text while the Core 3,000 cover around 99% (per Da Xiang Frequency Dictionary and Nation 2006 on the character coverage curve). The HSK does not test character count directly, but no adult learner gets to HSK 4 without committing to a character-frequency-first reading curriculum in parallel.

This page is the per-level breakdown using HSK 2.0 numbers (the version most adult learners still study against), with real example lemmas drawn from the Kilo Lingo Mandarin word index. For the institutional side of the exam (CEFR mapping, scholarships, where the test is required) see the HSK explainer. For the underlying frequency argument that drives the Kilo Lingo curriculum see why the first 1,000 words matter and the Core 1,000 character list.

The word and character count at each HSK level

The HSK 2.0 official cumulative word counts, drawn from the Hanban / Chinese Ministry of Education specification, with the approximate character count and the rough CEFR equivalent each level claims. The CEFR mappings are the official Hanban claims; most independent assessors believe HSK 5 is closer to a strong B2 and HSK 6 to a strong C1, not C1 and C2 as Hanban lists.

HSK levelCumulative wordsApprox. charactersCEFR (official Hanban claim)What it actually does
HSK 1150~170A1Survival; introduce yourself, order a coffee, count to a hundred.
HSK 2300~350A2Simple transactions; aspect marker 了, basic past, common time expressions.
HSK 3600~620B1Everyday adult conversation; full aspect system, opinion verbs, comparisons.
HSK 41,200~1,070B2Abstract topics; newspaper headlines with effort; formal-register entry.
HSK 52,500~1,710C1Newspaper reading, formal correspondence, films with Chinese subtitles.
HSK 65,000+~2,640C2Academic reading, classical-derived idioms, professional written register.

The character-to-word ratio shifts as you go up. HSK 1 is close to one character per word because most HSK 1 lemmas are single-syllable function words and pronouns; by HSK 6 the average word is closer to two characters, and a meaningful share are chéngyǔ (four-character idioms) drawn from classical sources. This is structurally why HSK 5 to HSK 6 takes longer than HSK 1 to HSK 4 combined: the vocabulary becomes more compound, more abstract, and more etymologically opaque.

HSK 1 (beginner)

150 cumulative words. Roughly 170 characters.

HSK 1 is pronouns, the three core copular and existential verbs (是 (shì), 有 (yǒu), 在 (zài)), the basic negation 不 (bù), numbers, days, common nouns of person and place, and the question particles 吗 (ma) and 呢 (ne). It is the closed-class function vocabulary the language runs on, plus enough open-class nouns and verbs to form a complete (if short) sentence.

A representative HSK 1 set:

The structural reason HSK 1 looks the way it does: Mandarin is a topic-prominent SVO language with no inflection, so the function words (的 (de), 了 (le), 吗 (ma), 呢 (ne)) carry the grammatical work that English handles through morphology. Get those into long-term memory at HSK 1 and the next three levels are about widening lexis rather than learning new structure.

Time at 30 minutes a day for an English speaker: around 4 to 6 months.

HSK 2 (elementary)

300 cumulative words. Roughly 350 characters.

HSK 2 doubles the headline count and introduces the perfective aspect marker 了 (le), the experiential aspect marker 过 (guò), more verbs of movement and perception, time expressions (今天 (jīn tiān), 明天 (míng tiān), 昨天 (zuó tiān)), and the everyday school-and-work vocabulary that lets you talk about your day.

Representative HSK 2 additions:

The structural reason: HSK 2 is the first level at which a real adult conversation about past events becomes possible. The 了 (le) particle is the workhorse, and it is also the single most over-explained piece of Mandarin grammar in beginner textbooks. The intermediate grammar page covers the four distinct uses of 了 that emerge over HSK 2 to HSK 4.

Time: another 4 to 6 months, cumulative 8 to 12 months from zero.

HSK 3 (intermediate)

600 cumulative words. Roughly 620 characters.

HSK 3 is where the aspect system arrives in full: 了 (le), 着 (zhe), 过 (guò) all in active use; comparison structures with 比较 (bǐ jiào); the opinion verbs 觉得 (jué de) and 认为 (rèn wéi); causal connectors 因为 (yīn wèi) and 所以 (suǒ yǐ). Conversation widens from transactional to opinion-bearing.

Representative HSK 3 additions:

This is the character-recognition cliff. The 620 characters at HSK 3 are roughly the same as the Core 1,000 most frequent characters across the whole language: 90% coverage of running text sits in this band. Adult learners who have been reading via pinyin gloss up to here now have to commit to spaced repetition on the character side or quietly drop out. The honest budget is around 200 hours of focused character review spread over 6 to 9 months, on top of the regular speaking and listening practice. Skritter, Anki, and Pleco with its built-in flashcard module are the dominant tools.

Time: another 12 to 18 months, cumulative 2 to 3 years from zero.

HSK 4 (upper intermediate)

1,200 cumulative words. Roughly 1,070 characters.

HSK 4 is the level the conventional Mandarin advice underestimates. The cumulative count doubles from 600 to 1,200, the character count almost doubles from 620 to 1,070, and the new vocabulary is mostly abstract: relationships, situations, plans, decisions, capabilities. This is the abstract-noun explosion that breaks the concrete-noun visualisation strategies that carried most learners from HSK 1 to HSK 3.

Representative HSK 4 additions:

Chéngyǔ (四字成语, the four-character classical idioms) also start to appear at HSK 4, though they are still rare. By HSK 5 they are unavoidable, and by HSK 6 they are the marker that separates strong intermediate writing from advanced.

Time: another 18 to 24 months, cumulative 4 to 5 years from zero. This is the level at which a weekly tutor becomes structurally necessary; see the intermediate grammar page for the patterns that stop being optional here, especially the 把 (bǎ) and 被 (bèi) constructions.

HSK 5 (advanced)

2,500 cumulative words. Roughly 1,710 characters.

HSK 5 is newspaper Mandarin and formal written register. The new vocabulary is heavily nominal, frequently disyllabic compound (two-character) nouns and verbs of Chinese government, economy, society, and culture. Chéngyǔ are now expected. The 1,710 characters at this level cover roughly 97% of running text in non-specialist writing.

Representative HSK 5 additions:

HSK 5 is the level Chinese university programmes and most white-collar employers in mainland China actually want. The hours number is significant: cumulative 2,200 to 2,500 hours from zero, which lines up with the US Foreign Service Institute's Category V rating for Mandarin (around 2,200 hours to professional working proficiency). For an adult doing 45 minutes a day, that is 6 to 8 years. The schedule is not negotiable; the only variable is whether you front-load it with immersion or stretch it across a decade.

Time: another 2 to 3 years on top of HSK 4, cumulative 6 to 8 years from zero.

HSK 6 (mastery for the exam)

5,000+ cumulative words. Roughly 2,640 characters.

HSK 6 is the top of the legacy ladder. The vocabulary includes literary register, classical-derived idioms, abstract philosophical and political nouns, and the bulk of the chéngyǔ a culturally educated speaker is expected to recognise. The 2,640 characters cover around 99% of running text in standard non-specialist writing.

What HSK 6 is not is native-level. The Hanban descriptors map it to CEFR C2 ("indistinguishable from an educated native") and most independent assessors disagree. Educated native Mandarin speakers operate with roughly 8,000 to 10,000 active words and recognise 20,000 to 40,000 in reading. HSK 6 certifies academic-reading proficiency at a defined level. It does not certify that you can hold a Beijing taxi-driver conversation, follow a Shanghai office banter session, or read a 19th-century novel in the original. Those are different and harder things. The 2021 HSK 3.0 reform (see below) is the formal admission that the old ceiling was lower than the marketing claimed.

Time: cumulative 10+ years from zero at 45 minutes a day, or roughly 3,000+ hours total. Most adult learners who clear HSK 6 do so after at least one immersion stretch (a year living in China or Taiwan) or after switching to Mandarin as their primary working language.

HSK 2.0 vs HSK 3.0

The Chinese Ministry of Education introduced HSK 3.0 in 2021, extending the system from six levels to nine. The reasoning was the long-standing critique that the legacy HSK 6 at 5,000 words did not actually reach C2 and that the gap between HSK 6 and educated native fluency needed its own scaffolding. The new structure:

  • HSK 1 to 3 (elementary band): roughly the same as HSK 2.0 levels 1 to 3.
  • HSK 4 to 6 (intermediate band): broadly aligned to HSK 2.0 levels 4 to 6 but with revised vocabulary.
  • HSK 7 to 9 (advanced band): a single test certifying around 11,000 cumulative words and 1,200 grammar points, including a speaking component within the written exam.

As of 2026 most adult learners outside mainland China are still working to HSK 2.0 numbers, most textbooks (the official HSK Standard Course series, the Pleco card decks, the Anki packs) are still calibrated to HSK 2.0, and most international test centres still administer HSK 1 to 6. HSK 7 to 9 is available at select centres in mainland China and a small number of international Confucius Institutes. The practical advice for an adult learner today is to study against HSK 2.0 numbers and treat HSK 7 to 9 as a long-term option rather than a current target. The HSK explainer article covers the institutional and exam-format side of the 3.0 transition in full.

How to actually build this

The Kilo Lingo prescription, in order:

  1. Pinyin and the four tones, drilled as perception not as labels. Start at the Mandarin pinyin page and the IPA chart. The minimum drill is around six months of daily minimal-pair listening (mā / má / mǎ / mà; mǎi / mài; shū / shú / shǔ / shù) before tones become reliable in conversation. Without this step every later level builds on a mispronounced foundation.
  2. The Core 1,000 characters, via spaced repetition. Anki, Skritter, or Pleco's built-in flashcard system. This is the 200-hour commitment that carries you through HSK 1 to HSK 3 and unlocks readable native input. Do not skip ahead to the Core 5,000 until the Core 1,000 are stable.
  3. HSK-tracking word lists from HSK 3 onward. The HSK 1, HSK 2, HSK 3, HSK 4, HSK 5 and HSK 6 lists on Kilo Lingo are calibrated to the HSK 2.0 official vocabulary spec. Use them as the spine of your active vocabulary, not as a complete curriculum.
  4. Reading at volume from HSK 3. The Mandarin reading list by CEFR covers graded readers from HSK 1 to native-level fiction. Real input has to start before you feel ready or it never starts at all.
  5. The Core 5,000 and the intermediate / advanced grammar pages once you clear HSK 4. The intermediate Mandarin grammar page covers the patterns that go from optional to required at HSK 4 to HSK 5; the conjugation page covers how Mandarin handles tense and aspect without inflection.
  6. A weekly tutor session from HSK 1, not from HSK 3. Mandarin is the language where the gamified apps fail adult learners hardest. Real-time pronunciation feedback in a tonal language requires a human; italki and Preply both have HSK-specialist teachers from around $20 to $40 an hour.

The wider context for all of this lives on the Mandarin pillar, which covers the FSI hours number, the Simplified vs Traditional fork, and the HSK 4 plateau wall in more depth. For why this site obsesses about the first 1,000 of anything, see why the first 1,000 words matter and the CEFR explainer for the European framework the HSK imperfectly maps onto.

Sources

  • Hanban / Center for Language Education and Cooperation: HSK 2.0 vocabulary lists (chinesetest.cn).
  • Da Xiang Frequency Dictionary (Mandarin character and word frequency data).
  • Nation, I. S. P. (2006). How large a vocabulary is needed for reading and listening? Canadian Modern Language Review, 63(1), 59-82.
  • US Foreign Service Institute Language Difficulty Rankings (Category V for Mandarin, around 2,200 hours to professional working proficiency).