Chinese Vocabulary by Frequency

Mandarin Chinese words ranked by corpus frequency: top 100, top 500, top 3000. Source: SUBTLEX-CH subtitle corpus.

Frequency lists based on the SUBTLEX-CH corpus (Cai & Brysbaert 2010), derived from ~33 million words of Chinese subtitle text.

Coverage Statistics

List Words Text Coverage
Top 100 words 100 ~65% of everyday text
Top 500 words 500 ~75% of conversation
Top 1,000 characters 1,000 ~90% of common written text
Top 3,000 words 3,000 ~90% of standard modern text
Top 10,000 words 10,000 ~99%+ of modern text

Key insight: Learning the top 1,000 characters gives you access to 90% of any Chinese text you encounter. This is achievable in 6–12 months of focused study.

Top 100 Most Frequent Words

Rank Chinese Pinyin English Notes
1 de possessive/attributive particle Most common character in Chinese
2 one; a Also used in phrases
3 shì to be Copula (before nouns only)
4 not, no Changes to bú before 4th tone
5 le completion/change aspect NOT a past tense marker
6 rén person, people
7 I, me
8 zài at, in; to be at
9 yǒu to have; there is/are
10 he, him Also 她 (she), 它 (it)
11 zhè this 这个, 这里
12 zhōng middle; China 中国, 中文
13 big, large
14 lái to come
15 shàng up, above; on Also: to go to
16 guó country, nation 中国, 美国
17 wéi/wèi for; to be (formal)
18 with; by means of Formal
19 dào to arrive; to Direction/resultative
20 shuō to say, speak
21 and; with
22 shí time; when
23 dì/de earth; adverb particle
24 chū to exit, come out
25 jiù then; precisely; as soon as Very versatile adverb
26 you
27 nián year
28 zhe ongoing state aspect
29 that
30 yào want; will; need Context-dependent
31 huì can; will; meeting
32 to go
33 dōu all; both
34 méi not (for 有/past) 没有 = don't have
35 also, too
36 duì correct; facing; toward
37 inside; unit of distance
38 can; may 可以, 可能
39 hòu after, behind
40 hěn very Also copula intensifier
41 什么 shénme what
42 我们 wǒmen we, us
43 shēng to be born; life
44 self; from 自己, 自然
45 xíng/háng to walk; OK; row
46 zuò to do, make
47 这个 zhège this one
48 kàn to look, see
49 zhǐ only Also: measure word (zhī)
50 知道 zhīdào to know (a fact)

Full top-500 and top-3000 lists will be generated by the chinese vocab frequency CLI command.

Data Sources