Hanzi (Characters)
Guide to Chinese characters: simplified vs traditional, stroke types and order rules, semantic-phonetic compound structure, the top 15 semantic radicals, and learning resources.
Hanzi (汉字, hànzì) — Chinese characters — are the writing system of Chinese. Unlike alphabets, which map symbols to sounds, Chinese characters are logographic: each character represents a morpheme (a meaningful unit), with pronunciation encoded separately (sometimes within the character itself, sometimes not at all).
Chinese is the only major living writing system that is logographic. Learning it requires explicit study in a way that learning, say, Spanish script does not. But the structure is not arbitrary — once you understand the system, characters become learnable and even predictable.
Scale: How Many Characters Do You Need?
| Level | Characters | Coverage |
|---|---|---|
| Most frequent 100 | ~100 | ~42% of all text by token frequency |
| Most frequent 500 | ~500 | ~75% of everyday text |
| Functional literacy | 2,000–3,000 | ~97–99% of newspapers/modern text |
| HSK 9 (highest level) | 3,088 (recognition) | Required for the top exam level |
| Chinese high school graduate | ~3,500–4,000 | Practical upper bound for active use |
| Total characters in existence | ~50,000–85,000 | Most extremely rare; not needed |
Bottom line: 2,000–3,000 characters gets you to functional literacy. The HSK 9 standard of 3,088 is a reasonable goal for serious learners.
Simplified vs. Traditional
| Simplified (简体字) | Traditional (繁體字) | |
|---|---|---|
| Where used | Mainland China, Singapore | Taiwan, Hong Kong, Macau, overseas communities |
| Origin | PRC government reforms, 1950s–1960s | Historical forms; evolved over millennia |
| Stroke count | Generally fewer | Generally more |
| Example: love | 爱 (ài) | 愛 (ài) — the heart 心 is retained inside |
| Example: dragon | 龙 (lóng) | 龍 (lóng) |
| Example: hear | 听 (tīng) | 聽 (tīng) |
Which to learn?
- Learning Mandarin for mainland China, business, or HSK: learn Simplified
- Learning for Taiwan, Hong Kong, classical texts, or aesthetics: learn Traditional
- Many learners start with Simplified and add Traditional recognition later — the two systems share ~70% of characters unchanged
The 7 Basic Stroke Types
Every character is built from a small set of strokes. There are 7 basic types:
| # | Name | Chinese | Pronunciation | Shape | Description |
|---|---|---|---|---|---|
| 1 | Horizontal | 横 | héng | ─ | Left to right, slightly rising |
| 2 | Vertical | 竖 | shù | │ | Top to bottom, straight |
| 3 | Left-falling | 撇 | piě | ノ | Starts at top, falls to lower-left |
| 4 | Right-falling | 捺 | nà | 乀 | Starts at upper-left, falls to lower-right, thickens at end |
| 5 | Dot | 点 | diǎn | 丶 | Short stroke, often angled |
| 6 | Hook | 钩 | gōu | 亅 | A stroke ending in a sharp hook |
| 7 | Turn | 折 | zhé | 乙 / 乚 | A stroke that changes direction |
In practice, many strokes are combinations of these (e.g., a horizontal + hook = 横钩 héng gōu). Character dictionaries typically list 31 or more combined stroke types, but the 7 above are the primitives.
The 7 Stroke Order Rules
Writing characters with correct stroke order is important for:
- Legibility (especially at speed — correct order produces natural-looking characters)
- Recognition (stroke order animations help memory)
- Digital input on mobile (handwriting recognition engines expect correct order)
| Rule | Description | Example |
|---|---|---|
| 1. Top before bottom | Upper strokes come first | 三: first ─, second ─, third ─ (top to bottom) |
| 2. Left before right | Left-side strokes first | 川: left vertical, middle, right |
| 3. Horizontal before vertical (crossing) | The horizontal goes first when strokes cross | 十: ─ then │ |
| 4. Left-falling (撇) before right-falling (捺) | 八: left piě first, then nà | 八, 人 |
| 5. Enter then close (enclosures) | Draw the frame, add interior, then close bottom | 国: outer frame with gap → interior → close bottom |
| 6. Center before sides (vertically symmetrical) | Middle vertical first | 小: │ then left dot, right dot |
| 7. Left-vertical before top-horizontal in some enclosures | For certain shapes | 口: left │, then ─, then right │ + bottom |
The rules interact and have exceptions. The authoritative source for any character is the official stroke order database — Pleco's stroke order add-on or Arch Chinese show animated stroke order for every character.
Character Structure: 80% Are Semantic-Phonetic Compounds
Chinese characters fall into several structural categories:
| Type | % of characters | Description |
|---|---|---|
| Semantic-phonetic compounds (形声字) | ~80–85% | One component suggests meaning; one suggests pronunciation |
| Simple pictographs (象形字) | ~4% | Originated as pictures of objects |
| Simple ideographs (指事字) | ~1% | Abstract symbols (上 above, 下 below, 一 one) |
| Compound ideographs (会意字) | ~13% | Two meaning components combined |
| Phonetic loans (假借字) | small % | Borrowed for sound, not meaning |
How Semantic-Phonetic Compounds Work
Each compound has two components:
- Semantic radical (意符): indicates the general meaning category
- Phonetic component (声旁): gives a (sometimes approximate) clue to pronunciation
Example: 妈 mā (mother)
- 女 (nǚ) — the "woman" radical → semantic: this relates to a female person
- 马 (mǎ, horse) — phonetic: pronounced mā (similar to mǎ)
The phonetic hint is not always exact — the pronunciation may differ in tone, initial, or final. But it is a real clue:
| Character | Pinyin | Meaning | Semantic radical | Phonetic (sound clue) |
|---|---|---|---|---|
| 妈 | mā | mother | 女 woman | 马 mǎ |
| 吗 | ma | question particle | 口 mouth | 马 mǎ |
| 骂 | mà | to scold | 马 horse (×2) | — (pure semantic) |
| 码 | mǎ | code, number | 石 stone | 马 mǎ |
| 蚂 | mǎ | ant (in 蚂蚁) | 虫 insect | 马 mǎ |
Notice how 马 shows up in the pronunciation of related characters. Learning phonetic components multiplies your character recognition speed.
Top 15 Semantic Radicals
Learning the most common semantic radicals is high-ROI. Recognizing them in unfamiliar characters immediately narrows down meaning.
| Radical | Pronunciation | Meaning | Example characters |
|---|---|---|---|
| 人 (亻) | rén | person | 你 (you), 他 (he), 做 (do), 休 (rest) |
| 口 | kǒu | mouth; speak | 吃 (eat), 喝 (drink), 说 (speak), 叫 (call) |
| 女 | nǚ | woman | 妈 (mom), 她 (she), 姐 (older sister), 嫁 (marry) |
| 手 (扌) | shǒu | hand | 打 (hit), 拿 (take), 把 (hold), 推 (push) |
| 水 (氵) | shuǐ | water | 海 (sea), 河 (river), 游 (swim), 洗 (wash) |
| 木 | mù | wood; tree | 树 (tree), 桌 (table), 椅 (chair), 森 (forest) |
| 火 (灬) | huǒ | fire | 烧 (burn), 热 (hot), 炒 (stir-fry), 灯 (lamp) |
| 心 (忄) | xīn | heart; mind | 想 (think/miss), 感 (feel), 快 (fast/happy), 忘 (forget) |
| 言 (讠) | yán | speech; language | 说 (speak), 请 (invite/please), 语 (language) |
| 目 | mù | eye | 看 (look), 眼 (eye), 睡 (sleep), 盲 (blind) |
| 土 | tǔ | earth; ground | 地 (ground/earth), 坐 (sit), 场 (place) |
| 金 (钅) | jīn | metal | 钱 (money), 银 (silver), 铁 (iron), 针 (needle) |
| 虫 | chóng | insect; creature | 蛇 (snake), 蚂蚁 (ant), 蜂 (bee), 虾 (shrimp) |
| 草 (艹) | cǎo | grass; plant | 花 (flower), 草 (grass), 茶 (tea), 药 (medicine) |
| 日 | rì | sun; day | 时 (time), 早 (early), 明 (bright), 晚 (evening) |
Simplified Radical Forms
Many radicals change form when used as a component inside a character:
| Full form | Component form | Context |
|---|---|---|
| 人 rén | 亻 | left side of character |
| 水 shuǐ | 氵 | left side (three drops) |
| 火 huǒ | 灬 | bottom of character |
| 手 shǒu | 扌 | left side |
| 心 xīn | 忄 | left side |
| 言 yán | 讠 | left side (simplified form) |
| 金 jīn | 钅 | left side (simplified form) |
| 食 shí | 饣 | left side (simplified form) |
Learning Strategy
Phase 1 — First 150 characters (Month 1)
Focus on high-frequency, structurally simple characters. Use a spaced repetition system (Anki, Pleco flashcards). Learn the most common radicals (listed above).
Phase 2 — Up to 500 characters (Months 2–4)
Start recognizing semantic-phonetic structure. When you see a new character, try to identify the semantic radical and the phonetic component. Use Pleco's stroke order add-on to practice writing.
Phase 3 — 500 to 2,000 (Months 4–18)
Sentence mining: learn characters in context from reading you actually do. Use Anki with cloze-deletion or production cards. At this stage, reading graded readers (Mandarin Companion, Du Chinese) accelerates character acquisition naturally.
Phase 4 — 2,000+ (Year 2+)
Reading volume. The more you read, the more characters you consolidate. Native content, news, novels.
Resources
| Resource | Type | Notes |
|---|---|---|
| Pleco (stroke order add-on) | App | Animated stroke order for every character; dictionary + flashcard integration |
| Skritter | App | SRS for writing practice; stroke-by-stroke correction; $~15/mo |
| Arch Chinese | Web | Free stroke order animations and printable worksheets; archchinese.com |
| Remembering the Hanzi (Heisig & Richardson) | Book | Mnemonic stories for 1,500+ characters; effective for writing production |
| Dong Chinese | Web/App | Character frequency data; shows which characters appear in graded texts |
| Chinese Grammar Wiki | Web | Also covers character usage; allsetlearning.com |
| Outlier Linguistics Dictionary | Pleco add-on | Deep etymological and component analysis; expensive but definitive |