Hanzi (Characters)

Guide to Chinese characters: simplified vs traditional, stroke types and order rules, semantic-phonetic compound structure, the top 15 semantic radicals, and learning resources.

Hanzi (汉字, hànzì) — Chinese characters — are the writing system of Chinese. Unlike alphabets, which map symbols to sounds, Chinese characters are logographic: each character represents a morpheme (a meaningful unit), with pronunciation encoded separately (sometimes within the character itself, sometimes not at all).

Chinese is the only major living writing system that is logographic. Learning it requires explicit study in a way that learning, say, Spanish script does not. But the structure is not arbitrary — once you understand the system, characters become learnable and even predictable.


Scale: How Many Characters Do You Need?

Level Characters Coverage
Most frequent 100 ~100 ~42% of all text by token frequency
Most frequent 500 ~500 ~75% of everyday text
Functional literacy 2,000–3,000 ~97–99% of newspapers/modern text
HSK 9 (highest level) 3,088 (recognition) Required for the top exam level
Chinese high school graduate ~3,500–4,000 Practical upper bound for active use
Total characters in existence ~50,000–85,000 Most extremely rare; not needed

Bottom line: 2,000–3,000 characters gets you to functional literacy. The HSK 9 standard of 3,088 is a reasonable goal for serious learners.


Simplified vs. Traditional

Simplified (简体字) Traditional (繁體字)
Where used Mainland China, Singapore Taiwan, Hong Kong, Macau, overseas communities
Origin PRC government reforms, 1950s–1960s Historical forms; evolved over millennia
Stroke count Generally fewer Generally more
Example: love 爱 (ài) 愛 (ài) — the heart 心 is retained inside
Example: dragon 龙 (lóng) 龍 (lóng)
Example: hear 听 (tīng) 聽 (tīng)

Which to learn?

  • Learning Mandarin for mainland China, business, or HSK: learn Simplified
  • Learning for Taiwan, Hong Kong, classical texts, or aesthetics: learn Traditional
  • Many learners start with Simplified and add Traditional recognition later — the two systems share ~70% of characters unchanged

The 7 Basic Stroke Types

Every character is built from a small set of strokes. There are 7 basic types:

# Name Chinese Pronunciation Shape Description
1 Horizontal héng Left to right, slightly rising
2 Vertical shù Top to bottom, straight
3 Left-falling piě Starts at top, falls to lower-left
4 Right-falling Starts at upper-left, falls to lower-right, thickens at end
5 Dot diǎn Short stroke, often angled
6 Hook gōu A stroke ending in a sharp hook
7 Turn zhé 乙 / 乚 A stroke that changes direction

In practice, many strokes are combinations of these (e.g., a horizontal + hook = 横钩 héng gōu). Character dictionaries typically list 31 or more combined stroke types, but the 7 above are the primitives.


The 7 Stroke Order Rules

Writing characters with correct stroke order is important for:

  1. Legibility (especially at speed — correct order produces natural-looking characters)
  2. Recognition (stroke order animations help memory)
  3. Digital input on mobile (handwriting recognition engines expect correct order)
Rule Description Example
1. Top before bottom Upper strokes come first 三: first ─, second ─, third ─ (top to bottom)
2. Left before right Left-side strokes first 川: left vertical, middle, right
3. Horizontal before vertical (crossing) The horizontal goes first when strokes cross 十: ─ then │
4. Left-falling (撇) before right-falling (捺) 八: left piě first, then nà 八, 人
5. Enter then close (enclosures) Draw the frame, add interior, then close bottom 国: outer frame with gap → interior → close bottom
6. Center before sides (vertically symmetrical) Middle vertical first 小: │ then left dot, right dot
7. Left-vertical before top-horizontal in some enclosures For certain shapes 口: left │, then ─, then right │ + bottom

The rules interact and have exceptions. The authoritative source for any character is the official stroke order database — Pleco's stroke order add-on or Arch Chinese show animated stroke order for every character.


Character Structure: 80% Are Semantic-Phonetic Compounds

Chinese characters fall into several structural categories:

Type % of characters Description
Semantic-phonetic compounds (形声字) ~80–85% One component suggests meaning; one suggests pronunciation
Simple pictographs (象形字) ~4% Originated as pictures of objects
Simple ideographs (指事字) ~1% Abstract symbols (上 above, 下 below, 一 one)
Compound ideographs (会意字) ~13% Two meaning components combined
Phonetic loans (假借字) small % Borrowed for sound, not meaning

How Semantic-Phonetic Compounds Work

Each compound has two components:

  • Semantic radical (意符): indicates the general meaning category
  • Phonetic component (声旁): gives a (sometimes approximate) clue to pronunciation

Example: 妈 mā (mother)

  • 女 (nǚ) — the "woman" radical → semantic: this relates to a female person
  • 马 (mǎ, horse) — phonetic: pronounced (similar to )

The phonetic hint is not always exact — the pronunciation may differ in tone, initial, or final. But it is a real clue:

Character Pinyin Meaning Semantic radical Phonetic (sound clue)
mother 女 woman 马 mǎ
ma question particle 口 mouth 马 mǎ
to scold 马 horse (×2) — (pure semantic)
code, number 石 stone 马 mǎ
ant (in 蚂蚁) 虫 insect 马 mǎ

Notice how 马 shows up in the pronunciation of related characters. Learning phonetic components multiplies your character recognition speed.


Top 15 Semantic Radicals

Learning the most common semantic radicals is high-ROI. Recognizing them in unfamiliar characters immediately narrows down meaning.

Radical Pronunciation Meaning Example characters
人 (亻) rén person 你 (you), 他 (he), 做 (do), 休 (rest)
kǒu mouth; speak 吃 (eat), 喝 (drink), 说 (speak), 叫 (call)
woman 妈 (mom), 她 (she), 姐 (older sister), 嫁 (marry)
手 (扌) shǒu hand 打 (hit), 拿 (take), 把 (hold), 推 (push)
水 (氵) shuǐ water 海 (sea), 河 (river), 游 (swim), 洗 (wash)
wood; tree 树 (tree), 桌 (table), 椅 (chair), 森 (forest)
火 (灬) huǒ fire 烧 (burn), 热 (hot), 炒 (stir-fry), 灯 (lamp)
心 (忄) xīn heart; mind 想 (think/miss), 感 (feel), 快 (fast/happy), 忘 (forget)
言 (讠) yán speech; language 说 (speak), 请 (invite/please), 语 (language)
eye 看 (look), 眼 (eye), 睡 (sleep), 盲 (blind)
earth; ground 地 (ground/earth), 坐 (sit), 场 (place)
金 (钅) jīn metal 钱 (money), 银 (silver), 铁 (iron), 针 (needle)
chóng insect; creature 蛇 (snake), 蚂蚁 (ant), 蜂 (bee), 虾 (shrimp)
草 (艹) cǎo grass; plant 花 (flower), 草 (grass), 茶 (tea), 药 (medicine)
sun; day 时 (time), 早 (early), 明 (bright), 晚 (evening)

Simplified Radical Forms

Many radicals change form when used as a component inside a character:

Full form Component form Context
人 rén left side of character
水 shuǐ left side (three drops)
火 huǒ bottom of character
手 shǒu left side
心 xīn left side
言 yán left side (simplified form)
金 jīn left side (simplified form)
食 shí left side (simplified form)

Learning Strategy

Phase 1 — First 150 characters (Month 1)

Focus on high-frequency, structurally simple characters. Use a spaced repetition system (Anki, Pleco flashcards). Learn the most common radicals (listed above).

Phase 2 — Up to 500 characters (Months 2–4)

Start recognizing semantic-phonetic structure. When you see a new character, try to identify the semantic radical and the phonetic component. Use Pleco's stroke order add-on to practice writing.

Phase 3 — 500 to 2,000 (Months 4–18)

Sentence mining: learn characters in context from reading you actually do. Use Anki with cloze-deletion or production cards. At this stage, reading graded readers (Mandarin Companion, Du Chinese) accelerates character acquisition naturally.

Phase 4 — 2,000+ (Year 2+)

Reading volume. The more you read, the more characters you consolidate. Native content, news, novels.


Resources

Resource Type Notes
Pleco (stroke order add-on) App Animated stroke order for every character; dictionary + flashcard integration
Skritter App SRS for writing practice; stroke-by-stroke correction; $~15/mo
Arch Chinese Web Free stroke order animations and printable worksheets; archchinese.com
Remembering the Hanzi (Heisig & Richardson) Book Mnemonic stories for 1,500+ characters; effective for writing production
Dong Chinese Web/App Character frequency data; shows which characters appear in graded texts
Chinese Grammar Wiki Web Also covers character usage; allsetlearning.com
Outlier Linguistics Dictionary Pleco add-on Deep etymological and component analysis; expensive but definitive