Pinyin
Complete Mandarin pinyin guide: all 21 initials and 36 finals, pronunciation notes for English speakers, special spelling rules, tone mark placement, and common pitfalls.
Pinyin (拼音, pīnyīn, literally "spell sound") is the official romanization system for Standard Chinese (Putonghua). It was developed in the 1950s and adopted by the People's Republic of China in 1958. Pinyin uses the Latin alphabet plus four diacritic marks for tones, and one special letter (ü).
Pinyin is how Chinese is typed on computers and phones, how words appear in dictionaries, and how children in China learn to read before mastering characters. For learners, it is the essential bridge to pronunciation and tone notation.
Critical warning: Pinyin letters do not always map to their English phonetic values. The letter x is not like English "x"; q is not like English "q"; zh/ch/sh/r are retroflexes with no English equivalent; b/d/g are not the voiced sounds English speakers expect. Study pinyin pronunciation explicitly — do not guess from English spelling.
The 21 Initials
Initials are the consonant sounds that begin a syllable. A syllable may have no initial (zero initial), in which case the vowel begins the syllable directly.
Bilabials (lips together)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| b | /p/ | like "p" in "spin" | Unaspirated — no puff of air. NOT the voiced "b" in "bat" |
| p | /pʰ/ | like "p" in "pot" | Aspirated — strong puff of air |
| m | /m/ | like "m" in "mother" | Standard |
Labiodental (lip + teeth)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| f | /f/ | like "f" in "fan" | Standard |
Alveolars (tongue tip to ridge behind upper teeth)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| d | /t/ | like "t" in "stop" | Unaspirated — no puff of air. NOT the voiced "d" in "dog" |
| t | /tʰ/ | like "t" in "top" | Aspirated |
| n | /n/ | like "n" in "no" | Standard |
| l | /l/ | like "l" in "look" | Standard |
Velars (back of tongue to soft palate)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| g | /k/ | like "k" in "ski" | Unaspirated — NOT the voiced "g" in "go" |
| k | /kʰ/ | like "k" in "kite" | Aspirated |
| h | /x/ | like "ch" in Scottish "loch" | Slightly throaty; NOT English "h" but close |
Palatals (j q x — only appear before i and ü)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| j | /tɕ/ | like "j" in "jeep" but unaspirated and further forward | Never before a, o, e, u. Always before i or ü (written u after j) |
| q | /tɕʰ/ | like "ch" in "cheese" but further forward, aspirated | Aspirated counterpart to j |
| x | /ɕ/ | like "sh" in "she" but further forward, more hiss | The hiss of air through a narrow gap; very different from English "x" |
j/q/x vs. zh/ch/sh: j/q/x are palatal — tongue body near hard palate, no retroflexion. zh/ch/sh are retroflex — tongue tip curled back. These are four distinct series that English has no equivalent for. They never overlap in distribution: j/q/x only before i/ü; zh/ch/sh before everything else.
Retroflexes (tongue tip curled back toward hard palate)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| zh | /ʈʂ/ | like "j" in "jar" but with tongue curled back | Unaspirated; tongue tip curled back |
| ch | /ʈʂʰ/ | like "ch" in "church" but with tongue curled back | Aspirated; tongue tip curled back |
| sh | /ʂ/ | like "sh" in "shoe" but with tongue curled back | Tongue tip curled back; more hiss |
| r | /ɻ/ | like "r" in "run" with tongue curled back — but voiced | Unique sound; some describe it as between English "r" and "zh"; NOT a trill |
Sibilants / Dental affricates (tongue tip at upper teeth)
| Initial | IPA | English approximation | Notes |
|---|---|---|---|
| z | /ts/ | like "ds" in "suds" | Unaspirated; tongue touches teeth |
| c | /tsʰ/ | like "ts" in "cats" | Aspirated counterpart to z |
| s | /s/ | like "s" in "sun" | Standard |
Aspiration: The Key to b/p, d/t, g/k, z/c, zh/ch, j/q
The Chinese distinction between pairs like b/p and d/t is aspiration (a puff of air), not voicing. English uses voicing (voiced "b" vs. voiceless "p"), but Mandarin uses aspiration (unaspirated vs. aspirated, both voiceless).
Test: Hold your hand in front of your mouth. "p" in Mandarin (aspirated) should blow your hand; "b" should not. Say "spin" — the p in spin is like Mandarin b.
This means:
- Mandarin b = English "p" in "spin" (unaspirated)
- Mandarin p = English "p" in "pot" (aspirated)
- Mandarin d = English "t" in "stop"
- Mandarin g = English "k" in "ski"
The 6 Simple Finals (Vowels)
| Final | IPA | Sound | Notes |
|---|---|---|---|
| a | /a/ | "ah" as in "father" | Open, back |
| o | /o/ | "aw" as in "law" | Round |
| e | /ɤ/ | no English equivalent | Unrounded back mid vowel; like "uh" with lips spread back |
| i | /i/ | "ee" as in "see" | But after zh/ch/sh/r/z/c/s, it becomes a different sound (apical vowel, like a buzz) |
| u | /u/ | "oo" as in "food" | Round |
| ü | /y/ | French "u" or German "ü" | Round lips as if saying "oo", but spread tongue as if saying "ee" |
The ü sound: no equivalent in English. Round your lips for "oo" and keep them round while you say "ee". After j, q, x, and y, the ü is written as plain u (since those initials never appear before the regular u — there is no ambiguity). So: 鱼 yú (fish) has ü, not u.
Compound Finals Table
| -a | -o | -e | -i | -u | -ü | -an | -en | -ang | -eng | -ong | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| (zero) | a | o | e | yi | wu | yu | an | en | ang | eng | — |
| b- | ba | bo | — | bi | bu | — | ban | ben | bang | beng | — |
| p- | pa | po | — | pi | pu | — | pan | pen | pang | peng | — |
| m- | ma | mo | me | mi | mu | — | man | men | mang | meng | — |
| f- | fa | fo | — | — | fu | — | fan | fen | fang | feng | — |
| d- | da | — | de | di | du | — | dan | den | dang | deng | dong |
| t- | ta | — | te | ti | tu | — | tan | — | tang | teng | tong |
| n- | na | — | ne | ni | nu | nü | nan | nen | nang | neng | nong |
| l- | la | — | le | li | lu | lü | lan | — | lang | leng | long |
| g- | ga | — | ge | — | gu | — | gan | gen | gang | geng | gong |
| k- | ka | — | ke | — | ku | — | kan | ken | kang | keng | kong |
| h- | ha | — | he | — | hu | — | han | hen | hang | heng | hong |
| j- | — | — | — | ji | — | ju | jian | jin | jiang | jing | jiong |
| q- | — | — | — | qi | — | qu | qian | qin | qiang | qing | qiong |
| x- | — | — | — | xi | — | xu | xian | xin | xiang | xiong | |
| zh- | zha | — | zhe | zhi | zhu | — | zhan | zhen | zhang | zheng | zhong |
| ch- | cha | — | che | chi | chu | — | chan | chen | chang | cheng | chong |
| sh- | sha | — | she | shi | shu | — | shan | shen | shang | sheng | — |
| r- | — | — | re | ri | ru | — | ran | ren | rang | reng | rong |
| z- | za | — | ze | zi | zu | — | zan | zen | zang | zeng | zong |
| c- | ca | — | ce | ci | cu | — | can | cen | cang | ceng | cong |
| s- | sa | — | se | si | su | — | san | sen | sang | seng | song |
Medial glides
Finals can begin with medial glides i-, u-, ü- (forming compound finals):
| Medial i- (y-) | ia, ie, iao, iou, ian, in, iang, ing, iong | | Medial u- (w-) | ua, uo, uai, uei, uan, uen, uang, ueng | | Medial ü- (yu-) | üe, üan, ün |
When a medial glide begins a syllable without an initial consonant, spelling changes apply:
- i → y + finals (or just yi for bare i)
- u → w + finals (or just wu for bare u)
- ü → yu + finals
Nasal Finals
| Final | Sound | Example |
|---|---|---|
| -n | front nasal; tongue touches ridge | 安 ān, 人 rén |
| -ng | back nasal (like "ng" in "sing"); no closure | 方 fāng, 生 shēng |
Note: -n and -ng are systematically contrasted. 安 ān vs 昂 áng, 心 xīn vs 星 xīng.
Special Spelling Rules
1. ü written as u after j, q, x, y
Since j, q, x, and y are palatals that never precede the regular u vowel, the ü diacritic is dropped when following these initials. Context makes it unambiguous:
- 居 = jū (not jǖ) — pronounced with ü
- 区 = qū — pronounced with ü
- 需 = xū — pronounced with ü
- 鱼 = yú — pronounced with ü
But after n and l, the distinction matters, so ü is always written:
- 女 nǚ (woman) vs 怒 nù (anger)
- 旅 lǚ (travel) vs 路 lù (road)
2. Apostrophe for syllable boundaries
When a syllable beginning with a vowel (a, o, e) follows another syllable, an apostrophe separates them to avoid ambiguity:
- 西安 Xī'ān (Xi'an city) — not "xian" which would be read as one syllable
- 天鹅 tiān'é (swan)
- 皮袄 pí'ǎo (fur coat)
3. iou, uei, uen shortening
When these finals follow an initial, they drop the middle vowel in spelling:
- iou → iu: 六 liù, 丢 diū
- uei → ui: 对 duì, 水 shuǐ
- uen → un: 论 lùn, 春 chūn
The full vowels (o, e) are always present in pronunciation but omitted in spelling.
4. e vs ê
The final e in syllables like ge, ke, he is the unrounded /ɤ/ vowel. In finals like ie and üe, the e is actually /ɛ/ (like "eh"). Pinyin doesn't distinguish these — context makes it clear.
Tone Mark Placement Rules
In any pinyin syllable, the tone mark goes on exactly one vowel letter. The rule is:
Priority order: a > o > e > i/u (last in sequence) > ü
- If the final contains a or e, mark it: tā, zhè
- If the final contains ou, mark the o: gōu, chóu
- Otherwise, mark the last vowel in the final: guī → ī, liú → ú, xiōng → ō
Examples:
- 妈 mā — only a, mark it
- 贵 guì — ui, mark the last: i → guì
- 走 zǒu — ou, mark o → zǒu
- 吃 chī — only i, mark it
For ü with a tone mark: ǖ ǘ ǚ ǜ (rare; u-umlaut with tone diacritics stacked).
Common Mistakes for English Speakers
| Mistake | Explanation | Fix |
|---|---|---|
| Pronouncing b/d/g as English voiced sounds | Mandarin b/d/g are voiceless unaspirated | Say "spy", "sty", "sky" — the b/d/g sound in those |
| Confusing j/q/x with zh/ch/sh | Completely different articulation points | j/q/x: tongue forward; zh/ch/sh: tongue curled back |
| Saying x like English "sh" | Mandarin x is fronter and thinner | Spread lips wide and make an "s"-like hiss further forward |
| Pronouncing r like English "r" | Mandarin r is retroflex and voiced | Curl tongue back; some say it's between "r" and "zh" |
| Mispronouncing final -e | It's /ɤ/, not English "eh" | Unround lips, pull tongue back — "uh" with spread lips |
| Confusing -n and -ng | English speakers often merge these | For -ng, do NOT close — leave mouth slightly open |
| Ignoring ü vs. u after n/l | 女 nǚ ≠ 怒 nù | Round lips for ü — it sounds different from u |