Pinyin

Complete Mandarin pinyin guide: all 21 initials and 36 finals, pronunciation notes for English speakers, special spelling rules, tone mark placement, and common pitfalls.

Pinyin (拼音, pīnyīn, literally "spell sound") is the official romanization system for Standard Chinese (Putonghua). It was developed in the 1950s and adopted by the People's Republic of China in 1958. Pinyin uses the Latin alphabet plus four diacritic marks for tones, and one special letter (ü).

Pinyin is how Chinese is typed on computers and phones, how words appear in dictionaries, and how children in China learn to read before mastering characters. For learners, it is the essential bridge to pronunciation and tone notation.

Critical warning: Pinyin letters do not always map to their English phonetic values. The letter x is not like English "x"; q is not like English "q"; zh/ch/sh/r are retroflexes with no English equivalent; b/d/g are not the voiced sounds English speakers expect. Study pinyin pronunciation explicitly — do not guess from English spelling.


The 21 Initials

Initials are the consonant sounds that begin a syllable. A syllable may have no initial (zero initial), in which case the vowel begins the syllable directly.

Bilabials (lips together)

Initial IPA English approximation Notes
b /p/ like "p" in "spin" Unaspirated — no puff of air. NOT the voiced "b" in "bat"
p /pʰ/ like "p" in "pot" Aspirated — strong puff of air
m /m/ like "m" in "mother" Standard

Labiodental (lip + teeth)

Initial IPA English approximation Notes
f /f/ like "f" in "fan" Standard

Alveolars (tongue tip to ridge behind upper teeth)

Initial IPA English approximation Notes
d /t/ like "t" in "stop" Unaspirated — no puff of air. NOT the voiced "d" in "dog"
t /tʰ/ like "t" in "top" Aspirated
n /n/ like "n" in "no" Standard
l /l/ like "l" in "look" Standard

Velars (back of tongue to soft palate)

Initial IPA English approximation Notes
g /k/ like "k" in "ski" Unaspirated — NOT the voiced "g" in "go"
k /kʰ/ like "k" in "kite" Aspirated
h /x/ like "ch" in Scottish "loch" Slightly throaty; NOT English "h" but close

Palatals (j q x — only appear before i and ü)

Initial IPA English approximation Notes
j /tɕ/ like "j" in "jeep" but unaspirated and further forward Never before a, o, e, u. Always before i or ü (written u after j)
q /tɕʰ/ like "ch" in "cheese" but further forward, aspirated Aspirated counterpart to j
x /ɕ/ like "sh" in "she" but further forward, more hiss The hiss of air through a narrow gap; very different from English "x"

j/q/x vs. zh/ch/sh: j/q/x are palatal — tongue body near hard palate, no retroflexion. zh/ch/sh are retroflex — tongue tip curled back. These are four distinct series that English has no equivalent for. They never overlap in distribution: j/q/x only before i/ü; zh/ch/sh before everything else.

Retroflexes (tongue tip curled back toward hard palate)

Initial IPA English approximation Notes
zh /ʈʂ/ like "j" in "jar" but with tongue curled back Unaspirated; tongue tip curled back
ch /ʈʂʰ/ like "ch" in "church" but with tongue curled back Aspirated; tongue tip curled back
sh /ʂ/ like "sh" in "shoe" but with tongue curled back Tongue tip curled back; more hiss
r /ɻ/ like "r" in "run" with tongue curled back — but voiced Unique sound; some describe it as between English "r" and "zh"; NOT a trill

Sibilants / Dental affricates (tongue tip at upper teeth)

Initial IPA English approximation Notes
z /ts/ like "ds" in "suds" Unaspirated; tongue touches teeth
c /tsʰ/ like "ts" in "cats" Aspirated counterpart to z
s /s/ like "s" in "sun" Standard

Aspiration: The Key to b/p, d/t, g/k, z/c, zh/ch, j/q

The Chinese distinction between pairs like b/p and d/t is aspiration (a puff of air), not voicing. English uses voicing (voiced "b" vs. voiceless "p"), but Mandarin uses aspiration (unaspirated vs. aspirated, both voiceless).

Test: Hold your hand in front of your mouth. "p" in Mandarin (aspirated) should blow your hand; "b" should not. Say "spin" — the p in spin is like Mandarin b.

This means:

  • Mandarin b = English "p" in "spin" (unaspirated)
  • Mandarin p = English "p" in "pot" (aspirated)
  • Mandarin d = English "t" in "stop"
  • Mandarin g = English "k" in "ski"

The 6 Simple Finals (Vowels)

Final IPA Sound Notes
a /a/ "ah" as in "father" Open, back
o /o/ "aw" as in "law" Round
e /ɤ/ no English equivalent Unrounded back mid vowel; like "uh" with lips spread back
i /i/ "ee" as in "see" But after zh/ch/sh/r/z/c/s, it becomes a different sound (apical vowel, like a buzz)
u /u/ "oo" as in "food" Round
ü /y/ French "u" or German "ü" Round lips as if saying "oo", but spread tongue as if saying "ee"

The ü sound: no equivalent in English. Round your lips for "oo" and keep them round while you say "ee". After j, q, x, and y, the ü is written as plain u (since those initials never appear before the regular u — there is no ambiguity). So: 鱼 yú (fish) has ü, not u.


Compound Finals Table

-a -o -e -i -u -an -en -ang -eng -ong
(zero) a o e yi wu yu an en ang eng
b- ba bo bi bu ban ben bang beng
p- pa po pi pu pan pen pang peng
m- ma mo me mi mu man men mang meng
f- fa fo fu fan fen fang feng
d- da de di du dan den dang deng dong
t- ta te ti tu tan tang teng tong
n- na ne ni nu nan nen nang neng nong
l- la le li lu lan lang leng long
g- ga ge gu gan gen gang geng gong
k- ka ke ku kan ken kang keng kong
h- ha he hu han hen hang heng hong
j- ji ju jian jin jiang jing jiong
q- qi qu qian qin qiang qing qiong
x- xi xu xian xin xiang xing xiong
zh- zha zhe zhi zhu zhan zhen zhang zheng zhong
ch- cha che chi chu chan chen chang cheng chong
sh- sha she shi shu shan shen shang sheng
r- re ri ru ran ren rang reng rong
z- za ze zi zu zan zen zang zeng zong
c- ca ce ci cu can cen cang ceng cong
s- sa se si su san sen sang seng song

Medial glides

Finals can begin with medial glides i-, u-, ü- (forming compound finals):

| Medial i- (y-) | ia, ie, iao, iou, ian, in, iang, ing, iong | | Medial u- (w-) | ua, uo, uai, uei, uan, uen, uang, ueng | | Medial ü- (yu-) | üe, üan, ün |

When a medial glide begins a syllable without an initial consonant, spelling changes apply:

  • i → y + finals (or just yi for bare i)
  • u → w + finals (or just wu for bare u)
  • ü → yu + finals

Nasal Finals

Final Sound Example
-n front nasal; tongue touches ridge 安 ān, 人 rén
-ng back nasal (like "ng" in "sing"); no closure 方 fāng, 生 shēng

Note: -n and -ng are systematically contrasted. 安 ān vs 昂 áng, 心 xīn vs 星 xīng.


Special Spelling Rules

1. ü written as u after j, q, x, y

Since j, q, x, and y are palatals that never precede the regular u vowel, the ü diacritic is dropped when following these initials. Context makes it unambiguous:

  • 居 = jū (not jǖ) — pronounced with ü
  • 区 = qū — pronounced with ü
  • 需 = xū — pronounced with ü
  • 鱼 = yú — pronounced with ü

But after n and l, the distinction matters, so ü is always written:

  • 女 nǚ (woman) vs 怒 nù (anger)
  • 旅 lǚ (travel) vs 路 lù (road)

2. Apostrophe for syllable boundaries

When a syllable beginning with a vowel (a, o, e) follows another syllable, an apostrophe separates them to avoid ambiguity:

  • 西安 Xī'ān (Xi'an city) — not "xian" which would be read as one syllable
  • 天鹅 tiān'é (swan)
  • 皮袄 pí'ǎo (fur coat)

3. iou, uei, uen shortening

When these finals follow an initial, they drop the middle vowel in spelling:

  • iou → iu: 六 liù, 丢 diū
  • uei → ui: 对 duì, 水 shuǐ
  • uen → un: 论 lùn, 春 chūn

The full vowels (o, e) are always present in pronunciation but omitted in spelling.

4. e vs ê

The final e in syllables like ge, ke, he is the unrounded /ɤ/ vowel. In finals like ie and üe, the e is actually /ɛ/ (like "eh"). Pinyin doesn't distinguish these — context makes it clear.


Tone Mark Placement Rules

In any pinyin syllable, the tone mark goes on exactly one vowel letter. The rule is:

Priority order: a > o > e > i/u (last in sequence) > ü

  1. If the final contains a or e, mark it: tā, zhè
  2. If the final contains ou, mark the o: gōu, chóu
  3. Otherwise, mark the last vowel in the final: guī → ī, liú → ú, xiōng → ō

Examples:

  • 妈 mā — only a, mark it
  • 贵 guì — ui, mark the last: i → guì
  • 走 zǒu — ou, mark o → zǒu
  • 吃 chī — only i, mark it

For ü with a tone mark: ǖ ǘ ǚ ǜ (rare; u-umlaut with tone diacritics stacked).


Common Mistakes for English Speakers

Mistake Explanation Fix
Pronouncing b/d/g as English voiced sounds Mandarin b/d/g are voiceless unaspirated Say "spy", "sty", "sky" — the b/d/g sound in those
Confusing j/q/x with zh/ch/sh Completely different articulation points j/q/x: tongue forward; zh/ch/sh: tongue curled back
Saying x like English "sh" Mandarin x is fronter and thinner Spread lips wide and make an "s"-like hiss further forward
Pronouncing r like English "r" Mandarin r is retroflex and voiced Curl tongue back; some say it's between "r" and "zh"
Mispronouncing final -e It's /ɤ/, not English "eh" Unround lips, pull tongue back — "uh" with spread lips
Confusing -n and -ng English speakers often merge these For -ng, do NOT close — leave mouth slightly open
Ignoring ü vs. u after n/l 女 nǚ ≠ 怒 nù Round lips for ü — it sounds different from u