Yale romanization of Cantonese

Last updated
Traditional Chinese 耶魯
Simplified Chinese 耶鲁
Cantonese Yale Yèh-lóuh

The Yale romanization of Cantonese was developed by Gerard P. Kok for his and Parker Po-fei Huang's textbook Speak Cantonese initially circulated in looseleaf form in 1952 [1] but later published in 1958. [2] Unlike the Yale romanization of Mandarin, it is still widely used in books and dictionaries, especially for foreign learners of Cantonese. It shares some similarities with Hanyu Pinyin in that unvoiced, unaspirated consonants are represented by letters traditionally used in English and most other European languages to represent voiced sounds. For example, [p] is represented as b in Yale, whereas its aspirated counterpart, [pʰ] is represented as p. [3] Students attending The Chinese University of Hong Kong's New-Asia Yale-in-China Chinese Language Center are taught using Yale romanization. [4]


Despite originally being a romanisation scheme to indicate pronunciations, some enthusiasts actually employ the Yale romanisation to explore writing Cantonese as an alphabetic language, elevating it from its assistive status to a written language in effect.


[ p ]
[ ]
[ m ]
[ f ]
[ t ]
[ ]
[ n ]
[ l ]
[ k ]
[ ]
[ ŋ ]
[ h ]
[ w ]
[ ts ]
[ tsʰ ]
[ s ]
[ j ]


[ ]
[ ɛː ]
[ ]
[ ɔː ]
[ ]
[ œː ]
[ ]
[ ]
[ ŋ̩ ]


Graphical representation of the tones of six-tone Cantonese. Cantonese Tones.png
Graphical representation of the tones of six-tone Cantonese.

Modern Cantonese has up to seven phonemic tones. Cantonese Yale represents these tones using a combination of diacritics and the letter h. [5] [6] Traditional Chinese linguistics treats the tones in syllables ending with a stop consonant as separate "entering tones". Cantonese Yale follows modern linguistic conventions in treating these the same as the high-flat, mid-flat and low-flat tones, respectively.

No.DescriptionIPA & Chao
tone numbers
Yale representation
1high-flat˥ 55sīnsīk
high-falling˥˨ 52sìn
2mid-rising˨˥ 25sín
3mid-flat˧ 33sisinsik
4low-falling˨˩ 21sìhsìhn
5low-rising˨˧ 23síhsíhn
6low-flat˨ 22sihsihnsihk


Traditional Simplified Romanization
你好Néih hóu

Sample transcription of one of the 300 Tang Poems by Meng Haoran:

Chēun híu
Maahng Houh-yìhn
春眠不覺曉,Chēun mìhn bāt gok híu,
處處聞啼鳥。chyu chyu màhn tàih níuh.
夜來風雨聲,yeh lòih fūng yúh sīng,
花落知多少?fā lohk jī dō síu?

See also

Related Research Articles

Pinyin Romanization scheme for Standard Mandarin

Hanyu Pinyin, often abbreviated to pinyin, is the official romanization system for Standard Mandarin Chinese in mainland China and to some extent in Taiwan and Singapore. It is often used to teach Standard Mandarin, which is normally written using Chinese characters. The system includes four diacritics denoting tones. Pinyin without tone marks is used to spell Chinese names and words in languages written with the Latin alphabet and also in certain computer input methods to enter Chinese characters.

Wade–Giles Romanization scheme for Mandarin Chinese

Wade–Giles is a romanization system for Mandarin Chinese. It developed from a system produced by Thomas Francis Wade, during the mid-19th century, and was given completed form with Herbert A. Giles's Chinese–English Dictionary of 1892.

Hakka Chinese Primary branch of Chinese originating in Southern China

Hakka is a language group of varieties of Chinese, spoken natively by the Hakka people throughout Southern China, Taiwan, Hong Kong, Macau and throughout the diaspora areas of East Asia, Southeast Asia and in overseas Chinese communities around the world.

Taiwanese Hokkien Variety of a language dialect

Taiwanese, also known as Taigi, Taiwanese Minnan, Holo, Taiwanese Hokkien, is a variety of the Hokkien language spoken natively by about 70% of the population of Taiwan. It is spoken by the Taiwanese Hoklo people, who descended from immigrants from southern Fujian during the Qing dynasty. The Pe̍h-ōe-jī (POJ) romanization is a popular orthography for Taiwanese.

Gwoyeu Romatzyh System for writing Mandarin Chinese in the Latin alphabet

Gwoyeu Romatzyh, abbreviated GR, is a system for writing Mandarin Chinese in the Latin alphabet. The system was conceived by Yuen Ren Chao and developed by a group of linguists including Chao and Lin Yutang from 1925 to 1926. Chao himself later published influential works in linguistics using GR. In addition a small number of other textbooks and dictionaries in GR were published in Hong Kong and overseas from 1942 to 2000.

Jyutping Romanization scheme for Cantonese

Jyutping is a romanisation system for Cantonese developed by the Linguistic Society of Hong Kong (LSHK), an academic group, in 1993. Its formal name is the Linguistic Society of Hong Kong Cantonese Romanization Scheme. The LSHK advocates for and promotes the use of this romanisation system.

Taishanese, or in the Cantonese romanization Toisanese, is a language of Yue Chinese. The language is related to and is often referred to as Cantonese but has little mutual intelligibility with the latter. Taishanese is spoken in the southern part of Guangdong Province in China, particularly around the city-level county of Taishan located on the western fringe of the Pearl River Delta. In the late 19th century and early 20th century, a significant amount of Chinese emigration to North America originated from Siyi (Seiyap), the area where this variety is natively spoken; making Taishanese a dominant variety of the Chinese language spoken in Chinatowns in Canada and the United States. It was formerly the lingua franca of the overseas Chinese residing in the United States.

Cantonese Variety of Yue Chinese spoken in Guangzhou, Hong Kong and Macau

Cantonese is a language within the Chinese (Sinitic) branch of the Sino-Tibetan languages originating from the city of Guangzhou and its surrounding area in Southeastern China. It is the traditional prestige variety of the Yue Chinese dialect group, which has over 80 million native speakers. While the term Cantonese specifically refers to the prestige variety, it is often used to refer to the entire Yue subgroup of Chinese, including related but largely mutually unintelligible languages and dialects such as Taishanese.

Teochew is a dialect of Chaoshan Min, a Southern Min language, that is spoken by the Teochew people in the Chaoshan region of eastern Guangdong and by their diaspora around the world. It is sometimes referred to as Chiuchow, its Cantonese rendering, due to the English romanisation by colonial officials and explorers. It is closely related to some dialects of Hokkien, as it shares some cognates and phonology with Hokkien, although the two are not largely mutually intelligible.

Meyer–Wempe romanization was the system used by two Roman Catholic missionaries in Hong Kong, Bernard F. Meyer and Theodore F. Wempe, for romanizing Cantonese in their Student's Cantonese English Dictionary published in 1935.

Guangdong Romanization refers to the four romanization schemes published by the Guangdong Provincial Education Department in 1960 for transliterating Cantonese, Teochew, Hakka and Hainanese. The schemes utilized similar elements with some differences in order to adapt to their respective spoken varieties.

General Chinese is a diaphonemic orthography invented by Yuen Ren Chao to represent the pronunciations of all major varieties of Chinese simultaneously. It is "the most complete genuine Chinese diasystem yet published". It can also be used for the Korean, Japanese, and Vietnamese pronunciations of Chinese characters, and challenges the claim that Chinese characters are required for interdialectal communication in written Chinese.

Sidney Lau Sek-cheung was a Cantonese teacher in the Chinese Language Section of the Government Training Division and Principal of the Government Language School of the Hong Kong Government. He had graduated bachelor of arts from Sun Yat-sen University, Guangdong, People's Republic of China.

Cantonese Pinyin is a romanization system for Cantonese developed by Rev. Yu Ping Chiu (余秉昭) in 1971, and subsequently modified by the Education Department of Hong Kong and Prof. Zhan Bohui (詹伯慧) of the Chinese Dialects Research Centre of the Jinan University, Guangdong, PRC, and honorary professor of the School of Chinese, University of Hong Kong. It is the only romanization system accepted by Education and Manpower Bureau of Hong Kong and Hong Kong Examinations and Assessment Authority.

Tone numbers are numerical digits used like letters to mark the tones of a language. The number is usually placed after a romanized syllable. Tone numbers are defined for a particular language, so they have little meaning between languages.

The Cantonese Romanisation system known as Barnett–Chao is based on the principles of the Gwoyeu Romatzyh system (GR) developed by Chao Yuenren in the 1920s, which he modified in 1947. The B-C system is a modification in 1950 by K M A Barnett which was adopted by the School of Oriental and African Studies, London (SOAS).

Transliteration of Chinese

The different varieties of Chinese have been transcribed into many other writing systems.

The standard pronunciation of Cantonese is that of Guangzhou, also known as Canton, the capital of Guangdong Province. Hong Kong Cantonese is related to the Guangzhou dialect, and the two diverge only slightly. Yue dialects in other parts of Guangdong and Guangxi provinces, such as Taishanese, may be considered divergent to a greater degree.

The Yale romanization of Mandarin is a system for transcribing the sounds of Standard Chinese, based on Mandarin Chinese varieties spoken in and around Beijing. It was devised in 1943 by the Yale sinologist George Kennedy for a course teaching Chinese to American soldiers, and popularized by continued development of that course at Yale. The system approximated Chinese sounds using English spelling conventions in order to accelerate acquisition of pronunciation by English speakers.

The Cantonese Transliteration Scheme, sometimes called Rao's romanization, is the romanisation for Cantonese published at part of the Guangdong Romanization by the Guangdong Education department in 1960, and further revised by Rao Bingcai in 1980. It is referred to as the Canton Romanization on the LSHK character database.


  1. Huang, Parker Po-fei (1965). Cantonese Sounds and Tones. New Haven, CT: Far Eastern Publications, Yale University. p. Foreword.
  2. The Routledge Encyclopedia of the Chinese Language , p. 40.
  3. "Cantonese". Omniglot. Retrieved 2016-10-27.
  4. "CUHK Teaching Materials" . Retrieved 2016-10-27.
  5. Ng Lam & Chik 2000: 515. "Appendix 3: Tones. The student of Cantonese will be well aware of the importance of tones in conveying meaning. Basically, there are seven tones which, in the Yale system, are represented by the use of diacritics and by the insertion of h for ..."
  6. Gwaan 2000: 7. "Basically, there are seven tones which, in the Yale system, are represented by the use of diacritics and by the insertion of h for the three low tones. The following chart will illustrate the seven tones: 3 Mid Level, 1 High Level, 5 Low Falling, 6 Low Level..."

Further reading