Khmer
Khmer (Cambodian) is the official language of Cambodia. It forms its own Khmeric branch in the Austroasiatic language family, making it (somewhat distantly) related to Vietnamese, Mon, Khmu, Wa, Palaung, Khasi, Santali, Mundari, and other languages.
Within Cambodia, Central Khmer is the basis for the standard language. Northern Khmer is spoken north of the border in the southern part of Thailand's Isan region. Southern Khmer is spoken south of the border in Vietnam's Mekong Delta. Western Khmer is spoken in the Cardamom Mountains in western Cambodia. The dialect of Phnom Penh, the capital of Cambodia, also has a number of unique colloquial features that distinguish it from the standard language.
Khmer is written in the Khmer script, which is ultimately derived from the Indic Brahmi script through the southern branch. As such, it is useful for learners to learn some fundamental principles underlying the typical Indic script system.
Consonants
Consonants in the Khmer script are organized into two series, the Ą Series and the Ǫ Series. The name refers to the vowel that the consonant takes as its inherent vowel, with Ą Series consonants taking /ɑː/ ( ~ /ɒː/) by default Ǫ Series consonants taking /ɔː/ by default.
Consonant | Subscript | Ancestral Indic | Modern IPA | Vowel Series | Lexx Rom |
---|---|---|---|---|---|
ក | ្ក | k | k | kɑː | kąą |
ខ | ្ខ | kʰ | kʰ | kʰɑː | khąą |
គ | ្គ | ɡ | k | kɔː | kǫǫ |
ឃ | ្ឃ | ɡʱ | kʰ | kʰɔː | khǫǫ |
ង | ្ង | ŋ | ŋ | ŋɔː | ngǫǫ |
ច | ្ច | tʃ | c | cɑː | cąą |
ឆ | ្ឆ | tʃʰ | cʰ | cʰɑː | chąą |
ជ | ្ជ | dʒ | c | cɔː | cǫǫ |
ឈ | ្ឈ | dʒʱ | cʰ | cʰɔː | chǫǫ |
ញ | ្ញ | ɲ | ɲ | ɲɔː | ñǫǫ |
ដ | ្ដ | ʈ | ɗ | ɗɑː | dąą |
ឋ | ្ឋ | ʈʰ | tʰ | tʰɑː | thąą |
ឌ | ្ឌ | ɖ | ɗ | ɗɔː | dǫǫ |
ឍ | ្ឍ | ɖʱ | tʰ | tʰɔː | thǫǫ |
ណ | ្ណ | ɳ | n | nɑː | nąą |
ត | ្ត | t̪ | t | tɑː | tąą |
ថ | ្ថ | t̪ʰ | tʰ | tʰɑː | thąą |
ទ | ្ទ | d̪ | t | tɔː | tǫǫ |
ធ | ្ធ | d̪ʱ | tʰ | tʰɔː | thǫǫ |
ន | ្ន | n | n | nɔː | nǫǫ |
ប | ្ប | p | ɓ | ɓɑː | bąą |
ផ | ្ផ | pʰ | pʰ | pʰɑː | phąą |
ព | ្ព | b | p | pɔː | pǫǫ |
ភ | ្ភ | bʱ | pʰ | pʰɔː | phǫǫ |
ម | ្ម | m | m | mɔː | mǫǫ |
យ | ្យ | j | j | jɔː | yǫǫ |
រ | ្រ | r | r | rɔː | rǫǫ |
ល | ្ល | l | l | lɔː | lǫǫ |
វ | ្វ | ʋ | ʋ | ʋɔː | vǫǫ |
ឝ | ្ឝ | ʃ | s | sɑː | sąą |
ឞ* | ្ឞ | ʂ | s | sɑː | sąą |
ស* | ្ស | s | s | sɑː | sąą |
ហ | ្ហ | h | h | hɑː | hąą |
ឡ | ្ឡ | ɭ | l | lɑː | ląą |
អ | ្អ | - | ʔ | ʔɑː | (')ąą |
Unlike Sanskrit and Pali, Khmer does not make a phonetic distinction between retroflex and dental consonants.
The two glottalized implosives /ɗ/ and /ɓ/ can vary with normal voice plosives /d/ and /b/.
វ can vary between /ʋ/ and /w/.
ឞ* and ស* are rare, used for transcribing Indic words.
The apostrophe used in Lexx Rom for glottal initial អ can be omitted at the beginning of the word, but should be used if within a word.
From the above, one can see that generally speaking, ancestrally voiceless initials induced Ą Series vowels, and ancestrally voiceless initials induced Ǫ Series vowels. However, there are exceptions, as Khmer reutilized some duplicate letters to serve as the alternate vowel series:
The unaspirated retroflex plosives ដ and ឌ are repurposed for /ɗ/ Ą Series and Ǫ Series.
The retroflex nasal ណ is repurposed for /n/ Ą Series to complement ន Ǫ Series.
The retroflex lateral ឡ is repurposed for /l/ Ą Series to complement ល Ǫ Series.
ប is used as a base letter for /ɓ/ in the Ą Series.
Even with the repurposing of these letters, there still remains some gaps where a consonant sound is missing one of the two complementary series. To meet this need, there are two superscript letters in the Khmer script ៉ and ៊ that can go on top of a base consonant letter to switch its vowel series to the other series. ៉ switches the vowel series to Ą Series, and ៊ switches the vowel series to Ǫ Series.
Series-Shifting Consonant Letters
Consonant | IPA | Vowel Series | Lexx Rom |
---|---|---|---|
ង៉ | ŋ | ŋɑː | ngąą |
ញ៉ | ɳ | ɳɑː | ñąą |
ប៉ | p | pɑː | pąą |
ប៊ | b | bɔː | bǫǫ |
ម៉ | m | mɑː | mąą |
យ៉ | j | jɑː | yąą |
រ៉ | r | rɑː | rąą |
វ៉ | ʋ | ʋɑː | vąą |
ស៊ | s | sɔː | sǫǫ |
ហ៊ | h | hɔː | hǫǫ |
អ៊ | ʔ | ʔɔː | (')ǫǫ |
Note that the letter ប can take both ៉ and ៊, due to it originally being an ancestral p in Indic as well as being repurposed in its base form to cover glottalized /ɓ/.
Some sonorant letters can also take an h in front of them to switch their vowel series from Ǫ Series to Ą Series, as an etymological spelling.
Consonant | Glyphs | IPA | Vowel Series | Lexx Rom |
---|---|---|---|---|
ហ្ន | h+n | n | nɑː | nąą |
ហ្ម | h+m | m | mɑː | mąą |
ហ្ល | h+l | l | lɑː | ląą |
ហ្វ | h+v | ʋ | ʋɑː | vąą |
There are also some other supplemental consonant letters that utilize an <h> letter in front of another letter to transcribe sounds that entered Khmer through modern loanwords.
Supplemental Consonant Letters
Consonant | Glyphs | IPA | Vowel Series | Lexx Rom |
---|---|---|---|---|
ហ្គ | h+g | ɡ | ɡɑː | gąą |
ហ្គ៊ | h+g+៊ | ɡ | ɡɔː | gǫǫ |
ហ្វ | h+v | f | fɑː | fąą |
ហ្វ៊ | h+v+៊ | f | fɔː | fǫǫ |
ហ្ស | h+s | z, ʒ | zɑː, ʒɑː | ząą, zhąą |
ហ្ស៊ | h+s+៊ | z, ʒ | zɔː, ʒɔː | zǫǫ, zhǫǫ |
Note that the usage of ហ្វ for transcribing /f/ means that this glyph does double duty, being used for /ʋɑː/ originally, but in modern loanwords (from French, English, etc.) indicating /fɑː/ instead.
Rimes
For every rime diacritic, learners must remember the vowel separately for the Ą Series consonants and the Ǫ Series consonants (or when there is no difference). As mentioned above, with no rime diacritic, the default rime for the Ą Series consonants is /ɑː/, and the default rime for the Ǫ Series consonants is /ɔː/.
Basic Rimes
Khmer Script | Ą Series IPA | Ą Series Lexx Rom | Ǫ Series IPA | Ǫ Series Lexx Rom |
---|---|---|---|---|
◌ | ɑː | ąą | ɔː | ǫǫ |
ា | aː | aa | iːə | iia̛ |
ិ | e | ei | i | i |
ី | əj | ơi | iː | ii |
ឹ | ə | ơ | ɨ | ư |
ឺ | əɨ | ơư | ɨː | ưư |
ុ | o | o | u | u |
ូ | ou | ou | uː | uu |
ួ | uə | ua̛ | = | = |
ើ | aə | aa̛ | əː | ơơ |
ឿ | ɨə | ưa̛ | = | = |
ៀ | iə | ia̛ | = | = |
េ | ei | ei | eː | ee |
ែ | ae | ae | ɛː | ęę |
ៃ | aj | ai | ɨj | ưi |
ោ | ao | ao | oː | oo |
ៅ | aw | au | ɨw | ưu |
៏ | ɑː | ąą | = | = |
Khmer is very particular with its vowel distinctions, so learners should take the time to practice them with native speakers.
Rimes with Coda
Khmer Script | Ą Series IPA | Ą Series Lexx Rom | Ǫ Series IPA | Ǫ Series Lexx Rom |
---|---|---|---|---|
ុំ | om | om | um | um |
ំ | ɑm | ąm | um | um |
ាំ | am | am | ŏəm | oa̛m |
ាំង | aŋ | ang | eə̆ŋ | ea̛ng |
ះ | ah | ah | ĕəh | ea̛h |
ិះ | eh | eh | ih | ih |
ឹះ | əh | ơh | - | - |
ុះ | oh | oh | uh | uh |
េះ | eh | eh | ih | ih |
ើះ | əh | ơh - | - | |
ែះ | eh | eh - | - | |
ោះ | ɑh | ąh | ŭəh | ua̛h |
ៈ | aʔ | a' | eə̆ʔ | ea' |
ូវ | əw | ơu | ɨw | ưu |
័រ | ɔə | ǫa̛ | - | |
ិយ | iː | ii | - | - |
័យ | aj | ai | ɨj | ưi |
ា់ | eə̯, oə̯ | ea̛◌, oa̛◌ | - | - |
Typically, vowels with glottal stop/null initial consonant are attached to អ, but there are also independent vowel letters that are used mainly in spelling Indic words. The exact rime used for these letters can vary by word, so learners must learn them with each vocabulary item.
Independent Vowel Letters
Khmer Script | IPA | Lexx Rom |
---|---|---|
ឥ | ʔə, ʔɨ, ʔəj | ơ, ư, ơi |
ឦ | ʔəj | ơi |
ឧ | ʔo, ʔu, ʔao | o, u, ao |
ឨ* | ʔok | ok |
ឩ | ʔou, ʔuː | ou, uu |
ឪ | ʔəw | ơu |
ឫ | rɨ | rư |
ឬ | rɨː | rưư |
ឭ | lɨ | lư |
ឮ | lɨː | lưư |
ឯ | ʔae, ʔɛː, ʔeː | ae, ęę, ee |
ឰ | ʔaj | ai |
ឱ, ឲ | ʔao | ao |
ឳ | ʔaw | au |
ឨ is no longer used in modern Khmer.