Sundanese language

Basa Sunda
ᮘᮞ ᮞᮥᮔ᮪ᮓ
بَاسَا سُوْندَا
Word Sunda in Sundanese script
Word "Sunda" written in Standard Sundanese script
Pronunciation [basa sʊnda]
Native to Java, Indonesia
Region West Java, Banten, Jakarta, parts of western Central Java, southern Lampung, also spoken by the Sundanese diaspora all over Indonesia, some were recorded in Australia[1] and throughout the world.
Native speakers
42 million (2016)[2]
Early form
Standard forms
Latin script (present)
Sundanese script (present; optional)
Old Sundanese script (14-18th centuries AD, present; optional)
Sundanese Cacarakan script (17-19th centuries AD, present; certain areas)
Sundanese Pégon script (17-20th centuries AD, present; religious use only)
Buda Script (13-15th centuries AD, present; optional)
Kawi script (historical)
Pallava (historical)
Pranagari (historical)
Vatteluttu (historical)
Official status
Official language in
Banten Banten (regional)
West Java West Java (regional)
Recognised minority
language in
Indonesia (Jakarta,[3] Lampung,[4] Central Java,[5] and another provinces in Indonesia.[6]
Regulated by Lembaga Basa Jeung Sastra Sunda
Language codes
ISO 639-1 su
ISO 639-2 sun
ISO 639-3 Variously:
sun – Sundanese
bac – Baduy Sundanese
osn – Old Sundanese
Glottolog sund1251
Linguasphere 31-MFN-a
Sundanese language distribution map.svg
  Areas where Sundanese is a majority native language
  Areas where Sundanese is a minority language
This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters. For an introductory guide on IPA symbols, see Help:IPA.
A Sundanese lontar manuscript written in Sundanese script.
The first page from the manuscript of Carita Waruga Guru which uses the Old Sundanese script and Old Sundanese language.
Aksara Sunda (Sundanese script)
A Sundanese speaker, recorded in Indonesia.

Sundanese (/sʌndəˈnz/:[7] Basa Sunda, Sundanese pronunciation: [basa sʊnda]; Sundanese script: ᮘᮞ ᮞᮥᮔ᮪ᮓ; Pegon script: بَاسَا سُوْندَا) is a Malayo-Polynesian language spoken by the Sundanese. It has approximately 40 million native speakers in the western third of Java; they represent about 15% of Indonesia's total population.


According to American linguist Robert Blust, Sundanese is closely related to the Malayic languages, as well as to language groups spoken in Borneo such as the Land Dayak languages or the Kayan–Murik languages, based on high lexical similarities between these languages.[8][9] It is more distantly related to Madurese and Javanese.

History and Distribution

Sundanese is mainly spoken on the west side of the island of Java, in an area known as Tatar Sunda (Pasundan). However, Sundanese is also spoken in the western part of Central Java, especially in Brebes and Cilacap Regency, because these areas were previously under the control of the Galuh Kingdom. Many place names in Cilacap are still Sundanese names and not Javanese names such as Dayeuhluhur, Cimanggu, Cipari and so on.

Until 1600 AD, Sundanese was the state language in the kingdoms of Salakanagara, Tarumanagara, Sunda, Galuh, and Pajajaran. During this period, Sundanese was heavily influenced by the Sanskrit language as seen in the Batu Tapak Kaki Kiri Nyoreang inscription at the time of King Purnawarman, using the Pallava script. Sundanese at that time was used in the fields of state, art, and daily life, many religious books were written in Sundanese and used Old Sundanese script such as the Sanghyang Siksa Kandang Karesian Manuscript, Carita Parahyangan, Amanat Galunggung, and Guru Talapakan.

In addition, according to some Sundanese language experts until around the 6th century, the area of speech reached around the Dieng Plateau in Central Java, based on the name "Dieng" which is considered the name Sundanese (from the origin of the word dihyang which is an Old Sundanese word). Along with transmigration and immigration carried out by the Sundanese ethnics, speakers of this language have spread beyond the island of Java. For example, in Lampung, South Sumatra, Jambi, Riau, West Kalimantan, Southeast Sulawesi and even outside the country of Indonesia, such as Taiwan, Japan, Australia and other countries, a significant number of ethnic Sundanese live in areas outside the Pasundan.


Sundanese has several dialects, conventionally described according to the locations of the people:

The Priangan dialect, which covers the largest area where Sundanese people lives (Parahyangan in Sundanese), is the most widely spoken type of Sundanese language, taught in elementary till senior-high schools (equivalent to twelfth-year school grade) in West Java and Banten Province.


The language has been written in different writing systems throughout history. The earliest attested documents of the Sundanese language were written in the Old Sundanese script (Aksara Sunda Kuno). After the arrival of Islam, the Pegon script is also used, usually for religious purposes. The Latin script then began to be used after the arrival of Europeans. In modern times, most of Sundanese literature is written in Latin. The regional government of West Java and Banten are currently promoting the use of Standard Sundanese script (Aksara Sunda Baku) in public places and road signs. The Pegon script is still used mostly by pesantrens (Islamic boarding school) in West Java and Banten or in Sundanese Islamic literature.[10]


Sundanese orthography is highly phonemic (see also Sundanese script).


There are seven vowels: a /a/, é /ɛ/, i /i/, o /ɔ/, u /u/, e /ə/, and eu /ɨ/.[11]


According to Müller-Gotama (2001) there are 18 consonants in the Sundanese phonology: /b/, /tʃ/, /d/, /ɡ/, /h/, /dʒ/, /k/, /l/, /m/, /n/, /p/, /r/, /s/, /ŋ/, /t/, /ɲ/, /w/, /j/; however, influences from foreign languages have introduced several additional consonants such as /f/, /v/, /z/ (as in fonem, qur'an, xerox, zakat). The consonantal phonemes are transcribed with the letters p, b, t, d, k, g, c (pronounced /tʃ/), j /d͡ʒ/, h, ng (/ŋ/), ny /ɲ/, m, n, s /s/, w, l, r /r~ɾ/, and y /j/. Other consonants that originally appear in Indonesian loanwords are mostly transferred into native consonants: f → p, v → p, sy → s, sh → s, z → j, and kh /x/ → h.

Ephentetic semivowels /w/ and /j/ are inserted after a high vowel immediately followed by another vowel, as in the words:

  • kuéh - /kuwɛh/
  • muih - /muwih/
  • béar - /bejar/
  • miang - /mijaŋ/


Sundanese has an elaborate system of register distinguishing levels of formality.[12] At the beginning of speech level development, it was known 6 levels of Sundanese language: basa kasar (rough), sedeng (medium), lemes (polite), lemes pisan (very polite), kasar pisan (very rough), and basa panengah (intermediate). But since the 1988 Congress of Sundanese Language in Bogor, the speech level has been narrowed to only two parts: basa hormat (respectful) and basa loma (fair). Besides that, the term was changed to "tatakrama basa" (language manners), although the substance remained the same. The hormat variant is a subtle language to respect, while the loma variant is fair, neutral and familiar use. This variety of loma language is then used as a kind of "standard" variety of written languages in Sundanese society. Sundanese magazines, newspapers, literary books and theses, mostly using the loma variant. Apart from the two previous levels, there is actually one more lowest level, namely cohag (rough). This level is only used when angry or just to show intimacy between speakers.

This register can only be found in the Sundanese Priangan dialect, while other dialects such as Bantenese Language, generally do not recognize this register.

For many words, there are distinct loma and lemes forms, e.g. arék (loma) vs. badé (lemes) "want", maca (loma) vs. maos (lemes) "read". In the lemes level, some words further distinguish humble and respectful forms, the former being used to refer to oneself, and the latter for the addressee and third persons, e.g. rorompok "(my own) house" vs. bumi "(your or someone else's) house" (the loma form is imah).

Similar systems of speech levels are found in Javanese, Madurese, Balinese and Sasak.

Basic vocabulary

English Hormat / Lemes Loma Cohag
i abdi (informal)

simkuring (formal)

urang (informal)

kuring (formal)

kami (non-formal,

expressing speaker's superiority)

you anjeun

hidep (for younger)



he/she mantenna (to be respected)


manéhna inyana
we abdi sadayana (informal)

sim kuring sadayana (formal)

kuring saréréa aing kabéhan
you aranjeun

hidep saréréa (for younger)

maranéh saria, sararia
English Hormat / Lemes Loma Cohag
for other for myself
eat tuang neda dahar nyatu, hakan, lebok
stomach patuangan, lambut padaharan beuteung gegembung
face pameunteu raray beungeut bebenguk
healthy damang pangéstu, pangésto cageur waras
can iasa tiasa bisa becus, belul, begug
go angkat, jengkar mios miang, indit jangkor
sleep kulem mondok saré héés, molor
see tingali tingal tempo deuleu, bintrang
speak nyarios, nyaur nyanggem nyarita, ngomong ngabangus, ngacapluk
know uninga terang apal nyaho
quiet linggih matuh, mindel cicing jedog
come sumping dongkap datang pucunghul
go home mulih wangsul balik, mulang mantog
home or house bumi rorompok imah gogobrog
family kulawargi kulawarga baraya bondoroyot
father tuang rama pun bapa bapa babéh
mother tuang ibu pun biang biang, ema indung
bring candak bantun bawa banjut
have or has kagungan gaduh boga gableg, gadur
dead palastra, pupus ngantunkeun, nilar maot, paéh modar, kojor, jeger
no! or I don't want it! teu kersa alim, narah embung teu sudi
want palay hoyong hayang
yes sumuhun, muhun enya heueuh
wake up gugah hudang
play ameng, anjang ulin
here (palih) dieu (beulah) dieu
there (palih) ditu (beulah) ditu
there is (there are) nyondong aya
there is no (there are no) teu aya euweuh


Numeral Sundanese script Sundanese
1 || hiji
2 || dua
3 || tilu
4 || opat
5 || lima
6 || genep
7 || tujuh
8 || dalapan
9 || salapan
10 |᮱᮰| sapuluh
11 |᮱᮱| sabelas
12 |᮱᮲| dua belas
20 |᮲᮰| dua puluh
21 |᮲᮱| dua puluh hiji
30 |᮳᮰| tilu puluh
31 |᮳᮱| tilu puluh hiji
40 |᮴᮰| opat puluh
50 |᮵᮰| lima puluh
60 |᮶᮰| genep puluh
70 |᮷᮰| tujuh puluh
80 |᮸᮰| dalapan puluh
90 |᮹᮰| salapan puluh
100 |᮱᮰᮰| saratus
hundreds ratusan
1000 |᮱᮰᮰᮰| sarébu
thousands rébu


Root word

English Sundanese
eat dahar tuang (for other)
neda (for myself)
drink inum leueut
write tulis serat
read maca maos
forget poho lali (for other)

hilap (for myself)

remember inget émut
sit diuk linggih (for other)

calik (for myself)

standing nangtung ngadeg
walk leumpang nyacat

Other Austronesian languages commonly use reduplication to create plural forms. However, Sundanese inserts the ar infix into the stem word. If the stem word starts with l, or contains r following the infix, the infix ar becomes al. Also, as with other Sundanese infixes (such as um), if the word starts with vowel, the infix becomes a prefix. Examples:

  1. Mangga A, tarahuna haneut kénéh. "Please sir, the bean curds are still warm/hot." The plural form of tahu 'bean curd, tofu' is formed by infixing ar after the initial consonant.
  2. Barudak leutik lalumpatan. "Small children running around." Barudak "children" is formed from budak (child) with the ar infix; in lumpat (run) the ar infix becomes al because lumpat starts with l.
  3. Ieu kaén batik aralus sadayana. "All of these batik clothes are beautiful." Formed from alus (nice, beautiful, good) with the infix ar that becomes a prefix because alus starts with a vowel. It denotes the adjective "beautiful" for the plural subject/noun (batik clothes).
  4. Siswa sakola éta mah balageur. "The students of that school are well-behaved." Formed from bageur ("good-behaving, nice, polite, helpful") with the infix ar, which becomes al because of r in the root, to denote the adjective "well-behaved" for plural students.

However, it is reported that this use of al instead of ar (as illustrated in (4) above) does not to occur if the 'r' is in onset of a neighbouring syllable. For example, the plural form of the adjective curiga (suspicious) is caruriga and not *caluriga, because the 'r' in the root occurs at the start of the following syllable.[13]

The prefix can be reduplicated to denote very-, or the plural of groups. For example, "bararudak" denotes many, many children or many groups of children (budak is child in Sundanese). Another example, "balalageur" denotes plural adjective of "very well-behaved".

Active form

Most active forms of Sundanese verbs are identical to the root, as with diuk "sit" or dahar "eat". Some others depend on the initial phoneme in the root:

  1. Initial /d/, /b/, /f/, /ɡ/, /h/, /j/, /l/, /r/, /w/, /z/ can be put after prefix nga like in ngadahar.
  2. Initial /i/, /e/, /u/, /a/, /o/ can be put after prefix ng like in nginum "drink".



  • Abdi teu acan neda. "I have not eaten yet."
  • Buku abdi mah sanés nu ieu. "My book is not this one."


  • Urang acan dahar. "I have not eaten yet."
  • Buku urang mah lain nu ieu. "My book is not this one."


Dupi (for polite situation)/Ari (for formal situation)-(question)



  • Dupi Tuang Rama nyondong di bumi? "Is your father at home?"
  • Dupi bumi di palih mana? "Where do you live?"


  • Ari Bapa aya di imah? "Is your father at home?"
  • Ari imah di beulah mana? "Where do you live?"


English Sundanese
what naon apa
who saha siapa
whose/whom nu saha kagungan saha punya siapa
where (di) mana (di) manten (di) mana
when iraha kapan
why naha, kunaon kenapa
how kumaha bagaimana
how many sabaraha berapa

Passive form


  • Buku dibantun ku abdi. "The book is brought by me." Dibantun is the passive form ngabantun "bring".
  • Pulpén ditambut ku abdi. "The pen is borrowed by me."
  • Soal ieu dipidamel ku abdi. "This problem is done by me."
  • Kacasoca dianggo ku abdi. "Glasses worn by me."


  • Buku dibawa ku urang. "The book is brought by me." Dibawa is the passive form mawa "bring".
  • Pulpén diinjeum ku urang. "The pen is borrowed by me."
  • Soal ieu digawékeun ku urang. "This problem is done by me."
  • Tasma dipaké ku urang. "Glasses worn by me."



teuas (hard), tiis (cool for water and solid objects), tiris (cool for air), hipu (soft), lada (hot/spicy, usually for foods), haneut (warm), etc.


Sundanese has three generic prepositions for spatial expressions:[14]

  • di: 'in', 'at' etc., indicating position
  • dina/na: 'on', 'at' etc., indicating specific position
  • ka: 'to', indicating direction
  • kana: 'to', indicating specific direction
  • ti: 'from', indicating origin
  • tina: 'from', indicating specific origin

To express more specific spatial relations (like 'inside', 'under' etc.), these prepositions have been combined with locative nouns:[15]

Formal Polite Gloss
di jero di lebet inside
di luar di luar outside
di gigir di gédéng beside
di luhur di luhur above
di handap di handap below
di tukang di pengker behind
di hareup di payun in front

Di gigir/luhur/handap/tukang/hareup (also ka gigir, ti gigir etc.) are absolute adverial expressions without a following noun. To express relative position, they have to add the suffix -eun, e.g.:


  • di luhur bumi – 'on top of the house'
  • dina luhur lomari – 'on top of the cupboard'
  • ti pengker bumi – 'from behind the house'
  • tina pengker lomari – 'from behind the cupboard'


  • di luhureun imah – 'on top of the house'
  • dina luhureun lomari – 'on top of the cupboard'
  • ti tukangeun imah – 'from behind the house'
  • tina tukangeun lomari – 'from behind the cupboard'

Di jero, di luar and the polite forms luhur & pengker can be used both with and without a following noun.

English Sundanese
before saacan/saméméh sateuacan
after sanggeus saparantos
during basa nalika
past baheula kapungkur
English Sundanese
from tina/ti tina
for jang, paragi kanggo/kanggé

See also


  1. ^ "Sundanese language survival among Indonesian diaspora families in Melbourne, Australia". 2015.
  2. ^ Muamar, Aam (2016-08-08). "Mempertahankan Eksistensi Bahasa Sunda" [Maintaining the existence of Sundanese Language]. Pikiran Rakyat (in Indonesian). Archived from the original on 22 June 2019. Retrieved 2018-09-27.
  3. ^ "Bahasa di Provinsi DKI Jakarta" [Language in DKI Jakarta Province]. Kemdikbud.
  4. ^ "Bahasa di Provinsi Lampung" [Language in Lampung Province]. Kemdikbud.
  5. ^ "Bahasa di Provinsi Jawa Tengah" [Language in Central Java Province]. Kemdikbud.
  7. ^ Bauer, Laurie (2007). The Linguistics Student's Handbook. Edinburgh: Edinburgh University Press.
  8. ^ Blust 2010.
  9. ^ Blust 2013.
  10. ^ Rosidi, Ajip (2010). Mengenang hidup orang lain: sejumlah obituari (in Indonesian). Kepustakaan Populer Gramedia. ISBN 9789799102225.
  11. ^ Müller-Gotama, Franz (2001). Sundanese. Languages of the World. Materials. 369. Munich: LINCOM Europa.
  12. ^ Anderson, E. A. (1997). "The use of speech levels in Sundanese". In Clark, M. (ed.). Papers in Southeast Asian Linguistics No. 16. Canberra: Pacific Linguistics. pp. 1–45. doi:10.15144/PL-A90.1.
  13. ^ Bennett, Wm G. (2015). The Phonology of Consonants: Harmony, Dissimilation, and Correspondence. Cambridge: Cambridge University Press. p. 132.
  14. ^ Hardjadibrata (1985), p. 30.
  15. ^ Hardjadibrata (1985), p. 72–74.


Hardjadibrata, R.R. (1985). Sundanese: A Syntactical Analysis. Pacific Linguistics. Canberra: Australian National University. doi:10.15144/PL-D65.

Further reading

  • Rigg, Jonathan (1862). A Dictionary of the Sunda Language of Java. Batavia: Lange & Co.
  • S. Coolsma (1985). Tata Bahasa Sunda. Jakarta: Djambatan.
  • Blust, Robert (2010). "The Greater North Borneo Hypothesis". Oceanic Linguistics. University of Hawai'i Press. 49 (1): 44–118. doi:10.1353/ol.0.0060. JSTOR 40783586. S2CID 145459318.
  • Blust, Robert (2013). The Austronesian languages. Asia-Pacific Linguistics 8 (revised ed.). Canberra: Asia-Pacific Linguistics, Research School of Pacific and Asian Studies, The Australian National University. hdl:1885/10191. ISBN 9781922185075.

External links