| This article contains Indic text. Without proper rendering support, you may see question marks or boxes, misplaced vowels or missing conjuncts instead of Indic text. |
| Bengali abugida | |
| Type | Abugida |
|---|---|
| Spoken languages | Bengali |
| Time period | 11th Century to the present[1] |
| Parent systems |
Proto-Sinaitic
|
| ISO 15924 | Beng |
| Note: This page may contain IPA phonetic symbols in Unicode. | |
|
||||||||||||||
| This article does not cite any references or sources. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. (June 2009) |
The Bengali script (Bengali: বাংলা লিপি Bangla lipi) is a variant of the Eastern Nagari script also used for Assamese and Bishnupriya Manipuri, and also for Maithili (in a variation called Mithilakshar). The Eastern Nagari script belongs to the Brahmic family of scripts, along with the Devanagari script and other written systems of the Indian subcontinent. It is an abugida system of writing, although it is less blocky and presents a more sinuous shaping than Devanagari. Both Eastern Nagari and Devanagari were derived from the ancient Nagari script. In addition to differences in how the letters are pronounced in the different languages, there are some minor typographical differences between the version of the script used for Assamese and Bishnupriya Manipuri, and that used for Bengali and other languages. For example, the letter rô (Bengali র; Assamese ৰ; Bishnupriya Manipuri র/ৰ) and wô (Bengali second ব; Assamese/Bishnupriya Manipuri ৱ) have distinct variations depending on the language being written.
The Bengali script was originally not associated with any particular language, but was prevalent as the main script in the eastern regions of Medieval India. The script was originally used to write Sanskrit, which for centuries was the only written language of the Indian subcontinent in addition to Tamil. Epics of Hindu scripture, including the Mahabharata or Ramayana, were written in older versions of the Bengali script in this region. After the medieval period, the use of Sanskrit as the sole written language gave way to Pali, and eventually the vernacular languages we know now as Bengali and Assamese. Srimanta Sankardeva used it in the 15th and 16th centuries to compose his oeuvre in Assamese and Brajavali the language of the Bhakti poets. It was also used by the later Ahom kings to write the Buranjis, the Ahom chronicles, in the Assamese language. There is a rich legacy of Indian literature written in this script, which is still occasionally used to write Sanskrit today.
Clusters of consonants are represented by different and sometimes quite irregular characters; thus, learning to read the script is complicated by the sheer size of the full set of characters and character combinations, numbering about 500. While efforts at standardizing the script for the Bengali language continue in such notable centers as the Bangla Academies (unaffiliated) at Dhaka (Bangladesh) and Kolkata (West Bengal, India), it is still not quite uniform as yet, as many people continue to use various archaic forms of letters, resulting in concurrent forms for the same sounds. Among the various regional variations within this script, only the Assamese and Bengali variations exist today in the formalized system.
It seems likely that the standardization of the script will be greatly influenced by the need to typeset it on computers. The large alphabet can be represented, with a great deal of ingenuity, within the ASCII character set, omitting certain irregular conjuncts. Work has been underway since around 2001 to develop Unicode fonts, and it seems likely that it will split into two variants, traditional and modern.
In this and other articles on Wikipedia dealing with the Bengali language, a Romanization scheme used by linguists specializing in Bengali phonology is included along with IPA transcription.
A recent effort by the government of West Bengal focused on simplifying bengali spellings in primary school texts.
Contents |
The script presently has a total of 11 vowel letters, used to represent the seven main vowel sounds of Bengali, along with a number of vowel diphthongs. All of these are used in both Bengali and Assamese, the two main languages using the script. Some of the vowel letters have different sounds depending on the word, and a number of vowel distinctions preserved in the writing system are not pronounced as such in modern spoken Bengali or Assamese. For example, the Bengali script has two symbols for the vowel sound [i] and two symbols for the vowel sound [u]. This redundancy stems from the time when this script was used to write Sanskrit, a language that had a short [i] and a long [iː], and a short [u] and a long [uː]. These letters are preserved in the Bengali script with their traditional names of hrôshsho i (lit. 'short i') and dirgho i (lit. 'long i'), etc., despite the fact that they are no longer pronounced differently in ordinary speech.
Vowel signs can be used in conjunction with consonants to modify the pronunciation of the consonant (here exemplified by ক, kô). When no vowel is written, the vowel 'অ' (ô or o) is often assumed. To specifically denote the absence of a vowel, a hôshonto (্) may be written underneath the consonant.
This table reflects the modernized Bengali script system, which eliminates three of the traditional vowels, rii, li, and lii, traditionally placed between ri and e.
| Letter | Name of letter | Vowel sign with [kɔ] (ক) | Name of vowel sign | Transliteration | IPA |
|---|---|---|---|---|---|
| অ | shôro ô (shôre ô) |
(none) | kô and ko | /kɔ/ and /ko/ | |
| আ | shôro a (shôre a) |
কা | akar | ka | /ka/ |
| ই | hrôshsho i (hrôshsho i) |
কি | hrôshsho ikar (roshshikar) |
ki | /ki/ |
| ঈ | dirgho i | কী | dirgho ikar (dirghikar) |
kī | /kiː/ |
| hrôshsho u (rôshsho u) |
কু | hrôshsho ukar (roshshukar) |
ku | /ku/ | |
| ঊ | dirgho u | কূ | dirgho ukar (dirghukar) |
kū | /kuː/ |
| ঋ | ri | কৃ | rikar/rifôla | kri | /kri/ |
| এ | e | কে | ekar | kê and ke | /kæ/ and /ke/ |
| ঐ | oi | কৈ | oikar | koi | /koj/ |
| ও | o | কো | okar | ko | /ko/ |
| ঔ | ou | কৌ | oukar | kou | /kow/ |
| Symbol with [kɔ] (ক) | Name | Function | Transliteration | IPA |
|---|---|---|---|---|
| ক্ | hôshonto | Suppresses the inherent vowel | k | /k/ |
| কৎ | khônđo tô | Final unaspirated dental [t̪] (ত) | kôt | /kɔt̪/ |
| কং | ônushshôr | Final velar nasal | kôņ | /kɔŋ/ |
| কঃ | bishôrgo | Final voiceless breath | kôh | /kɔh/ |
| কঁ | chôndrobindu | Vowel nasalization | kôñ | /kɔ̃/ |
The names of the consonant letters in Bengali are typically just the consonant's main pronunciation plus the inherent vowel ô. Since the inherent vowel is assumed and not written, most letters' names look identical to the letter itself (e.g. the name of the letter ঘ is itself ঘ ghô). Some letters that have lost their distinctive pronunciation in Modern Bengali are called by a more elaborate name. For example, since the consonant phoneme /n/ can be written ন, ণ, or ঞ (depending on the spelling of the particular word), these letters are not simply called nô; instead, they are called দন্ত্য ন donto nô ("dental n"), মূর্ধন্য ণ murdhonno nô ("cerebral n"), and ঞীয়/ইঙ niô/ingô. Similarly, the phoneme /ʃ/ can be written as শ talobbo shô ("palatal s"), ষ murdhonno shô ("cerebral s"), or স donto shô ("dental s"), depending on the word. Since the consonant ঙ /ŋ/ cannot occur at the beginning of a word in Bengali, its name is not ঙ ngô but উঙ ungô (pronounced by some as উম umô or উঁঅ ũô). Similarly, since semivowels ([j], [w], [e̯], [o̯]) cannot occur at the beginning of a Bengali word, the name for "semi-vowel e̯" য় is not অন্তঃস্থ য় ôntostho e̯ô but অন্তঃস্থ অ ôntostho ô.
The "modern" Bengali alphabet eliminates some letters traditionally included in the table, particularly a second bo (ôntostho bô) that follows lo. This bo originally represented a v or w sound which merged with b (borgio bô) in Bengali and were represented with identical symbols occurring in two different places in the alphabet.
| Letter | Name of Letter | Transliteration | IPA |
|---|---|---|---|
| kô | k | /k/ | |
| খ | khô | kh | /kʰ/ |
| গ | gô | g | /ɡ/ |
| ঘ | ghô | gh | /ɡʱ/ |
| ঙ | ungô, umô | ņ | /ŋ/ |
| চ | chô | ch | /tʃ/ |
| ছ | chhô | chh | /tʃʰ/ |
| জ | borgio jô (burgijjô) |
j | /dʒ/ |
| ঝ | jhô | jh | /dʒʱ/ |
| ঞ | ingô, niô | n | /n/ |
| ট | ţô | ţ | /ʈ/ |
| ঠ | ţhô | ţh | /ʈʰ/ |
| ড | đô | đ | /ɖ/ |
| ঢ | đhô | đh | /ɖʱ/ |
| ণ | murdhonno nô (moddhennô) |
n | /n/ |
| ত | tô | t | /t̪/ |
| থ | thô | th | /t̪ʰ/ |
| দ | dô | d | /d̪/ |
| ধ | dhô | dh | /d̪ʱ/ |
| ন | donto nô (dontennô) |
n | /n/ |
| প | pô | p | /p/ |
| ফ | fô | f | /f/ |
| ব | bô | b | /b/ |
| ভ | bhô | bh | /bʱ/ |
| ম | mô | m | /m/ |
| য | ôntostho jô (ontostejô) |
j | /dʒ/ |
| র | (bôe shunno) rô | r | /ɾ/ |
| ল | lô | l | /l/ |
| শ | talobbo shô (taleboshshô) |
sh and s | /ʃ/ / /s/ |
| ষ | murdhonno shô (muddhennoshshô) (peţ kaţa shô) |
sh | /ʃ/ |
| স | donto shô (donteshshô) |
sh and s | /ʃ/ / /s/ |
| হ | hô | h | /h/ |
| য় | ôntostho ô (ontosteô) |
e and - | /e̯/ /- |
| ড় | đôe shunno/bindu ŗô | ŗ | /ɽ/ |
| ঢ় | đhôe shunno/bindu ŗô | ŗh | /ɽ/ |
Up to four consecutive consonants not separated by vowels can be orthographically represented as a "consonant conjunct" (Bengali: যুক্তাক্ষর juktakkhor or যুক্তবর্ণ juktobôrno). Typically, the first consonant in the conjunct is shown above and/or to the left of the following consonants. Many consonants appear in an abbreviated or compressed form when serving as part of a conjunct. Others simply take exceptional forms in conjuncts, bearing little or no resemblance to the base character.
Often, consonant conjuncts are not actually pronounced as would be implied by the pronunciation of the individual components. For example, adding ল lô underneath শ shô in Bengali creates the conjunct শ্ল, which is not pronounced shlô but slô in Bengali. Many conjuncts represent Sanskrit sounds that were lost thousands of years before modern Bengali was ever spoken, as in জ্ঞ, which is a combination of জ jô and ঞ niô, but is not pronounced jnô. Instead, it is pronounced ggõ in Bengali. Thus, as conjuncts often represent (combinations of) sounds that cannot be easily understood from the components, the following descriptions are concerned only with the construction of the conjunct, and not the resulting pronunciation. Thus, a variant of the IAST romanization scheme is used instead of the phonemic romanization used in other articles:
Some consonants fuse in such a way that one stroke of the first consonant serves as the a stroke of the next.
Some consonants are simply written closer to one another to indicate that they are in a conjunct together.
Some consonants are compressed (and often simplified) when appearing as the first member of a conjunct.
Some consonants are abbreviated when appearing in conjuncts, losing part of their basic shape.
Some consonants have forms that are used regularly, but only within conjuncts.
When serving as a vowel sign, উ u, ঊ ū, and ঋ ṛ take on many exceptional forms.
Conjuncts of three consonants also exist, and follow the same rules as above. Examples include স sô + ত tô +র rô = স্ত্র strô, ম mô + প pô + র rô = ম্প্র mprô, ঙ ŋô + ক kô + ষ ṣô = ঙ্ক্ষ ŋkṣô, জ jô + জ jô + ৱ wô = জ্জ্ব jjwô, ক kô + ষ ṣô + ম mô = ক্ষ্ম kṣmô. Theoretically, four-consonant conjuncts can also be created, as in র rô + স sô + ট ṭô + র rô = র্স্ট্র rsṭrô, but these are not found in real words.
The Bengali script has ten digits corresponding to what are called the Arabic numerals, which originally came from India. Bengali digits have no horizontal headstroke or "matra". When writing large numbers, commas are used to separate the thousand (হাজার hajar), the hundred thousand or lakh (লাখ lakh or লক্ষ lokkho), and the ten million or crore (কোটি koṭi or ক্রোড় kroṛ) units. For example, the English number 17,557,345 will be written in traditional Bengali as ১,৭৫,৫৭,৩৪৫ (এক কোটি, পাঁচাত্তর লাখ, সাতান্ন হাজার, তিন শ পঁয়তাল্লিশ êk koṭi pãchattor lakh, shatanno hajar, tin sho pôetallish, "one crore, seventy-five lakhs, fifty-seven thousand, three hundred forty-five").
| Arabic numerals | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|---|
| Bengali numerals | ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ |
| Bengali names | shunno | êk | dui | tin | char | pãch | chhôe | shat | aţ | nôe |
| শূন্য | এক | দুই | তিন | চার | পাঁচ | ছয় | সাত | আট | নয় |
The romanization of Bengali, or the representation of the Bengali language in the Latin script. While different standards for romanization have been proposed for Bengali, these have not been adopted with the degree of uniformity seen in languages such as Japanese or Sanskrit[1]. Most standardized Bengali romanizations are adapted from standards proposed for Indic languages, and these models are compared below.
The Romanization of a language written in a non-Roman script can be based on transliteration (orthographically accurate, i.e. the original spelling can be recovered) or transcription (phonetically accurate, i.e. the pronunciation can be reproduced). This distinction is important in Bengali as its orthography was adopted from Sanskrit, and ignores sound change processes of several millennia. To some degree, all writing systems differ from the way the language is pronounced, but this may be more extreme for languages like Bengali. For example, the three letters শ, ষ, and স had distinct pronunciations in Sanskrit, but over several centuries, the standard pronunciation of Bengali (usually modeled on the Nadia dialect), has lost these phonetic distinctions (all three are usually pronounced as IPA [ʃ]) while the spelling distinction nevertheless persists in orthography.
In written texts, it is easy to distinguish between homophones such as শাপ shap "curse" from সাপ shap "snake". Such a distinction could be particularly relevant in searching for the term in an encyclopedia, for example. However, the fact that the words sound identical means that they would be transcribed identically; thus, some important meaning distinctions cannot be rendered in a transcription model. Another issue with transcription systems is that cross-dialectal and cross-register differences are widespread, and thus the same word or lexeme may have many different transcriptions. Even simple words like মন "mind" may be pronounced "mon", "môn", or (in poetry) "mônô" (e.g. the Indian national anthem, Jana Gana Mana).
Often, different phonemes (meaningful sound differences) are represented by the same symbol or grapheme. Thus, the vowel এ can represent both [e] (এল elo [elo] "came"), or [ɛ] (এক êk [ɛk] "one"). Occasionally, words written in the same way (homographs) may have different pronunciations for differing meanings: মত can mean "opinion" (pronounced môt), or "similar to" (môto). Thus, some important phonemic distinctions cannot be rendered in a transliteration model. In addition, when representing a Bengali word to allow speakers of other languages to pronounce it easily, it may be better to use a transcription, which does not include the silent letters and other idiosyncrasies (e.g. স্বাস্থ্য shastho, spelled <swāsthya>, or অজ্ঞান ôggên, spelled <ajñāna>) that make Bengali orthography so complicated.
Comparisons of standard romanization schemes for Bengali are given in the table below. Two standards are commonly used for transliteration of Indic languages including Bengali. Many standards (e.g. NLK / ISO), use diacritic marks and permit case markings for proper nouns. Newer forms (e.g. Harvard-Kyoto) are more suited for ASCII-derivative keyboards, and use upper- and lower-case letters contrastively and forgo normal standards for English capitalization.
The following table includes examples of Bengali words Romanized using the various systems mentioned above.
| In orthography | Meaning | NLK | XHK | ITRANS | HK | Wiki | IPA |
|---|---|---|---|---|---|---|---|
| মন | mind | mana | mana | mana | mana | mon | [mon] |
| সাপ | snake | sāpa | sApa | saapa | sApa | shap | [ʃap] |
| শাপ | curse | śāpa | zApa | shaapa | zApa | shap | [ʃap] |
| মত | opinion | mata | mata | mata | mata | môt | [mɔt̪] |
| মত | like | mata | mata | mata | mata | môto | [mɔt̪o] |
| তেল | oil | tēla | tela | tela | tela | tel | [t̪el] |
| গেল | went | gēla | gela | gela | gela | gêlo | [ɡɛlo] |
| জ্বর | fever | jvara | jvara | jvara | jvara | jôr | [dʒɔr] |
| স্বাস্থ্য | health | svāsthya | svAsthya | svaasthya | svAsthya | shastho | [ʃast̪ʰo] |
| বাংলাদেশ | Bangladesh | bāṃlādēśa | bAMlAdeza | baa.mlaadesha | bAMlAdeza | bangladesh | [baŋlad̪eʃ] |
| ব্যঞ্জনধ্বনি | consonant | byañjanadhvani | byaJjanadhvani | bya~njanadhvani | byaJjanadhvani | bênjondhoni | [bɛndʒond̪ʱoni] |
| আত্মহত্যা | suicide | ātmahatyā | AtmahatyA | aatmahatyaa | AtmahatyA | attõhotta | [at̪ːõhot̪ːa] |
The IPA (International Phonetic Alphabet) transcription is provided in the rightmost column, representing the most common pronunciation of the glyph in Standard Colloquial Bengali, alongside the various romanizations described above.
|
|
|
The Unicode range for Bengali is U+0980 ... U+09FF.
| Bengali Unicode.org chart (PDF) |
||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+098x | ঁ | ং | ঃ | অ | আ | ই | ঈ | উ | ঊ | ঋ | ঌ | এ | ||||
| U+099x | ঐ | ও | ঔ | ক | খ | গ | ঘ | ঙ | চ | ছ | জ | ঝ | ঞ | ট | ||
| U+09Ax | ঠ | ড | ঢ | ণ | ত | থ | দ | ধ | ন | প | ফ | ব | ভ | ম | য | |
| U+09Bx | র | ল | শ | ষ | স | হ | ় | ঽ | া | ি | ||||||
| U+09Cx | ী | ু | ূ | ৃ | ৄ | ে | ৈ | ো | ৌ | ্ | ৎ | |||||
| U+09Dx | ৗ | ড় | ঢ় | য় | ||||||||||||
| U+09Ex | ৠ | ৡ | ৢ | ৣ | ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ | ||
| U+09Fx | ৰ | ৱ | ৲ | ৳ | ৴ | ৵ | ৶ | ৷ | ৸ | ৹ | ৺ | |||||
Bengali in Eastern Nagari script
Bengali in Romanization
Bengali in IPA
Gloss
Translation
The following is a sample text of script, from the song Jana Gana Mana (জন গণ মন Jôno Gôno Mono). The selection is a Bengali song, written in Shadhubhasha (সাধুভাষা) style. The song was later adopted as the national anthem of India. It was written by Rabindranath Tagore (রবীন্দ্রনাথ ঠাকুর Robindronath Ṭhakur) who is acknowledged as the single most important and defining figure of Bengali literature.
জনগণমন-অধিনায়ক জয় হে ভারতভাগ্যবিধাতা!
পঞ্জাব সিন্ধু গুজরাট মরাঠা দ্রাবিড় উত্কল বঙ্গ
বিন্ধ্য হিমাচল যমুনা গঙ্গা উচ্ছলজলধিতরঙ্গ
তব শুভ নামে জাগে, তব শুভ আশিস মাগে,
গাহে তব জয়গাথা।
জনগণমঙ্গলদায়ক জয় হে ভারতভাগ্যবিধাতা!
জয় হে, জয় হে, জয় হে, জয় জয় জয়, জয় হে ॥
In Romanization:
Jônogônomono-odhinaeoko jôeô he Bharotobhaggobidhata!
Pônjabo Shindhu Gujoraţo Môraţha Drabiŗo Utkôlo Bônggo,
Bindho Himachôlo Jomuna Gôngga Uchchhôlojôlodhitoronggo,
Tôbo shubho name jage, tôbo shubho ashish mage,
Gahe tôbo jôeogatha.
Jônogônomonggolodaeoko jôeô he Bharotobhaggobidhata!
Jôeo he, jôeo he, jôeo he, jôeo jôeo jôeo, jôeo he!
|
|||||||||||
|
|||||||||||
|
|||||||||||
stock | retire | vm
Why are we here?
All text is available under the terms of the GNU Free Documentation License
This page is cache of Wikipedia. History