Gaj's Latin Alphabet


Gaj's Latin alphabet is a variant of the Latin alphabet devised by Croat Ljudevit Gaj, in his 1830 book, Kratka osnova horvatsko-slavenskogo pravopisanja (A short primer of Croatian-Slavic orthography). It is the only script of the Croatian standard language in current use, and one of the two scripts of the standard Bosnian and Serbian languages (along with Cyrillic). A slightly modified version is used as the script for the Slovenian language.

The script was also one of two official scripts used for the Serbo-Croatian language prior to the demise of Yugoslavia. The alphabet is used for the Banat version of the Bulgarian language.

Contents

The alphabet

It consists of thirty upper and lowercase letters:

Letter IPA Letter IPA Letter IPA
A, a /a/ G, g /ɡ/ O, o /o/
B, b /b/ H, h /x/ P, p /p/
C, c /ts/ I, i /i/ R, r /r/
Č, č /tʃ/ J, j /j/ S, s /s/
Ć, ć /tɕ/ K, k /k/ Š, š /ʃ/
D, d /d/ L, l /l/ T, t /t/
, dž /dʒ/ Lj, lj /ʎ/ U, u /u/
Đ, đ /dʑ/ M, m /m/ V, v /ʋ/
E, e /e/ N, n /n/ Z, z /z/
F, f /f/ Nj, nj /ɲ/ Ž, ž /ʒ/

The original Gaj's alphabet contained a digraph ‹dj›, which was later replaced by the letter ‹đ›.

The letters do not have names, and consonants are normally pronounced as such when spelling is necessary (or followed by a short schwa, e.g. /fə/). In science and mathematics, only 26 letters of the basic modern Latin alphabet are used, and in those contexts they're named as in German alphabet, with the exception of ‹v› (ve) and ‹w› (dublve).

Digraphs

Note that the digraphs›, ‹lj›, and ‹nj› are considered to be single letters. This means that:

  • In dictionaries, njegov comes after novine, in a separate ‹nj› section after the end of the ‹n› section, and bolje comes after bolnica, and so forth.
  • In vertical writing (such as on signs), ‹dž›, ‹lj›, ‹nj› are nevertheless written horizontally, as a unit. For instance, if mjenjačnica ('Bureau de Change') is written vertically, ‹nj› appears on the fourth line (but note ‹m› and ‹j› appear separately on the first and second lines, respectively, because ‹mj› contains two letters, not one). In crossword puzzles, ‹dž›, ‹lj›, ‹nj› each occupy a single square.
M
J
E
NJ
A
Č
N
I
C
A


  • In cases where words are written with a space between each letter (such as on signs), each of these letters is written together. For instance: M J E NJ A Č N I C A.
  • In cases where only the initial letter of a word is capitalized, only the first of the two component letters is capitalized: Njemačka and not NJemačka. In Unicode, the form ‹Nj› is referred to as titlecase, as opposed to the uppercase form ‹NJ›, representing one of the few cases where titlecase and uppercase differ. Uppercase would be used if the entire word was capitalized: NJEMAČKA.

Origins

The Croatian Latin was mostly designed by Ljudevit Gaj, who modelled it after Czech and Polish, and invented ‹lj›, ‹nj› and ‹dž›. In 1830 in Buda he printed the book Kratka osnova horvatsko-slavenskog pravopisanja ("Brief basics of the Croatian-Slavonic orthography"), which was the first common Croatian orthography book. It was not the first ever Croatian orthography work, as it was preceded by works of Rajmund Đamanjić (1639), Ignjat Đurđević and Pavao Ritter Vitezović. The Croats had previously used the Latin alphabet, but some of the specific sounds were not uniformly represented.

Gaj followed the example of Pavao Ritter Vitezović and the Czech orthography, making one letter of the Latin script for each sound in the language. His alphabet mapped completely on Serbian Cyrillic which was standardized by Vuk Karadžić a few years before. Đuro Daničić added the letter ‹đ›.

Computing

In the 1990s, there was a general confusion about the proper character encoding to use to write text in Latin Croatian on computers.

  • An attempt was made to apply the 7-bit "YUSCII" (later adapted to CROSCII, sometimes jokingly called žabeceda (žaba=frog, abeceda=alphabet, because ASCII @, sorting before A, encodes Ž)), which included the five letters with diacritics at the expense of five non-letter characters ([, ], {, }, @), but it was ultimately unsuccessful. Other short-lived vendor-specific efforts were also undertaken.
  • The 8-bit ISO 8859-2 (Latin-2) standard was developed by ISO, but
  • MS-DOS introduced 8-bit encoding CP852 for Central European languages, and
  • Microsoft Windows spread yet another 8-bit encoding called CP1250, which had a few letters mapped one-to-one with ISO 8859-2, but also had some mapped elsewhere.
  • Apple uses yet another encoding.
  • EBCDIC also has Latin-2 encoding, Code Page 01153.

The preferred character encoding for Croatian today is either the ISO 8859-2, or the Unicode encoding UTF-8 (with two bytes or 16 bits necessary to use the letters with diacritics). However, one can still find programs and, more importantly, databases that use CP1250, CP852 or even CROSCII, the former still sometimes being considered de-facto standard.

The Gaj alphabet for Slovenian

Since the early 1840s, Gaj's alphabet was increasingly used for the Slovene language. In the beginning, Slovene authors who treated Slovene as a variant of Serbo-Croatian (such as Stanko Vraz) most commonly used it, but it was later accepted by a large spectrum of Slovene-writing authors. The breakthrough came when the Slovene conservative leader Janez Bleiweis started using Gaj's script in his journal Kmetijske in rokodelske novice ("Peasant's and Artisan's News"), which was read by a wide public in the countryside. By 1850, Gaj's alphabet (known as gajica in Slovene) became the only official Slovene alphabet, replacing three other writing systems which circulated in the Slovenian Lands since the 1830s: the traditional one, called bohoričica (after its inventor, Adam Bohorič), and the two innovative proposals by the Peter Dajnko (the dajnčica) and Franc Serafin Metelko (the metelčica).

The Slovene version of Gaj's alphabet differs from the Croatian one in the following traits:

  • the Slovene alphabet does not have the characters ‹ć› and ‹đ›; the sounds these letters represent are not present in the Slovene language;
  • in the Slovenian variant, the digraphs ‹lj› and ‹nj› are treated as two separate letters and represent separate sounds (e.g. the word polje is pronounced /polje/ in Slovenian, as opposed to /poʎe/ in Croatian).
  • while the phoneme /dʒ/ exists in modern Slovenian and is written ‹dž›, it is only used in borrowed words, and ‹d› and ‹ž› are considered separate letters, not a digraph.

While the Serbo-Croatian alphabet is almost completely phonetic[citation needed] (that is, each sound is represented by a single letter or digraph, and each letter or digraph stands for only one sound), the Slovene alphabet, although following the same principle, has numerous exceptions.

Differences with other Slavic Latin alphabets

  • Lacks the accented letters ‹á›, ‹é›, ‹í›, ‹ĺ›, ‹ó›, ‹ŕ›, ‹ú›, ‹ý›, palatised consonants ‹ď›, ‹ľ›, ‹ň›, ‹ř›, ‹ť› and other special characters ‹ě›, ‹ô›, ‹ů›, etc. found in Czech and Slovak.
  • The Polish ‹ą›, ‹ę›, ‹ń›, ‹ó›, ‹ś›, ‹ł›, ‹ź›, ‹ż› are not used.
  • the letters ‹q›, ‹w›, ‹x› and ‹y› and digraph ‹ch› are used only in foreign words.
  • The digraph ‹lj› corresponds to Slovak ‹ľ›, and ‹nj› is the equivalent of Polish ‹ń› and Czech-Slovak ‹ň›. The unique ‹đ› matches the Polish ‹dź›, while ‹dž› matches ‹dż›.

See also

References

External links







stock | retire | vm
Why are we here?
All text is available under the terms of the GNU Free Documentation License
This page is cache of Wikipedia. History