Latin or Roman script is a set of graphic signs (script) based on the letters of the classical Latin alphabet, which is derived from a form of the Cumae alphabet version of the Greek alphabet, used by the Etruscans.
Several Latin-script alphabets exist which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
The Latin script is the basis of the International Phonetic Alphabet and the 26 most widespread letters are the letters contained in the ISO basic Latin alphabet.
Latin script is the basis for the largest number of alphabets of any writing system
and is the
most widely adopted writing system in the world (commonly used by about 70% of the world's population). Latin script is used as the standard method of writing in most Western and Central European languages, as well as in many languages in other parts of the world.
The script is either called Roman script or Latin script, in reference to its origin in ancient Rome
. In the context of transliteration
, the term "romanization
" or "romanisation" is often found.
uses the term "Latin"
as does the International Organization for Standardization (ISO).
The numeral system is called the Roman numeral system; and the collection of the elements, Roman numerals. The numbers 1, 2, 3 ... are Latin/Roman script numbers for the Hindu–Arabic numeral system.
Old Italic alphabet
Archaic Latin alphabet
The letter was the western form of the Greek gamma
, but it was used for the sounds and alike, possibly under the influence of Etruscan, which might have lacked any voiced plosives
. Later, probably during the 3rd century BC, the letter – unneeded to write Latin properly – was replaced with the new letter , a modified with a small vertical stroke, which took its place in the alphabet. From then on, represented the voiced plosive , while was generally reserved for the voiceless plosive . The letter was used only rarely, in a small number of words such as Kalends
, often interchangeably with .
Classical Latin alphabet
After the Roman conquest of Greece in the 1st century BC, Latin adopted the Greek letters and (or readopted, in the latter case) to write Greek language
loanwords, placing them at the end of the alphabet. An attempt by the emperor Claudius
to introduce three Claudian letters
did not last. Thus it was during the classical Latin
period that the Latin alphabet contained 23 letters:
|+ Classical Latin alphabet|
ISO basic Latin alphabet
|+ ISO basic Latin alphabet|
The use of the letters I and V for both consonants and vowels proved inconvenient as the Latin alphabet was adapted to Germanic and Romance languages. W originated as a doubled V (VV) used to represent the sound found in Old English as early as the 7th century. It came into common use in the later 11th century, replacing the runic Wynn letter which had been used for the same sound. In the Romance languages, the minuscule form of V was a rounded u; from this was derived a rounded capital U for the vowel in the 16th century, while a new, pointed minuscule v was derived from V for the consonant. In the case of I, a word-final swash form, j, came to be used for the consonant, with the un-swashed form restricted to vowel use. Such conventions were erratic for centuries. J was introduced into English for the consonant in the 17th century (it had been rare as a vowel), but it was not universally considered a distinct letter in the alphabetic order until the 19th century.
By the 1960s, it became apparent to the computer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin alphabet in their (ISO/IEC 646) standard. To achieve widespread acceptance, this encapsulation was based on popular usage. As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published American Standard Code for Information Interchange, better known as ASCII, which included in the character set the 26 × 2 (uppercase and lowercase) letters of the English alphabet. Later standards issued by the ISO, for example ISO/IEC 10646 (Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.
The Latin alphabet spread, along with Latin
, from the Italian Peninsula to the lands surrounding the Mediterranean Sea with the expansion of the Roman Empire
. The eastern half of the Empire, including Greece
, the Levant
, and Egypt
, continued to use Greek language
as a lingua franca
, but Latin was widely spoken in the western half, and as the western Romance languages evolved out of Latin, they continued to use and adapt the Latin alphabet.
With the spread of Western Christianity during the Middle Ages
, the Latin alphabet was gradually adopted by the peoples of Northern Europe
who spoke Celtic languages (displacing the Ogham
alphabet) or Germanic languages (displacing earlier
) or Baltic languages
, as well as by the speakers of several Uralic languages
, most notably Hungarian, Finnish language
The Latin script also came into use for writing the West Slavic languages and several South Slavic languages, as the people who spoke them adopted Roman Catholicism. The speakers of East Slavic languages generally adopted Cyrillic script along with Orthodox Christianity. The Serbian language uses both scripts, with Cyrillic predominating in official communication and Latin elsewhere, as determined by the Law on Official Use of the Language and Alphabet.
Since the 16th century
As late as 1500, the Latin script was limited primarily to the languages spoken in Western Europe
, Northern Europe
, and Central Europe
. The Orthodox Christian Slavs of Eastern Europe
and Southeastern Europe mostly used Cyrillic script
, and the Greek alphabet was in use by Greek-speakers around the eastern Mediterranean. The Arabic script
was widespread within Islam, both among
and non-Arab nations like the Iranian peoples
, Indonesians, Malays, and Turkic peoples
. Most of the rest of Asia used a variety of Brahmic family
or the Chinese script
Through European colonization the Latin script has spread to the Americas, Oceania, parts of Asia, Africa, and the Pacific, in forms based on the Spanish alphabet, Portuguese, English alphabet, French alphabet, German alphabet and Dutch alphabet alphabets.
It is used for many Austronesian languages, including the languages of the Philippines and the Malaysian and Indonesian languages, replacing earlier Arabic and indigenous Brahmic alphabets. Latin letters served as the basis for the forms of the Cherokee syllabary developed by Sequoyah; however, the sound values are completely different.
Since 19th century
In the late 19th century, the Romanians
returned to the Latin alphabet, which they had used until the Council of Florence in 1439,
primarily because Romanian is a Romance language
. The Romanians were predominantly Orthodox Christians, and their Church, increasingly influenced by Russia after the fall of Byzantine Greek Constantinople in 1453 and capture of the Greek Orthodox Patriarch, had begun promoting the Slavic Cyrillic script
Under French rule and Portuguese missionary influence, a Latin alphabet was devised for the Vietnamese language, which had previously used Chu nom.
Since 20th century
In 1928, as part of Mustafa Kemal Atatürk's reforms, the new Turkey
adopted a Latin alphabet for the Turkish language
, replacing a modified Arabic alphabet. Most of the Turkic languages
-speaking peoples of the former Soviet Union
, including Tatars
, Kyrgyz people
and others, used the Latin-based Uniform Turkic alphabet in the 1930s; but, in the 1940s, all were replaced by Cyrillic. After the collapse of the Soviet Union
in 1991, three of the newly independent Turkic-speaking republics, Azerbaijan
, and Turkmenistan
, as well as Romanian-speaking Moldova
, officially adopted Latin alphabets for their languages.
Kazakhstan, Kyrgyzstan, Iranian-speaking Tajikistan, and the breakaway region of Transnistria kept the Cyrillic alphabet, chiefly due to their close ties with Russia. In the 1930s and 1940s, the majority of Kurds replaced the Arabic script with two Latin alphabets. Although the only official Kurdish government uses an Arabic alphabet for public documents, the Latin Kurdish alphabet remains widely used throughout the region by the majority of Kurdish language-speakers.
In 2015, the Kazakhstan government announced that the Latin alphabet would replace Cyrillic as the writing system for the Kazakh language by 2025.
[ Kazakh language to be converted to Latin alphabet – MCS RK. Inform.kz (30 January 2015). Retrieved on 2015-09-28.]
By the 1960s, it became apparent to the computer and telecommunications industries in the First World
that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin alphabet in their (ISO/IEC 646) standard. To achieve widespread acceptance, this encapsulation was based on popular usage.
As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published American Standard Code for Information Interchange, better known as ASCII, which included in the character set the 26 × 2 (uppercase and lowercase) letters of the English alphabet. Later standards issued by the ISO, for example ISO/IEC 10646 (Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.
As used by various languages
In the course of its use, the Latin alphabet was adapted for use in new languages, sometimes representing
not found in languages that were already written with the Roman characters. To represent these new sounds, extensions were therefore created, be it by adding
to existing letters, by joining multiple letters together to make ligatures, by creating completely new forms, or by assigning a special function to pairs or triplets of letters. These new forms are given a place in the alphabet by defining an collation
or collation sequence, which can vary with the particular language.
Some examples of new letters to the standard Latin alphabet are the Runic alphabet
and thorn , and the letter eth
, which were added to the alphabet of Old English. Another Irish letter, the Insular G
, developed into yogh
, used in Middle English
. Wynn was later replaced with the new letter , eth and thorn with , and yogh with . Although the four are no longer part of the English or Irish alphabets, eth and thorn are still used in the modern Icelandic and
Some West, Central and languages use a few additional letters that have a similar sound value to their equivalents in the IPA. For example, Adangme language uses the letters and , and Ga language uses , and . Hausa language uses and for implosives, and for an ejective. African studies have standardized these into the African reference alphabet.
The Azerbaijani language also has the letter written as "Ə", which represents the near-open front unrounded vowel.
A digraph is a pair of letters used to write one sound or a combination of sounds that does not correspond to the written letters in sequence. Examples are , , , in English, and in Dutch. In Dutch the is capitalized as or the ligature , but never as , and it often takes the appearance of a ligature very similar to the letter in handwriting
A trigraph is made up of three letters, like the German language , the Breton language or the Milanese . In the orthography of some languages, digraphs and trigraphs are regarded as independent letters of the alphabet in their own right. The capitalization of digraphs and trigraphs is language-dependent, as only the first letter may be capitalized, or all component letters simultaneously (even for words written in titlecase, where letters after the digraph or trigraph are left in lowercase).
A ligature is a fusion of two or more ordinary letters into a new glyph
or character. Examples are (from , called "ash"), (from , sometimes called "oethel"), the abbreviation
(from Latin language et
"and"), and the German language
symbol ("sharp S" or "eszet", from or , the Long s
, followed by a or ).
A diacritic, in some cases also called an accent, is a small symbol that can appear above or below a letter, or in some other position, such as the umlaut sign used in the German characters , , or the Romanian characters ă, â, î, S-comma
. Its main function is to change the phonetic value of the letter to which it is added, but it may also modify the pronunciation of a whole syllable or word, or distinguish between homonym
. As with letters, the value of diacritics is language-dependent.
Some modified letters, such as the symbols , , and , may be regarded as new individual letters in themselves, and assigned a specific place in the alphabet for collation
purposes, separate from that of the letter on which they are based, as is done in Swedish alphabet
. In other cases, such as with , , in German, this is not done; letter-diacritic combinations being identified with their base letter. The same applies to digraphs and trigraphs. Different diacritics may be treated differently in collation within a single language. For example, in Spanish language
, the character is considered a letter, and sorted between and in dictionaries, but the accented vowels , , , , are not separated from the unaccented vowels , , , , .
The languages that use the Latin script today generally use capital letters
to begin paragraphs and sentences and proper nouns. The rules for capitalization
have changed over time, and different languages have varied in their rules for capitalization. Old English, for example, was rarely written with even proper nouns capitalized; whereas Modern English
of the 18th century had frequently all nouns capitalized, in the same way that Modern German language
is written today, e.g. Alle Schwestern der alten Stadt hatten die Vögel gesehen
("All of the sisters of the old city had seen the birds").
Words from languages natively written with other Writing system
, such as Arabic or Chinese, are usually transliteration
or transcribed when embedded in Latin-script text or in Multilingualism
international communication, a process termed Romanization.
Whilst the Romanization of such languages is used mostly at unofficial levels, it has been especially prominent in computer messaging where only the limited 7-bit ASCII code is available on older systems. However, with the introduction of Unicode, Romanization is now becoming less necessary. Note that keyboards used to enter such text may still restrict users to Romanized text, as only ASCII or Latin-alphabet characters may be available.
List of languages by writing system#Latin script
Western Latin character sets (computing)
Latin letters used in mathematics
Boyle, Leonard E. 1976. "Optimist and recensionist: 'Common errors' or 'common variations.'" In Latin script and letters A.D. 400–900: Festschrift presented to Ludwig Bieler on the occasion of his 70th birthday. Edited by John J. O’Meara and Bernd Naumann, 264–74. Leiden, The Netherlands: Brill.
Morison, Stanley. 1972. Politics and script: Aspects of authority and freedom in the development of Graeco-Latin script from the sixth century B.C. to the twentieth century A.D. Oxford: Clarendon.