Tangut (Tangut: ; ) is an extinct Sino‑Tibetan language, now argued to belong within the Horpa subgroup of West Gyalrongic.
Tangut was one of the official languages of the Western Xia, founded by the Tangut people in northwestern China. The Western Xia was annihilated by the Mongol Empire in 1227. The Tangut language has its own script, the Tangut script. The latest known text written in the Tangut language, the Tangut dharani pillars, dates to 1502, suggesting that the language was still in use nearly three hundred years after the collapse of Western Xia.
Beaudouin (2023a,b) showed that Tangut was a Horpa language, a subgroup of West Gyalrongic. He hypothesizes a position between the Northern (Stodsde) and Central (Stau, Geshiza) lects, proposing a tentative Urheimat around the place where the Erkai variety is spoken today within Ngawa, Sichuan.
The majority of extant Tangut texts were excavated at Khara-Khoto in 1909 by Pyotr Kozlov, and the script was identified as that of the Tangut state of Xixia. Such scholars as Aleksei Ivanovich Ivanov, Ishihama Juntaro (石濱純太郎), Berthold Laufer, Luo Fuchang (羅福萇), Luo Fucheng (羅福成), and Wang Jingru (王靜如) have contributed to research on the Tangut language. The most significant contribution was made by the Russian scholar Nikolai Aleksandrovich Nevsky (1892–1937), who compiled the first Tangut dictionary and reconstructed the meaning of a number of Tangut grammatical particles, thus making it possible to actually read and understand Tangut texts. His scholarly achievements were published posthumously in 1960 under the title Tangutskaya Filologiya (Tangut Philology), and the scholar was eventually (and posthumously) awarded the Soviet Lenin Prize for his work. The understanding of the Tangut language is far from perfect: although certain aspects of the morphology (Ksenia Kepping, The Morphology of the Tangut Language, Moscow: Nauka, 1985) and grammar (Tatsuo Nishida, Seika go no kenkyū, etc.) are understood, the syntactic structure of Tangut remains largely unexplored.
The Khara-Khoto documents are at present preserved in the Institute of Oriental Manuscripts of the Russian Academy of Sciences in Saint Petersburg. These survived the Siege of Leningrad, but a number of manuscripts in the possession of Nevsky at the time of his arrest by the NKVD (NKVD) in 1937 went missing, and were returned, under mysterious circumstances, to the Institute of Oriental Manuscripts only in October 1991. The collections amount to about 10,000 volumes, of mostly Buddhist texts, law codes, and legal documents dating from mid-11th up to early 13th centuries. Among the Buddhist texts, a number of unique compilations, not known either in Chinese or in Tibetan versions, were recently discovered. Furthermore, the Buddhist canon, the Chinese classics, and a great number of indigenous texts written in Tangut have been preserved. These other major Tangut collections, though much smaller, belong to the British Library, the French National Library ('italic=no'), the National Library in Beijing, the Library of Beijing University, and other libraries.
The discovery of the Pearl in the Palm, a Tangut–Chinese bilingual glossary, permitted Ivanov (1909) and Laufer (1916) to propose initial reconstructions and to undertake the comparative study of Tangut. This glossary in effect indicates the pronunciation of each Tangut character with one or several Chinese characters, and inversely each Chinese character with one or more Tangut characters. The second source is the corpus of Tibetan transcriptions of Tangut. These data were studied for the first time by Nevsky (Nevskij) (1925). Though these transcriptions were not written with the intention of representing with precision the pronunciation of Tangut, but instead simply to help foreigners to pronounce and memorize the words of one language with the words of another which they could understand.
The third source, which constitutes the basis of the modern reconstructions, consists of monolingual Tangut dictionaries: the Wenhai (文海), two editions of the Tongyin (同音), the Wenhai zalei (文海雜類), and an untitled dictionary. The record of the pronunciation in these dictionaries is made using the principle of fanqie, borrowed from the Chinese lexicographic tradition. Although these dictionaries may differ on small details (e.g. the Tongyin categorizes the characters according to syllable initial and rime without taking any account of tone), they all adopt the same system of 105 rimes. A certain number of rimes are in complementary distribution with respect to the place of articulation of the initials, e.g. rimes 10 and 11 or rimes 36 and 37. Fǎnqiè makes distinctions among the rhymes in a systematic and precise manner. Nonetheless, it is still necessary to compare the phonological system of the dictionaries with the other sources in order to "fill in" the categories with a phonetic value.
N. A. Nevsky reconstructed Tangut grammar and provided the first Tangut–Chinese–English–Russian dictionary, which together with the collection of his papers was published posthumously in 1960 under the title Tangut Philology (Moscow: 1960). Later, substantial contribution to the research of Tangut language was done by 西田龍雄, Ksenia Kepping, Gong Hwang-cherng (龔煌城), M.V. Sofronov, and Li Fanwen (李範文). Marc Miyake has published on Tangut phonology and diachronics. There are four Tangut dictionaries available: the one composed by N.A. Nevsky, one composed by Nishida (1966), one composed by Li Fanwen (1997, revised edition 2008), and one composed by Yevgeny Kychanov (2006).
Modern refinements to Tangut reconstruction leverage a new type of data that was unavailable to previous scholars: the phonology of modern Gyalrongic languages.
| 重唇音類 | heavy lip | bilabials | p, ph, b, m | p, ph, b, m | p, ph, b, m |
| 輕唇音類 | light lip | labio-dentals | f, v, w | v | |
| 舌頭音類 | tongue tip | apicals (dentals) | t, th, d, n | t, th, d, n | t, th, d, n |
| 舌上音類 | tongue surface | laminals (alveolars) | ty', thy', dy', ny' | tʂ tʂh dʐ ɳ | |
| 牙音類 | molar | velars | k, kh, g, ng | k, kh, g, ŋ | k, kh, g, ŋ |
| 齒頭音類 | incisor tip | dental affricates and fricatives | ts, tsh, dz, s | ts, tsh, dz, s | ts, tsh, dz, s |
| 正齒音類 | incisor proper | palatal affricates and fricatives | c, ch, j, sh | tɕ, tɕh, dʑ, ɕ | |
| 喉音類 | throat | laryngeals | ', h | ., x, ɣ | ʔ, x, ɣ |
| 流風音類 | flowing air | resonants | l, lh, ld, z, r, zz | l, lh, z, r, ʑ | ɫ, ɬ, z, ɽr, r |
The rhyme books distinguish 105 rhyme classes, which are, in turn, classified in several ways: division/grade (等), type (環), and class (攝).
Tangut rhymes occur in three types (環). They are seen in the tradition of Nishida, followed by both Arakawa and Gong as 'normal' (普通母音), 'tense' (緊喉母音), and 'retroflex' (捲舌母音). Gong leaves normal vowels unmarked and places a dot under tense vowels and an -r after retroflex vowels. Arakawa differs only by indicating tense vowels with a final -q.
The rhyme books distinguish four vowel grades (等). In early phonetic reconstructions, all four were separately accounted for, but it has since been realized that grades three and four are in complementary distribution, depending on the initial. Consequently, the reconstructions of Arakawa and Gong do not account for this distinction. Gong represents these three grades as V, iV, and jV. Arakawa accounts for them as V, iV, and V.
In general, rhyme class (攝) corresponds to the set of all rhymes under the same rhyme type which have the same main vowel.
Gong further posits phonemic vowel length and points to evidence that indicates that Tangut had a distinction that Chinese lacked. There is no certainty that the distinction was vowel length and so other researchers have remained skeptical.
| close | i I u | iq eq uq | ir Ir ur |
| mid | e o | eq2 oq | er or |
| open | a | aq | ar |
Miyake reconstructs the vowels differently. In his reconstruction, the 95 vowels of Tangut formed from a six-vowel system in Pre-Tangut because of preinitial loss. (The two vowels in parentheses appeared only in loanwords from Chinese, and many of the vowels in class III were in complementary distribution with their equivalents in class IV.)
| əəu | oo | ɨuu | iuu |
| (əũ) | |||
| əụ | ɨụ | iụ | |
| əuʳ | iuʳ | ||
| əəi | ɪɪ | ɨii | ii |
| əĩ | ɨĩ | ĩ | |
| əị | ɨị | ị | |
| əiʳ | ɪʳ | ɨiʳ | iʳ |
| əəiʳ | ɪɪʳ | ɨiiʳ | iiʳ |
| aa | ææ | ɨaa | iaa |
| ã | æ̃ | ɨã | iã |
| ạ | ɨạ | iạ | |
| aʳ | æʳ | ɨaʳ | iaʳ |
| aaʳ | ɨaaʳ | iaaʳ | |
| (ya) | |||
| əə | ɨəə | iəə | |
| ə̣ | ɨə̣ | iə̣ | |
| əʳ | ʌʳ | ɨəʳ | iəʳ |
| ɨəəʳ | iəəʳ | ||
| ee | ɛ | ɨee | iee |
| ẽ | ɛ̃ | ɨẽ | iẽ |
| ɛ̣̃ | ɨẹ̃ | iẹ̃ | |
| ɛ̣ | ɨẹ | iẹ | |
| eʳ | ɛʳ | ɨeʳ | ieʳ |
| ɨiw | iw | ||
| eʳw | i(e)ʳw | ||
| wɨo | |||
| oo | ɔɔ | ɨoo | ioo |
| õ | ɔ̃ | ɨõ | iõ |
| ɔ̃ɔ̃ | ɨõõ | iõõ | |
| ọ | ɔ̣ | ɨọ | iọ |
| oʳ | ɔʳ | ɨoʳ | ioʳ |
| ooʳ | iooʳ | ||
| õʳ | iõʳ |
The classes here are related to those of Chinese .
| + Tangut verb template ! +6 | +1 ! colspan="2" | -4 |
| valency | -djij² |
In Tangut texts, only few instances of syntactic noun incorporation are attested: the head is final, since it doesn’t move, the directional marker serves as adverb; can absorb the object, but not the subject. In other Qiangic languages that possess high levels of pronominalization such as Japhug and Khroskyabs, NI is still a more syntactically productive process with widespread uses.Guillaume, Jacques. (2011) The Structure of the Tangut verb. Journal of Chinese Linguistics, 2011, 39 (2), pp. 419–443.
In Tangut, two parts of the verb are sensitive to agreement, the person suffix (slot -1) and the verb stem itself (verbal core).Beaudouin, Mathieu. (2022) Tangut verb agreement: Optional or not?. Linguistics of the Tibeto-Burman area, 2022, 45 (1), pp. 93–109. For intransitive verbs, only the person suffix is relevant where it agrees with the subject of the verb. As for transitive verbs, verbs generally agree with the absolutive argument except if the absolutive argument is 3rd person and the ergative is 1st or 2nd person. In these situations, the suffix instead agrees with the ergative argument.
| + !! rowspan="2" | Subject or Agent !! colspan="7" | Patient !! rowspan="2" | Intransitive |
Cells coloured in green not only involve the person suffix but also involve alternations of the stems from the basic stem A to stem B. This stem alternation pattern originates from a 3rd person object suffix of the form *-w as is also found in other Sino-Tibetan languages.Jacques, Guillaume. (2009). The origin of vowel alternations in the Tangut verb Language and Linguistics. 10.1:17–27. In general, stem alternation involves changing the vowel of the stem in a pattern shown as below.Gong, Xun. (2017). 'Verb stems in Tangut and their orthography.' SCRIPTA, 9. pp. 29–48.
| + !Stem A !Stem B !Example | ||
| -i/e | -o | dzji1 → dzjo1 "eat" |
| -u | -o | lju2 → ljo2 "to throw" |
| -ej/ij | -o | dźjij2 → dźjo2 "to possess" |
| -ej/ij | -i/e | ljij2 → lji2 "to see" |
| -a | -ɨ/ə | kjạ1 → kjɨ̣1 "to fear" |
| 𗶷 śjɨ1 | 𗶹 śji2 | "to ɡo" |
| 𗄼 lja1 | 𗆐 ljịj2 | "to come" |
| 𗈶 sjɨ1 | 𗢏/𗏋 sji2 | "to die" |
| 𘐩 phjɨ1 | 𘜉 phji2 | "to abandon, to lose" |
|
|