1I18N::LangTags::List(3pmP)erl Programmers Reference GuidIe18N::LangTags::List(3pm)
2
3
4

NAME

6       I18N::LangTags::List -- tags and names for human languages
7

SYNOPSIS

9         use I18N::LangTags::List;
10         print "Parlez-vous... ", join(', ',
11             I18N::LangTags::List::name('elx') || 'unknown_language',
12             I18N::LangTags::List::name('ar-Kw') || 'unknown_language',
13             I18N::LangTags::List::name('en') || 'unknown_language',
14             I18N::LangTags::List::name('en-CA') || 'unknown_language',
15           ), "?\n";
16
17       prints:
18
19         Parlez-vous... Elamite, Kuwait Arabic, English, Canadian English?
20

DESCRIPTION

22       This module provides a function "I18N::LangTags::List::name( langtag )
23       " that takes a language tag (see I18N::LangTags) and returns the best
24       attempt at an English name for it, or undef if it can't make sense of
25       the tag.
26
27       The function I18N::LangTags::List::name(...) is not exported.
28
29       This module also provides a function "I18N::LangTags::List::is_decent(
30       langtag )" that returns true iff the language tag is syntactically
31       valid and is for general use (like "fr" or "fr-ca", below).  That is,
32       it returns false for tags that are syntactically invalid and for tags,
33       like "aus", that are listed in brackets below.  This function is not
34       exported.
35
36       The map of tags-to-names that it uses is accessible as
37       %I18N::LangTags::List::Name, and it's the same as the list that follows
38       in this documentation, which should be useful to you even if you don't
39       use this module.
40

ABOUT LANGUAGE TAGS

42       Internet language tags, as defined in RFC 3066, are a formalism for
43       denoting human languages.  The two-letter ISO 639-1 language codes are
44       well known (as "en" for English), as are their forms when qualified by
45       a country code ("en-US").  Less well-known are the arbitrary-length
46       non-ISO codes (like "i-mingo"), and the recently (in 2001) introduced
47       three-letter ISO-639-2 codes.
48
49       Remember these important facts:
50
51       ·   Language tags are not locale IDs.  A locale ID is written with a
52           "_" instead of a "-", (almost?) always matches "m/^\w\w_\w\w\b/",
53           and means something different than a language tag.  A language tag
54           denotes a language.  A locale ID denotes a language as used in a
55           particular place, in combination with non-linguistic location-
56           specific information such as what currency is used there.  Locales
57           also often denote character set information, as in
58           "en_US.ISO8859-1".
59
60       ·   Language tags are not for computer languages.
61
62       ·   "Dialect" is not a useful term, since there is no objective
63           criterion for establishing when two language-forms are dialects of
64           eachother, or are separate languages.
65
66       ·   Language tags are not case-sensitive.  en-US, en-us, En-Us, etc.,
67           are all the same tag, and denote the same language.
68
69       ·   Not every language tag really refers to a single language.  Some
70           language tags refer to conditions: i-default (system-message text
71           in English plus maybe other languages), und (undetermined
72           language).  Others (notably lots of the three-letter codes) are
73           bibliographic tags that classify whole groups of languages, as with
74           cus "Cushitic (Other)" (i.e., a language that has been classed as
75           Cushtic, but which has no more specific code) or the even less
76           linguistically coherent sai for "South American Indian (Other)".
77           Though useful in bibliography, SUCH TAGS ARE NOT FOR GENERAL USE.
78           For further guidance, email me.
79
80       ·   Language tags are not country codes.  In fact, they are often
81           distinct codes, as with language tag ja for Japanese, and ISO 3166
82           country code ".jp" for Japan.
83

LIST OF LANGUAGES

85       The first part of each item is the language tag, between {...}.  It is
86       followed by an English name for the language or language-group.
87       Language tags that I judge to be not for general use, are bracketed.
88
89       This list is in alphabetical order by English name of the language.
90
91       {ab} : Abkhazian
92           eq Abkhaz
93
94       {ace} : Achinese
95       {ach} : Acoli
96       {ada} : Adangme
97       {ady} : Adyghe
98           eq Adygei
99
100       {aa} : Afar
101       {afh} : Afrihili
102           (Artificial)
103
104       {af} : Afrikaans
105       [{afa} : Afro-Asiatic (Other)]
106       {ak} : Akan
107           (Formerly "aka".)
108
109       {akk} : Akkadian
110           (Historical)
111
112       {sq} : Albanian
113       {ale} : Aleut
114       [{alg} : Algonquian languages]
115           NOT Algonquin!
116
117       [{tut} : Altaic (Other)]
118       {am} : Amharic
119           NOT Aramaic!
120
121       {i-ami} : Ami
122           eq Amis.  eq 'Amis.  eq Pangca.
123
124       [{apa} : Apache languages]
125       {ar} : Arabic
126           Many forms are mutually un-intelligible in spoken media.  Notable
127           forms: {ar-ae} UAE Arabic; {ar-bh} Bahrain Arabic; {ar-dz} Algerian
128           Arabic; {ar-eg} Egyptian Arabic; {ar-iq} Iraqi Arabic; {ar-jo}
129           Jordanian Arabic; {ar-kw} Kuwait Arabic; {ar-lb} Lebanese Arabic;
130           {ar-ly} Libyan Arabic; {ar-ma} Moroccan Arabic; {ar-om} Omani
131           Arabic; {ar-qa} Qatari Arabic; {ar-sa} Sauda Arabic; {ar-sy} Syrian
132           Arabic; {ar-tn} Tunisian Arabic; {ar-ye} Yemen Arabic.
133
134       {arc} : Aramaic
135           NOT Amharic!  NOT Samaritan Aramaic!
136
137       {arp} : Arapaho
138       {arn} : Araucanian
139       {arw} : Arawak
140       {hy} : Armenian
141       {an} : Aragonese
142       [{art} : Artificial (Other)]
143       {ast} : Asturian
144           eq Bable.
145
146       {as} : Assamese
147       [{ath} : Athapascan languages]
148           eq Athabaskan.  eq Athapaskan.  eq Athabascan.
149
150       [{aus} : Australian languages]
151       [{map} : Austronesian (Other)]
152       {av} : Avaric
153           (Formerly "ava".)
154
155       {ae} : Avestan
156           eq Zend
157
158       {awa} : Awadhi
159       {ay} : Aymara
160       {az} : Azerbaijani
161           eq Azeri
162
163           Notable forms: {az-Arab} Azerbaijani in Arabic script; {az-Cyrl}
164           Azerbaijani in Cyrillic script; {az-Latn} Azerbaijani in Latin
165           script.
166
167       {ban} : Balinese
168       [{bat} : Baltic (Other)]
169       {bal} : Baluchi
170       {bm} : Bambara
171           (Formerly "bam".)
172
173       [{bai} : Bamileke languages]
174       {bad} : Banda
175       [{bnt} : Bantu (Other)]
176       {bas} : Basa
177       {ba} : Bashkir
178       {eu} : Basque
179       {btk} : Batak (Indonesia)
180       {bej} : Beja
181       {be} : Belarusian
182           eq Belarussian.  eq Byelarussian.  eq Belorussian.  eq
183           Byelorussian.  eq White Russian.  eq White Ruthenian.  NOT
184           Ruthenian!
185
186       {bem} : Bemba
187       {bn} : Bengali
188           eq Bangla.
189
190       [{ber} : Berber (Other)]
191       {bho} : Bhojpuri
192       {bh} : Bihari
193       {bik} : Bikol
194       {bin} : Bini
195       {bi} : Bislama
196           eq Bichelamar.
197
198       {bs} : Bosnian
199       {bra} : Braj
200       {br} : Breton
201       {bug} : Buginese
202       {bg} : Bulgarian
203       {i-bnn} : Bunun
204       {bua} : Buriat
205       {my} : Burmese
206       {cad} : Caddo
207       {car} : Carib
208       {ca} : Catalan
209           eq Catalan.  eq Catalonian.
210
211       [{cau} : Caucasian (Other)]
212       {ceb} : Cebuano
213       [{cel} : Celtic (Other)]
214           Notable forms: {cel-gaulish} Gaulish (Historical)
215
216       [{cai} : Central American Indian (Other)]
217       {chg} : Chagatai
218           (Historical?)
219
220       [{cmc} : Chamic languages]
221       {ch} : Chamorro
222       {ce} : Chechen
223       {chr} : Cherokee
224           eq Tsalagi
225
226       {chy} : Cheyenne
227       {chb} : Chibcha
228           (Historical)  NOT Chibchan (which is a language family).
229
230       {ny} : Chichewa
231           eq Nyanja.  eq Chinyanja.
232
233       {zh} : Chinese
234           Many forms are mutually un-intelligible in spoken media.  Notable
235           forms: {zh-Hans} Chinese, in simplified script; {zh-Hant} Chinese,
236           in traditional script; {zh-tw} Taiwan Chinese; {zh-cn} PRC Chinese;
237           {zh-sg} Singapore Chinese; {zh-mo} Macau Chinese; {zh-hk} Hong Kong
238           Chinese; {zh-guoyu} Mandarin [Putonghua/Guoyu]; {zh-hakka} Hakka
239           [formerly "i-hakka"]; {zh-min} Hokkien; {zh-min-nan} Southern
240           Hokkien; {zh-wuu} Shanghaiese; {zh-xiang} Hunanese; {zh-gan} Gan;
241           {zh-yue} Cantonese.
242
243       {chn} : Chinook Jargon
244           eq Chinook Wawa.
245
246       {chp} : Chipewyan
247       {cho} : Choctaw
248       {cu} : Church Slavic
249           eq Old Church Slavonic.
250
251       {chk} : Chuukese
252           eq Trukese.  eq Chuuk.  eq Truk.  eq Ruk.
253
254       {cv} : Chuvash
255       {cop} : Coptic
256       {kw} : Cornish
257       {co} : Corsican
258           eq Corse.
259
260       {cr} : Cree
261           NOT Creek!  (Formerly "cre".)
262
263       {mus} : Creek
264           NOT Cree!
265
266       [{cpe} : English-based Creoles and pidgins (Other)]
267       [{cpf} : French-based Creoles and pidgins (Other)]
268       [{cpp} : Portuguese-based Creoles and pidgins (Other)]
269       [{crp} : Creoles and pidgins (Other)]
270       {hr} : Croatian
271           eq Croat.
272
273       [{cus} : Cushitic (Other)]
274       {cs} : Czech
275       {dak} : Dakota
276           eq Nakota.  eq Latoka.
277
278       {da} : Danish
279       {dar} : Dargwa
280       {day} : Dayak
281       {i-default} : Default (Fallthru) Language
282           Defined in RFC 2277, this is for tagging text (which must include
283           English text, and might/should include text in other appropriate
284           languages) that is emitted in a context where language-negotiation
285           wasn't possible -- in SMTP mail failure messages, for example.
286
287       {del} : Delaware
288       {din} : Dinka
289       {dv} : Divehi
290           eq Maldivian.  (Formerly "div".)
291
292       {doi} : Dogri
293           NOT Dogrib!
294
295       {dgr} : Dogrib
296           NOT Dogri!
297
298       [{dra} : Dravidian (Other)]
299       {dua} : Duala
300       {nl} : Dutch
301           eq Netherlander.  Notable forms: {nl-nl} Netherlands Dutch; {nl-be}
302           Belgian Dutch.
303
304       {dum} : Middle Dutch (ca.1050-1350)
305           (Historical)
306
307       {dyu} : Dyula
308       {dz} : Dzongkha
309       {efi} : Efik
310       {egy} : Ancient Egyptian
311           (Historical)
312
313       {eka} : Ekajuk
314       {elx} : Elamite
315           (Historical)
316
317       {en} : English
318           Notable forms: {en-au} Australian English; {en-bz} Belize English;
319           {en-ca} Canadian English; {en-gb} UK English; {en-ie} Irish
320           English; {en-jm} Jamaican English; {en-nz} New Zealand English;
321           {en-ph} Philippine English; {en-tt} Trinidad English; {en-us} US
322           English; {en-za} South African English; {en-zw} Zimbabwe English.
323
324       {enm} : Old English (1100-1500)
325           (Historical)
326
327       {ang} : Old English (ca.450-1100)
328           eq Anglo-Saxon.  (Historical)
329
330       {i-enochian} : Enochian (Artificial)
331       {myv} : Erzya
332       {eo} : Esperanto
333           (Artificial)
334
335       {et} : Estonian
336       {ee} : Ewe
337           (Formerly "ewe".)
338
339       {ewo} : Ewondo
340       {fan} : Fang
341       {fat} : Fanti
342       {fo} : Faroese
343       {fj} : Fijian
344       {fi} : Finnish
345       [{fiu} : Finno-Ugrian (Other)]
346           eq Finno-Ugric.  NOT Ugaritic!
347
348       {fon} : Fon
349       {fr} : French
350           Notable forms: {fr-fr} France French; {fr-be} Belgian French;
351           {fr-ca} Canadian French; {fr-ch} Swiss French; {fr-lu} Luxembourg
352           French; {fr-mc} Monaco French.
353
354       {frm} : Middle French (ca.1400-1600)
355           (Historical)
356
357       {fro} : Old French (842-ca.1400)
358           (Historical)
359
360       {fy} : Frisian
361       {fur} : Friulian
362       {ff} : Fulah
363           (Formerly "ful".)
364
365       {gaa} : Ga
366       {gd} : Scots Gaelic
367           NOT Scots!
368
369       {gl} : Gallegan
370           eq Galician
371
372       {lg} : Ganda
373           (Formerly "lug".)
374
375       {gay} : Gayo
376       {gba} : Gbaya
377       {gez} : Geez
378           eq Ge'ez
379
380       {ka} : Georgian
381       {de} : German
382           Notable forms: {de-at} Austrian German; {de-be} Belgian German;
383           {de-ch} Swiss German; {de-de} Germany German; {de-li} Liechtenstein
384           German; {de-lu} Luxembourg German.
385
386       {gmh} : Middle High German (ca.1050-1500)
387           (Historical)
388
389       {goh} : Old High German (ca.750-1050)
390           (Historical)
391
392       [{gem} : Germanic (Other)]
393       {gil} : Gilbertese
394       {gon} : Gondi
395       {gor} : Gorontalo
396       {got} : Gothic
397           (Historical)
398
399       {grb} : Grebo
400       {grc} : Ancient Greek
401           (Historical)  (Until 15th century or so.)
402
403       {el} : Modern Greek
404           (Since 15th century or so.)
405
406       {gn} : Guarani
407           Guarani
408
409       {gu} : Gujarati
410       {gwi} : Gwich'in
411           eq Gwichin
412
413       {hai} : Haida
414       {ht} : Haitian
415           eq Haitian Creole
416
417       {ha} : Hausa
418       {haw} : Hawaiian
419           Hawai'ian
420
421       {he} : Hebrew
422           (Formerly "iw".)
423
424       {hz} : Herero
425       {hil} : Hiligaynon
426       {him} : Himachali
427       {hi} : Hindi
428       {ho} : Hiri Motu
429       {hit} : Hittite
430           (Historical)
431
432       {hmn} : Hmong
433       {hu} : Hungarian
434       {hup} : Hupa
435       {iba} : Iban
436       {is} : Icelandic
437       {io} : Ido
438           (Artificial)
439
440       {ig} : Igbo
441           (Formerly "ibo".)
442
443       {ijo} : Ijo
444       {ilo} : Iloko
445       [{inc} : Indic (Other)]
446       [{ine} : Indo-European (Other)]
447       {id} : Indonesian
448           (Formerly "in".)
449
450       {inh} : Ingush
451       {ia} : Interlingua (International Auxiliary Language Association)
452           (Artificial)  NOT Interlingue!
453
454       {ie} : Interlingue
455           (Artificial)  NOT Interlingua!
456
457       {iu} : Inuktitut
458           A subform of "Eskimo".
459
460       {ik} : Inupiaq
461           A subform of "Eskimo".
462
463       [{ira} : Iranian (Other)]
464       {ga} : Irish
465       {mga} : Middle Irish (900-1200)
466           (Historical)
467
468       {sga} : Old Irish (to 900)
469           (Historical)
470
471       [{iro} : Iroquoian languages]
472       {it} : Italian
473           Notable forms: {it-it} Italy Italian; {it-ch} Swiss Italian.
474
475       {ja} : Japanese
476           (NOT "jp"!)
477
478       {jv} : Javanese
479           (Formerly "jw" because of a typo.)
480
481       {jrb} : Judeo-Arabic
482       {jpr} : Judeo-Persian
483       {kbd} : Kabardian
484       {kab} : Kabyle
485       {kac} : Kachin
486       {kl} : Kalaallisut
487           eq Greenlandic "Eskimo"
488
489       {xal} : Kalmyk
490       {kam} : Kamba
491       {kn} : Kannada
492           eq Kanarese.  NOT Canadian!
493
494       {kr} : Kanuri
495           (Formerly "kau".)
496
497       {krc} : Karachay-Balkar
498       {kaa} : Kara-Kalpak
499       {kar} : Karen
500       {ks} : Kashmiri
501       {csb} : Kashubian
502           eq Kashub
503
504       {kaw} : Kawi
505       {kk} : Kazakh
506       {kha} : Khasi
507       {km} : Khmer
508           eq Cambodian.  eq Kampuchean.
509
510       [{khi} : Khoisan (Other)]
511       {kho} : Khotanese
512       {ki} : Kikuyu
513           eq Gikuyu.
514
515       {kmb} : Kimbundu
516       {rw} : Kinyarwanda
517       {ky} : Kirghiz
518       {i-klingon} : Klingon
519       {kv} : Komi
520       {kg} : Kongo
521           (Formerly "kon".)
522
523       {kok} : Konkani
524       {ko} : Korean
525       {kos} : Kosraean
526       {kpe} : Kpelle
527       {kro} : Kru
528       {kj} : Kuanyama
529       {kum} : Kumyk
530       {ku} : Kurdish
531       {kru} : Kurukh
532       {kut} : Kutenai
533       {lad} : Ladino
534           eq Judeo-Spanish.  NOT Ladin (a minority language in Italy).
535
536       {lah} : Lahnda
537           NOT Lamba!
538
539       {lam} : Lamba
540           NOT Lahnda!
541
542       {lo} : Lao
543           eq Laotian.
544
545       {la} : Latin
546           (Historical)  NOT Ladin!  NOT Ladino!
547
548       {lv} : Latvian
549           eq Lettish.
550
551       {lb} : Letzeburgesch
552           eq Luxemburgian, eq Luxemburger.  (Formerly "i-lux".)
553
554       {lez} : Lezghian
555       {li} : Limburgish
556           eq Limburger, eq Limburgan.  NOT Letzeburgesch!
557
558       {ln} : Lingala
559       {lt} : Lithuanian
560       {nds} : Low German
561           eq Low Saxon.  eq Low German.  eq Low Saxon.
562
563       {art-lojban} : Lojban (Artificial)
564       {loz} : Lozi
565       {lu} : Luba-Katanga
566           (Formerly "lub".)
567
568       {lua} : Luba-Lulua
569       {lui} : Luiseno
570           eq Luisen~o.
571
572       {lun} : Lunda
573       {luo} : Luo (Kenya and Tanzania)
574       {lus} : Lushai
575       {mk} : Macedonian
576           eq the modern Slavic language spoken in what was Yugoslavia.  NOT
577           the form of Greek spoken in Greek Macedonia!
578
579       {mad} : Madurese
580       {mag} : Magahi
581       {mai} : Maithili
582       {mak} : Makasar
583       {mg} : Malagasy
584       {ms} : Malay
585           NOT Malayalam!
586
587       {ml} : Malayalam
588           NOT Malay!
589
590       {mt} : Maltese
591       {mnc} : Manchu
592       {mdr} : Mandar
593           NOT Mandarin!
594
595       {man} : Mandingo
596       {mni} : Manipuri
597           eq Meithei.
598
599       [{mno} : Manobo languages]
600       {gv} : Manx
601       {mi} : Maori
602           NOT Mari!
603
604       {mr} : Marathi
605       {chm} : Mari
606           NOT Maori!
607
608       {mh} : Marshall
609           eq Marshallese.
610
611       {mwr} : Marwari
612       {mas} : Masai
613       [{myn} : Mayan languages]
614       {men} : Mende
615       {mic} : Micmac
616       {min} : Minangkabau
617       {i-mingo} : Mingo
618           eq the Irquoian language West Virginia Seneca.  NOT New York
619           Seneca!
620
621       [{mis} : Miscellaneous languages]
622           Don't use this.
623
624       {moh} : Mohawk
625       {mdf} : Moksha
626       {mo} : Moldavian
627           eq Moldovan.
628
629       [{mkh} : Mon-Khmer (Other)]
630       {lol} : Mongo
631       {mn} : Mongolian
632           eq Mongol.
633
634       {mos} : Mossi
635       [{mul} : Multiple languages]
636           Not for normal use.
637
638       [{mun} : Munda languages]
639       {nah} : Nahuatl
640       {nap} : Neapolitan
641       {na} : Nauru
642       {nv} : Navajo
643           eq Navaho.  (Formerly "i-navajo".)
644
645       {nd} : North Ndebele
646       {nr} : South Ndebele
647       {ng} : Ndonga
648       {ne} : Nepali
649           eq Nepalese.  Notable forms: {ne-np} Nepal Nepali; {ne-in} India
650           Nepali.
651
652       {new} : Newari
653       {nia} : Nias
654       [{nic} : Niger-Kordofanian (Other)]
655       [{ssa} : Nilo-Saharan (Other)]
656       {niu} : Niuean
657       {nog} : Nogai
658       {non} : Old Norse
659           (Historical)
660
661       [{nai} : North American Indian]
662           Do not use this.
663
664       {no} : Norwegian
665           Note the two following forms:
666
667       {nb} : Norwegian Bokmal
668           eq Bokmaal, (A form of Norwegian.)  (Formerly "no-bok".)
669
670       {nn} : Norwegian Nynorsk
671           (A form of Norwegian.)  (Formerly "no-nyn".)
672
673       [{nub} : Nubian languages]
674       {nym} : Nyamwezi
675       {nyn} : Nyankole
676       {nyo} : Nyoro
677       {nzi} : Nzima
678       {oc} : Occitan (post 1500)
679           eq Provencal, eq Provencal
680
681       {oj} : Ojibwa
682           eq Ojibwe.  (Formerly "oji".)
683
684       {or} : Oriya
685       {om} : Oromo
686       {osa} : Osage
687       {os} : Ossetian; Ossetic
688       [{oto} : Otomian languages]
689           Group of languages collectively called "Otomi".
690
691       {pal} : Pahlavi
692           eq Pahlevi
693
694       {i-pwn} : Paiwan
695           eq Pariwan
696
697       {pau} : Palauan
698       {pi} : Pali
699           (Historical?)
700
701       {pam} : Pampanga
702       {pag} : Pangasinan
703       {pa} : Panjabi
704           eq Punjabi
705
706       {pap} : Papiamento
707           eq Papiamentu.
708
709       [{paa} : Papuan (Other)]
710       {fa} : Persian
711           eq Farsi.  eq Iranian.
712
713       {peo} : Old Persian (ca.600-400 B.C.)
714       [{phi} : Philippine (Other)]
715       {phn} : Phoenician
716           (Historical)
717
718       {pon} : Pohnpeian
719           NOT Pompeiian!
720
721       {pl} : Polish
722       {pt} : Portuguese
723           eq Portugese.  Notable forms: {pt-pt} Portugal Portuguese; {pt-br}
724           Brazilian Portuguese.
725
726       [{pra} : Prakrit languages]
727       {pro} : Old Provencal (to 1500)
728           eq Old Provencal.  (Historical.)
729
730       {ps} : Pushto
731           eq Pashto.  eq Pushtu.
732
733       {qu} : Quechua
734           eq Quecha.
735
736       {rm} : Raeto-Romance
737           eq Romansh.
738
739       {raj} : Rajasthani
740       {rap} : Rapanui
741       {rar} : Rarotongan
742       [{qaa - qtz} : Reserved for local use.]
743       [{roa} : Romance (Other)]
744           NOT Romanian!  NOT Romany!  NOT Romansh!
745
746       {ro} : Romanian
747           eq Rumanian.  NOT Romany!
748
749       {rom} : Romany
750           eq Rom.  NOT Romanian!
751
752       {rn} : Rundi
753       {ru} : Russian
754           NOT White Russian!  NOT Rusyn!
755
756       [{sal} : Salishan languages]
757           Large language group.
758
759       {sam} : Samaritan Aramaic
760           NOT Aramaic!
761
762       {se} : Northern Sami
763           eq Lappish.  eq Lapp.  eq (Northern) Saami.
764
765       {sma} : Southern Sami
766       {smn} : Inari Sami
767       {smj} : Lule Sami
768       {sms} : Skolt Sami
769       [{smi} : Sami languages (Other)]
770       {sm} : Samoan
771       {sad} : Sandawe
772       {sg} : Sango
773       {sa} : Sanskrit
774           (Historical)
775
776       {sat} : Santali
777       {sc} : Sardinian
778           eq Sard.
779
780       {sas} : Sasak
781       {sco} : Scots
782           NOT Scots Gaelic!
783
784       {sel} : Selkup
785       [{sem} : Semitic (Other)]
786       {sr} : Serbian
787           eq Serb.  NOT Sorbian.
788
789           Notable forms: {sr-Cyrl} : Serbian in Cyrillic script; {sr-Latn} :
790           Serbian in Latin script.
791
792       {srr} : Serer
793       {shn} : Shan
794       {sn} : Shona
795       {sid} : Sidamo
796       {sgn-...} : Sign Languages
797           Always use with a subtag.  Notable forms: {sgn-gb} British Sign
798           Language (BSL); {sgn-ie} Irish Sign Language (ESL); {sgn-ni}
799           Nicaraguan Sign Language (ISN); {sgn-us} American Sign Language
800           (ASL).
801
802           (And so on with other country codes as the subtag.)
803
804       {bla} : Siksika
805           eq Blackfoot.  eq Pikanii.
806
807       {sd} : Sindhi
808       {si} : Sinhalese
809           eq Sinhala.
810
811       [{sit} : Sino-Tibetan (Other)]
812       [{sio} : Siouan languages]
813       {den} : Slave (Athapascan)
814           ("Slavey" is a subform.)
815
816       [{sla} : Slavic (Other)]
817       {sk} : Slovak
818           eq Slovakian.
819
820       {sl} : Slovenian
821           eq Slovene.
822
823       {sog} : Sogdian
824       {so} : Somali
825       {son} : Songhai
826       {snk} : Soninke
827       {wen} : Sorbian languages
828           eq Wendish.  eq Sorb.  eq Lusatian.  eq Wend.  NOT Venda!  NOT
829           Serbian!
830
831       {nso} : Northern Sotho
832       {st} : Southern Sotho
833           eq Sutu.  eq Sesotho.
834
835       [{sai} : South American Indian (Other)]
836       {es} : Spanish
837           Notable forms: {es-ar} Argentine Spanish; {es-bo} Bolivian Spanish;
838           {es-cl} Chilean Spanish; {es-co} Colombian Spanish; {es-do}
839           Dominican Spanish; {es-ec} Ecuadorian Spanish; {es-es} Spain
840           Spanish; {es-gt} Guatemalan Spanish; {es-hn} Honduran Spanish;
841           {es-mx} Mexican Spanish; {es-pa} Panamanian Spanish; {es-pe}
842           Peruvian Spanish; {es-pr} Puerto Rican Spanish; {es-py} Paraguay
843           Spanish; {es-sv} Salvadoran Spanish; {es-us} US Spanish; {es-uy}
844           Uruguayan Spanish; {es-ve} Venezuelan Spanish.
845
846       {suk} : Sukuma
847       {sux} : Sumerian
848           (Historical)
849
850       {su} : Sundanese
851       {sus} : Susu
852       {sw} : Swahili
853           eq Kiswahili
854
855       {ss} : Swati
856       {sv} : Swedish
857           Notable forms: {sv-se} Sweden Swedish; {sv-fi} Finland Swedish.
858
859       {syr} : Syriac
860       {tl} : Tagalog
861       {ty} : Tahitian
862       [{tai} : Tai (Other)]
863           NOT Thai!
864
865       {tg} : Tajik
866       {tmh} : Tamashek
867       {ta} : Tamil
868       {i-tao} : Tao
869           eq Yami.
870
871       {tt} : Tatar
872       {i-tay} : Tayal
873           eq Atayal.  eq Atayan.
874
875       {te} : Telugu
876       {ter} : Tereno
877       {tet} : Tetum
878       {th} : Thai
879           NOT Tai!
880
881       {bo} : Tibetan
882       {tig} : Tigre
883       {ti} : Tigrinya
884       {tem} : Timne
885           eq Themne.  eq Timene.
886
887       {tiv} : Tiv
888       {tli} : Tlingit
889       {tpi} : Tok Pisin
890       {tkl} : Tokelau
891       {tog} : Tonga (Nyasa)
892           NOT Tsonga!
893
894       {to} : Tonga (Tonga Islands)
895           (Pronounced "Tong-a", not "Tong-ga")
896
897           NOT Tsonga!
898
899       {tsi} : Tsimshian
900           eq Sm'algyax
901
902       {ts} : Tsonga
903           NOT Tonga!
904
905       {i-tsu} : Tsou
906       {tn} : Tswana
907           Same as Setswana.
908
909       {tum} : Tumbuka
910       [{tup} : Tupi languages]
911       {tr} : Turkish
912           (Typically in Roman script)
913
914       {ota} : Ottoman Turkish (1500-1928)
915           (Typically in Arabic script)  (Historical)
916
917       {crh} : Crimean Turkish
918           eq Crimean Tatar
919
920       {tk} : Turkmen
921           eq Turkmeni.
922
923       {tvl} : Tuvalu
924       {tyv} : Tuvinian
925           eq Tuvan.  eq Tuvin.
926
927       {tw} : Twi
928       {udm} : Udmurt
929       {uga} : Ugaritic
930           NOT Ugric!
931
932       {ug} : Uighur
933       {uk} : Ukrainian
934       {umb} : Umbundu
935       {und} : Undetermined
936           Not a tag for normal use.
937
938       {ur} : Urdu
939       {uz} : Uzbek
940           eq Oezbek
941
942           Notable forms: {uz-Cyrl} Uzbek in Cyrillic script; {uz-Latn} Uzbek
943           in Latin script.
944
945       {vai} : Vai
946       {ve} : Venda
947           NOT Wendish!  NOT Wend!  NOT Avestan!  (Formerly "ven".)
948
949       {vi} : Vietnamese
950           eq Viet.
951
952       {vo} : Volapuk
953           eq Volapuek.  (Artificial)
954
955       {vot} : Votic
956           eq Votian.  eq Vod.
957
958       [{wak} : Wakashan languages]
959       {wa} : Walloon
960       {wal} : Walamo
961           eq Wolaytta.
962
963       {war} : Waray
964           Presumably the Philippine language Waray-Waray (Samaren~o), not the
965           smaller Philippine language Waray Sorsogon, nor the extinct
966           Australian language Waray.
967
968       {was} : Washo
969           eq Washoe
970
971       {cy} : Welsh
972       {wo} : Wolof
973       {x-...} : Unregistered (Semi-Private Use)
974           "x-" is a prefix for language tags that are not registered with ISO
975           or IANA.  Example, x-double-dutch
976
977       {xh} : Xhosa
978       {sah} : Yakut
979       {yao} : Yao
980           (The Yao in Malawi?)
981
982       {yap} : Yapese
983           eq Yap
984
985       {ii} : Sichuan Yi
986       {yi} : Yiddish
987           Formerly "ji".  Usually in Hebrew script.
988
989           Notable forms: {yi-latn} Yiddish in Latin script
990
991       {yo} : Yoruba
992       [{ypk} : Yupik languages]
993           Several "Eskimo" languages.
994
995       {znd} : Zande
996       [{zap} : Zapotec]
997           (A group of languages.)
998
999       {zen} : Zenaga
1000           NOT Zend.
1001
1002       {za} : Zhuang
1003       {zu} : Zulu
1004       {zun} : Zuni
1005           eq Zun~i
1006

SEE ALSO

1008       I18N::LangTags and its "See Also" section.
1009
1011       Copyright (c) 2001+ Sean M. Burke. All rights reserved.
1012
1013       You can redistribute and/or modify this document under the same terms
1014       as Perl itself.
1015
1016       This document is provided in the hope that it will be useful, but
1017       without any warranty; without even the implied warranty of accuracy,
1018       authoritativeness, completeness, merchantability, or fitness for a
1019       particular purpose.
1020
1021       Email any corrections or questions to me.
1022

AUTHOR

1024       Sean M. Burke, sburke@cpan.org
1025
1026
1027
1028perl v5.32.1                      2021-03-31         I18N::LangTags::List(3pm)
Impressum