1I18N::LangTags::List(3pmP)erl Programmers Reference GuidIe18N::LangTags::List(3pm)
2
3
4

NAME

6       I18N::LangTags::List -- tags and names for human languages
7

SYNOPSIS

9         use I18N::LangTags::List;
10         print "Parlez-vous... ", join(', ',
11             I18N::LangTags::List::name('elx') ⎪⎪ 'unknown_language',
12             I18N::LangTags::List::name('ar-Kw') ⎪⎪ 'unknown_language',
13             I18N::LangTags::List::name('en') ⎪⎪ 'unknown_language',
14             I18N::LangTags::List::name('en-CA') ⎪⎪ 'unknown_language',
15           ), "?\n";
16
17       prints:
18
19         Parlez-vous... Elamite, Kuwait Arabic, English, Canadian English?
20

DESCRIPTION

22       This module provides a function "I18N::LangTags::List::name( langtag )
23       " that takes a language tag (see I18N::LangTags) and returns the best
24       attempt at an English name for it, or undef if it can't make sense of
25       the tag.
26
27       The function I18N::LangTags::List::name(...) is not exported.
28
29       This module also provides a function "I18N::LangTags::List::is_decent(
30       langtag )" that returns true iff the language tag is syntactically
31       valid and is for general use (like "fr" or "fr-ca", below).  That is,
32       it returns false for tags that are syntactically invalid and for tags,
33       like "aus", that are listed in brackets below.  This function is not
34       exported.
35
36       The map of tags-to-names that it uses is accessable as %I18N::Lang‐
37       Tags::List::Name, and it's the same as the list that follows in this
38       documentation, which should be useful to you even if you don't use this
39       module.
40

ABOUT LANGUAGE TAGS

42       Internet language tags, as defined in RFC 3066, are a formalism for
43       denoting human languages.  The two-letter ISO 639-1 language codes are
44       well known (as "en" for English), as are their forms when qualified by
45       a country code ("en-US").  Less well-known are the arbitrary-length
46       non-ISO codes (like "i-mingo"), and the recently (in 2001) introduced
47       three-letter ISO-639-2 codes.
48
49       Remember these important facts:
50
51       ·   Language tags are not locale IDs.  A locale ID is written with a
52           "_" instead of a "-", (almost?) always matches "m/^\w\w_\w\w\b/",
53           and means something different than a language tag.  A language tag
54           denotes a language.  A locale ID denotes a language as used in a
55           particular place, in combination with non-linguistic location-spe‐
56           cific information such as what currency is used there.  Locales
57           also often denote character set information, as in
58           "en_US.ISO8859-1".
59
60       ·   Language tags are not for computer languages.
61
62       ·   "Dialect" is not a useful term, since there is no objective crite‐
63           rion for establishing when two language-forms are dialects of
64           eachother, or are separate languages.
65
66       ·   Language tags are not case-sensitive.  en-US, en-us, En-Us, etc.,
67           are all the same tag, and denote the same language.
68
69       ·   Not every language tag really refers to a single language.  Some
70           language tags refer to conditions: i-default (system-message text
71           in English plus maybe other languages), und (undetermined lan‐
72           guage).  Others (notably lots of the three-letter codes) are bibli‐
73           ographic tags that classify whole groups of languages, as with cus
74           "Cushitic (Other)" (i.e., a language that has been classed as
75           Cushtic, but which has no more specific code) or the even less lin‐
76           guistically coherent sai for "South American Indian (Other)".
77           Though useful in bibliography, SUCH TAGS ARE NOT FOR GENERAL USE.
78           For further guidance, email me.
79
80       ·   Language tags are not country codes.  In fact, they are often dis‐
81           tinct codes, as with language tag ja for Japanese, and ISO 3166
82           country code ".jp" for Japan.
83

LIST OF LANGUAGES

85       The first part of each item is the language tag, between {...}.  It is
86       followed by an English name for the language or language-group.  Lan‐
87       guage tags that I judge to be not for general use, are bracketed.
88
89       This list is in alphabetical order by English name of the language.
90
91       {ab} : Abkhazian
92           eq Abkhaz
93
94       {ace} : Achinese
95       {ach} : Acoli
96       {ada} : Adangme
97       {ady} : Adyghe
98           eq Adygei
99
100       {aa} : Afar
101       {afh} : Afrihili
102           (Artificial)
103
104       {af} : Afrikaans
105       [{afa} : Afro-Asiatic (Other)]
106       {ak} : Akan
107           (Formerly "aka".)
108
109       {akk} : Akkadian
110           (Historical)
111
112       {sq} : Albanian
113       {ale} : Aleut
114       [{alg} : Algonquian languages]
115           NOT Algonquin!
116
117       [{tut} : Altaic (Other)]
118       {am} : Amharic
119           NOT Aramaic!
120
121       {i-ami} : Ami
122           eq Amis.  eq 'Amis.  eq Pangca.
123
124       [{apa} : Apache languages]
125       {ar} : Arabic
126           Many forms are mutually un-intelligible in spoken media.  Notable
127           forms: {ar-ae} UAE Arabic; {ar-bh} Bahrain Arabic; {ar-dz} Algerian
128           Arabic; {ar-eg} Egyptian Arabic; {ar-iq} Iraqi Arabic; {ar-jo} Jor‐
129           danian Arabic; {ar-kw} Kuwait Arabic; {ar-lb} Lebanese Arabic;
130           {ar-ly} Libyan Arabic; {ar-ma} Moroccan Arabic; {ar-om} Omani Ara‐
131           bic; {ar-qa} Qatari Arabic; {ar-sa} Sauda Arabic; {ar-sy} Syrian
132           Arabic; {ar-tn} Tunisian Arabic; {ar-ye} Yemen Arabic.
133
134       {arc} : Aramaic
135           NOT Amharic!  NOT Samaritan Aramaic!
136
137       {arp} : Arapaho
138       {arn} : Araucanian
139       {arw} : Arawak
140       {hy} : Armenian
141       {an} : Aragonese
142       [{art} : Artificial (Other)]
143       {ast} : Asturian
144           eq Bable.
145
146       {as} : Assamese
147       [{ath} : Athapascan languages]
148           eq Athabaskan.  eq Athapaskan.  eq Athabascan.
149
150       [{aus} : Australian languages]
151       [{map} : Austronesian (Other)]
152       {av} : Avaric
153           (Formerly "ava".)
154
155       {ae} : Avestan
156           eq Zend
157
158       {awa} : Awadhi
159       {ay} : Aymara
160       {az} : Azerbaijani
161           eq Azeri
162
163           Notable forms: {az-Arab} Azerbaijani in Arabic script; {az-Cyrl}
164           Azerbaijani in Cyrillic script; {az-Latn} Azerbaijani in Latin
165           script.
166
167       {ban} : Balinese
168       [{bat} : Baltic (Other)]
169       {bal} : Baluchi
170       {bm} : Bambara
171           (Formerly "bam".)
172
173       [{bai} : Bamileke languages]
174       {bad} : Banda
175       [{bnt} : Bantu (Other)]
176       {bas} : Basa
177       {ba} : Bashkir
178       {eu} : Basque
179       {btk} : Batak (Indonesia)
180       {bej} : Beja
181       {be} : Belarusian
182           eq Belarussian.  eq Byelarussian.  eq Belorussian.  eq Byelorus‐
183           sian.  eq White Russian.  eq White Ruthenian.  NOT Ruthenian!
184
185       {bem} : Bemba
186       {bn} : Bengali
187           eq Bangla.
188
189       [{ber} : Berber (Other)]
190       {bho} : Bhojpuri
191       {bh} : Bihari
192       {bik} : Bikol
193       {bin} : Bini
194       {bi} : Bislama
195           eq Bichelamar.
196
197       {bs} : Bosnian
198       {bra} : Braj
199       {br} : Breton
200       {bug} : Buginese
201       {bg} : Bulgarian
202       {i-bnn} : Bunun
203       {bua} : Buriat
204       {my} : Burmese
205       {cad} : Caddo
206       {car} : Carib
207       {ca} : Catalan
208           eq Catalan.  eq Catalonian.
209
210       [{cau} : Caucasian (Other)]
211       {ceb} : Cebuano
212       [{cel} : Celtic (Other)]
213           Notable forms: {cel-gaulish} Gaulish (Historical)
214
215       [{cai} : Central American Indian (Other)]
216       {chg} : Chagatai
217           (Historical?)
218
219       [{cmc} : Chamic languages]
220       {ch} : Chamorro
221       {ce} : Chechen
222       {chr} : Cherokee
223           eq Tsalagi
224
225       {chy} : Cheyenne
226       {chb} : Chibcha
227           (Historical)  NOT Chibchan (which is a language family).
228
229       {ny} : Chichewa
230           eq Nyanja.  eq Chinyanja.
231
232       {zh} : Chinese
233           Many forms are mutually un-intelligible in spoken media.  Notable
234           forms: {zh-Hans} Chinese, in simplified script; {zh-Hant} Chinese,
235           in traditional script; {zh-tw} Taiwan Chinese; {zh-cn} PRC Chinese;
236           {zh-sg} Singapore Chinese; {zh-mo} Macau Chinese; {zh-hk} Hong Kong
237           Chinese; {zh-guoyu} Mandarin [Putonghua/Guoyu]; {zh-hakka} Hakka
238           [formerly "i-hakka"]; {zh-min} Hokkien; {zh-min-nan} Southern
239           Hokkien; {zh-wuu} Shanghaiese; {zh-xiang} Hunanese; {zh-gan} Gan;
240           {zh-yue} Cantonese.
241
242       {chn} : Chinook Jargon
243           eq Chinook Wawa.
244
245       {chp} : Chipewyan
246       {cho} : Choctaw
247       {cu} : Church Slavic
248           eq Old Church Slavonic.
249
250       {chk} : Chuukese
251           eq Trukese.  eq Chuuk.  eq Truk.  eq Ruk.
252
253       {cv} : Chuvash
254       {cop} : Coptic
255       {kw} : Cornish
256       {co} : Corsican
257           eq Corse.
258
259       {cr} : Cree
260           NOT Creek!  (Formerly "cre".)
261
262       {mus} : Creek
263           NOT Cree!
264
265       [{cpe} : English-based Creoles and pidgins (Other)]
266       [{cpf} : French-based Creoles and pidgins (Other)]
267       [{cpp} : Portuguese-based Creoles and pidgins (Other)]
268       [{crp} : Creoles and pidgins (Other)]
269       {hr} : Croatian
270           eq Croat.
271
272       [{cus} : Cushitic (Other)]
273       {cs} : Czech
274       {dak} : Dakota
275           eq Nakota.  eq Latoka.
276
277       {da} : Danish
278       {dar} : Dargwa
279       {day} : Dayak
280       {i-default} : Default (Fallthru) Language
281           Defined in RFC 2277, this is for tagging text (which must include
282           English text, and might/should include text in other appropriate
283           languages) that is emitted in a context where language-negotiation
284           wasn't possible -- in SMTP mail failure messages, for example.
285
286       {del} : Delaware
287       {din} : Dinka
288       {dv} : Divehi
289           eq Maldivian.  (Formerly "div".)
290
291       {doi} : Dogri
292           NOT Dogrib!
293
294       {dgr} : Dogrib
295           NOT Dogri!
296
297       [{dra} : Dravidian (Other)]
298       {dua} : Duala
299       {nl} : Dutch
300           eq Netherlander.  Notable forms: {nl-nl} Netherlands Dutch; {nl-be}
301           Belgian Dutch.
302
303       {dum} : Middle Dutch (ca.1050-1350)
304           (Historical)
305
306       {dyu} : Dyula
307       {dz} : Dzongkha
308       {efi} : Efik
309       {egy} : Ancient Egyptian
310           (Historical)
311
312       {eka} : Ekajuk
313       {elx} : Elamite
314           (Historical)
315
316       {en} : English
317           Notable forms: {en-au} Australian English; {en-bz} Belize English;
318           {en-ca} Canadian English; {en-gb} UK English; {en-ie} Irish Eng‐
319           lish; {en-jm} Jamaican English; {en-nz} New Zealand English;
320           {en-ph} Philippine English; {en-tt} Trinidad English; {en-us} US
321           English; {en-za} South African English; {en-zw} Zimbabwe English.
322
323       {enm} : Old English (1100-1500)
324           (Historical)
325
326       {ang} : Old English (ca.450-1100)
327           eq Anglo-Saxon.  (Historical)
328
329       {i-enochian} : Enochian (Artificial)
330       {myv} : Erzya
331       {eo} : Esperanto
332           (Artificial)
333
334       {et} : Estonian
335       {ee} : Ewe
336           (Formerly "ewe".)
337
338       {ewo} : Ewondo
339       {fan} : Fang
340       {fat} : Fanti
341       {fo} : Faroese
342       {fj} : Fijian
343       {fi} : Finnish
344       [{fiu} : Finno-Ugrian (Other)]
345           eq Finno-Ugric.  NOT Ugaritic!
346
347       {fon} : Fon
348       {fr} : French
349           Notable forms: {fr-fr} France French; {fr-be} Belgian French;
350           {fr-ca} Canadian French; {fr-ch} Swiss French; {fr-lu} Luxembourg
351           French; {fr-mc} Monaco French.
352
353       {frm} : Middle French (ca.1400-1600)
354           (Historical)
355
356       {fro} : Old French (842-ca.1400)
357           (Historical)
358
359       {fy} : Frisian
360       {fur} : Friulian
361       {ff} : Fulah
362           (Formerly "ful".)
363
364       {gaa} : Ga
365       {gd} : Scots Gaelic
366           NOT Scots!
367
368       {gl} : Gallegan
369           eq Galician
370
371       {lg} : Ganda
372           (Formerly "lug".)
373
374       {gay} : Gayo
375       {gba} : Gbaya
376       {gez} : Geez
377           eq Ge'ez
378
379       {ka} : Georgian
380       {de} : German
381           Notable forms: {de-at} Austrian German; {de-be} Belgian German;
382           {de-ch} Swiss German; {de-de} Germany German; {de-li} Liechtenstein
383           German; {de-lu} Luxembourg German.
384
385       {gmh} : Middle High German (ca.1050-1500)
386           (Historical)
387
388       {goh} : Old High German (ca.750-1050)
389           (Historical)
390
391       [{gem} : Germanic (Other)]
392       {gil} : Gilbertese
393       {gon} : Gondi
394       {gor} : Gorontalo
395       {got} : Gothic
396           (Historical)
397
398       {grb} : Grebo
399       {grc} : Ancient Greek
400           (Historical)  (Until 15th century or so.)
401
402       {el} : Modern Greek
403           (Since 15th century or so.)
404
405       {gn} : Guarani
406           Guarani
407
408       {gu} : Gujarati
409       {gwi} : Gwich'in
410           eq Gwichin
411
412       {hai} : Haida
413       {ht} : Haitian
414           eq Haitian Creole
415
416       {ha} : Hausa
417       {haw} : Hawaiian
418           Hawai'ian
419
420       {he} : Hebrew
421           (Formerly "iw".)
422
423       {hz} : Herero
424       {hil} : Hiligaynon
425       {him} : Himachali
426       {hi} : Hindi
427       {ho} : Hiri Motu
428       {hit} : Hittite
429           (Historical)
430
431       {hmn} : Hmong
432       {hu} : Hungarian
433       {hup} : Hupa
434       {iba} : Iban
435       {is} : Icelandic
436       {io} : Ido
437           (Artificial)
438
439       {ig} : Igbo
440           (Formerly "ibo".)
441
442       {ijo} : Ijo
443       {ilo} : Iloko
444       [{inc} : Indic (Other)]
445       [{ine} : Indo-European (Other)]
446       {id} : Indonesian
447           (Formerly "in".)
448
449       {inh} : Ingush
450       {ia} : Interlingua (International Auxiliary Language Association)
451           (Artificial)  NOT Interlingue!
452
453       {ie} : Interlingue
454           (Artificial)  NOT Interlingua!
455
456       {iu} : Inuktitut
457           A subform of "Eskimo".
458
459       {ik} : Inupiaq
460           A subform of "Eskimo".
461
462       [{ira} : Iranian (Other)]
463       {ga} : Irish
464       {mga} : Middle Irish (900-1200)
465           (Historical)
466
467       {sga} : Old Irish (to 900)
468           (Historical)
469
470       [{iro} : Iroquoian languages]
471       {it} : Italian
472           Notable forms: {it-it} Italy Italian; {it-ch} Swiss Italian.
473
474       {ja} : Japanese
475           (NOT "jp"!)
476
477       {jv} : Javanese
478           (Formerly "jw" because of a typo.)
479
480       {jrb} : Judeo-Arabic
481       {jpr} : Judeo-Persian
482       {kbd} : Kabardian
483       {kab} : Kabyle
484       {kac} : Kachin
485       {kl} : Kalaallisut
486           eq Greenlandic "Eskimo"
487
488       {xal} : Kalmyk
489       {kam} : Kamba
490       {kn} : Kannada
491           eq Kanarese.  NOT Canadian!
492
493       {kr} : Kanuri
494           (Formerly "kau".)
495
496       {krc} : Karachay-Balkar
497       {kaa} : Kara-Kalpak
498       {kar} : Karen
499       {ks} : Kashmiri
500       {csb} : Kashubian
501           eq Kashub
502
503       {kaw} : Kawi
504       {kk} : Kazakh
505       {kha} : Khasi
506       {km} : Khmer
507           eq Cambodian.  eq Kampuchean.
508
509       [{khi} : Khoisan (Other)]
510       {kho} : Khotanese
511       {ki} : Kikuyu
512           eq Gikuyu.
513
514       {kmb} : Kimbundu
515       {rw} : Kinyarwanda
516       {ky} : Kirghiz
517       {i-klingon} : Klingon
518       {kv} : Komi
519       {kg} : Kongo
520           (Formerly "kon".)
521
522       {kok} : Konkani
523       {ko} : Korean
524       {kos} : Kosraean
525       {kpe} : Kpelle
526       {kro} : Kru
527       {kj} : Kuanyama
528       {kum} : Kumyk
529       {ku} : Kurdish
530       {kru} : Kurukh
531       {kut} : Kutenai
532       {lad} : Ladino
533           eq Judeo-Spanish.  NOT Ladin (a minority language in Italy).
534
535       {lah} : Lahnda
536           NOT Lamba!
537
538       {lam} : Lamba
539           NOT Lahnda!
540
541       {lo} : Lao
542           eq Laotian.
543
544       {la} : Latin
545           (Historical)  NOT Ladin!  NOT Ladino!
546
547       {lv} : Latvian
548           eq Lettish.
549
550       {lb} : Letzeburgesch
551           eq Luxemburgian, eq Luxemburger.  (Formerly "i-lux".)
552
553       {lez} : Lezghian
554       {li} : Limburgish
555           eq Limburger, eq Limburgan.  NOT Letzeburgesch!
556
557       {ln} : Lingala
558       {lt} : Lithuanian
559       {nds} : Low German
560           eq Low Saxon.  eq Low German.  eq Low Saxon.
561
562       {art-lojban} : Lojban (Artificial)
563       {loz} : Lozi
564       {lu} : Luba-Katanga
565           (Formerly "lub".)
566
567       {lua} : Luba-Lulua
568       {lui} : Luiseno
569           eq Luisen~o.
570
571       {lun} : Lunda
572       {luo} : Luo (Kenya and Tanzania)
573       {lus} : Lushai
574       {mk} : Macedonian
575           eq the modern Slavic language spoken in what was Yugoslavia.  NOT
576           the form of Greek spoken in Greek Macedonia!
577
578       {mad} : Madurese
579       {mag} : Magahi
580       {mai} : Maithili
581       {mak} : Makasar
582       {mg} : Malagasy
583       {ms} : Malay
584           NOT Malayalam!
585
586       {ml} : Malayalam
587           NOT Malay!
588
589       {mt} : Maltese
590       {mnc} : Manchu
591       {mdr} : Mandar
592           NOT Mandarin!
593
594       {man} : Mandingo
595       {mni} : Manipuri
596           eq Meithei.
597
598       [{mno} : Manobo languages]
599       {gv} : Manx
600       {mi} : Maori
601           NOT Mari!
602
603       {mr} : Marathi
604       {chm} : Mari
605           NOT Maori!
606
607       {mh} : Marshall
608           eq Marshallese.
609
610       {mwr} : Marwari
611       {mas} : Masai
612       [{myn} : Mayan languages]
613       {men} : Mende
614       {mic} : Micmac
615       {min} : Minangkabau
616       {i-mingo} : Mingo
617           eq the Irquoian language West Virginia Seneca.  NOT New York
618           Seneca!
619
620       [{mis} : Miscellaneous languages]
621           Don't use this.
622
623       {moh} : Mohawk
624       {mdf} : Moksha
625       {mo} : Moldavian
626           eq Moldovan.
627
628       [{mkh} : Mon-Khmer (Other)]
629       {lol} : Mongo
630       {mn} : Mongolian
631           eq Mongol.
632
633       {mos} : Mossi
634       [{mul} : Multiple languages]
635           Not for normal use.
636
637       [{mun} : Munda languages]
638       {nah} : Nahuatl
639       {nap} : Neapolitan
640       {na} : Nauru
641       {nv} : Navajo
642           eq Navaho.  (Formerly "i-navajo".)
643
644       {nd} : North Ndebele
645       {nr} : South Ndebele
646       {ng} : Ndonga
647       {ne} : Nepali
648           eq Nepalese.  Notable forms: {ne-np} Nepal Nepali; {ne-in} India
649           Nepali.
650
651       {new} : Newari
652       {nia} : Nias
653       [{nic} : Niger-Kordofanian (Other)]
654       [{ssa} : Nilo-Saharan (Other)]
655       {niu} : Niuean
656       {nog} : Nogai
657       {non} : Old Norse
658           (Historical)
659
660       [{nai} : North American Indian]
661           Do not use this.
662
663       {no} : Norwegian
664           Note the two following forms:
665
666       {nb} : Norwegian Bokmal
667           eq Bokmaal, (A form of Norwegian.)  (Formerly "no-bok".)
668
669       {nn} : Norwegian Nynorsk
670           (A form of Norwegian.)  (Formerly "no-nyn".)
671
672       [{nub} : Nubian languages]
673       {nym} : Nyamwezi
674       {nyn} : Nyankole
675       {nyo} : Nyoro
676       {nzi} : Nzima
677       {oc} : Occitan (post 1500)
678           eq Provencal, eq Provencal
679
680       {oj} : Ojibwa
681           eq Ojibwe.  (Formerly "oji".)
682
683       {or} : Oriya
684       {om} : Oromo
685       {osa} : Osage
686       {os} : Ossetian; Ossetic
687       [{oto} : Otomian languages]
688           Group of languages collectively called "Otomi".
689
690       {pal} : Pahlavi
691           eq Pahlevi
692
693       {i-pwn} : Paiwan
694           eq Pariwan
695
696       {pau} : Palauan
697       {pi} : Pali
698           (Historical?)
699
700       {pam} : Pampanga
701       {pag} : Pangasinan
702       {pa} : Panjabi
703           eq Punjabi
704
705       {pap} : Papiamento
706           eq Papiamentu.
707
708       [{paa} : Papuan (Other)]
709       {fa} : Persian
710           eq Farsi.  eq Iranian.
711
712       {peo} : Old Persian (ca.600-400 B.C.)
713       [{phi} : Philippine (Other)]
714       {phn} : Phoenician
715           (Historical)
716
717       {pon} : Pohnpeian
718           NOT Pompeiian!
719
720       {pl} : Polish
721       {pt} : Portuguese
722           eq Portugese.  Notable forms: {pt-pt} Portugal Portuguese; {pt-br}
723           Brazilian Portuguese.
724
725       [{pra} : Prakrit languages]
726       {pro} : Old Provencal (to 1500)
727           eq Old Provencal.  (Historical.)
728
729       {ps} : Pushto
730           eq Pashto.  eq Pushtu.
731
732       {qu} : Quechua
733           eq Quecha.
734
735       {rm} : Raeto-Romance
736           eq Romansh.
737
738       {raj} : Rajasthani
739       {rap} : Rapanui
740       {rar} : Rarotongan
741       [{qaa - qtz} : Reserved for local use.]
742       [{roa} : Romance (Other)]
743           NOT Romanian!  NOT Romany!  NOT Romansh!
744
745       {ro} : Romanian
746           eq Rumanian.  NOT Romany!
747
748       {rom} : Romany
749           eq Rom.  NOT Romanian!
750
751       {rn} : Rundi
752       {ru} : Russian
753           NOT White Russian!  NOT Rusyn!
754
755       [{sal} : Salishan languages]
756           Large language group.
757
758       {sam} : Samaritan Aramaic
759           NOT Aramaic!
760
761       {se} : Northern Sami
762           eq Lappish.  eq Lapp.  eq (Northern) Saami.
763
764       {sma} : Southern Sami
765       {smn} : Inari Sami
766       {smj} : Lule Sami
767       {sms} : Skolt Sami
768       [{smi} : Sami languages (Other)]
769       {sm} : Samoan
770       {sad} : Sandawe
771       {sg} : Sango
772       {sa} : Sanskrit
773           (Historical)
774
775       {sat} : Santali
776       {sc} : Sardinian
777           eq Sard.
778
779       {sas} : Sasak
780       {sco} : Scots
781           NOT Scots Gaelic!
782
783       {sel} : Selkup
784       [{sem} : Semitic (Other)]
785       {sr} : Serbian
786           eq Serb.  NOT Sorbian.
787
788           Notable forms: {sr-Cyrl} : Serbian in Cyrillic script; {sr-Latn} :
789           Serbian in Latin script.
790
791       {srr} : Serer
792       {shn} : Shan
793       {sn} : Shona
794       {sid} : Sidamo
795       {sgn-...} : Sign Languages
796           Always use with a subtag.  Notable forms: {sgn-gb} British Sign
797           Language (BSL); {sgn-ie} Irish Sign Language (ESL); {sgn-ni}
798           Nicaraguan Sign Language (ISN); {sgn-us} American Sign Language
799           (ASL).
800
801           (And so on with other country codes as the subtag.)
802
803       {bla} : Siksika
804           eq Blackfoot.  eq Pikanii.
805
806       {sd} : Sindhi
807       {si} : Sinhalese
808           eq Sinhala.
809
810       [{sit} : Sino-Tibetan (Other)]
811       [{sio} : Siouan languages]
812       {den} : Slave (Athapascan)
813           ("Slavey" is a subform.)
814
815       [{sla} : Slavic (Other)]
816       {sk} : Slovak
817           eq Slovakian.
818
819       {sl} : Slovenian
820           eq Slovene.
821
822       {sog} : Sogdian
823       {so} : Somali
824       {son} : Songhai
825       {snk} : Soninke
826       {wen} : Sorbian languages
827           eq Wendish.  eq Sorb.  eq Lusatian.  eq Wend.  NOT Venda!  NOT Ser‐
828           bian!
829
830       {nso} : Northern Sotho
831       {st} : Southern Sotho
832           eq Sutu.  eq Sesotho.
833
834       [{sai} : South American Indian (Other)]
835       {es} : Spanish
836           Notable forms: {es-ar} Argentine Spanish; {es-bo} Bolivian Spanish;
837           {es-cl} Chilean Spanish; {es-co} Colombian Spanish; {es-do} Domini‐
838           can Spanish; {es-ec} Ecuadorian Spanish; {es-es} Spain Spanish;
839           {es-gt} Guatemalan Spanish; {es-hn} Honduran Spanish; {es-mx} Mexi‐
840           can Spanish; {es-pa} Panamanian Spanish; {es-pe} Peruvian Spanish;
841           {es-pr} Puerto Rican Spanish; {es-py} Paraguay Spanish; {es-sv}
842           Salvadoran Spanish; {es-us} US Spanish; {es-uy} Uruguayan Spanish;
843           {es-ve} Venezuelan Spanish.
844
845       {suk} : Sukuma
846       {sux} : Sumerian
847           (Historical)
848
849       {su} : Sundanese
850       {sus} : Susu
851       {sw} : Swahili
852           eq Kiswahili
853
854       {ss} : Swati
855       {sv} : Swedish
856           Notable forms: {sv-se} Sweden Swedish; {sv-fi} Finland Swedish.
857
858       {syr} : Syriac
859       {tl} : Tagalog
860       {ty} : Tahitian
861       [{tai} : Tai (Other)]
862           NOT Thai!
863
864       {tg} : Tajik
865       {tmh} : Tamashek
866       {ta} : Tamil
867       {i-tao} : Tao
868           eq Yami.
869
870       {tt} : Tatar
871       {i-tay} : Tayal
872           eq Atayal.  eq Atayan.
873
874       {te} : Telugu
875       {ter} : Tereno
876       {tet} : Tetum
877       {th} : Thai
878           NOT Tai!
879
880       {bo} : Tibetan
881       {tig} : Tigre
882       {ti} : Tigrinya
883       {tem} : Timne
884           eq Themne.  eq Timene.
885
886       {tiv} : Tiv
887       {tli} : Tlingit
888       {tpi} : Tok Pisin
889       {tkl} : Tokelau
890       {tog} : Tonga (Nyasa)
891           NOT Tsonga!
892
893       {to} : Tonga (Tonga Islands)
894           (Pronounced "Tong-a", not "Tong-ga")
895
896           NOT Tsonga!
897
898       {tsi} : Tsimshian
899           eq Sm'algyax
900
901       {ts} : Tsonga
902           NOT Tonga!
903
904       {i-tsu} : Tsou
905       {tn} : Tswana
906           Same as Setswana.
907
908       {tum} : Tumbuka
909       [{tup} : Tupi languages]
910       {tr} : Turkish
911           (Typically in Roman script)
912
913       {ota} : Ottoman Turkish (1500-1928)
914           (Typically in Arabic script)  (Historical)
915
916       {crh} : Crimean Turkish
917           eq Crimean Tatar
918
919       {tk} : Turkmen
920           eq Turkmeni.
921
922       {tvl} : Tuvalu
923       {tyv} : Tuvinian
924           eq Tuvan.  eq Tuvin.
925
926       {tw} : Twi
927       {udm} : Udmurt
928       {uga} : Ugaritic
929           NOT Ugric!
930
931       {ug} : Uighur
932       {uk} : Ukrainian
933       {umb} : Umbundu
934       {und} : Undetermined
935           Not a tag for normal use.
936
937       {ur} : Urdu
938       {uz} : Uzbek
939           eq Oezbek
940
941           Notable forms: {uz-Cyrl} Uzbek in Cyrillic script; {uz-Latn} Uzbek
942           in Latin script.
943
944       {vai} : Vai
945       {ve} : Venda
946           NOT Wendish!  NOT Wend!  NOT Avestan!  (Formerly "ven".)
947
948       {vi} : Vietnamese
949           eq Viet.
950
951       {vo} : Volapuk
952           eq Volapuek.  (Artificial)
953
954       {vot} : Votic
955           eq Votian.  eq Vod.
956
957       [{wak} : Wakashan languages]
958       {wa} : Walloon
959       {wal} : Walamo
960           eq Wolaytta.
961
962       {war} : Waray
963           Presumably the Philippine language Waray-Waray (Samaren~o), not the
964           smaller Philippine language Waray Sorsogon, nor the extinct Aus‐
965           tralian language Waray.
966
967       {was} : Washo
968           eq Washoe
969
970       {cy} : Welsh
971       {wo} : Wolof
972       {x-...} : Unregistered (Semi-Private Use)
973           "x-" is a prefix for language tags that are not registered with ISO
974           or IANA.  Example, x-double-dutch
975
976       {xh} : Xhosa
977       {sah} : Yakut
978       {yao} : Yao
979           (The Yao in Malawi?)
980
981       {yap} : Yapese
982           eq Yap
983
984       {ii} : Sichuan Yi
985       {yi} : Yiddish
986           Formerly "ji".  Usually in Hebrew script.
987
988           Notable forms: {yi-latn} Yiddish in Latin script
989
990       {yo} : Yoruba
991       [{ypk} : Yupik languages]
992           Several "Eskimo" languages.
993
994       {znd} : Zande
995       [{zap} : Zapotec]
996           (A group of languages.)
997
998       {zen} : Zenaga
999           NOT Zend.
1000
1001       {za} : Zhuang
1002       {zu} : Zulu
1003       {zun} : Zuni
1004           eq Zun~i
1005

SEE ALSO

1007       I18N::LangTags and its "See Also" section.
1008
1010       Copyright (c) 2001+ Sean M. Burke. All rights reserved.
1011
1012       You can redistribute and/or modify this document under the same terms
1013       as Perl itself.
1014
1015       This document is provided in the hope that it will be useful, but with‐
1016       out any warranty; without even the implied warranty of accuracy,
1017       authoritativeness, completeness, merchantability, or fitness for a par‐
1018       ticular purpose.
1019
1020       Email any corrections or questions to me.
1021

AUTHOR

1023       Sean M. Burke, sburke@cpan.org
1024
1025
1026
1027perl v5.8.8                       2001-09-21         I18N::LangTags::List(3pm)
Impressum