1I18N::LangTags::List(3pmP)erl Programmers Reference GuidIe18N::LangTags::List(3pm)
2
3
4
6 I18N::LangTags::List -- tags and names for human languages
7
9 use I18N::LangTags::List;
10 print "Parlez-vous... ", join(', ',
11 I18N::LangTags::List::name('elx') ⎪⎪ 'unknown_language',
12 I18N::LangTags::List::name('ar-Kw') ⎪⎪ 'unknown_language',
13 I18N::LangTags::List::name('en') ⎪⎪ 'unknown_language',
14 I18N::LangTags::List::name('en-CA') ⎪⎪ 'unknown_language',
15 ), "?\n";
16
17 prints:
18
19 Parlez-vous... Elamite, Kuwait Arabic, English, Canadian English?
20
22 This module provides a function "I18N::LangTags::List::name( langtag )
23 " that takes a language tag (see I18N::LangTags) and returns the best
24 attempt at an English name for it, or undef if it can't make sense of
25 the tag.
26
27 The function I18N::LangTags::List::name(...) is not exported.
28
29 This module also provides a function "I18N::LangTags::List::is_decent(
30 langtag )" that returns true iff the language tag is syntactically
31 valid and is for general use (like "fr" or "fr-ca", below). That is,
32 it returns false for tags that are syntactically invalid and for tags,
33 like "aus", that are listed in brackets below. This function is not
34 exported.
35
36 The map of tags-to-names that it uses is accessable as %I18N::Lang‐
37 Tags::List::Name, and it's the same as the list that follows in this
38 documentation, which should be useful to you even if you don't use this
39 module.
40
42 Internet language tags, as defined in RFC 3066, are a formalism for
43 denoting human languages. The two-letter ISO 639-1 language codes are
44 well known (as "en" for English), as are their forms when qualified by
45 a country code ("en-US"). Less well-known are the arbitrary-length
46 non-ISO codes (like "i-mingo"), and the recently (in 2001) introduced
47 three-letter ISO-639-2 codes.
48
49 Remember these important facts:
50
51 · Language tags are not locale IDs. A locale ID is written with a
52 "_" instead of a "-", (almost?) always matches "m/^\w\w_\w\w\b/",
53 and means something different than a language tag. A language tag
54 denotes a language. A locale ID denotes a language as used in a
55 particular place, in combination with non-linguistic location-spe‐
56 cific information such as what currency is used there. Locales
57 also often denote character set information, as in
58 "en_US.ISO8859-1".
59
60 · Language tags are not for computer languages.
61
62 · "Dialect" is not a useful term, since there is no objective crite‐
63 rion for establishing when two language-forms are dialects of
64 eachother, or are separate languages.
65
66 · Language tags are not case-sensitive. en-US, en-us, En-Us, etc.,
67 are all the same tag, and denote the same language.
68
69 · Not every language tag really refers to a single language. Some
70 language tags refer to conditions: i-default (system-message text
71 in English plus maybe other languages), und (undetermined lan‐
72 guage). Others (notably lots of the three-letter codes) are bibli‐
73 ographic tags that classify whole groups of languages, as with cus
74 "Cushitic (Other)" (i.e., a language that has been classed as
75 Cushtic, but which has no more specific code) or the even less lin‐
76 guistically coherent sai for "South American Indian (Other)".
77 Though useful in bibliography, SUCH TAGS ARE NOT FOR GENERAL USE.
78 For further guidance, email me.
79
80 · Language tags are not country codes. In fact, they are often dis‐
81 tinct codes, as with language tag ja for Japanese, and ISO 3166
82 country code ".jp" for Japan.
83
85 The first part of each item is the language tag, between {...}. It is
86 followed by an English name for the language or language-group. Lan‐
87 guage tags that I judge to be not for general use, are bracketed.
88
89 This list is in alphabetical order by English name of the language.
90
91 {ab} : Abkhazian
92 eq Abkhaz
93
94 {ace} : Achinese
95 {ach} : Acoli
96 {ada} : Adangme
97 {ady} : Adyghe
98 eq Adygei
99
100 {aa} : Afar
101 {afh} : Afrihili
102 (Artificial)
103
104 {af} : Afrikaans
105 [{afa} : Afro-Asiatic (Other)]
106 {ak} : Akan
107 (Formerly "aka".)
108
109 {akk} : Akkadian
110 (Historical)
111
112 {sq} : Albanian
113 {ale} : Aleut
114 [{alg} : Algonquian languages]
115 NOT Algonquin!
116
117 [{tut} : Altaic (Other)]
118 {am} : Amharic
119 NOT Aramaic!
120
121 {i-ami} : Ami
122 eq Amis. eq 'Amis. eq Pangca.
123
124 [{apa} : Apache languages]
125 {ar} : Arabic
126 Many forms are mutually un-intelligible in spoken media. Notable
127 forms: {ar-ae} UAE Arabic; {ar-bh} Bahrain Arabic; {ar-dz} Algerian
128 Arabic; {ar-eg} Egyptian Arabic; {ar-iq} Iraqi Arabic; {ar-jo} Jor‐
129 danian Arabic; {ar-kw} Kuwait Arabic; {ar-lb} Lebanese Arabic;
130 {ar-ly} Libyan Arabic; {ar-ma} Moroccan Arabic; {ar-om} Omani Ara‐
131 bic; {ar-qa} Qatari Arabic; {ar-sa} Sauda Arabic; {ar-sy} Syrian
132 Arabic; {ar-tn} Tunisian Arabic; {ar-ye} Yemen Arabic.
133
134 {arc} : Aramaic
135 NOT Amharic! NOT Samaritan Aramaic!
136
137 {arp} : Arapaho
138 {arn} : Araucanian
139 {arw} : Arawak
140 {hy} : Armenian
141 {an} : Aragonese
142 [{art} : Artificial (Other)]
143 {ast} : Asturian
144 eq Bable.
145
146 {as} : Assamese
147 [{ath} : Athapascan languages]
148 eq Athabaskan. eq Athapaskan. eq Athabascan.
149
150 [{aus} : Australian languages]
151 [{map} : Austronesian (Other)]
152 {av} : Avaric
153 (Formerly "ava".)
154
155 {ae} : Avestan
156 eq Zend
157
158 {awa} : Awadhi
159 {ay} : Aymara
160 {az} : Azerbaijani
161 eq Azeri
162
163 Notable forms: {az-Arab} Azerbaijani in Arabic script; {az-Cyrl}
164 Azerbaijani in Cyrillic script; {az-Latn} Azerbaijani in Latin
165 script.
166
167 {ban} : Balinese
168 [{bat} : Baltic (Other)]
169 {bal} : Baluchi
170 {bm} : Bambara
171 (Formerly "bam".)
172
173 [{bai} : Bamileke languages]
174 {bad} : Banda
175 [{bnt} : Bantu (Other)]
176 {bas} : Basa
177 {ba} : Bashkir
178 {eu} : Basque
179 {btk} : Batak (Indonesia)
180 {bej} : Beja
181 {be} : Belarusian
182 eq Belarussian. eq Byelarussian. eq Belorussian. eq Byelorus‐
183 sian. eq White Russian. eq White Ruthenian. NOT Ruthenian!
184
185 {bem} : Bemba
186 {bn} : Bengali
187 eq Bangla.
188
189 [{ber} : Berber (Other)]
190 {bho} : Bhojpuri
191 {bh} : Bihari
192 {bik} : Bikol
193 {bin} : Bini
194 {bi} : Bislama
195 eq Bichelamar.
196
197 {bs} : Bosnian
198 {bra} : Braj
199 {br} : Breton
200 {bug} : Buginese
201 {bg} : Bulgarian
202 {i-bnn} : Bunun
203 {bua} : Buriat
204 {my} : Burmese
205 {cad} : Caddo
206 {car} : Carib
207 {ca} : Catalan
208 eq Catalan. eq Catalonian.
209
210 [{cau} : Caucasian (Other)]
211 {ceb} : Cebuano
212 [{cel} : Celtic (Other)]
213 Notable forms: {cel-gaulish} Gaulish (Historical)
214
215 [{cai} : Central American Indian (Other)]
216 {chg} : Chagatai
217 (Historical?)
218
219 [{cmc} : Chamic languages]
220 {ch} : Chamorro
221 {ce} : Chechen
222 {chr} : Cherokee
223 eq Tsalagi
224
225 {chy} : Cheyenne
226 {chb} : Chibcha
227 (Historical) NOT Chibchan (which is a language family).
228
229 {ny} : Chichewa
230 eq Nyanja. eq Chinyanja.
231
232 {zh} : Chinese
233 Many forms are mutually un-intelligible in spoken media. Notable
234 forms: {zh-Hans} Chinese, in simplified script; {zh-Hant} Chinese,
235 in traditional script; {zh-tw} Taiwan Chinese; {zh-cn} PRC Chinese;
236 {zh-sg} Singapore Chinese; {zh-mo} Macau Chinese; {zh-hk} Hong Kong
237 Chinese; {zh-guoyu} Mandarin [Putonghua/Guoyu]; {zh-hakka} Hakka
238 [formerly "i-hakka"]; {zh-min} Hokkien; {zh-min-nan} Southern
239 Hokkien; {zh-wuu} Shanghaiese; {zh-xiang} Hunanese; {zh-gan} Gan;
240 {zh-yue} Cantonese.
241
242 {chn} : Chinook Jargon
243 eq Chinook Wawa.
244
245 {chp} : Chipewyan
246 {cho} : Choctaw
247 {cu} : Church Slavic
248 eq Old Church Slavonic.
249
250 {chk} : Chuukese
251 eq Trukese. eq Chuuk. eq Truk. eq Ruk.
252
253 {cv} : Chuvash
254 {cop} : Coptic
255 {kw} : Cornish
256 {co} : Corsican
257 eq Corse.
258
259 {cr} : Cree
260 NOT Creek! (Formerly "cre".)
261
262 {mus} : Creek
263 NOT Cree!
264
265 [{cpe} : English-based Creoles and pidgins (Other)]
266 [{cpf} : French-based Creoles and pidgins (Other)]
267 [{cpp} : Portuguese-based Creoles and pidgins (Other)]
268 [{crp} : Creoles and pidgins (Other)]
269 {hr} : Croatian
270 eq Croat.
271
272 [{cus} : Cushitic (Other)]
273 {cs} : Czech
274 {dak} : Dakota
275 eq Nakota. eq Latoka.
276
277 {da} : Danish
278 {dar} : Dargwa
279 {day} : Dayak
280 {i-default} : Default (Fallthru) Language
281 Defined in RFC 2277, this is for tagging text (which must include
282 English text, and might/should include text in other appropriate
283 languages) that is emitted in a context where language-negotiation
284 wasn't possible -- in SMTP mail failure messages, for example.
285
286 {del} : Delaware
287 {din} : Dinka
288 {dv} : Divehi
289 eq Maldivian. (Formerly "div".)
290
291 {doi} : Dogri
292 NOT Dogrib!
293
294 {dgr} : Dogrib
295 NOT Dogri!
296
297 [{dra} : Dravidian (Other)]
298 {dua} : Duala
299 {nl} : Dutch
300 eq Netherlander. Notable forms: {nl-nl} Netherlands Dutch; {nl-be}
301 Belgian Dutch.
302
303 {dum} : Middle Dutch (ca.1050-1350)
304 (Historical)
305
306 {dyu} : Dyula
307 {dz} : Dzongkha
308 {efi} : Efik
309 {egy} : Ancient Egyptian
310 (Historical)
311
312 {eka} : Ekajuk
313 {elx} : Elamite
314 (Historical)
315
316 {en} : English
317 Notable forms: {en-au} Australian English; {en-bz} Belize English;
318 {en-ca} Canadian English; {en-gb} UK English; {en-ie} Irish Eng‐
319 lish; {en-jm} Jamaican English; {en-nz} New Zealand English;
320 {en-ph} Philippine English; {en-tt} Trinidad English; {en-us} US
321 English; {en-za} South African English; {en-zw} Zimbabwe English.
322
323 {enm} : Old English (1100-1500)
324 (Historical)
325
326 {ang} : Old English (ca.450-1100)
327 eq Anglo-Saxon. (Historical)
328
329 {i-enochian} : Enochian (Artificial)
330 {myv} : Erzya
331 {eo} : Esperanto
332 (Artificial)
333
334 {et} : Estonian
335 {ee} : Ewe
336 (Formerly "ewe".)
337
338 {ewo} : Ewondo
339 {fan} : Fang
340 {fat} : Fanti
341 {fo} : Faroese
342 {fj} : Fijian
343 {fi} : Finnish
344 [{fiu} : Finno-Ugrian (Other)]
345 eq Finno-Ugric. NOT Ugaritic!
346
347 {fon} : Fon
348 {fr} : French
349 Notable forms: {fr-fr} France French; {fr-be} Belgian French;
350 {fr-ca} Canadian French; {fr-ch} Swiss French; {fr-lu} Luxembourg
351 French; {fr-mc} Monaco French.
352
353 {frm} : Middle French (ca.1400-1600)
354 (Historical)
355
356 {fro} : Old French (842-ca.1400)
357 (Historical)
358
359 {fy} : Frisian
360 {fur} : Friulian
361 {ff} : Fulah
362 (Formerly "ful".)
363
364 {gaa} : Ga
365 {gd} : Scots Gaelic
366 NOT Scots!
367
368 {gl} : Gallegan
369 eq Galician
370
371 {lg} : Ganda
372 (Formerly "lug".)
373
374 {gay} : Gayo
375 {gba} : Gbaya
376 {gez} : Geez
377 eq Ge'ez
378
379 {ka} : Georgian
380 {de} : German
381 Notable forms: {de-at} Austrian German; {de-be} Belgian German;
382 {de-ch} Swiss German; {de-de} Germany German; {de-li} Liechtenstein
383 German; {de-lu} Luxembourg German.
384
385 {gmh} : Middle High German (ca.1050-1500)
386 (Historical)
387
388 {goh} : Old High German (ca.750-1050)
389 (Historical)
390
391 [{gem} : Germanic (Other)]
392 {gil} : Gilbertese
393 {gon} : Gondi
394 {gor} : Gorontalo
395 {got} : Gothic
396 (Historical)
397
398 {grb} : Grebo
399 {grc} : Ancient Greek
400 (Historical) (Until 15th century or so.)
401
402 {el} : Modern Greek
403 (Since 15th century or so.)
404
405 {gn} : Guarani
406 Guarani
407
408 {gu} : Gujarati
409 {gwi} : Gwich'in
410 eq Gwichin
411
412 {hai} : Haida
413 {ht} : Haitian
414 eq Haitian Creole
415
416 {ha} : Hausa
417 {haw} : Hawaiian
418 Hawai'ian
419
420 {he} : Hebrew
421 (Formerly "iw".)
422
423 {hz} : Herero
424 {hil} : Hiligaynon
425 {him} : Himachali
426 {hi} : Hindi
427 {ho} : Hiri Motu
428 {hit} : Hittite
429 (Historical)
430
431 {hmn} : Hmong
432 {hu} : Hungarian
433 {hup} : Hupa
434 {iba} : Iban
435 {is} : Icelandic
436 {io} : Ido
437 (Artificial)
438
439 {ig} : Igbo
440 (Formerly "ibo".)
441
442 {ijo} : Ijo
443 {ilo} : Iloko
444 [{inc} : Indic (Other)]
445 [{ine} : Indo-European (Other)]
446 {id} : Indonesian
447 (Formerly "in".)
448
449 {inh} : Ingush
450 {ia} : Interlingua (International Auxiliary Language Association)
451 (Artificial) NOT Interlingue!
452
453 {ie} : Interlingue
454 (Artificial) NOT Interlingua!
455
456 {iu} : Inuktitut
457 A subform of "Eskimo".
458
459 {ik} : Inupiaq
460 A subform of "Eskimo".
461
462 [{ira} : Iranian (Other)]
463 {ga} : Irish
464 {mga} : Middle Irish (900-1200)
465 (Historical)
466
467 {sga} : Old Irish (to 900)
468 (Historical)
469
470 [{iro} : Iroquoian languages]
471 {it} : Italian
472 Notable forms: {it-it} Italy Italian; {it-ch} Swiss Italian.
473
474 {ja} : Japanese
475 (NOT "jp"!)
476
477 {jv} : Javanese
478 (Formerly "jw" because of a typo.)
479
480 {jrb} : Judeo-Arabic
481 {jpr} : Judeo-Persian
482 {kbd} : Kabardian
483 {kab} : Kabyle
484 {kac} : Kachin
485 {kl} : Kalaallisut
486 eq Greenlandic "Eskimo"
487
488 {xal} : Kalmyk
489 {kam} : Kamba
490 {kn} : Kannada
491 eq Kanarese. NOT Canadian!
492
493 {kr} : Kanuri
494 (Formerly "kau".)
495
496 {krc} : Karachay-Balkar
497 {kaa} : Kara-Kalpak
498 {kar} : Karen
499 {ks} : Kashmiri
500 {csb} : Kashubian
501 eq Kashub
502
503 {kaw} : Kawi
504 {kk} : Kazakh
505 {kha} : Khasi
506 {km} : Khmer
507 eq Cambodian. eq Kampuchean.
508
509 [{khi} : Khoisan (Other)]
510 {kho} : Khotanese
511 {ki} : Kikuyu
512 eq Gikuyu.
513
514 {kmb} : Kimbundu
515 {rw} : Kinyarwanda
516 {ky} : Kirghiz
517 {i-klingon} : Klingon
518 {kv} : Komi
519 {kg} : Kongo
520 (Formerly "kon".)
521
522 {kok} : Konkani
523 {ko} : Korean
524 {kos} : Kosraean
525 {kpe} : Kpelle
526 {kro} : Kru
527 {kj} : Kuanyama
528 {kum} : Kumyk
529 {ku} : Kurdish
530 {kru} : Kurukh
531 {kut} : Kutenai
532 {lad} : Ladino
533 eq Judeo-Spanish. NOT Ladin (a minority language in Italy).
534
535 {lah} : Lahnda
536 NOT Lamba!
537
538 {lam} : Lamba
539 NOT Lahnda!
540
541 {lo} : Lao
542 eq Laotian.
543
544 {la} : Latin
545 (Historical) NOT Ladin! NOT Ladino!
546
547 {lv} : Latvian
548 eq Lettish.
549
550 {lb} : Letzeburgesch
551 eq Luxemburgian, eq Luxemburger. (Formerly "i-lux".)
552
553 {lez} : Lezghian
554 {li} : Limburgish
555 eq Limburger, eq Limburgan. NOT Letzeburgesch!
556
557 {ln} : Lingala
558 {lt} : Lithuanian
559 {nds} : Low German
560 eq Low Saxon. eq Low German. eq Low Saxon.
561
562 {art-lojban} : Lojban (Artificial)
563 {loz} : Lozi
564 {lu} : Luba-Katanga
565 (Formerly "lub".)
566
567 {lua} : Luba-Lulua
568 {lui} : Luiseno
569 eq Luisen~o.
570
571 {lun} : Lunda
572 {luo} : Luo (Kenya and Tanzania)
573 {lus} : Lushai
574 {mk} : Macedonian
575 eq the modern Slavic language spoken in what was Yugoslavia. NOT
576 the form of Greek spoken in Greek Macedonia!
577
578 {mad} : Madurese
579 {mag} : Magahi
580 {mai} : Maithili
581 {mak} : Makasar
582 {mg} : Malagasy
583 {ms} : Malay
584 NOT Malayalam!
585
586 {ml} : Malayalam
587 NOT Malay!
588
589 {mt} : Maltese
590 {mnc} : Manchu
591 {mdr} : Mandar
592 NOT Mandarin!
593
594 {man} : Mandingo
595 {mni} : Manipuri
596 eq Meithei.
597
598 [{mno} : Manobo languages]
599 {gv} : Manx
600 {mi} : Maori
601 NOT Mari!
602
603 {mr} : Marathi
604 {chm} : Mari
605 NOT Maori!
606
607 {mh} : Marshall
608 eq Marshallese.
609
610 {mwr} : Marwari
611 {mas} : Masai
612 [{myn} : Mayan languages]
613 {men} : Mende
614 {mic} : Micmac
615 {min} : Minangkabau
616 {i-mingo} : Mingo
617 eq the Irquoian language West Virginia Seneca. NOT New York
618 Seneca!
619
620 [{mis} : Miscellaneous languages]
621 Don't use this.
622
623 {moh} : Mohawk
624 {mdf} : Moksha
625 {mo} : Moldavian
626 eq Moldovan.
627
628 [{mkh} : Mon-Khmer (Other)]
629 {lol} : Mongo
630 {mn} : Mongolian
631 eq Mongol.
632
633 {mos} : Mossi
634 [{mul} : Multiple languages]
635 Not for normal use.
636
637 [{mun} : Munda languages]
638 {nah} : Nahuatl
639 {nap} : Neapolitan
640 {na} : Nauru
641 {nv} : Navajo
642 eq Navaho. (Formerly "i-navajo".)
643
644 {nd} : North Ndebele
645 {nr} : South Ndebele
646 {ng} : Ndonga
647 {ne} : Nepali
648 eq Nepalese. Notable forms: {ne-np} Nepal Nepali; {ne-in} India
649 Nepali.
650
651 {new} : Newari
652 {nia} : Nias
653 [{nic} : Niger-Kordofanian (Other)]
654 [{ssa} : Nilo-Saharan (Other)]
655 {niu} : Niuean
656 {nog} : Nogai
657 {non} : Old Norse
658 (Historical)
659
660 [{nai} : North American Indian]
661 Do not use this.
662
663 {no} : Norwegian
664 Note the two following forms:
665
666 {nb} : Norwegian Bokmal
667 eq Bokmaal, (A form of Norwegian.) (Formerly "no-bok".)
668
669 {nn} : Norwegian Nynorsk
670 (A form of Norwegian.) (Formerly "no-nyn".)
671
672 [{nub} : Nubian languages]
673 {nym} : Nyamwezi
674 {nyn} : Nyankole
675 {nyo} : Nyoro
676 {nzi} : Nzima
677 {oc} : Occitan (post 1500)
678 eq Provencal, eq Provencal
679
680 {oj} : Ojibwa
681 eq Ojibwe. (Formerly "oji".)
682
683 {or} : Oriya
684 {om} : Oromo
685 {osa} : Osage
686 {os} : Ossetian; Ossetic
687 [{oto} : Otomian languages]
688 Group of languages collectively called "Otomi".
689
690 {pal} : Pahlavi
691 eq Pahlevi
692
693 {i-pwn} : Paiwan
694 eq Pariwan
695
696 {pau} : Palauan
697 {pi} : Pali
698 (Historical?)
699
700 {pam} : Pampanga
701 {pag} : Pangasinan
702 {pa} : Panjabi
703 eq Punjabi
704
705 {pap} : Papiamento
706 eq Papiamentu.
707
708 [{paa} : Papuan (Other)]
709 {fa} : Persian
710 eq Farsi. eq Iranian.
711
712 {peo} : Old Persian (ca.600-400 B.C.)
713 [{phi} : Philippine (Other)]
714 {phn} : Phoenician
715 (Historical)
716
717 {pon} : Pohnpeian
718 NOT Pompeiian!
719
720 {pl} : Polish
721 {pt} : Portuguese
722 eq Portugese. Notable forms: {pt-pt} Portugal Portuguese; {pt-br}
723 Brazilian Portuguese.
724
725 [{pra} : Prakrit languages]
726 {pro} : Old Provencal (to 1500)
727 eq Old Provencal. (Historical.)
728
729 {ps} : Pushto
730 eq Pashto. eq Pushtu.
731
732 {qu} : Quechua
733 eq Quecha.
734
735 {rm} : Raeto-Romance
736 eq Romansh.
737
738 {raj} : Rajasthani
739 {rap} : Rapanui
740 {rar} : Rarotongan
741 [{qaa - qtz} : Reserved for local use.]
742 [{roa} : Romance (Other)]
743 NOT Romanian! NOT Romany! NOT Romansh!
744
745 {ro} : Romanian
746 eq Rumanian. NOT Romany!
747
748 {rom} : Romany
749 eq Rom. NOT Romanian!
750
751 {rn} : Rundi
752 {ru} : Russian
753 NOT White Russian! NOT Rusyn!
754
755 [{sal} : Salishan languages]
756 Large language group.
757
758 {sam} : Samaritan Aramaic
759 NOT Aramaic!
760
761 {se} : Northern Sami
762 eq Lappish. eq Lapp. eq (Northern) Saami.
763
764 {sma} : Southern Sami
765 {smn} : Inari Sami
766 {smj} : Lule Sami
767 {sms} : Skolt Sami
768 [{smi} : Sami languages (Other)]
769 {sm} : Samoan
770 {sad} : Sandawe
771 {sg} : Sango
772 {sa} : Sanskrit
773 (Historical)
774
775 {sat} : Santali
776 {sc} : Sardinian
777 eq Sard.
778
779 {sas} : Sasak
780 {sco} : Scots
781 NOT Scots Gaelic!
782
783 {sel} : Selkup
784 [{sem} : Semitic (Other)]
785 {sr} : Serbian
786 eq Serb. NOT Sorbian.
787
788 Notable forms: {sr-Cyrl} : Serbian in Cyrillic script; {sr-Latn} :
789 Serbian in Latin script.
790
791 {srr} : Serer
792 {shn} : Shan
793 {sn} : Shona
794 {sid} : Sidamo
795 {sgn-...} : Sign Languages
796 Always use with a subtag. Notable forms: {sgn-gb} British Sign
797 Language (BSL); {sgn-ie} Irish Sign Language (ESL); {sgn-ni}
798 Nicaraguan Sign Language (ISN); {sgn-us} American Sign Language
799 (ASL).
800
801 (And so on with other country codes as the subtag.)
802
803 {bla} : Siksika
804 eq Blackfoot. eq Pikanii.
805
806 {sd} : Sindhi
807 {si} : Sinhalese
808 eq Sinhala.
809
810 [{sit} : Sino-Tibetan (Other)]
811 [{sio} : Siouan languages]
812 {den} : Slave (Athapascan)
813 ("Slavey" is a subform.)
814
815 [{sla} : Slavic (Other)]
816 {sk} : Slovak
817 eq Slovakian.
818
819 {sl} : Slovenian
820 eq Slovene.
821
822 {sog} : Sogdian
823 {so} : Somali
824 {son} : Songhai
825 {snk} : Soninke
826 {wen} : Sorbian languages
827 eq Wendish. eq Sorb. eq Lusatian. eq Wend. NOT Venda! NOT Ser‐
828 bian!
829
830 {nso} : Northern Sotho
831 {st} : Southern Sotho
832 eq Sutu. eq Sesotho.
833
834 [{sai} : South American Indian (Other)]
835 {es} : Spanish
836 Notable forms: {es-ar} Argentine Spanish; {es-bo} Bolivian Spanish;
837 {es-cl} Chilean Spanish; {es-co} Colombian Spanish; {es-do} Domini‐
838 can Spanish; {es-ec} Ecuadorian Spanish; {es-es} Spain Spanish;
839 {es-gt} Guatemalan Spanish; {es-hn} Honduran Spanish; {es-mx} Mexi‐
840 can Spanish; {es-pa} Panamanian Spanish; {es-pe} Peruvian Spanish;
841 {es-pr} Puerto Rican Spanish; {es-py} Paraguay Spanish; {es-sv}
842 Salvadoran Spanish; {es-us} US Spanish; {es-uy} Uruguayan Spanish;
843 {es-ve} Venezuelan Spanish.
844
845 {suk} : Sukuma
846 {sux} : Sumerian
847 (Historical)
848
849 {su} : Sundanese
850 {sus} : Susu
851 {sw} : Swahili
852 eq Kiswahili
853
854 {ss} : Swati
855 {sv} : Swedish
856 Notable forms: {sv-se} Sweden Swedish; {sv-fi} Finland Swedish.
857
858 {syr} : Syriac
859 {tl} : Tagalog
860 {ty} : Tahitian
861 [{tai} : Tai (Other)]
862 NOT Thai!
863
864 {tg} : Tajik
865 {tmh} : Tamashek
866 {ta} : Tamil
867 {i-tao} : Tao
868 eq Yami.
869
870 {tt} : Tatar
871 {i-tay} : Tayal
872 eq Atayal. eq Atayan.
873
874 {te} : Telugu
875 {ter} : Tereno
876 {tet} : Tetum
877 {th} : Thai
878 NOT Tai!
879
880 {bo} : Tibetan
881 {tig} : Tigre
882 {ti} : Tigrinya
883 {tem} : Timne
884 eq Themne. eq Timene.
885
886 {tiv} : Tiv
887 {tli} : Tlingit
888 {tpi} : Tok Pisin
889 {tkl} : Tokelau
890 {tog} : Tonga (Nyasa)
891 NOT Tsonga!
892
893 {to} : Tonga (Tonga Islands)
894 (Pronounced "Tong-a", not "Tong-ga")
895
896 NOT Tsonga!
897
898 {tsi} : Tsimshian
899 eq Sm'algyax
900
901 {ts} : Tsonga
902 NOT Tonga!
903
904 {i-tsu} : Tsou
905 {tn} : Tswana
906 Same as Setswana.
907
908 {tum} : Tumbuka
909 [{tup} : Tupi languages]
910 {tr} : Turkish
911 (Typically in Roman script)
912
913 {ota} : Ottoman Turkish (1500-1928)
914 (Typically in Arabic script) (Historical)
915
916 {crh} : Crimean Turkish
917 eq Crimean Tatar
918
919 {tk} : Turkmen
920 eq Turkmeni.
921
922 {tvl} : Tuvalu
923 {tyv} : Tuvinian
924 eq Tuvan. eq Tuvin.
925
926 {tw} : Twi
927 {udm} : Udmurt
928 {uga} : Ugaritic
929 NOT Ugric!
930
931 {ug} : Uighur
932 {uk} : Ukrainian
933 {umb} : Umbundu
934 {und} : Undetermined
935 Not a tag for normal use.
936
937 {ur} : Urdu
938 {uz} : Uzbek
939 eq Oezbek
940
941 Notable forms: {uz-Cyrl} Uzbek in Cyrillic script; {uz-Latn} Uzbek
942 in Latin script.
943
944 {vai} : Vai
945 {ve} : Venda
946 NOT Wendish! NOT Wend! NOT Avestan! (Formerly "ven".)
947
948 {vi} : Vietnamese
949 eq Viet.
950
951 {vo} : Volapuk
952 eq Volapuek. (Artificial)
953
954 {vot} : Votic
955 eq Votian. eq Vod.
956
957 [{wak} : Wakashan languages]
958 {wa} : Walloon
959 {wal} : Walamo
960 eq Wolaytta.
961
962 {war} : Waray
963 Presumably the Philippine language Waray-Waray (Samaren~o), not the
964 smaller Philippine language Waray Sorsogon, nor the extinct Aus‐
965 tralian language Waray.
966
967 {was} : Washo
968 eq Washoe
969
970 {cy} : Welsh
971 {wo} : Wolof
972 {x-...} : Unregistered (Semi-Private Use)
973 "x-" is a prefix for language tags that are not registered with ISO
974 or IANA. Example, x-double-dutch
975
976 {xh} : Xhosa
977 {sah} : Yakut
978 {yao} : Yao
979 (The Yao in Malawi?)
980
981 {yap} : Yapese
982 eq Yap
983
984 {ii} : Sichuan Yi
985 {yi} : Yiddish
986 Formerly "ji". Usually in Hebrew script.
987
988 Notable forms: {yi-latn} Yiddish in Latin script
989
990 {yo} : Yoruba
991 [{ypk} : Yupik languages]
992 Several "Eskimo" languages.
993
994 {znd} : Zande
995 [{zap} : Zapotec]
996 (A group of languages.)
997
998 {zen} : Zenaga
999 NOT Zend.
1000
1001 {za} : Zhuang
1002 {zu} : Zulu
1003 {zun} : Zuni
1004 eq Zun~i
1005
1007 I18N::LangTags and its "See Also" section.
1008
1010 Copyright (c) 2001+ Sean M. Burke. All rights reserved.
1011
1012 You can redistribute and/or modify this document under the same terms
1013 as Perl itself.
1014
1015 This document is provided in the hope that it will be useful, but with‐
1016 out any warranty; without even the implied warranty of accuracy,
1017 authoritativeness, completeness, merchantability, or fitness for a par‐
1018 ticular purpose.
1019
1020 Email any corrections or questions to me.
1021
1023 Sean M. Burke, sburke@cpan.org
1024
1025
1026
1027perl v5.8.8 2001-09-21 I18N::LangTags::List(3pm)