1I18N::LangTags::List(3pmP)erl Programmers Reference GuidIe18N::LangTags::List(3pm)
2
3
4
6 I18N::LangTags::List -- tags and names for human languages
7
9 use I18N::LangTags::List;
10 print "Parlez-vous... ", join(', ',
11 I18N::LangTags::List::name('elx') || 'unknown_language',
12 I18N::LangTags::List::name('ar-Kw') || 'unknown_language',
13 I18N::LangTags::List::name('en') || 'unknown_language',
14 I18N::LangTags::List::name('en-CA') || 'unknown_language',
15 ), "?\n";
16
17 prints:
18
19 Parlez-vous... Elamite, Kuwait Arabic, English, Canadian English?
20
22 This module provides a function "I18N::LangTags::List::name( langtag )
23 " that takes a language tag (see I18N::LangTags) and returns the best
24 attempt at an English name for it, or undef if it can't make sense of
25 the tag.
26
27 The function I18N::LangTags::List::name(...) is not exported.
28
29 This module also provides a function "I18N::LangTags::List::is_decent(
30 langtag )" that returns true iff the language tag is syntactically
31 valid and is for general use (like "fr" or "fr-ca", below). That is,
32 it returns false for tags that are syntactically invalid and for tags,
33 like "aus", that are listed in brackets below. This function is not
34 exported.
35
36 The map of tags-to-names that it uses is accessible as
37 %I18N::LangTags::List::Name, and it's the same as the list that follows
38 in this documentation, which should be useful to you even if you don't
39 use this module.
40
42 Internet language tags, as defined in RFC 3066, are a formalism for
43 denoting human languages. The two-letter ISO 639-1 language codes are
44 well known (as "en" for English), as are their forms when qualified by
45 a country code ("en-US"). Less well-known are the arbitrary-length
46 non-ISO codes (like "i-mingo"), and the recently (in 2001) introduced
47 three-letter ISO-639-2 codes.
48
49 Remember these important facts:
50
51 • Language tags are not locale IDs. A locale ID is written with a
52 "_" instead of a "-", (almost?) always matches "m/^\w\w_\w\w\b/",
53 and means something different than a language tag. A language tag
54 denotes a language. A locale ID denotes a language as used in a
55 particular place, in combination with non-linguistic location-
56 specific information such as what currency is used there. Locales
57 also often denote character set information, as in
58 "en_US.ISO8859-1".
59
60 • Language tags are not for computer languages.
61
62 • "Dialect" is not a useful term, since there is no objective
63 criterion for establishing when two language-forms are dialects of
64 eachother, or are separate languages.
65
66 • Language tags are not case-sensitive. en-US, en-us, En-Us, etc.,
67 are all the same tag, and denote the same language.
68
69 • Not every language tag really refers to a single language. Some
70 language tags refer to conditions: i-default (system-message text
71 in English plus maybe other languages), und (undetermined
72 language). Others (notably lots of the three-letter codes) are
73 bibliographic tags that classify whole groups of languages, as with
74 cus "Cushitic (Other)" (i.e., a language that has been classed as
75 Cushtic, but which has no more specific code) or the even less
76 linguistically coherent sai for "South American Indian (Other)".
77 Though useful in bibliography, SUCH TAGS ARE NOT FOR GENERAL USE.
78 For further guidance, email me.
79
80 • Language tags are not country codes. In fact, they are often
81 distinct codes, as with language tag ja for Japanese, and ISO 3166
82 country code ".jp" for Japan.
83
85 The first part of each item is the language tag, between {...}. It is
86 followed by an English name for the language or language-group.
87 Language tags that I judge to be not for general use, are bracketed.
88
89 This list is in alphabetical order by English name of the language.
90
91 {ab} : Abkhazian
92 eq Abkhaz
93
94 {ace} : Achinese
95 {ach} : Acoli
96 {ada} : Adangme
97 {ady} : Adyghe
98 eq Adygei
99
100 {aa} : Afar
101 {afh} : Afrihili
102 (Artificial)
103
104 {af} : Afrikaans
105 [{afa} : Afro-Asiatic (Other)]
106 {ak} : Akan
107 (Formerly "aka".)
108
109 {akk} : Akkadian
110 (Historical)
111
112 {sq} : Albanian
113 {ale} : Aleut
114 [{alg} : Algonquian languages]
115 NOT Algonquin!
116
117 [{tut} : Altaic (Other)]
118 {am} : Amharic
119 NOT Aramaic!
120
121 {i-ami} : Ami
122 eq Amis. eq 'Amis. eq Pangca.
123
124 [{apa} : Apache languages]
125 {ar} : Arabic
126 Many forms are mutually un-intelligible in spoken media. Notable
127 forms: {ar-ae} UAE Arabic; {ar-bh} Bahrain Arabic; {ar-dz} Algerian
128 Arabic; {ar-eg} Egyptian Arabic; {ar-iq} Iraqi Arabic; {ar-jo}
129 Jordanian Arabic; {ar-kw} Kuwait Arabic; {ar-lb} Lebanese Arabic;
130 {ar-ly} Libyan Arabic; {ar-ma} Moroccan Arabic; {ar-om} Omani
131 Arabic; {ar-qa} Qatari Arabic; {ar-sa} Sauda Arabic; {ar-sy} Syrian
132 Arabic; {ar-tn} Tunisian Arabic; {ar-ye} Yemen Arabic.
133
134 {arc} : Aramaic
135 NOT Amharic! NOT Samaritan Aramaic!
136
137 {arp} : Arapaho
138 {arn} : Araucanian
139 {arw} : Arawak
140 {hy} : Armenian
141 {an} : Aragonese
142 [{art} : Artificial (Other)]
143 {ast} : Asturian
144 eq Bable.
145
146 {as} : Assamese
147 [{ath} : Athapascan languages]
148 eq Athabaskan. eq Athapaskan. eq Athabascan.
149
150 [{aus} : Australian languages]
151 [{map} : Austronesian (Other)]
152 {av} : Avaric
153 (Formerly "ava".)
154
155 {ae} : Avestan
156 eq Zend
157
158 {awa} : Awadhi
159 {ay} : Aymara
160 {az} : Azerbaijani
161 eq Azeri
162
163 Notable forms: {az-Arab} Azerbaijani in Arabic script; {az-Cyrl}
164 Azerbaijani in Cyrillic script; {az-Latn} Azerbaijani in Latin
165 script.
166
167 {ban} : Balinese
168 [{bat} : Baltic (Other)]
169 {bal} : Baluchi
170 {bm} : Bambara
171 (Formerly "bam".)
172
173 [{bai} : Bamileke languages]
174 {bad} : Banda
175 [{bnt} : Bantu (Other)]
176 {bas} : Basa
177 {ba} : Bashkir
178 {eu} : Basque
179 {btk} : Batak (Indonesia)
180 {bej} : Beja
181 {be} : Belarusian
182 eq Belarussian. eq Byelarussian. eq Belorussian. eq
183 Byelorussian. eq White Russian. eq White Ruthenian. NOT
184 Ruthenian!
185
186 {bem} : Bemba
187 {bn} : Bengali
188 eq Bangla.
189
190 [{ber} : Berber (Other)]
191 {bho} : Bhojpuri
192 {bh} : Bihari
193 {bik} : Bikol
194 {bin} : Bini
195 {bi} : Bislama
196 eq Bichelamar.
197
198 {bs} : Bosnian
199 {bra} : Braj
200 {br} : Breton
201 {bug} : Buginese
202 {bg} : Bulgarian
203 {i-bnn} : Bunun
204 {bua} : Buriat
205 {my} : Burmese
206 {cad} : Caddo
207 {car} : Carib
208 {ca} : Catalan
209 eq Catalan. eq Catalonian.
210
211 [{cau} : Caucasian (Other)]
212 {ceb} : Cebuano
213 [{cel} : Celtic (Other)]
214 Notable forms: {cel-gaulish} Gaulish (Historical)
215
216 [{cai} : Central American Indian (Other)]
217 {chg} : Chagatai
218 (Historical?)
219
220 [{cmc} : Chamic languages]
221 {ch} : Chamorro
222 {ce} : Chechen
223 {chr} : Cherokee
224 eq Tsalagi
225
226 {chy} : Cheyenne
227 {chb} : Chibcha
228 (Historical) NOT Chibchan (which is a language family).
229
230 {ny} : Chichewa
231 eq Nyanja. eq Chinyanja.
232
233 {zh} : Chinese
234 Many forms are mutually un-intelligible in spoken media. Notable
235 forms: {zh-Hans} Chinese, in simplified script; {zh-Hant} Chinese,
236 in traditional script; {zh-tw} Taiwan Chinese; {zh-cn} PRC Chinese;
237 {zh-sg} Singapore Chinese; {zh-mo} Macau Chinese; {zh-hk} Hong Kong
238 Chinese; {zh-guoyu} Mandarin [Putonghua/Guoyu]; {zh-hakka} Hakka
239 [formerly "i-hakka"]; {zh-min} Hokkien; {zh-min-nan} Southern
240 Hokkien; {zh-wuu} Shanghaiese; {zh-xiang} Hunanese; {zh-gan} Gan;
241 {zh-yue} Cantonese.
242
243 {chn} : Chinook Jargon
244 eq Chinook Wawa.
245
246 {chp} : Chipewyan
247 {cho} : Choctaw
248 {cu} : Church Slavic
249 eq Old Church Slavonic.
250
251 {chk} : Chuukese
252 eq Trukese. eq Chuuk. eq Truk. eq Ruk.
253
254 {cv} : Chuvash
255 {cop} : Coptic
256 {kw} : Cornish
257 {co} : Corsican
258 eq Corse.
259
260 {cr} : Cree
261 NOT Creek! (Formerly "cre".)
262
263 {mus} : Creek
264 NOT Cree!
265
266 [{cpe} : English-based Creoles and pidgins (Other)]
267 [{cpf} : French-based Creoles and pidgins (Other)]
268 [{cpp} : Portuguese-based Creoles and pidgins (Other)]
269 [{crp} : Creoles and pidgins (Other)]
270 {hr} : Croatian
271 eq Croat.
272
273 [{cus} : Cushitic (Other)]
274 {cs} : Czech
275 {dak} : Dakota
276 eq Nakota. eq Latoka.
277
278 {da} : Danish
279 {dar} : Dargwa
280 {day} : Dayak
281 {i-default} : Default (Fallthru) Language
282 Defined in RFC 2277, this is for tagging text (which must include
283 English text, and might/should include text in other appropriate
284 languages) that is emitted in a context where language-negotiation
285 wasn't possible -- in SMTP mail failure messages, for example.
286
287 {del} : Delaware
288 {din} : Dinka
289 {dv} : Divehi
290 eq Maldivian. (Formerly "div".)
291
292 {doi} : Dogri
293 NOT Dogrib!
294
295 {dgr} : Dogrib
296 NOT Dogri!
297
298 [{dra} : Dravidian (Other)]
299 {dua} : Duala
300 {nl} : Dutch
301 eq Netherlander. Notable forms: {nl-nl} Netherlands Dutch; {nl-be}
302 Belgian Dutch.
303
304 {dum} : Middle Dutch (ca.1050-1350)
305 (Historical)
306
307 {dyu} : Dyula
308 {dz} : Dzongkha
309 {efi} : Efik
310 {egy} : Ancient Egyptian
311 (Historical)
312
313 {eka} : Ekajuk
314 {elx} : Elamite
315 (Historical)
316
317 {en} : English
318 Notable forms: {en-au} Australian English; {en-bz} Belize English;
319 {en-ca} Canadian English; {en-gb} UK English; {en-ie} Irish
320 English; {en-jm} Jamaican English; {en-nz} New Zealand English;
321 {en-ph} Philippine English; {en-tt} Trinidad English; {en-us} US
322 English; {en-za} South African English; {en-zw} Zimbabwe English.
323
324 {enm} : Old English (1100-1500)
325 (Historical)
326
327 {ang} : Old English (ca.450-1100)
328 eq Anglo-Saxon. (Historical)
329
330 {i-enochian} : Enochian (Artificial)
331 {myv} : Erzya
332 {eo} : Esperanto
333 (Artificial)
334
335 {et} : Estonian
336 {ee} : Ewe
337 (Formerly "ewe".)
338
339 {ewo} : Ewondo
340 {fan} : Fang
341 {fat} : Fanti
342 {fo} : Faroese
343 {fj} : Fijian
344 {fi} : Finnish
345 [{fiu} : Finno-Ugrian (Other)]
346 eq Finno-Ugric. NOT Ugaritic!
347
348 {fon} : Fon
349 {fr} : French
350 Notable forms: {fr-fr} France French; {fr-be} Belgian French;
351 {fr-ca} Canadian French; {fr-ch} Swiss French; {fr-lu} Luxembourg
352 French; {fr-mc} Monaco French.
353
354 {frm} : Middle French (ca.1400-1600)
355 (Historical)
356
357 {fro} : Old French (842-ca.1400)
358 (Historical)
359
360 {fy} : Frisian
361 {fur} : Friulian
362 {ff} : Fulah
363 (Formerly "ful".)
364
365 {gaa} : Ga
366 {gd} : Scots Gaelic
367 NOT Scots!
368
369 {gl} : Gallegan
370 eq Galician
371
372 {lg} : Ganda
373 (Formerly "lug".)
374
375 {gay} : Gayo
376 {gba} : Gbaya
377 {gez} : Geez
378 eq Ge'ez
379
380 {ka} : Georgian
381 {de} : German
382 Notable forms: {de-at} Austrian German; {de-be} Belgian German;
383 {de-ch} Swiss German; {de-de} Germany German; {de-li} Liechtenstein
384 German; {de-lu} Luxembourg German.
385
386 {gmh} : Middle High German (ca.1050-1500)
387 (Historical)
388
389 {goh} : Old High German (ca.750-1050)
390 (Historical)
391
392 [{gem} : Germanic (Other)]
393 {gil} : Gilbertese
394 {gon} : Gondi
395 {gor} : Gorontalo
396 {got} : Gothic
397 (Historical)
398
399 {grb} : Grebo
400 {grc} : Ancient Greek
401 (Historical) (Until 15th century or so.)
402
403 {el} : Modern Greek
404 (Since 15th century or so.)
405
406 {gn} : Guarani
407 Guarani
408
409 {gu} : Gujarati
410 {gwi} : Gwich'in
411 eq Gwichin
412
413 {hai} : Haida
414 {ht} : Haitian
415 eq Haitian Creole
416
417 {ha} : Hausa
418 {haw} : Hawaiian
419 Hawai'ian
420
421 {he} : Hebrew
422 (Formerly "iw".)
423
424 {hz} : Herero
425 {hil} : Hiligaynon
426 {him} : Himachali
427 {hi} : Hindi
428 {ho} : Hiri Motu
429 {hit} : Hittite
430 (Historical)
431
432 {hmn} : Hmong
433 {hu} : Hungarian
434 {hup} : Hupa
435 {iba} : Iban
436 {is} : Icelandic
437 {io} : Ido
438 (Artificial)
439
440 {ig} : Igbo
441 (Formerly "ibo".)
442
443 {ijo} : Ijo
444 {ilo} : Iloko
445 [{inc} : Indic (Other)]
446 [{ine} : Indo-European (Other)]
447 {id} : Indonesian
448 (Formerly "in".)
449
450 {inh} : Ingush
451 {ia} : Interlingua (International Auxiliary Language Association)
452 (Artificial) NOT Interlingue!
453
454 {ie} : Interlingue
455 (Artificial) NOT Interlingua!
456
457 {iu} : Inuktitut
458 A subform of "Eskimo".
459
460 {ik} : Inupiaq
461 A subform of "Eskimo".
462
463 [{ira} : Iranian (Other)]
464 {ga} : Irish
465 {mga} : Middle Irish (900-1200)
466 (Historical)
467
468 {sga} : Old Irish (to 900)
469 (Historical)
470
471 [{iro} : Iroquoian languages]
472 {it} : Italian
473 Notable forms: {it-it} Italy Italian; {it-ch} Swiss Italian.
474
475 {ja} : Japanese
476 (NOT "jp"!)
477
478 {jv} : Javanese
479 (Formerly "jw" because of a typo.)
480
481 {jrb} : Judeo-Arabic
482 {jpr} : Judeo-Persian
483 {kbd} : Kabardian
484 {kab} : Kabyle
485 {kac} : Kachin
486 {kl} : Kalaallisut
487 eq Greenlandic "Eskimo"
488
489 {xal} : Kalmyk
490 {kam} : Kamba
491 {kn} : Kannada
492 eq Kanarese. NOT Canadian!
493
494 {kr} : Kanuri
495 (Formerly "kau".)
496
497 {krc} : Karachay-Balkar
498 {kaa} : Kara-Kalpak
499 {kar} : Karen
500 {ks} : Kashmiri
501 {csb} : Kashubian
502 eq Kashub
503
504 {kaw} : Kawi
505 {kk} : Kazakh
506 {kha} : Khasi
507 {km} : Khmer
508 eq Cambodian. eq Kampuchean.
509
510 [{khi} : Khoisan (Other)]
511 {kho} : Khotanese
512 {ki} : Kikuyu
513 eq Gikuyu.
514
515 {kmb} : Kimbundu
516 {rw} : Kinyarwanda
517 {ky} : Kirghiz
518 {i-klingon} : Klingon
519 {kv} : Komi
520 {kg} : Kongo
521 (Formerly "kon".)
522
523 {kok} : Konkani
524 {ko} : Korean
525 {kos} : Kosraean
526 {kpe} : Kpelle
527 {kro} : Kru
528 {kj} : Kuanyama
529 {kum} : Kumyk
530 {ku} : Kurdish
531 {kru} : Kurukh
532 {kut} : Kutenai
533 {lad} : Ladino
534 eq Judeo-Spanish. NOT Ladin (a minority language in Italy).
535
536 {lah} : Lahnda
537 NOT Lamba!
538
539 {lam} : Lamba
540 NOT Lahnda!
541
542 {lo} : Lao
543 eq Laotian.
544
545 {la} : Latin
546 (Historical) NOT Ladin! NOT Ladino!
547
548 {lv} : Latvian
549 eq Lettish.
550
551 {lb} : Letzeburgesch
552 eq Luxemburgian, eq Luxemburger. (Formerly "i-lux".)
553
554 {lez} : Lezghian
555 {li} : Limburgish
556 eq Limburger, eq Limburgan. NOT Letzeburgesch!
557
558 {ln} : Lingala
559 {lt} : Lithuanian
560 {nds} : Low German
561 eq Low Saxon. eq Low German. eq Low Saxon.
562
563 {art-lojban} : Lojban (Artificial)
564 {loz} : Lozi
565 {lu} : Luba-Katanga
566 (Formerly "lub".)
567
568 {lua} : Luba-Lulua
569 {lui} : Luiseno
570 eq Luisen~o.
571
572 {lun} : Lunda
573 {luo} : Luo (Kenya and Tanzania)
574 {lus} : Lushai
575 {mk} : Macedonian
576 eq the modern Slavic language spoken in what was Yugoslavia. NOT
577 the form of Greek spoken in Greek Macedonia!
578
579 {mad} : Madurese
580 {mag} : Magahi
581 {mai} : Maithili
582 {mak} : Makasar
583 {mg} : Malagasy
584 {ms} : Malay
585 NOT Malayalam!
586
587 {ml} : Malayalam
588 NOT Malay!
589
590 {mt} : Maltese
591 {mnc} : Manchu
592 {mdr} : Mandar
593 NOT Mandarin!
594
595 {man} : Mandingo
596 {mni} : Manipuri
597 eq Meithei.
598
599 [{mno} : Manobo languages]
600 {gv} : Manx
601 {mi} : Maori
602 NOT Mari!
603
604 {mr} : Marathi
605 {chm} : Mari
606 NOT Maori!
607
608 {mh} : Marshall
609 eq Marshallese.
610
611 {mwr} : Marwari
612 {mas} : Masai
613 [{myn} : Mayan languages]
614 {men} : Mende
615 {mic} : Micmac
616 {min} : Minangkabau
617 {i-mingo} : Mingo
618 eq the Irquoian language West Virginia Seneca. NOT New York
619 Seneca!
620
621 [{mis} : Miscellaneous languages]
622 Don't use this.
623
624 {moh} : Mohawk
625 {mdf} : Moksha
626 {mo} : Moldavian
627 eq Moldovan.
628
629 [{mkh} : Mon-Khmer (Other)]
630 {lol} : Mongo
631 {mn} : Mongolian
632 eq Mongol.
633
634 {mos} : Mossi
635 [{mul} : Multiple languages]
636 Not for normal use.
637
638 [{mun} : Munda languages]
639 {nah} : Nahuatl
640 {nap} : Neapolitan
641 {na} : Nauru
642 {nv} : Navajo
643 eq Navaho. (Formerly "i-navajo".)
644
645 {nd} : North Ndebele
646 {nr} : South Ndebele
647 {ng} : Ndonga
648 {ne} : Nepali
649 eq Nepalese. Notable forms: {ne-np} Nepal Nepali; {ne-in} India
650 Nepali.
651
652 {new} : Newari
653 {nia} : Nias
654 [{nic} : Niger-Kordofanian (Other)]
655 [{ssa} : Nilo-Saharan (Other)]
656 {niu} : Niuean
657 {nog} : Nogai
658 {non} : Old Norse
659 (Historical)
660
661 [{nai} : North American Indian]
662 Do not use this.
663
664 {no} : Norwegian
665 Note the two following forms:
666
667 {nb} : Norwegian Bokmal
668 eq Bokmaal, (A form of Norwegian.) (Formerly "no-bok".)
669
670 {nn} : Norwegian Nynorsk
671 (A form of Norwegian.) (Formerly "no-nyn".)
672
673 [{nub} : Nubian languages]
674 {nym} : Nyamwezi
675 {nyn} : Nyankole
676 {nyo} : Nyoro
677 {nzi} : Nzima
678 {oc} : Occitan (post 1500)
679 eq Provencal, eq Provencal
680
681 {oj} : Ojibwa
682 eq Ojibwe. (Formerly "oji".)
683
684 {or} : Oriya
685 {om} : Oromo
686 {osa} : Osage
687 {os} : Ossetian; Ossetic
688 [{oto} : Otomian languages]
689 Group of languages collectively called "Otomi".
690
691 {pal} : Pahlavi
692 eq Pahlevi
693
694 {i-pwn} : Paiwan
695 eq Pariwan
696
697 {pau} : Palauan
698 {pi} : Pali
699 (Historical?)
700
701 {pam} : Pampanga
702 {pag} : Pangasinan
703 {pa} : Panjabi
704 eq Punjabi
705
706 {pap} : Papiamento
707 eq Papiamentu.
708
709 [{paa} : Papuan (Other)]
710 {fa} : Persian
711 eq Farsi. eq Iranian.
712
713 {peo} : Old Persian (ca.600-400 B.C.)
714 [{phi} : Philippine (Other)]
715 {phn} : Phoenician
716 (Historical)
717
718 {pon} : Pohnpeian
719 NOT Pompeiian!
720
721 {pl} : Polish
722 {pt} : Portuguese
723 eq Portugese. Notable forms: {pt-pt} Portugal Portuguese; {pt-br}
724 Brazilian Portuguese.
725
726 [{pra} : Prakrit languages]
727 {pro} : Old Provencal (to 1500)
728 eq Old Provencal. (Historical.)
729
730 {ps} : Pushto
731 eq Pashto. eq Pushtu.
732
733 {qu} : Quechua
734 eq Quecha.
735
736 {rm} : Raeto-Romance
737 eq Romansh.
738
739 {raj} : Rajasthani
740 {rap} : Rapanui
741 {rar} : Rarotongan
742 [{qaa - qtz} : Reserved for local use.]
743 [{roa} : Romance (Other)]
744 NOT Romanian! NOT Romany! NOT Romansh!
745
746 {ro} : Romanian
747 eq Rumanian. NOT Romany!
748
749 {rom} : Romany
750 eq Rom. NOT Romanian!
751
752 {rn} : Rundi
753 {ru} : Russian
754 NOT White Russian! NOT Rusyn!
755
756 [{sal} : Salishan languages]
757 Large language group.
758
759 {sam} : Samaritan Aramaic
760 NOT Aramaic!
761
762 {se} : Northern Sami
763 eq Lappish. eq Lapp. eq (Northern) Saami.
764
765 {sma} : Southern Sami
766 {smn} : Inari Sami
767 {smj} : Lule Sami
768 {sms} : Skolt Sami
769 [{smi} : Sami languages (Other)]
770 {sm} : Samoan
771 {sad} : Sandawe
772 {sg} : Sango
773 {sa} : Sanskrit
774 (Historical)
775
776 {sat} : Santali
777 {sc} : Sardinian
778 eq Sard.
779
780 {sas} : Sasak
781 {sco} : Scots
782 NOT Scots Gaelic!
783
784 {sel} : Selkup
785 [{sem} : Semitic (Other)]
786 {sr} : Serbian
787 eq Serb. NOT Sorbian.
788
789 Notable forms: {sr-Cyrl} : Serbian in Cyrillic script; {sr-Latn} :
790 Serbian in Latin script.
791
792 {srr} : Serer
793 {shn} : Shan
794 {sn} : Shona
795 {sid} : Sidamo
796 {sgn-...} : Sign Languages
797 Always use with a subtag. Notable forms: {sgn-gb} British Sign
798 Language (BSL); {sgn-ie} Irish Sign Language (ESL); {sgn-ni}
799 Nicaraguan Sign Language (ISN); {sgn-us} American Sign Language
800 (ASL).
801
802 (And so on with other country codes as the subtag.)
803
804 {bla} : Siksika
805 eq Blackfoot. eq Pikanii.
806
807 {sd} : Sindhi
808 {si} : Sinhalese
809 eq Sinhala.
810
811 [{sit} : Sino-Tibetan (Other)]
812 [{sio} : Siouan languages]
813 {den} : Slave (Athapascan)
814 ("Slavey" is a subform.)
815
816 [{sla} : Slavic (Other)]
817 {sk} : Slovak
818 eq Slovakian.
819
820 {sl} : Slovenian
821 eq Slovene.
822
823 {sog} : Sogdian
824 {so} : Somali
825 {son} : Songhai
826 {snk} : Soninke
827 {wen} : Sorbian languages
828 eq Wendish. eq Sorb. eq Lusatian. eq Wend. NOT Venda! NOT
829 Serbian!
830
831 {nso} : Northern Sotho
832 {st} : Southern Sotho
833 eq Sutu. eq Sesotho.
834
835 [{sai} : South American Indian (Other)]
836 {es} : Spanish
837 Notable forms: {es-ar} Argentine Spanish; {es-bo} Bolivian Spanish;
838 {es-cl} Chilean Spanish; {es-co} Colombian Spanish; {es-do}
839 Dominican Spanish; {es-ec} Ecuadorian Spanish; {es-es} Spain
840 Spanish; {es-gt} Guatemalan Spanish; {es-hn} Honduran Spanish;
841 {es-mx} Mexican Spanish; {es-pa} Panamanian Spanish; {es-pe}
842 Peruvian Spanish; {es-pr} Puerto Rican Spanish; {es-py} Paraguay
843 Spanish; {es-sv} Salvadoran Spanish; {es-us} US Spanish; {es-uy}
844 Uruguayan Spanish; {es-ve} Venezuelan Spanish.
845
846 {suk} : Sukuma
847 {sux} : Sumerian
848 (Historical)
849
850 {su} : Sundanese
851 {sus} : Susu
852 {sw} : Swahili
853 eq Kiswahili
854
855 {ss} : Swati
856 {sv} : Swedish
857 Notable forms: {sv-se} Sweden Swedish; {sv-fi} Finland Swedish.
858
859 {syr} : Syriac
860 {tl} : Tagalog
861 {ty} : Tahitian
862 [{tai} : Tai (Other)]
863 NOT Thai!
864
865 {tg} : Tajik
866 {tmh} : Tamashek
867 {ta} : Tamil
868 {i-tao} : Tao
869 eq Yami.
870
871 {tt} : Tatar
872 {i-tay} : Tayal
873 eq Atayal. eq Atayan.
874
875 {te} : Telugu
876 {ter} : Tereno
877 {tet} : Tetum
878 {th} : Thai
879 NOT Tai!
880
881 {bo} : Tibetan
882 {tig} : Tigre
883 {ti} : Tigrinya
884 {tem} : Timne
885 eq Themne. eq Timene.
886
887 {tiv} : Tiv
888 {tli} : Tlingit
889 {tpi} : Tok Pisin
890 {tkl} : Tokelau
891 {tog} : Tonga (Nyasa)
892 NOT Tsonga!
893
894 {to} : Tonga (Tonga Islands)
895 (Pronounced "Tong-a", not "Tong-ga")
896
897 NOT Tsonga!
898
899 {tsi} : Tsimshian
900 eq Sm'algyax
901
902 {ts} : Tsonga
903 NOT Tonga!
904
905 {i-tsu} : Tsou
906 {tn} : Tswana
907 Same as Setswana.
908
909 {tum} : Tumbuka
910 [{tup} : Tupi languages]
911 {tr} : Turkish
912 (Typically in Roman script)
913
914 {ota} : Ottoman Turkish (1500-1928)
915 (Typically in Arabic script) (Historical)
916
917 {crh} : Crimean Turkish
918 eq Crimean Tatar
919
920 {tk} : Turkmen
921 eq Turkmeni.
922
923 {tvl} : Tuvalu
924 {tyv} : Tuvinian
925 eq Tuvan. eq Tuvin.
926
927 {tw} : Twi
928 {udm} : Udmurt
929 {uga} : Ugaritic
930 NOT Ugric!
931
932 {ug} : Uighur
933 {uk} : Ukrainian
934 {umb} : Umbundu
935 {und} : Undetermined
936 Not a tag for normal use.
937
938 {ur} : Urdu
939 {uz} : Uzbek
940 eq Oezbek
941
942 Notable forms: {uz-Cyrl} Uzbek in Cyrillic script; {uz-Latn} Uzbek
943 in Latin script.
944
945 {vai} : Vai
946 {ve} : Venda
947 NOT Wendish! NOT Wend! NOT Avestan! (Formerly "ven".)
948
949 {vi} : Vietnamese
950 eq Viet.
951
952 {vo} : Volapuk
953 eq Volapuek. (Artificial)
954
955 {vot} : Votic
956 eq Votian. eq Vod.
957
958 [{wak} : Wakashan languages]
959 {wa} : Walloon
960 {wal} : Walamo
961 eq Wolaytta.
962
963 {war} : Waray
964 Presumably the Philippine language Waray-Waray (Samaren~o), not the
965 smaller Philippine language Waray Sorsogon, nor the extinct
966 Australian language Waray.
967
968 {was} : Washo
969 eq Washoe
970
971 {cy} : Welsh
972 {wo} : Wolof
973 {x-...} : Unregistered (Semi-Private Use)
974 "x-" is a prefix for language tags that are not registered with ISO
975 or IANA. Example, x-double-dutch
976
977 {xh} : Xhosa
978 {sah} : Yakut
979 {yao} : Yao
980 (The Yao in Malawi?)
981
982 {yap} : Yapese
983 eq Yap
984
985 {ii} : Sichuan Yi
986 {yi} : Yiddish
987 Formerly "ji". Usually in Hebrew script.
988
989 Notable forms: {yi-latn} Yiddish in Latin script
990
991 {yo} : Yoruba
992 [{ypk} : Yupik languages]
993 Several "Eskimo" languages.
994
995 {znd} : Zande
996 [{zap} : Zapotec]
997 (A group of languages.)
998
999 {zen} : Zenaga
1000 NOT Zend.
1001
1002 {za} : Zhuang
1003 {zu} : Zulu
1004 {zun} : Zuni
1005 eq Zun~i
1006
1008 I18N::LangTags and its "See Also" section.
1009
1011 Copyright (c) 2001+ Sean M. Burke. All rights reserved.
1012
1013 You can redistribute and/or modify this document under the same terms
1014 as Perl itself.
1015
1016 This document is provided in the hope that it will be useful, but
1017 without any warranty; without even the implied warranty of accuracy,
1018 authoritativeness, completeness, merchantability, or fitness for a
1019 particular purpose.
1020
1021 Email any corrections or questions to me.
1022
1024 Sean M. Burke, sburke@cpan.org
1025
1026
1027
1028perl v5.32.1 2021-05-31 I18N::LangTags::List(3pm)