User:Mzajac/Language attributes/IANA subtags

Suppress-scripts edit

Data culled from the IANA subtags registry (2008-11-25) appears in italics.

A “suppress-script” is a language's default script, and should not be indicated. For example, English should be indicated as simply lang="en", not lang="en-Latn".

An “explicit script” is a case where a language is written in more than one script, and it should be indicated (“redundant” script tags, in IANA). For example, Serbian is written in both the Roman and Cyrillic alphabets, so the script should always be indicated explicitly to avoid ambiguity, as sr-Latn or sr-Cyrl. (Only script subtags are listed in the table, for IANA's registered regions and variants, see below.)

Table of language scripts edit

Please make any necessary additions for Wiktionary to this table

(Yes, bs-Latn appears in both columns.)

Script Suppress-script Explicit script
Arab ar, fa, ps, ur az-Arab, tg-Arab
Armn hy
Avst
Beng as, bn
Blis zbl
Brai
Bugi
Cans iu-Cans
Cari
Cher
Cyrl ab, be, bg, kk, mk, ru, uk az-Cyrl, bs-Cyrl, mn-Cyrl, sr-Cyrl, tg-Cyrl, uz-Cyrl
Cyrs cu, orv
Deva mai, hi, mr, ne, kok
Egyp
Ethi am, ti
Geor ka
Glag
Goth
Grek el
polytonic el-polyton
Gujr gu
Guru pa
Hang
Hani
Hans zh-Hans
Hant zh-Hant
Hebr he, iw, yi
Ital
Jpan ja
Khmr km
Knda kn
Kore ko
Laoo lo
Latn af, ay, bs, ca, ch, cs, cy, da, de, nds, tem, en, men, eo, es, et, eu, fi, fj, fo, fr, fy, ga, gl, gn, gv, hr, ht, hu, id, in, is, it, niu, kl, tkl, la, lb, ln, lt, lv, mg, mh, tmh, mo, ms, mt, na, nb, nd, nl, nn, no, nr, ny, om, son, cpe, tpi, pl, pt, qu, rn, ro, frr, frs, rw, dsb, hsb, sg, sk, sl, sm, so, nso, sq, ss, st, sv, sw, gsw, tl, tn, to, tr, ts, ve, vi, tvl, wo, xh, zu az-Latn, be-Latn, bs-Latn, iu-Latn, sr-Latn, uz-Latn, yi-Latn
Latinx
Linb
Lyci
Lydi
Mlym ml
Mong mn-Mong
Mymr my
Nkoo nqo
Ogam
Olck
Orya or
Phnx
Runr
Sinh si
Syrc
Taml ta
Telu te
Tfng
Thaa dv
Thai th
Tibt dz
Ugar
Xpeo
Xsux

Redundant scripts edit

For convenience, all of IANA's “redundant” script subtags are listed by language, including scripts, regions, and variants.

  • Azerbaijani: az-Arab, az-Cyrl, az-Latn
  • Belarusian w:Łacinka: be-Latn
  • Bosnian: bs-Cyrl, bs-Latn
  • German: de-1901, de-1996, de-AT-1901, de-AT-1996, de-CH-1901, de-CH-1996, de-DE-1901, de-DE-1996
  • English Boontling and Scouse: en-boont, en-scouse
  • Estonian: es-419
  • Inuktitut: iu-Cans, iu-Latn
  • Mongolian: mn-Cyrl, mn-Mong
  • Sign language: sgn-BR, sgn-CO, sgn-DE, sgn-DK, sgn-ES, sgn-FR, sgn-GB, sgn-GR, sgn-IE, sgn-IT, sgn-JP, sgn-MX, sgn-NI, sgn-NL, sgn-NO, sgn-PT, sgn-SE, sgn-US, sgn-ZA
  • Slovene: sl-nedis, sl-rozaj
  • Serbian: sr-Cyrl, sr-Latn
  • Tajik: tg-Arab, tg-Cyrl
  • Uzbek: uz-Cyrl, uz-Latn
  • Yiddish: yi-Latn
  • Chinese: zh-Hans, zh-Hans-CN, zh-Hans-HK, zh-Hans-MO, zh-Hans-SG, zh-Hans-TW, zh-Hant, zh-Hant-CN, zh-Hant-HK, zh-Hant-MO, zh-Hant-SG, zh-Hant-TW