Module:families/data/documentation

Documentation for Module:families/data. [edit]
This page contains usage information, categories, interwiki links and other content describing the module.

This module contains definitions for all language family codes on Wiktionary. The entries are listed by their family code, which is usually an ISO-639 code, but there are also some exceptional codes. These are listed on Wiktionary:Families; that page also explains how to create new exceptional codes.

This module must not be imported using require. Instead, it is imported like this:

local families = mw.loadData("Module:families")

This ensures that the data is only loaded once per page, rather than once for every module invocation like normal.

Errors

edit

The following errors were detected by Module:data consistency check:

  • Proto-Central Togo language (alv-gtm-pro) does not have the expected name "Proto-Ghana-Togo Mountain", even though it is the proto-language of the Ghana-Togo Mountain languages (alv-gtm).
  • Proto-Arawa language (auf-pro) does not have the expected name "Proto-Arauan", even though it is the proto-language of the Arauan languages (auf).
  • Proto-Amuesha-Chamicuro language (awd-amc-pro) has a proto-language code associated with the invalid code awd-amc.
  • Proto-Kampa language (awd-kmp-pro) has a proto-language code associated with the invalid code awd-kmp.
  • Proto-Arawak language (awd-pro) does not have the expected name "Proto-Arawakan", even though it is the proto-language of the Arawakan languages (awd).
  • Proto-Paresi-Waura language (awd-prw-pro) has a proto-language code associated with the invalid code awd-prw.
  • Proto-Ta-Arawak language (awd-taa-pro) does not have the expected name "Proto-Ta-Arawakan", even though it is the proto-language of the Ta-Arawakan languages (awd-taa).
  • Proto-Rukai language (dru-pro) has a proto-language code associated with Rukai (dru), which is not a family.
  • Proto-Basque language (euq-pro) does not have the expected name "Proto-Vasconic", even though it is the proto-language of the Vasconic languages (euq).
  • Proto-Norse language (gmq-pro) does not have the expected name "Proto-North Germanic", even though it is the proto-language of the North Germanic languages (gmq).
  • Proto-Kamta language (inc-krn-pro) does not have the expected name "Proto-KRNB lects", even though it is the proto-language of the KRNB lects (inc-krn).
  • Proto-Chumash language (nai-chu-pro) does not have the expected name "Proto-Chumashan", even though it is the proto-language of the Chumashan languages (nai-chu).
  • Proto-Maidun language (nai-mdu-pro) does not have the expected name "Proto-Maiduan", even though it is the proto-language of the Maiduan languages (nai-mdu).
  • Proto-Mixe-Zoque language (nai-miz-pro) does not have the expected name "Proto-Mixe-Zoquean", even though it is the proto-language of the Mixe-Zoquean languages (nai-miz).
  • Proto-Pomo language (nai-pom-pro) does not have the expected name "Proto-Pomoan", even though it is the proto-language of the Pomoan languages (nai-pom).
  • Proto-Mazatec language (omq-maz-pro) does not have the expected name "Proto-Mazatecan", even though it is the proto-language of the Mazatecan languages (omq-maz).
  • Proto-Ossetic language (os-pro) has a proto-language code associated with Ossetian (os), which is not a family.
  • Proto-North Sarawak language (poz-swa-pro) does not have the expected name "Proto-North Sarawakan", even though it is the proto-language of the North Sarawakan languages (poz-swa).
  • Proto-Salish language (sal-pro) does not have the expected name "Proto-Salishan", even though it is the proto-language of the Salishan languages (sal).
  • Proto-Samic language (smi-pro) does not have the expected name "Proto-Sami", even though it is the proto-language of the Sami languages (smi).
  • Proto-Kuki-Chin language (tbq-kuk-pro) does not have the expected name "Proto-Kukish", even though it is the proto-language of the Kukish languages (tbq-kuk).
  • Proto-Saka language (xsc-sak-pro) does not have the expected name "Proto-Sakan", even though it is the proto-language of the Sakan languages (xsc-sak).
  • Literary Chinese language (lzh-lit) has a canonical name that is not unique; it is also used by the code lzh.
  • The data key preprocess_links for ??? (th-new) is invalid.
  • Literary Chinese, the canonical name for the code lzh-lit, is wrong; it should be Literary Chinese.
  • Literary Chinese, the canonical name for the code lzh-lit, is wrong; it should be Literary Chinese.
  • The code ira-mid and the canonical name Middle Iranian should be removed; they are not found in Module:families/data.
  • The code ira-old and the canonical name Old Iranian should be removed; they are not found in Module:families/data.
  • The code ira-mid and the canonical name Middle Iranian should be removed; they are not found in Module:families/data.
  • The code ira-old and the canonical name Old Iranian should be removed; they are not found in Module:families/data.

Required values

edit

Every entry in the table must contain the following properties:

canonicalName
The "canonical" name of the family. This is the name that is used in Wiktionary entries and category names.

Optional values

edit
aliases
A list of aliases/synonyms for the family, other than the canonical name.
varieties
A table of family varieties that are subsumed under the family. This should not include those varieties for which separate family codes have been assigned. If a given variety has several names, they can all be listed by including a sublist in the overall list, where the first element is the canonical name that you want the variety to be known by, and the remainder are aliases. This facility is more commonly used for individual languages than families, but the facility is also provided for families. See Template:language extradata documentation for an example.
otherNames (deprecated)
A table of all non-canonical names that this family is known by, including both synonyms and varieties. This should not be used in new families, and existing families should have the entries in this list moved into either aliases or varieties.
protoLanguage
The language code for the proto-language of the family; that is, the common ancestor of all languages in the family. If this is not specified, Module:families will attempt to create the proto-language code by adding -pro to the family code — for instance, ine (Indo-European languages) → ine-pro (Proto-Indo-European language) — and then use Module:languages to check if the code is valid.
family
The superfamily that the language family belongs to, if any.
wikidata_item
The number in the Wikidata item id for the language family.