User:AKA MBG/Statistics:Parameters of the database created by the Wiktionary parser

Parameters of the created (parsed) Wiktionary database edit

The parsed database name: enwikt20140908_parsed[1]

Table is a name of the table in the database.

Size is a number of records in the table.

The table filled automatically by wikt.stat.ParsedDB of the wiwordik project.

Table Size Table description
page 2087773 Number of words / entries
relation 383514 Number of semantic relations, e.g. synonyms, antonyms, etc.
lang_pos 1915646 Number of pairs: language & part of speech, one Wiktionary page can contain several such pairs.
wiki_text 2824252 Number of meanings / definitions + number of semantic relations phrases (divided by comma, semicolon) + number of wikified translations.
wiki_text_words 3331475 Number of wikified words (in meanings / definitions + in semantic relations + in translations).
meaning 2572634 Number of meanings, one word can have several meanings / definitions.
inflection 123274 It is extracted from wikified word definitions, e.g. [[normal form|inflection]]
label 1100 Number of unique labels.
label_category 17 Number of categories of context labels.
label_meaning 0 Number of labels used in meanings / definitions.
label_relation 0 Number of labels used in semantic relations (only in ruwikt).
quote 0 Number of quotations and examples, one meaning can have several quotes.
quot_translation 0 Number of translations of quotes (quote in foreign languages can have translation).
quot_transcription 0 Number of transcriptions of quotes.
quot_ref 0 Number of unique quote references (author, title, year,...).
quot_author 0 Number of authors of quotes.
quot_year 0 Number of unique years (and range of years) of quotes.
quot_publisher 0 Number of publishers of quotes.
quot_source 0 Number of sources of quotes.
translation 114662 Number of translation section boxes (at best: one translation box corresponds to one meaning).
translation_entry 1439750 Number of different translations (pairs of translations).

References edit

  1. ^ This (or more recent) database would be available at the project site wikokit, see Download section at page whinger.krc.karelia.ru.

See also edit