Wiktionary:Frequency lists/Esperanto/Wikipedia 2023

Based on the words found in the Esperanto Wikipedia dump of 2023-03-01. All words are reduced to their base form (plural -j and accusative -n are stripped, the verb endings -as/-is/-os/-us/-u are changed to the infinitive -i). Each word is listed in the most typical case form (lower-case, capitalized, or all-caps). Non-Esperanto-ified proper names are mostly omitted (unless listed in common dictionaries). The total size of the corpus is more than 43 million words.


First hundred by frequency

edit

Together these 100 words cover 51.93% percent of the whole corpus.

Second hundred

edit

Together these 200 words cover 58.22% percent of the whole corpus.

Third hundred

edit

Together these 300 words cover 62.10% percent of the whole corpus.

Fourth hundred

edit

Together these 400 words cover 64.99% percent of the whole corpus.

Fifth hundred

edit

Together these 500 words cover 67.32% percent of the whole corpus.

Frequency rank of 501–1000

edit

Together these 1000 words cover 74.73% percent of the whole corpus.

Frequency rank of 1001–2000

edit

Together these 2000 words cover 81.81% percent of the whole corpus.

Frequency rank of 2001–3000

edit

Together these 3000 words cover 85.60% percent of the whole corpus.

Frequency rank of 3001–4000

edit

Together these 4000 words cover 88.02% percent of the whole corpus.

Frequency rank of 4001–5000

edit

Together these 5000 words cover 89.73% percent of the whole corpus.

Frequency rank of 5001–6000

edit

Together these 6000 words cover 91.03% percent of the whole corpus.

Frequency rank of 6001–7000

edit

Together these 7000 words cover 92.06% percent of the whole corpus.

Frequency rank of 7001–8000

edit

Together these 8000 words cover 92.90% percent of the whole corpus.

Frequency rank of 8001–9000

edit

Together these 9000 words cover 93.59% percent of the whole corpus.

Frequency rank of 9001–10000

edit

Together these 10000 words cover 94.18% percent of the whole corpus.

Frequency rank of 10001–11000

edit

Together these 11000 words cover 94.69% percent of the whole corpus.

Frequency rank of 11001–12000

edit

Together these 12000 words cover 95.14% percent of the whole corpus.

Frequency rank of 12001–13000

edit

Together these 13000 words cover 95.53% percent of the whole corpus.

Frequency rank of 13001–14000

edit

Together these 14000 words cover 95.88% percent of the whole corpus.

Frequency rank of 14001–15000

edit

Together these 15000 words cover 96.19% percent of the whole corpus.