User:AKA MBG/Statistics:POS
The parsed database name: enwikt20140908_parsed[1] This page outlines:
- Number of meanings.
- Number of empty definitions for each language.
- Number of entries for each part of speech (POS).
See about Part of Speech (POS) headers:
Meanings
[edit]Number of words (with meanings) with unknown POS: 40828
The total of all unique noun, verb, etc. (+ with empty definitions): 1915645
Number of empty definitions: 109418
Number of words (unique noun, verb, etc.) with nonempty definitions: 1806227
Number of records in the table lang_pos: 1915646
Number of words having different number of meanings / definitions
[edit]Table description:
- column 0 - number of words with empty definitions (total and for each language)
- column 1 - number of monosemous words (total and for each language)
- column 2 - number of words with two meanings, etc.
- last column ("Total") - total number of words for this language.
Only the first 9 meanings (columns) are presented in the table.
| Number of meanings: | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| code | Total (all languages) : | 109418 | 1477775 | 213489 | 61135 | 22147 | 13074 | 7201 | 2323 | 4621 | 886 | 1912069 |
| pms | Piedmontese | 0 | 31 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| mwl | Mirandese | 1 | 253 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 270 |
| mg | Malagasy | 35 | 3301 | 23 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 3363 |
| wrh | Wiradjuri | 6 | 143 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 159 |
| chc | Catawba | 1 | 16 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
| rej | Rejang | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| tvl | Tuvaluan | 0 | 27 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 30 |
| ktn | Karitiâna | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| see | Seneca | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| rtm | Rotuman | 0 | 14 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 15 |
| chn | Chinook Jargon | 0 | 65 | 12 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 80 |
| sl | Slovene | 21 | 3235 | 298 | 65 | 22 | 12 | 5 | 0 | 0 | 5 | 3663 |
| osa | Osage | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ems | Alutiiq | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| btk | Batak | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| cuk | Kuna | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| pdc | Pennsylvania German | 1 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| crp-rsn | Russenorsk | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pa | Punjabi | 0 | 174 | 32 | 8 | 1 | 5 | 0 | 0 | 0 | 0 | 220 |
| mnc | Manchu | 0 | 40 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| xdc | Dacian | 0 | 40 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 40 |
| rap | Rapa Nui | 1 | 291 | 16 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 313 |
| axm | Middle Armenian | 6 | 129 | 17 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 160 |
| gul | Gullah | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| ext | Extremaduran | 0 | 86 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 90 |
| luo | Dholuo | 0 | 36 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| nys | Nyunga | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| dak | Dakota | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| kayah | Kayah | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| frk | Frankish | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| syc | Syriac | 181 | 1391 | 619 | 381 | 200 | 123 | 88 | 39 | 26 | 25 | 3073 |
| pag | Pangasinan | 0 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| agj | Argobba | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| gsw | Swiss German | 4 | 140 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 160 |
| oj | Ojibwe | 0 | 202 | 41 | 16 | 7 | 4 | 1 | 0 | 0 | 0 | 271 |
| uz | Uzbek | 1 | 301 | 43 | 12 | 1 | 1 | 0 | 0 | 0 | 0 | 359 |
| for | Fore | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| hak | Hakka | 0 | 52 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| xtg | Gaulish | 0 | 25 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
| kjb | Q'anjob'al | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ave | Avestan | 0 | 19 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 21 |
| nij | Ngaju | 0 | 8 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 10 |
| wbw | Woi | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| ntj | Ngaanyatjarra | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| si | Sinhala | 1 | 148 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 150 |
| ase | American Sign Language | 0 | 299 | 47 | 11 | 3 | 1 | 0 | 0 | 0 | 0 | 361 |
| are | Arrernte | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| mrv | Mangareva | 0 | 7 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| lij | Ligurian | 0 | 147 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 150 |
| cop | Coptic | 0 | 15 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
| ada | Adangme | 0 | 21 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| pit | Pitta-Pitta | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| cui | Cuiba | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| wo | Wolof | 0 | 27 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| akk | Akkadian | 0 | 144 | 23 | 8 | 5 | 2 | 1 | 0 | 1 | 0 | 184 |
| kn | Kannada | 0 | 393 | 35 | 3 | 0 | 1 | 0 | 1 | 0 | 0 | 433 |
| xvn | Vandalic | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| gn | Guaraní | 0 | 63 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 66 |
| tsn | Tswana | 1 | 76 | 15 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 95 |
| cjs | Shor | 0 | 38 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| yux | Southern Yukaghir | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| alt | Altai | 0 | 64 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 64 |
| pjt | Pitjantjatjara | 0 | 153 | 33 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 199 |
| xdm | Edomite | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| ccc | Chamicuro | 0 | 441 | 9 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 452 |
| bej | Beja | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ary | Moroccan Arabic | 0 | 35 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| nog | Nogai | 0 | 22 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| nzi | Nzema | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gmh | Middle High German | 1 | 69 | 6 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 79 |
| asm | Assamese | 0 | 37 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| gil | Gilbertese | 0 | 19 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 22 |
| mic | Mi'kmaq | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sc | Sardinian | 0 | 135 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 140 |
| cow | Cowlitz | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| rup | Aromanian | 22 | 976 | 180 | 61 | 11 | 3 | 1 | 0 | 0 | 0 | 1254 |
| cdo | Min Dong | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| crp-gep | Greenlandic Eskimo Pidgin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| jam | Jamaican Creole | 0 | 31 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| pox | Polabian | 0 | 157 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 164 |
| pam | Kapampangan | 0 | 11 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 14 |
| khw | Khowar | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| eu | Basque | 22 | 1214 | 68 | 13 | 2 | 2 | 0 | 0 | 0 | 0 | 1321 |
| sv | Swedish | 4994 | 15283 | 1993 | 549 | 231 | 84 | 33 | 19 | 16 | 5 | 23207 |
| tkd | Tukudede | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sh | Serbo-Croatian | 118 | 39967 | 7189 | 2203 | 654 | 255 | 110 | 42 | 21 | 28 | 50587 |
| se | Northern Sami | 1 | 773 | 37 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 817 |
| kaa | Karakalpak | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| ryu | Okinawan | 13 | 90 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 106 |
| sco | Scots | 140 | 1770 | 232 | 65 | 27 | 12 | 5 | 3 | 0 | 0 | 2254 |
| nah | Nahuatl | 36 | 1439 | 133 | 23 | 11 | 2 | 1 | 0 | 0 | 0 | 1645 |
| nb | Bokmål | 2116 | 4208 | 523 | 133 | 45 | 13 | 3 | 1 | 1 | 0 | 7043 |
| zen | Zenaga | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| aib | Äynu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sma | Southern Sami | 0 | 33 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| ang | Old English | 149 | 2943 | 732 | 276 | 100 | 42 | 4 | 2 | 0 | 1 | 4249 |
| mia | Miami-Illinois | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| am | Amharic | 0 | 176 | 6 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 184 |
| dif | Dieri | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| ln | Lingala | 0 | 63 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 66 |
| ecr | Eteocretan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| eo | Esperanto | 1173 | 12029 | 807 | 99 | 13 | 0 | 0 | 0 | 0 | 0 | 14121 |
| mzn | Mazandarani | 0 | 54 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
| cpi | Chinese Pidgin English | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| roa-ptg | Galician-Portuguese | 23 | 334 | 29 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 395 |
| nyn | Nyankole | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| li | Limburgish | 0 | 366 | 136 | 12 | 4 | 2 | 0 | 0 | 0 | 0 | 520 |
| fi | Finnish | 4157 | 57816 | 5833 | 1357 | 423 | 174 | 85 | 41 | 22 | 15 | 69923 |
| nxn | Ngawun | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tpn | Tupinambá | 0 | 9 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 12 |
| ara | Arabic | 101 | 3185 | 704 | 322 | 175 | 105 | 63 | 34 | 43 | 17 | 4749 |
| is | Icelandic | 81 | 9544 | 1475 | 559 | 168 | 80 | 24 | 13 | 1 | 0 | 11945 |
| av | Avar | 0 | 141 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 149 |
| chy | Cheyenne | 0 | 21 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| ho | Hiri Motu | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| mdf | Moksha | 0 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| run | Rundi | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| yij | Yindjibarndi | 0 | 21 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 25 |
| tati | Tati | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| pap | Papiamento | 0 | 120 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 125 |
| yua | Yucatec Maya | 2 | 150 | 19 | 11 | 6 | 1 | 0 | 0 | 0 | 0 | 189 |
| knb | Lubuagan Kalinga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tir | Tigrinya | 0 | 243 | 13 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 259 |
| ff | Fula | 3 | 57 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 65 |
| arl | Arabela | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| apy | Apalaí | 0 | 20 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| awa | Awadhi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tgt | Tagbanwa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| gml | Middle Low German | 0 | 52 | 18 | 9 | 0 | 2 | 2 | 0 | 0 | 0 | 83 |
| ess | Central Siberian Yupik | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| cab | Garifuna | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| iu | Inuktitut | 3 | 187 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 195 |
| vep | Veps | 0 | 139 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 145 |
| yan | Mayangna | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ina | Interlingua | 173 | 1017 | 69 | 16 | 4 | 0 | 0 | 1 | 0 | 0 | 1280 |
| nl | Dutch | 3764 | 19511 | 3504 | 907 | 263 | 102 | 36 | 20 | 9 | 3 | 28119 |
| ood | O'odham | 0 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| jao | Yanyuwa | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| lkt | Lakota | 1 | 42 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 52 |
| xpr | Parthian | 0 | 24 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| non | Old Norse | 14 | 1155 | 169 | 41 | 11 | 7 | 2 | 1 | 0 | 1 | 1401 |
| smn | Inari Sami | 0 | 200 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 210 |
| lo | Lao | 1 | 667 | 91 | 41 | 15 | 5 | 1 | 1 | 1 | 0 | 823 |
| kxv | Kuvi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| te | Telugu | 19 | 6968 | 815 | 193 | 60 | 45 | 23 | 8 | 4 | 2 | 8137 |
| mnw | Mon | 0 | 44 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| ja | Japanese | 1594 | 61244 | 7297 | 2089 | 829 | 346 | 189 | 122 | 71 | 48 | 73829 |
| frm | Middle French | 120 | 2315 | 148 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 2593 |
| ban | Balinese | 0 | 59 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
| spx | South Picene | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| tk | Turkmen | 0 | 404 | 15 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 424 |
| mvi | Miyako | 4 | 33 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| jv | Javanese | 2 | 80 | 4 | 1 | 0 | 4 | 0 | 0 | 0 | 0 | 91 |
| ki | Gikuyu | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| crp-tpr | Taimyr Pidgin Russian | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| doz | Dorze | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sgz | Sursurunga | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| or | Oriya | 0 | 57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
| tar | Tarahumara | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| css | Southern Ohlone | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| agx | Aghul | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| es | Spanish | 7795 | 31243 | 4806 | 1449 | 469 | 185 | 90 | 38 | 23 | 8 | 46106 |
| sga | Old Irish | 268 | 1009 | 328 | 128 | 46 | 27 | 13 | 3 | 2 | 3 | 1827 |
| vma | Martuthunira | 0 | 101 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 109 |
| crh | Crimean Tatar | 0 | 1963 | 211 | 32 | 6 | 0 | 0 | 0 | 0 | 0 | 2212 |
| kzg | Kikai | 4 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| ple | Palu'e | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| nap | Neapolitan | 8 | 511 | 106 | 23 | 7 | 1 | 0 | 1 | 0 | 0 | 657 |
| nia | Nias | 0 | 28 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
| mhk | Mungaka | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kky | Guugu Yimithirr | 1 | 88 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 99 |
| th | Thai | 8 | 2498 | 301 | 101 | 30 | 5 | 4 | 2 | 1 | 1 | 2951 |
| ba | Bashkir | 7 | 749 | 205 | 74 | 26 | 14 | 9 | 4 | 1 | 0 | 1089 |
| dsb | Lower Sorbian | 54 | 1209 | 242 | 149 | 53 | 1 | 1 | 3 | 1 | 0 | 1713 |
| qu | Quechua | 6 | 546 | 90 | 30 | 11 | 2 | 0 | 0 | 0 | 0 | 685 |
| xss | Assan | 0 | 17 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| kbc | Kadiwéu | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| kum | Kumyk | 0 | 117 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 122 |
| bal | Balochi | 0 | 302 | 48 | 18 | 10 | 4 | 0 | 0 | 1 | 0 | 383 |
| bjz | Baruga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aoz | Uab Meto | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| umb | Umbundu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lb | Luxembourgish | 88 | 3688 | 541 | 137 | 33 | 15 | 2 | 0 | 1 | 0 | 4505 |
| ny | Chewa | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| lzh | Classical Chinese | 2 | 5 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| mwr | Marwari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ckt | Chukchi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| bug | Buginese | 0 | 36 | 7 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
| fa | Persian | 65 | 5075 | 1191 | 476 | 207 | 80 | 37 | 18 | 12 | 6 | 7167 |
| srq | Sirionó | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kda | Worimi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| szl | Silesian | 0 | 44 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 48 |
| yuf | Yavapai | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 2 |
| dar | Dargwa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| krl | Karelian | 0 | 418 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 442 |
| ml | Malayalam | 3 | 143 | 6 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 153 |
| rhg | Rohingya | 0 | 187 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 188 |
| nbm | Ngbaka Ma'bo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ymm | Maay | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gdm | Laal | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| stp | Tepehuán | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| cs | Czech | 143 | 18976 | 1901 | 349 | 110 | 38 | 10 | 5 | 0 | 4 | 21536 |
| tkl | Tokelauan | 0 | 9 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 10 |
| pcm | Nigerian Pidgin | 4 | 61 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 69 |
| nv | Navajo | 2 | 3299 | 402 | 106 | 25 | 4 | 1 | 0 | 1 | 0 | 3840 |
| waq | Wagiman | 0 | 4 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 6 |
| pal | Middle Persian | 1 | 109 | 30 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 150 |
| mrc | Maricopa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gwc | Kalami | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mgm | Mambae | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gv | Manx | 223 | 5103 | 718 | 243 | 123 | 45 | 43 | 13 | 5 | 7 | 6523 |
| co | Corsican | 4 | 155 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 164 |
| bzd | Bribri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kr | Kanuri | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| zun | Zuni | 0 | 13 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| dlm | Dalmatian | 0 | 694 | 77 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 790 |
| tix | Southern Tiwa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sah | Sakha | 0 | 226 | 26 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 261 |
| yai | Yaghnobi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| brg | Baure | 0 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| ks | Kashmiri | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| xlu | Luwian | 0 | 46 | 4 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 54 |
| zko | Kott | 1 | 126 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 134 |
| wym | Vilamovian | 16 | 941 | 87 | 17 | 3 | 3 | 0 | 1 | 0 | 0 | 1068 |
| juc | Jurchen | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| rof | Rombo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bar | Bavarian | 0 | 30 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| blt | Tai Dam | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| apm | Chiricahua | 1 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| ky | Kyrgyz | 0 | 287 | 15 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 304 |
| bn | Bengali | 2 | 1703 | 120 | 55 | 11 | 5 | 0 | 2 | 0 | 0 | 1898 |
| nan | Min Nan | 17 | 1064 | 63 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 1154 |
| khb | Tai Lü | 0 | 32 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| wbb | Wabo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| prg | Old Prussian | 1 | 223 | 20 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 246 |
| peo | Old Persian | 8 | 95 | 13 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 121 |
| aeb | Tunisian Arabic | 0 | 20 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 22 |
| mk | Macedonian | 13 | 5287 | 616 | 85 | 25 | 5 | 2 | 1 | 0 | 0 | 6034 |
| haw | Hawaiian | 6 | 924 | 335 | 101 | 16 | 5 | 2 | 1 | 1 | 0 | 1391 |
| amk | Ambai | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| gut | Maléku | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| wrp | Waropen | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| esu | Central Alaskan Yup'ik | 0 | 55 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 61 |
| fur | Friulian | 14 | 930 | 141 | 61 | 12 | 1 | 0 | 0 | 0 | 0 | 1159 |
| aak | Ankave | 0 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| tcy | Tulu | 0 | 110 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
| xho | Xhosa | 0 | 110 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 112 |
| osx | Old Saxon | 84 | 1213 | 259 | 42 | 9 | 1 | 0 | 0 | 0 | 0 | 1608 |
| mdr | Mandar | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| mga | Middle Irish | 1 | 50 | 15 | 3 | 1 | 2 | 0 | 0 | 0 | 0 | 72 |
| bsb | Brunei Bisaya | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pad | Paumarí | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| oc | Occitan | 26 | 1142 | 122 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 1300 |
| roa-nor | Norman | 180 | 6494 | 288 | 35 | 4 | 0 | 0 | 0 | 0 | 0 | 7001 |
| vls | Flemish | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| xvs | Vestinian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kea | Kabuverdianu | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| uln | Unserdeutsch | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| itl | Itelmen | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| ajp | South Levantine Arabic | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| acv | Achumawi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| rw | Kinyarwanda | 0 | 26 | 6 | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 35 |
| jbo | Lojban | 361 | 2753 | 75 | 4 | 1 | 0 | 0 | 1 | 0 | 0 | 3195 |
| kri | Krio | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| nds-nl | Dutch Low Saxon | 6 | 90 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 105 |
| tum | Tumbuka | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| mas | Maasai | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| bcl | Bikol Central | 0 | 10 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| aty | Aneityum | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wya | Wyandot | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nn | Norwegian Nynorsk | 1192 | 5826 | 453 | 94 | 17 | 6 | 3 | 1 | 0 | 0 | 7592 |
| ski | Sika | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| abq | Abaza | 0 | 30 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
| xcr | Carian | 0 | 49 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| ali | Amaimon | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ch | Chamorro | 0 | 88 | 7 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 98 |
| ani | Andi | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| tzj | Tz'utujil | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| xvo | Volscian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xil | Illyrian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ist | Istriot | 9 | 360 | 35 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 413 |
| sms | Skolt Sami | 0 | 227 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 228 |
| xav | Xavante | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sw | Swahili | 51 | 2131 | 118 | 28 | 7 | 0 | 1 | 1 | 0 | 0 | 2337 |
| awk | Awabakal | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| nrn | Norn | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| ady | Adyghe | 2 | 2487 | 298 | 77 | 34 | 9 | 0 | 3 | 0 | 0 | 2910 |
| sk | Slovak | 14 | 2025 | 245 | 49 | 14 | 3 | 3 | 1 | 0 | 0 | 2354 |
| min | Minangkabau | 2 | 60 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 62 |
| orv | Old East Slavic | 1 | 60 | 19 | 5 | 2 | 0 | 0 | 0 | 1 | 0 | 88 |
| pwn | Paiwan | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| ale | Aleut | 0 | 68 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
| kal | Greenlandic | 5 | 761 | 80 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 853 |
| niv | Nivkh | 0 | 13 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| roo | Rotokas | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| kaz | Kazakh | 0 | 224 | 8 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 234 |
| bg | Bulgarian | 344 | 5757 | 568 | 244 | 87 | 49 | 30 | 25 | 2 | 6 | 7112 |
| Chumashan | Chumashan | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| lou | Louisiana Creole French | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| cy | Welsh | 149 | 2552 | 278 | 70 | 16 | 5 | 4 | 0 | 1 | 1 | 3076 |
| bou | Bondei | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| fy | West Frisian | 15 | 835 | 115 | 25 | 3 | 2 | 0 | 0 | 0 | 0 | 995 |
| aiw | Aari | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| abe | Abenaki | 1 | 92 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 97 |
| tgl | Tagalog | 39 | 1306 | 129 | 32 | 8 | 1 | 1 | 0 | 0 | 0 | 1516 |
| pis | Pijin | 3 | 54 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 63 |
| akz | Alabama | 0 | 58 | 11 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 74 |
| bi | Bislama | 3 | 36 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 42 |
| kw | Cornish | 6 | 668 | 44 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 724 |
| bku | Buhid | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| obm | Moabite | 0 | 18 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| kha | Khasi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ain | Ainu | 1 | 104 | 19 | 9 | 3 | 1 | 0 | 0 | 0 | 0 | 137 |
| fon | Fon | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| egy | Egyptian | 23 | 715 | 91 | 25 | 9 | 1 | 3 | 2 | 0 | 1 | 870 |
| aln | Gheg | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| duj | Dhuwal | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| ksi | I'saka | 0 | 20 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| tsg | Tausug | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| cjm | Eastern Cham | 0 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| lbe | Lak | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| sas | Sasak | 2 | 23 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 26 |
| ace | Acehnese | 0 | 62 | 6 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 70 |
| km | Khmer | 6 | 737 | 122 | 60 | 38 | 21 | 13 | 7 | 1 | 1 | 1006 |
| den | Slavey | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xrn | Arin | 0 | 32 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| arn | Mapudungun | 0 | 513 | 252 | 59 | 23 | 3 | 2 | 0 | 0 | 0 | 852 |
| kj | Ovambo | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| adj | Adioukrou | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ka | Georgian | 109 | 12600 | 860 | 53 | 40 | 2 | 5 | 0 | 1 | 0 | 13670 |
| gan | Gan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bo | Tibetan | 3 | 421 | 70 | 18 | 5 | 1 | 2 | 1 | 0 | 0 | 521 |
| ltg | Latgalian | 0 | 91 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 94 |
| om | Oromo | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mul | Translingual | 469 | 34951 | 7080 | 2755 | 975 | 366 | 176 | 82 | 31 | 12 | 46897 |
| meu | Motu | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| mh | Marshallese | 1 | 221 | 14 | 8 | 5 | 1 | 0 | 0 | 0 | 1 | 251 |
| ne | Nepali | 0 | 76 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 87 |
| bdy | Bandjalang | 0 | 36 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 40 |
| hai | Haida | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| rcf | Réunion Creole | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mns | Mansi | 0 | 15 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 16 |
| zai | Isthmus Zapotec | 1 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| xcl | Classical Armenian | 528 | 2966 | 975 | 448 | 203 | 81 | 41 | 25 | 12 | 8 | 5287 |
| kac | Jingpho | 0 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| bh | Bihari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| evn | Evenki | 0 | 42 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
| aus-bun | Bunurong | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ssb | Southern Sama | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wbp | Warlpiri | 0 | 43 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
| rom | Romani | 13 | 359 | 32 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 407 |
| aka | Akan | 0 | 28 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
| akm | Bo | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| udi | Udi | 0 | 39 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 44 |
| cpe-spp | Samoan Plantation Pidgin | 0 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| ayl | Libyan Arabic | 0 | 125 | 36 | 2 | 1 | 1 | 0 | 0 | 0 | 0 | 165 |
| cia | Cia-Cia | 0 | 57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
| ofs | Old Frisian | 12 | 221 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 239 |
| lzz | Laz | 0 | 34 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| lad | Judaeo-Spanish | 23 | 1041 | 57 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 1125 |
| sjk | Kemi Sami | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xhu | Hurrian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| owl | Old Welsh | 0 | 10 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| enm | Middle English | 47 | 1289 | 86 | 24 | 10 | 6 | 4 | 1 | 0 | 0 | 1467 |
| alq | Algonquin | 0 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| xal | Kalmyk | 0 | 16 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| sbf | Shabo | 0 | 68 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 69 |
| xeb | Eblaite | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| zku | Kaurna | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| chk | Chuukese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| arg | Aragonese | 1 | 224 | 3 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 231 |
| tet | Tetum | 0 | 96 | 7 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 107 |
| uby | Ubykh | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| mvr | Marau | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| kgg | Kusunda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ast | Asturian | 784 | 4148 | 368 | 81 | 34 | 14 | 5 | 3 | 1 | 2 | 5440 |
| mwf | Murrinh-Patha | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| ur | Urdu | 2 | 1041 | 452 | 287 | 188 | 121 | 53 | 45 | 31 | 18 | 2238 |
| pot | Potawatomi | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| yii | Yidiny | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| cu | Old Church Slavonic | 2 | 1871 | 327 | 76 | 13 | 0 | 0 | 1 | 0 | 0 | 2290 |
| scn | Sicilian | 53 | 834 | 145 | 41 | 11 | 2 | 0 | 0 | 0 | 1 | 1087 |
| nxe | Nage | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pro | Old Occitan | 16 | 232 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 273 |
| aus-dar | Darkinjung | 0 | 88 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 89 |
| vin | Vinza | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| hrx | Hunsrik | 0 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| mus | Creek | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| csb | Cassubian | 0 | 419 | 35 | 17 | 3 | 1 | 0 | 0 | 0 | 0 | 475 |
| cr | Cree | 0 | 38 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 40 |
| xta | Alcozauca Mixtec | 0 | 24 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| ay | Aymara | 0 | 39 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 41 |
| dtp | Dusun | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wew | Weyewa | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| miq | Miskito | 0 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| xpo | Pochutec | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| eml | Emiliano-Romagnolo | 0 | 66 | 4 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 76 |
| dbj | Ida'an | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mnk | Mandinka | 0 | 57 | 7 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
| del | Delaware | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| kut | Kutenai | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mr | Marathi | 0 | 202 | 25 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 234 |
| ndh | Ndali | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tr | Turkish | 148 | 11400 | 1989 | 291 | 108 | 67 | 29 | 17 | 5 | 4 | 14058 |
| ewe | Ewe | 0 | 466 | 58 | 16 | 9 | 1 | 1 | 0 | 0 | 0 | 551 |
| huq | Tsat | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mai | Maithili | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| et | Estonian | 119 | 3304 | 238 | 41 | 16 | 4 | 1 | 1 | 0 | 0 | 3724 |
| dbl | Dyirbal | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sd | Sindhi | 1 | 53 | 8 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 66 |
| coo | Comox | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| gaa | Ga | 1 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
| kem | Kemak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| smj | Lule Sami | 0 | 36 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| lg | Luganda | 1 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| xas | Kamassian | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| bua | Buryat | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| iii | Nuosu | 0 | 51 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 53 |
| yo | Yoruba | 0 | 123 | 4 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 129 |
| xsr | Sherpa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| kg | Kongo | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| hu | Hungarian | 16 | 34309 | 1552 | 344 | 90 | 34 | 13 | 2 | 2 | 0 | 36362 |
| gag | Gagauz | 0 | 59 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 64 |
| gha | Ghadamès | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aji | Ajië | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| he | Hebrew | 116 | 5385 | 889 | 259 | 89 | 43 | 13 | 9 | 4 | 0 | 6807 |
| alr | Alyutor | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| gld | Nanai | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| wgy | Warrgamay | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| jiv | Shuar | 0 | 28 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
| cmn | Mandarin | 2231 | 32065 | 749 | 281 | 129 | 87 | 76 | 48 | 44 | 44 | 35754 |
| uk | Ukrainian | 9 | 2003 | 269 | 92 | 38 | 12 | 8 | 0 | 1 | 1 | 2433 |
| jct | Krymchak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| chg | Chagatai | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| izh | Ingrian | 0 | 50 | 3 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 55 |
| az | Azerbaijani | 24 | 995 | 65 | 11 | 8 | 2 | 0 | 0 | 1 | 0 | 1106 |
| cst | Northern Ohlone | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ve | Venda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sa | Sanskrit | 14 | 981 | 454 | 309 | 186 | 133 | 119 | 75 | 65 | 29 | 2365 |
| ctu | Chol | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| cv | Chuvash | 0 | 53 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 58 |
| osp | Old Spanish | 1 | 40 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
| brc | Berbice Creole Dutch | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| my | Burmese | 1 | 505 | 38 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 552 |
| kpg | Kapingamarangi | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| ru | Russian | 505 | 17033 | 3713 | 1414 | 576 | 246 | 125 | 59 | 35 | 13 | 23719 |
| wyb | Ngiyambaa | 0 | 37 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| agg | Angor | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| men | Mende | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| pua | Purepecha | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| gwe | Gweno | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| it | Italian | 13567 | 93559 | 13733 | 3485 | 932 | 264 | 101 | 40 | 16 | 9 | 125706 |
| iba | Iban | 0 | 29 | 3 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 34 |
| pon | Pohnpeian | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| la | Latin | 15878 | 21330 | 6778 | 2459 | 1010 | 376 | 168 | 84 | 31 | 13 | 48127 |
| zh | Chinese | 1061 | 39922 | 4637 | 785 | 222 | 52 | 16 | 10 | 8 | 1 | 46714 |
| na | Nauruan | 0 | 109 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 112 |
| wba | Warao | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| shh | Shoshone | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| ro | Romanian | 417 | 8430 | 2401 | 690 | 242 | 33 | 16 | 7 | 0 | 0 | 12236 |
| kpy | Koryak | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| gnd | Zulgo-Gemzek | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sov | Sonsorolese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ulk | Meriam | 0 | 41 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| gcf | Antillean Creole | 0 | 9 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| pi | Pali | 3 | 40 | 3 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 53 |
| so | Somali | 0 | 91 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 98 |
| xpg | Phrygian | 0 | 17 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| mn | Mongolian | 3 | 873 | 78 | 25 | 12 | 8 | 8 | 2 | 2 | 1 | 1012 |
| bm | Bambara | 1 | 36 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 43 |
| sog | Sogdian | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| akl | Aklanon | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| dgi | Northern Dagara | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| bhw | Biak | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| kdd | Yankunytjatjara | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| amu | Amuzgo | 0 | 27 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
| id_ | Indonesian | 14 | 1660 | 191 | 58 | 12 | 9 | 7 | 1 | 1 | 1 | 1954 |
| mt | Maltese | 33 | 1461 | 96 | 26 | 0 | 2 | 0 | 0 | 0 | 0 | 1618 |
| vec | Venetian | 230 | 1713 | 506 | 82 | 7 | 4 | 0 | 0 | 0 | 0 | 2542 |
| bft | Balti | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| omn | Minoan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lld | Ladin | 219 | 1159 | 192 | 30 | 1 | 0 | 1 | 0 | 0 | 0 | 1602 |
| zu | Zulu | 15 | 468 | 80 | 25 | 10 | 0 | 1 | 2 | 0 | 0 | 601 |
| hsb | Upper Sorbian | 0 | 226 | 12 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 242 |
| new | Newari | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| hi | Hindi | 11 | 2233 | 628 | 330 | 207 | 117 | 67 | 38 | 24 | 18 | 3673 |
| abm | Abanyom | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wlm | Middle Welsh | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| zkt | Khitan | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| kld | Gamilaraay | 2 | 124 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 141 |
| rar | Rarotongan | 0 | 8 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| tiw | Tiwi | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| en | English | 15067 | 349547 | 41557 | 11912 | 5058 | 2543 | 1488 | 911 | 615 | 420 | 429118 |
| zmg | Marti Ke | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| yue | Cantonese | 824 | 16943 | 83 | 25 | 7 | 0 | 2 | 1 | 0 | 1 | 17886 |
| hnd | Hindko | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tfn | Dena'ina | 0 | 21 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| sat | Santali | 0 | 78 | 7 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 90 |
| hit | Hittite | 0 | 71 | 13 | 6 | 3 | 1 | 2 | 0 | 1 | 0 | 97 |
| chm | Mari | 0 | 56 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 56 |
| abk | Abkhaz | 0 | 134 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 138 |
| sei | Seri | 0 | 55 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 61 |
| dgr | Dogrib | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| bvb | Bube | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ps | Pashto | 2 | 561 | 68 | 19 | 4 | 0 | 0 | 0 | 0 | 0 | 654 |
| bew | Betawi | 0 | 29 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 32 |
| cho | Choctaw | 0 | 82 | 22 | 7 | 6 | 2 | 0 | 0 | 0 | 0 | 119 |
| fud | Futunan | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| chl | Cahuilla | 1 | 434 | 26 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 464 |
| kju | Kashaya | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tgk | Tajik | 1 | 623 | 135 | 60 | 30 | 12 | 2 | 3 | 3 | 0 | 869 |
| wiv | Vitu | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| liv | Livonian | 4 | 2426 | 215 | 59 | 11 | 0 | 1 | 0 | 1 | 0 | 2717 |
| sem-amm | Ammonite | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| kos | Kosraean | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| inh | Ingush | 0 | 16 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
| st | Sotho | 0 | 32 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| pmt | Tuamotuan | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| ket | Ket | 0 | 8 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| ota | Ottoman Turkish | 0 | 401 | 172 | 84 | 35 | 25 | 7 | 6 | 5 | 0 | 735 |
| zkz | Khazar | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| lmo | Lombard | 1 | 46 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| smo | Samoan | 4 | 250 | 31 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 288 |
| bho | Bhojpuri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| yrk | Nenets | 0 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 29 |
| xlc | Lycian | 0 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| brh | Brahui | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| ltc | Middle Chinese | 166 | 453 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 624 |
| got | Gothic | 63 | 9298 | 71 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 9446 |
| mth | Munggui | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| abs | Ambonese | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| ade | Adele | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ga | Irish | 776 | 10508 | 1677 | 600 | 247 | 109 | 56 | 25 | 16 | 5 | 14019 |
| sux | Sumerian | 0 | 101 | 17 | 7 | 2 | 2 | 0 | 0 | 0 | 0 | 129 |
| auj | Awjila | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| hif | Fiji Hindi | 0 | 114 | 3 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 119 |
| alp | Alune | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| mni | Meitei | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mps | Dadibi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tyv | Tuvan | 0 | 152 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 159 |
| jrb | Judeo-Arabic | 0 | 50 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 53 |
| ruo | Istro-Romanian | 1 | 75 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 78 |
| aus-wem | Wemba-Wemba | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| zza | Zazaki | 1 | 242 | 32 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 284 |
| ty | Tahitian | 0 | 219 | 15 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 238 |
| tzm | Central Morocco Tamazight | 0 | 113 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 119 |
| crk | Plains Cree | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mjg | Monguor | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ksd | Tolai | 0 | 30 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
| frr | North Frisian | 0 | 187 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 197 |
| mer | Meru | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| vol | Volapük | 140 | 2477 | 135 | 36 | 16 | 4 | 2 | 1 | 0 | 0 | 2811 |
| hy | Armenian | 110 | 8025 | 1796 | 433 | 147 | 93 | 38 | 12 | 8 | 3 | 10665 |
| vro | Võro | 0 | 158 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 162 |
| aus-gun | Gunai | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| arp | Arapaho | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| sqi | Albanian | 38 | 3879 | 666 | 179 | 63 | 23 | 8 | 2 | 0 | 0 | 4858 |
| sjd | Kildin Sami | 0 | 32 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| hmn | Hmong | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| krc | Karachay-Balkar | 0 | 71 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 73 |
| fit | Meänkieli | 0 | 13 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| nds | Low Saxon | 32 | 182 | 62 | 25 | 19 | 6 | 2 | 1 | 2 | 1 | 332 |
| nha | Nhanda | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ms | Malay | 90 | 2916 | 205 | 65 | 18 | 17 | 8 | 0 | 3 | 0 | 3322 |
| ku | Kurdish | 11 | 1784 | 70 | 26 | 0 | 8 | 0 | 0 | 0 | 0 | 1899 |
| xum | Umbrian | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| de | German | 2365 | 40759 | 4445 | 1146 | 389 | 143 | 59 | 26 | 11 | 4 | 49347 |
| wam | Massachusett | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ug | Uyghur | 0 | 351 | 32 | 11 | 2 | 1 | 0 | 0 | 0 | 0 | 397 |
| guz | Gusii | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wuu | Wu | 5 | 13 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 19 |
| kam | Kamba | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| ie | Occidental | 0 | 92 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93 |
| ce | Chechen | 0 | 185 | 11 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 201 |
| ta | Tamil | 0 | 466 | 40 | 20 | 4 | 1 | 4 | 0 | 2 | 0 | 537 |
| kio | Kiowa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| vai | Vai | 5 | 308 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 315 |
| osc | Oscan | 0 | 13 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| xmf | Mingrelian | 0 | 36 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 41 |
| pih | Pitcairn-Norfolk | 0 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| khv | Khwarshi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| za | Zhuang | 0 | 42 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| tpi | Tok Pisin | 11 | 719 | 73 | 25 | 7 | 0 | 3 | 0 | 0 | 0 | 838 |
| su | Sundanese | 0 | 53 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 56 |
| pgl | Primitive Irish | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| rme | Angloromani | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gez | Ge'ez | 0 | 19 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| aqc | Archi | 0 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| yi | Yiddish | 72 | 1338 | 172 | 33 | 10 | 2 | 0 | 1 | 0 | 0 | 1628 |
| oma | Omaha-Ponca | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| zmb | Zimba | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| cic | Chickasaw | 0 | 342 | 46 | 9 | 1 | 0 | 0 | 1 | 0 | 0 | 399 |
| amn | Amanab | 0 | 54 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| gni | Gooniyandi | 0 | 56 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
| xfa | Faliscan | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| twf | Taos | 1 | 489 | 23 | 6 | 2 | 2 | 0 | 0 | 0 | 0 | 523 |
| aus-syd | Sydney | 2 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| str | Saanich | 0 | 200 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 210 |
| kln | Kalenjin | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| din | Dinka | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tpw | Old Tupi | 1 | 74 | 11 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 89 |
| tay | Atayal | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| pga | Juba Arabic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ha | Hausa | 1 | 327 | 58 | 15 | 5 | 1 | 0 | 0 | 0 | 0 | 407 |
| gmy | Mycenaean Greek | 0 | 101 | 16 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 120 |
| be | Belarusian | 0 | 1176 | 67 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 1252 |
| kjg | Khmu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| afb | Gulf Arabic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| udm | Udmurt | 0 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| tt | Tatar | 0 | 534 | 26 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 564 |
| sva | Svan | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| yur | Yurok | 0 | 280 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 285 |
| gbb | Kaytetye | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ksh | Kölsch | 0 | 58 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 62 |
| num | Niuafo'ou | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| dv | Dhivehi | 0 | 19 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| rif | Tarifit | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| os | Ossetian | 0 | 233 | 31 | 11 | 1 | 0 | 1 | 0 | 0 | 0 | 277 |
| tts | Isan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| bpl | Broome Pearling Lugger Pidgin | 0 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| pnw | Panyjima | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| shn | Shan | 0 | 44 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| wad | Wandamen | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| raj | Rajasthani | 0 | 122 | 25 | 9 | 7 | 1 | 0 | 0 | 0 | 0 | 164 |
| mfe | Mauritian Creole | 8 | 117 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 134 |
| mod | Mobilian | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| crg | Michif | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| xmk | Ancient Macedonian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gu | Gujarati | 0 | 219 | 25 | 5 | 3 | 2 | 0 | 0 | 0 | 1 | 255 |
| arc | Aramaic | 3 | 1423 | 374 | 91 | 28 | 8 | 1 | 0 | 0 | 0 | 1928 |
| aus-wwg | Woiwurrung | 0 | 47 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 52 |
| kjr | Kurudu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tna | Tacana | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| xld | Lydian | 0 | 34 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| ca | Catalan | 2096 | 8988 | 919 | 263 | 79 | 32 | 7 | 1 | 2 | 3 | 12390 |
| srn | Sranan Tongo | 0 | 279 | 29 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 310 |
| moe | Innu-aimun | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| yut | Yopno | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| phn | Phoenician | 0 | 108 | 7 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 117 |
| kab | Kabyle | 0 | 52 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| solresol | Solresol | 0 | 28 | 7 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 37 |
| sn | Shona | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| cax | Chiquitano | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| frp | Franco-Provençal | 1 | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
| dng | Dungan | 0 | 19 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| gvf | Golin | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| bpy | Bishnupriya Manipuri | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| pcd | Picard | 0 | 32 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| gd | Scottish Gaelic | 541 | 6415 | 1230 | 442 | 208 | 78 | 32 | 23 | 13 | 5 | 8987 |
| myp | Pirahã | 0 | 10 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| wim | Wik-Mungknh | 0 | 24 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| stq | Saterland Frisian | 1 | 115 | 15 | 1 | 3 | 0 | 0 | 0 | 0 | 0 | 135 |
| mha | Manda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sg | Sango | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| har | Harari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| myx | Masaba | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| otk | Old Turkic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tli | Tlingit | 0 | 34 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 41 |
| grc | Ancient Greek | 107 | 3728 | 812 | 462 | 269 | 167 | 108 | 70 | 43 | 20 | 5786 |
| kyi | Kiput | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| vi | Vietnamese | 523 | 8884 | 408 | 73 | 25 | 10 | 2 | 0 | 0 | 1 | 9926 |
| mi | Maori | 0 | 403 | 50 | 10 | 5 | 0 | 1 | 0 | 0 | 0 | 469 |
| efi | Efik | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ceb | Cebuano | 12 | 80 | 11 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 104 |
| naq | Nama | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| and | Ansus | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| ilo | Ilokano | 0 | 40 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 43 |
| sqt | Soqotri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| zpq | Zoogocho Zapotec | 0 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| tso | Tsonga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| to | Tongan | 1 | 107 | 12 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 123 |
| niu | Niuean | 0 | 40 | 2 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 44 |
| luy | Luhya | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| aie | Amara | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| adt | Adnyamathanha | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| lv | Latvian | 802 | 55986 | 31508 | 5355 | 2139 | 4976 | 2974 | 74 | 3226 | 22 | 107062 |
| fr | French | 7856 | 39919 | 6459 | 1550 | 512 | 251 | 110 | 45 | 27 | 13 | 56742 |
| wa | Walloon | 9 | 315 | 24 | 4 | 4 | 2 | 1 | 1 | 0 | 0 | 360 |
| tcs | Torres Strait Creole | 0 | 106 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
| ake | Akawaio | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| gl | Galician | 900 | 4178 | 477 | 73 | 33 | 7 | 5 | 5 | 0 | 0 | 5678 |
| loz | Lozi | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| myh | Makah | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| lt | Lithuanian | 84 | 17575 | 2821 | 490 | 69 | 7 | 1 | 1 | 1 | 1 | 21050 |
| fo | Faroese | 72 | 3684 | 617 | 172 | 68 | 28 | 13 | 2 | 2 | 1 | 4659 |
| yap | Yapese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sjt | Ter Sami | 0 | 107 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 112 |
| ett | Etruscan | 2 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| ik | Inupiaq | 0 | 19 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| chh | Chinook | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| da | Danish | 380 | 24779 | 1674 | 482 | 189 | 91 | 53 | 33 | 9 | 4 | 27694 |
| caa | Ch'orti' | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| lre | Laurentian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bjn | Banjarese | 0 | 42 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 45 |
| xve | Venetic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lun | Lunda | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tmh | Tamashaq | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| ht | Haitian Creole | 2 | 742 | 31 | 6 | 2 | 1 | 1 | 0 | 0 | 0 | 785 |
| fro | Old French | 540 | 3923 | 698 | 120 | 34 | 3 | 2 | 0 | 0 | 0 | 5320 |
| lez | Lezgian | 0 | 49 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| win | Winnebago | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| alu | 'Are'are | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| com | Comanche | 0 | 60 | 5 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 67 |
| kbd | Kabardian | 0 | 436 | 54 | 6 | 6 | 0 | 0 | 1 | 1 | 0 | 504 |
| xto | Tocharian | 1 | 187 | 24 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 214 |
| moh | Mohawk | 0 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| xpm | Pumpokol | 0 | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| kv | Komi | 0 | 37 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 39 |
| www | Wawa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tkr | Tsakhur | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| mad | Madurese | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| ko | Korean | 1344 | 16106 | 1023 | 241 | 78 | 40 | 15 | 5 | 12 | 10 | 18874 |
| rue | Rusyn | 0 | 36 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| lvk | Lavukaleve | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| nov | Novial | 9 | 596 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 612 |
| kyh | Karok | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| uga | Ugaritic | 0 | 325 | 72 | 16 | 3 | 1 | 1 | 0 | 0 | 0 | 418 |
| quc | K'iche' | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| aar | Afar | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| ach | Acholi | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| vmb | Mbabaram | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mpm | Yosondúa Mixtec | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| saw | Sawi | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| aaa | Ghotuo | 1 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| lua | Luba-Kasai | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pim | Powhatan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xls | Lusitanian | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mak | Makassarese | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| adz | Adzera | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| nay | Ngarrindjeri | 0 | 142 | 7 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 152 |
| xpu | Punic | 0 | 36 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 39 |
| nod | Northern Thai | 0 | 37 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| rop | Kriol | 0 | 27 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| ish | Esan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gay | Gayo | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| io | Ido | 680 | 5005 | 218 | 42 | 12 | 2 | 0 | 0 | 0 | 0 | 5959 |
| ppm | Papuma | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| war | Waray-Waray | 0 | 4 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 6 |
| afr | Afrikaans | 120 | 1704 | 176 | 34 | 12 | 5 | 2 | 1 | 1 | 0 | 2055 |
| myv | Erzya | 0 | 66 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 73 |
| chp | Chipewyan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| roa-tara | Tarantino | 13 | 490 | 29 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 536 |
| apw | Western Apache | 0 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| chr | Cherokee | 0 | 331 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 353 |
| dz | Dzongkha | 0 | 10 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 12 |
| apc | North Levantine Arabic | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| aii | Assyrian Neo-Aramaic | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ibo | Igbo | 0 | 31 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| mixe | Mixe | 0 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| pl | Polish | 636 | 15881 | 1442 | 640 | 685 | 23 | 10 | 2 | 8 | 2 | 19329 |
| fkv | Kven | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| bla | Blackfoot | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| el | Greek | 83 | 19167 | 4301 | 5088 | 895 | 104 | 102 | 10 | 4 | 1 | 29755 |
| seu | Serui-Laut | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| br | Breton | 9 | 952 | 70 | 16 | 6 | 2 | 1 | 0 | 0 | 0 | 1056 |
| tig | Tigre | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| kca | Khanty | 0 | 41 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 41 |
| fj | Fijian | 2 | 146 | 12 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 163 |
| pt | Portuguese | 5596 | 23275 | 4550 | 1309 | 408 | 132 | 63 | 23 | 19 | 4 | 35379 |
| hop | Hopi | 0 | 11 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 12 |
| rut | Rutul | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| vot | Votic | 0 | 220 | 5 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 228 |
| roa-gal | Gallo | 0 | 152 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 152 |
| pau | Palauan | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| zav | Yatzachi Zapotec | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| mrh | Mara Chin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xbc | Bactrian | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| no | Norwegian | 48 | 4437 | 314 | 58 | 20 | 6 | 2 | 2 | 0 | 0 | 4887 |
| kim | Tofa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| arw | Arawak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sth | Shelta | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| hil | Hiligaynon | 3 | 1357 | 137 | 25 | 3 | 0 | 0 | 0 | 0 | 0 | 1525 |
| aus-gab | Gabi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| goh | Old High German | 9 | 882 | 83 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 986 |
| rm | Romansch | 24 | 2021 | 92 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 2147 |
| vmw | Makhuwa | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| arz | Egyptian Arabic | 1 | 288 | 25 | 8 | 3 | 1 | 0 | 0 | 0 | 0 | 326 |
| lif | Limbu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| nio | Nganasan | 0 | 10 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
| dlg | Dolgan | 0 | 25 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| kjh | Khakas | 0 | 201 | 31 | 7 | 1 | 2 | 0 | 0 | 0 | 0 | 242 |
Part of speech
[edit]Total (all entries)
[edit]Number of words and senses
[edit]Rows in the table: 52
| Unique Strings | Total Word-Sense Pairs | POS | Short name | Templates | Max Senses | Entry |
|---|---|---|---|---|---|---|
| 4211 | 6983 | initialism | initialism | 71 | AAA 71, CCA 40, AC 26 | |
| 41212 | 41542 | hanzi | hanzi | 10 | 即 10, 场 7, 托 7 | |
| 5048 | 6807 | suffix | suffix | 34 | -at 34, -ат 34, -inho 15 | |
| 9306 | 9580 | hanja | hanja | 17 | 体 17, 仚 8, 望 8 | |
| 2854 | 4245 | conjunction | conjunction | 212 | πρίν 212, ἐπεί 144, ὥστε 53 | |
| 31 | 31 | interfix | interfix | 1 | -a- 1, -e- 1, -e- 1 | |
| 4224 | 5259 | letter | letter | 26 | ⠱ 26, ⠣ 24, ⠪ 24 | |
| 13 | 14 | expression | expression | 2 | 実は 2, aŭ 1, どういたしまして 1 | |
| 867585 | 1116909 | noun | noun | 130 | heaven 130, head 94, mark 78 | |
| 161 | 187 | affix | affix | 6 | たく 6, 牙 3, 梨 3 | |
| 5 | 5 | measure word | measure word | 1 | 遍 1, 杯 1, 盆 1 | |
| 60 | 63 | infix | infix | 2 | -h- 2, -ma- 2, -n- 2 | |
| 8 | 8 | lujvo | lujvo | 1 | banskepre 1, bavlamdei 1, prulamdei 1 | |
| 31 | 165 | kanji reading | kanji reading | 29 | てい 29, ち 20, ちゅう 9 | |
| 16 | 16 | circumfix | circumfix | 1 | უ- -ო 1, a- -ing 1, em- -en 1 | |
| 38 | 47 | adnominal | adnominal | 3 | その 3, とんだ 3, この 2 | |
| 1103 | 1243 | contraction | contraction | 6 | gotcha 6, i'w 4, thou'dst 4 | |
| 1335 | 2349 | syllable | syllable | 85 | 질 85, 가 44, 차 35 | |
| 249813 | 355376 | verb | verb, verb prefix, verb form | 251 | ἔχω 251, χράω 189, ἵστημι 152 | |
| 48532 | 95212 | participle | participle | 14 | iacens 14, iaciturus 14, productus 9 | |
| 1095 | 1389 | determiner | determiner | 30 | πλεῖστος 30, ἅπας 15, which 5 | |
| 1 | 1 | pinyin | pinyin | 1 | Bìxiù 1 | |
| 293 | 432 | root | root | 30 | स्था 30, बुध् 13, वच् 8 | |
| 108665 | 127533 | proper noun | proper noun | 34 | Neustadt 34, こうじ 33, たかし 32 | |
| 44 | 82 | predicative | predicative | 5 | громко 5, ясно 5, должен 4 | |
| 3978 | 4760 | symbol | symbol | 30 | A 30, C 9, z 9 | |
| 12535 | 13291 | kanji | kanji | 11 | 随 11, 精 11, 獲 8 | |
| 53330 | 63982 | adverb | adverb | 157 | ὅπως 157, πλήν 65, ἔπειτα 63 | |
| 11 | 31 | preverb | preverb | 6 | ჩა- 6, მი- 4, წა- 4 | |
| 466 | 572 | article | article | 15 | të 15, a 7, des 5 | |
| 2476 | 2684 | idiom | idiom | 4 | 不上不下 4, 生拉硬拽 4, 三更燈火五更雞 3 | |
| 6136 | 7295 | interjection | interjection | 10 | oh 10, а 7, no worries 7 | |
| 138 | 195 | counter | counter | 8 | か 8, 本 4, だい 4 | |
| 4140 | 4988 | abbreviation | abbreviation | 11 | D 11, T 11, lv 10 | |
| 125 | 144 | classifier | classifier | 5 | 道 5, ដៃ 3, 组 3 | |
| 1974 | 2094 | proverb | proverb | 4 | 새옹지마 4, 三十年河东,三十年河西 4, 三十年河東,三十年河西 4 | |
| 5310 | 5780 | phrase | phrase | 7 | rejse sig 7, aaniish naa 5, the fuck 4 | |
| 9296 | 9619 | numeral | ordinal numeral, numeral, ordinal number, cardinal numeral, number, cardinal number | 19 | शत 19, viens 13, vienu 6 | |
| 1343 | 1393 | gismu | gismu | 4 | bongu 4, fanva 3, fanza 2 | |
| 8341 | 11575 | pronoun | pronoun | 185 | ὅσος 185, ὅδε 126, ὅσπερ 37 | |
| 711 | 1005 | acronym | acronym | 24 | CED 24, CET 22, KOS 17 | |
| 980 | 1441 | particle | particle | 52 | ἕως 52, אין 18, lai 7 | |
| 275915 | 392355 | adjective | adjective, quasi-adjective, adjectival noun | 101 | ὕστερος 101, περισσός 63, ἱκανός 47 | |
| 437 | 598 | prepositional phrase | prepositional phrase | 7 | in hand 7, in front 5, at the high port 5 | |
| 7 | 15 | gerund | gerund | 4 | laborandum 4, definiendum 3, sufflaminandum 2 | |
| 50 | 51 | katakana character | katakana character | 2 | ッ 2, ヲ 1, ィ 1 | |
| 8 | 8 | correlative | correlative | 1 | nenial 1, neniel 1, nenies 1 | |
| 32986 | 50711 | han character | han character | 17 | 正 17, 㴴 13, 䌛 13 | |
| 4175 | 7387 | preposition | preposition | 196 | of 196, ὑπό 160, ל־ 38 | |
| 1 | 1 | hanja reading | hanja reading | 1 | 겸 1 | |
| 3943 | 5101 | prefix | prefix | 61 | πρό 61, μετά 9, meta- 9 | |
| 545 | 709 | postposition | postposition | 7 | vasten 7, için 6, को 6 |
Polysemy information
[edit]Rows in the table: 52
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 3081 | 1130 | 3902 | 1,66 | 3,45 |
| hanzi | 40978 | 234 | 564 | 1,01 | 2,41 |
| suffix | 4109 | 939 | 2698 | 1,35 | 2,87 |
| hanja | 9127 | 179 | 453 | 1,03 | 2,53 |
| conjunction | 2310 | 544 | 1935 | 1,49 | 3,56 |
| interfix | 31 | 0 | 0 | 1,0 | -1,0 |
| letter | 3975 | 249 | 1284 | 1,25 | 5,16 |
| expression | 12 | 1 | 2 | 1,08 | 2,0 |
| noun | 720849 | 146736 | 396060 | 1,29 | 2,7 |
| affix | 144 | 17 | 43 | 1,16 | 2,53 |
| measure word | 5 | 0 | 0 | 1,0 | -1,0 |
| infix | 57 | 3 | 6 | 1,05 | 2,0 |
| lujvo | 8 | 0 | 0 | 1,0 | -1,0 |
| kanji reading | 8 | 23 | 157 | 5,32 | 6,83 |
| circumfix | 16 | 0 | 0 | 1,0 | -1,0 |
| adnominal | 31 | 7 | 16 | 1,24 | 2,29 |
| contraction | 995 | 108 | 248 | 1,13 | 2,3 |
| syllable | 1256 | 79 | 1093 | 1,76 | 13,84 |
| verb | 195210 | 54603 | 160166 | 1,42 | 2,93 |
| participle | 26987 | 21545 | 68225 | 1,96 | 3,17 |
| determiner | 922 | 173 | 467 | 1,27 | 2,7 |
| pinyin | 1 | 0 | 0 | 1,0 | -1,0 |
| root | 236 | 57 | 196 | 1,47 | 3,44 |
| proper noun | 96913 | 11752 | 30620 | 1,17 | 2,61 |
| predicative | 22 | 22 | 60 | 1,86 | 2,73 |
| symbol | 3535 | 443 | 1225 | 1,2 | 2,77 |
| kanji | 12137 | 398 | 1154 | 1,06 | 2,9 |
| adverb | 46269 | 7061 | 17713 | 1,2 | 2,51 |
| preverb | 3 | 8 | 28 | 2,82 | 3,5 |
| article | 408 | 58 | 164 | 1,23 | 2,83 |
| idiom | 2295 | 181 | 389 | 1,08 | 2,15 |
| interjection | 5309 | 827 | 1986 | 1,19 | 2,4 |
| counter | 104 | 34 | 91 | 1,41 | 2,68 |
| abbreviation | 3685 | 455 | 1303 | 1,2 | 2,86 |
| classifier | 113 | 12 | 31 | 1,15 | 2,58 |
| proverb | 1869 | 105 | 225 | 1,06 | 2,14 |
| phrase | 4935 | 375 | 845 | 1,09 | 2,25 |
| numeral | 9072 | 224 | 547 | 1,03 | 2,44 |
| gismu | 1297 | 46 | 96 | 1,04 | 2,09 |
| pronoun | 6710 | 1631 | 4865 | 1,39 | 2,98 |
| acronym | 593 | 118 | 412 | 1,41 | 3,49 |
| particle | 763 | 217 | 678 | 1,47 | 3,12 |
| adjective | 213327 | 62588 | 179028 | 1,42 | 2,86 |
| prepositional phrase | 322 | 115 | 276 | 1,37 | 2,4 |
| gerund | 3 | 4 | 12 | 2,14 | 3,0 |
| katakana character | 49 | 1 | 2 | 1,02 | 2,0 |
| correlative | 8 | 0 | 0 | 1,0 | -1,0 |
| han character | 22155 | 10831 | 28556 | 1,54 | 2,64 |
| preposition | 3033 | 1142 | 4354 | 1,77 | 3,81 |
| hanja reading | 1 | 0 | 0 | 1,0 | -1,0 |
| prefix | 3256 | 687 | 1845 | 1,29 | 2,69 |
| postposition | 439 | 106 | 270 | 1,3 | 2,55 |
English entries
[edit]Number of words with unknown POS: 4006
Number of words and senses
[edit]Rows in the table: 31
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 3483 | 6199 | initialism | 71 | AAA 71, CCA 40, AC 26 |
| 22700 | 31590 | proper noun | 33 | Franklin 33, Mid-Atlantic 32, Greenville 25 |
| 665 | 978 | suffix | 18 | -able 18, -ist 18, -es 8 |
| 90 | 139 | symbol | 30 | A 30, b 5, d 3 |
| 14490 | 16763 | adverb | 57 | quite 57, peculiarly 30, up 14 |
| 195 | 371 | conjunction | 78 | and 78, as 11, but 7 |
| 4 | 4 | interfix | 1 | -i- 1, -k- 1, -n- 1 |
| 56 | 57 | letter | 2 | k 2, I 1, a 1 |
| 209509 | 279399 | noun | 130 | heaven 130, head 94, mark 78 |
| 16 | 29 | article | 8 | the 8, a 7, an 1 |
| 24 | 25 | idiom | 2 | old enough to vote 2, be absorbed by 1, bring sand to the beach 1 |
| 1704 | 2027 | interjection | 10 | oh 10, no worries 7, huh 6 |
| 1561 | 2230 | abbreviation | 11 | D 11, T 11, lv 10 |
| 5 | 5 | affix | 1 | -i- 1, -kin- 1, -o- 1 |
| 46 | 48 | infix | 2 | -h- 2, -ma- 2, -a- 1 |
| 470 | 512 | proverb | 3 | it's a long road that has no turning 3, there but for the grace of God go I 3, six of one, half a dozen of the other 2 |
| 1316 | 1482 | phrase | 5 | excuse me 5, the fuck 4, what's up 4 |
| 3 | 3 | circumfix | 1 | a- -ing 1, em- -en 1, en- -en 1 |
| 359 | 413 | numeral | 3 | billion 3, eighty-eight 3, novemdecillion 3 |
| 490 | 599 | contraction | 6 | gotcha 6, thou'dst 4, whatcha 4 |
| 363 | 476 | pronoun | 12 | him 12, me 9, myself 5 |
| 610 | 898 | acronym | 24 | CED 24, CET 22, KOS 17 |
| 69693 | 95334 | verb | 165 | go 165, run 121, take 116 |
| 84680 | 104199 | adjective | 66 | gay 66, free 48, deep 44 |
| 18 | 25 | particle | 4 | like 4, 's 2, no 2 |
| 395 | 551 | prepositional phrase | 7 | in hand 7, in front 5, at the high port 5 |
| 1 | 1 | participle | 1 | bawn 1 |
| 105 | 165 | determiner | 6 | some 6, all 6, which 5 |
| 426 | 1069 | preposition | 196 | of 196, in 40, over 30 |
| 1021 | 1348 | prefix | 9 | be- 9, meta- 9, paleo- 5 |
| 1 | 1 | postposition | 1 | aside 1 |
Polysemy information
[edit]Rows in the table: 31
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 2401 | 1082 | 3798 | 1,78 | 3,51 |
| proper noun | 17998 | 4702 | 13592 | 1,39 | 2,89 |
| suffix | 486 | 179 | 492 | 1,47 | 2,75 |
| symbol | 78 | 12 | 61 | 1,54 | 5,08 |
| adverb | 12902 | 1588 | 3861 | 1,16 | 2,43 |
| conjunction | 141 | 54 | 230 | 1,9 | 4,26 |
| interfix | 4 | 0 | 0 | 1,0 | -1,0 |
| letter | 55 | 1 | 2 | 1,02 | 2,0 |
| noun | 174902 | 34607 | 104497 | 1,33 | 3,02 |
| article | 14 | 2 | 15 | 1,81 | 7,5 |
| idiom | 23 | 1 | 2 | 1,04 | 2,0 |
| interjection | 1484 | 220 | 543 | 1,19 | 2,47 |
| abbreviation | 1233 | 328 | 997 | 1,43 | 3,04 |
| affix | 5 | 0 | 0 | 1,0 | -1,0 |
| infix | 44 | 2 | 4 | 1,04 | 2,0 |
| proverb | 430 | 40 | 82 | 1,09 | 2,05 |
| phrase | 1182 | 134 | 300 | 1,13 | 2,24 |
| circumfix | 3 | 0 | 0 | 1,0 | -1,0 |
| numeral | 318 | 41 | 95 | 1,15 | 2,32 |
| contraction | 411 | 79 | 188 | 1,22 | 2,38 |
| pronoun | 303 | 60 | 173 | 1,31 | 2,88 |
| acronym | 498 | 112 | 400 | 1,47 | 3,57 |
| verb | 59562 | 10131 | 35772 | 1,37 | 3,53 |
| adjective | 72841 | 11839 | 31358 | 1,23 | 2,65 |
| particle | 13 | 5 | 12 | 1,39 | 2,4 |
| prepositional phrase | 285 | 110 | 266 | 1,39 | 2,42 |
| participle | 1 | 0 | 0 | 1,0 | -1,0 |
| determiner | 73 | 32 | 92 | 1,57 | 2,88 |
| preposition | 317 | 109 | 752 | 2,51 | 6,9 |
| prefix | 818 | 203 | 530 | 1,32 | 2,61 |
| postposition | 1 | 0 | 0 | 1,0 | -1,0 |
Russian entries
[edit]Number of words with unknown POS: 84
Number of words and senses
[edit]Rows in the table: 27
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 13 | 14 | initialism | 2 | ВДНХ 2, ЛГБТ 1, НПО 1 |
| 1808 | 1987 | proper noun | 6 | Русь 6, Тура 6, Виктория 4 |
| 42 | 52 | suffix | 4 | -ок 4, -ев 3, -ик 2 |
| 41 | 77 | predicative | 5 | громко 5, ясно 5, должен 4 |
| 2 | 2 | symbol | 1 | × 1, @ 1 |
| 1094 | 1469 | adverb | 9 | в целом 9, неясно 7, так 7 |
| 63 | 82 | conjunction | 3 | и 3, как 3, или 3 |
| 46 | 53 | letter | 3 | ь 3, у 2, а 2 |
| 12890 | 18243 | noun | 13 | путаница 13, ход 12, полнота 11 |
| 36 | 39 | idiom | 2 | лёгок на помине 2, на славу 2, не в своей тарелке 2 |
| 246 | 307 | interjection | 5 | ведь 5, есть 4, на здоровье 4 |
| 1 | 1 | affix | 1 | -ун- 1 |
| 205 | 249 | abbreviation | 10 | Л 10, РКК 5, ср. 4 |
| 179 | 191 | phrase | 3 | без ума 3, что это такое 3, а то 2 |
| 81 | 83 | proverb | 2 | с глаз долой — из сердца вон 2, куй железо, пока горячо 2, неча на зеркало пенять, коли рожа крива 1 |
| 73 | 80 | numeral | 7 | один 7, сорока 2, биллион 1 |
| 1 | 1 | syllable | 1 | 대 1 |
| 183 | 261 | pronoun | 4 | этот 4, своей 4, той 3 |
| 2839 | 4250 | adjective | 13 | успокаивающий 13, раздражающий 12, предшествующий 11 |
| 33 | 49 | particle | 5 | же 5, не 5, вряд ли 2 |
| 3048 | 6160 | verb | 13 | лезть 13, развести 11, проводить 10 |
| 20 | 20 | acronym | 1 | СПИД 1, Би-би-си 1, ЦРУ 1 |
| 11 | 14 | prepositional phrase | 2 | по меньшей мере 2, в ссоре 2, с первого взгляда 2 |
| 67 | 122 | participle | 7 | отклоняющийся 7, отходя 6, засевшей 4 |
| 17 | 23 | determiner | 3 | весь 3, всякий 3, иной 2 |
| 112 | 186 | preposition | 18 | по 18, за 6, через 6 |
| 76 | 104 | prefix | 5 | за- 5, раз- 5, под- 3 |
Polysemy information
[edit]Rows in the table: 27
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 12 | 1 | 2 | 1,08 | 2,0 |
| proper noun | 1658 | 150 | 329 | 1,1 | 2,19 |
| suffix | 36 | 6 | 16 | 1,24 | 2,67 |
| predicative | 21 | 20 | 56 | 1,88 | 2,8 |
| symbol | 2 | 0 | 0 | 1,0 | -1,0 |
| adverb | 835 | 259 | 634 | 1,34 | 2,45 |
| conjunction | 48 | 15 | 34 | 1,3 | 2,27 |
| letter | 40 | 6 | 13 | 1,15 | 2,17 |
| noun | 9664 | 3226 | 8579 | 1,42 | 2,66 |
| idiom | 33 | 3 | 6 | 1,08 | 2,0 |
| interjection | 203 | 43 | 104 | 1,25 | 2,42 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| abbreviation | 181 | 24 | 68 | 1,21 | 2,83 |
| phrase | 169 | 10 | 22 | 1,07 | 2,2 |
| proverb | 79 | 2 | 4 | 1,02 | 2,0 |
| numeral | 71 | 2 | 9 | 1,1 | 4,5 |
| syllable | 1 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 121 | 62 | 140 | 1,43 | 2,26 |
| adjective | 2090 | 749 | 2160 | 1,5 | 2,88 |
| particle | 25 | 8 | 24 | 1,48 | 3,0 |
| verb | 1510 | 1538 | 4650 | 2,02 | 3,02 |
| acronym | 20 | 0 | 0 | 1,0 | -1,0 |
| prepositional phrase | 8 | 3 | 6 | 1,27 | 2,0 |
| participle | 38 | 29 | 84 | 1,82 | 2,9 |
| determiner | 13 | 4 | 10 | 1,35 | 2,5 |
| preposition | 82 | 30 | 104 | 1,66 | 3,47 |
| prefix | 58 | 18 | 46 | 1,37 | 2,56 |
Finnish entries
[edit]Number of words with unknown POS: 20
Number of words and senses
[edit]Rows in the table: 23
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 52 | 55 | initialism | 2 | BKT 2, DI 2, NKT 2 |
| 2272 | 2634 | proper noun | 4 | Jumalan Karitsa 4, Aura 4, Lappi 4 |
| 155 | 295 | suffix | 27 | -nne 27, -si 9, -nsä 9 |
| 48 | 52 | proverb | 2 | parempi katsoa kuin katua 2, poissa silmistä, poissa mielestä 2, yhteistyö on voimaa 2 |
| 348 | 380 | phrase | 4 | närhen munat 4, ottaa lämpöä 4, sen paremmin 4 |
| 202 | 207 | numeral | 3 | seiska 3, yhdes 2, kybä 2 |
| 3 | 3 | symbol | 1 | Gt 1, Mt 1, kt 1 |
| 2445 | 2789 | adverb | 8 | hajalle 8, hajallaan 6, niin 6 |
| 56 | 69 | conjunction | 5 | kun 5, niin kuin 3, sekä 2 |
| 37 | 37 | contraction | 1 | son 1, ehkei 1, ehken 1 |
| 34 | 34 | letter | 1 | I 1, C 1, X 1 |
| 148 | 230 | pronoun | 10 | niiden 10, kuka 4, itse 4 |
| 4 | 4 | acronym | 1 | Kela 1, SKY 1, STT 1 |
| 11 | 39 | particle | 6 | -hän 6, -kaan 5, -kö 4 |
| 11085 | 13865 | verb | 17 | avata 17, purkaa 16, kiertää 16 |
| 6637 | 7916 | adjective | 15 | sopiva 15, heikko 11, tavallinen 8 |
| 41603 | 48241 | noun | 13 | kuori 13, juttu 12, varsi 11 |
| 14 | 14 | idiom | 1 | ennen aikojaan 1, koko nimi 1, saada vihiä 1 |
| 16 | 19 | preposition | 2 | kiinni 2, kautta 2, kuin mitä 2 |
| 253 | 274 | interjection | 4 | älä 4, hei 3, terve 2 |
| 177 | 199 | prefix | 4 | yli- 4, avo- 3, vasta- 3 |
| 114 | 157 | postposition | 7 | vasten 7, vastaan 4, asti 4 |
| 75 | 86 | abbreviation | 3 | jkn 3, kk 3, huom 2 |
Polysemy information
[edit]Rows in the table: 23
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 49 | 3 | 6 | 1,06 | 2,0 |
| proper noun | 1937 | 335 | 697 | 1,16 | 2,08 |
| suffix | 114 | 41 | 181 | 1,9 | 4,41 |
| proverb | 44 | 4 | 8 | 1,08 | 2,0 |
| phrase | 322 | 26 | 58 | 1,09 | 2,23 |
| numeral | 198 | 4 | 9 | 1,02 | 2,25 |
| symbol | 3 | 0 | 0 | 1,0 | -1,0 |
| adverb | 2185 | 260 | 604 | 1,14 | 2,32 |
| conjunction | 48 | 8 | 21 | 1,23 | 2,62 |
| contraction | 37 | 0 | 0 | 1,0 | -1,0 |
| letter | 34 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 94 | 54 | 136 | 1,55 | 2,52 |
| acronym | 4 | 0 | 0 | 1,0 | -1,0 |
| particle | 1 | 10 | 38 | 3,55 | 3,8 |
| verb | 9455 | 1630 | 4410 | 1,25 | 2,71 |
| adjective | 5741 | 896 | 2175 | 1,19 | 2,43 |
| noun | 36961 | 4642 | 11280 | 1,16 | 2,43 |
| idiom | 14 | 0 | 0 | 1,0 | -1,0 |
| preposition | 13 | 3 | 6 | 1,19 | 2,0 |
| interjection | 236 | 17 | 38 | 1,08 | 2,24 |
| prefix | 161 | 16 | 38 | 1,12 | 2,38 |
| postposition | 92 | 22 | 65 | 1,38 | 2,95 |
| abbreviation | 66 | 9 | 20 | 1,15 | 2,22 |
Ukrainian entries
[edit]Number of words with unknown POS: 1
Number of words and senses
[edit]Rows in the table: 19
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 1 | 1 | initialism | 1 | НКАУ 1 |
| 228 | 240 | proper noun | 3 | Русь 3, Борис 2, Брест 2 |
| 8 | 10 | phrase | 3 | будь ласка 3, у мене є питання 1, кожному своє 1 |
| 36 | 36 | numeral | 1 | один 1, вісім 1, вісімдесят 1 |
| 1 | 2 | symbol | 2 | ’ 2 |
| 35 | 54 | adverb | 11 | навмисно 11, дальше 4, зараз 3 |
| 10 | 10 | conjunction | 1 | а 1, і 1, й 1 |
| 34 | 51 | pronoun | 9 | себе 9, ніхто 2, своє 2 |
| 10 | 10 | letter | 1 | а 1, й 1, є 1 |
| 1 | 1 | acronym | 1 | ЗМІ 1 |
| 146 | 189 | adjective | 4 | гарний 4, минулий 4, м'який 4 |
| 4 | 5 | particle | 2 | чи 2, і 1, не 1 |
| 87 | 128 | verb | 6 | припадати 6, іти 4, бувати 4 |
| 1787 | 2327 | noun | 8 | міць 8, дід 6, керівник 6 |
| 13 | 14 | determiner | 2 | чий 2, четверо 1, ваш 1 |
| 5 | 6 | preposition | 2 | серед 2, під 1, для 1 |
| 1 | 1 | prefix | 1 | авіа- 1 |
| 16 | 18 | interjection | 3 | гей 3, а 1, ах 1 |
| 1 | 1 | abbreviation | 1 | КПРС 1 |
Polysemy information
[edit]Rows in the table: 19
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 1 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 217 | 11 | 23 | 1,05 | 2,09 |
| phrase | 7 | 1 | 3 | 1,25 | 3,0 |
| numeral | 36 | 0 | 0 | 1,0 | -1,0 |
| symbol | 0 | 1 | 2 | 2,0 | 2,0 |
| adverb | 28 | 7 | 26 | 1,54 | 3,71 |
| conjunction | 10 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 24 | 10 | 27 | 1,5 | 2,7 |
| letter | 10 | 0 | 0 | 1,0 | -1,0 |
| acronym | 1 | 0 | 0 | 1,0 | -1,0 |
| adjective | 117 | 29 | 72 | 1,29 | 2,48 |
| particle | 3 | 1 | 2 | 1,25 | 2,0 |
| verb | 64 | 23 | 64 | 1,47 | 2,78 |
| noun | 1451 | 336 | 876 | 1,3 | 2,61 |
| determiner | 12 | 1 | 2 | 1,08 | 2,0 |
| preposition | 4 | 1 | 2 | 1,2 | 2,0 |
| prefix | 1 | 0 | 0 | 1,0 | -1,0 |
| interjection | 15 | 1 | 3 | 1,12 | 3,0 |
| abbreviation | 1 | 0 | 0 | 1,0 | -1,0 |
French entries
[edit]Number of words with unknown POS: 336
Number of words and senses
[edit]Rows in the table: 27
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 116 | 123 | initialism | 3 | PQ 3, IA 2, CPM 2 |
| 2381 | 2736 | proper noun | 5 | Atlas 5, Mérida 4, Katanga 4 |
| 172 | 208 | suffix | 6 | -is 6, -ant 3, -aud 3 |
| 8 | 8 | symbol | 1 | a 1, Go 1, Mo 1 |
| 2819 | 3201 | adverb | 8 | grossièrement 8, singulièrement 6, carrément 5 |
| 72 | 86 | conjunction | 5 | que 5, comme 3, afin 2 |
| 21 | 21 | letter | 1 | a 1, x 1, S 1 |
| 26101 | 33173 | noun | 18 | tampon 18, verre 14, masse 11 |
| 10 | 20 | article | 5 | des 5, de la 2, les 2 |
| 7 | 9 | idiom | 2 | et ainsi de suite 2, coude à coude 2, il n'y a pas mort d'homme 1 |
| 231 | 257 | interjection | 3 | beuh 3, tiens 3, ouais 3 |
| 1 | 1 | affix | 1 | -un- 1 |
| 94 | 101 | abbreviation | 5 | AP 5, cie 2, g 2 |
| 2 | 2 | infix | 1 | -iss- 1, t 1 |
| 57 | 60 | proverb | 2 | à la guerre comme à la guerre 2, l'habit ne fait pas le moine 2, les chiens aboient, la caravane passe 2 |
| 217 | 250 | phrase | 4 | dans la foulée 4, huis clos 4, avant la lettre 2 |
| 49 | 49 | numeral | 1 | billion 1, trillion 1, deux 1 |
| 64 | 67 | contraction | 2 | du 2, c'est 2, t'es 2 |
| 132 | 191 | pronoun | 7 | se 7, vous 7, y 3 |
| 6058 | 9802 | verb | 31 | passer 31, pogner 14, relever 12 |
| 9943 | 11579 | adjective | 8 | grossier 8, chaleureux 8, fini 7 |
| 11 | 15 | particle | 4 | ne 4, est-ce que 2, genre 1 |
| 1 | 1 | acronym | 1 | PEB 1 |
| 7 | 9 | prepositional phrase | 2 | en tête 2, à gauche 2, de guingois 1 |
| 19 | 19 | determiner | 1 | son 1, ton 1, ma 1 |
| 109 | 190 | preposition | 14 | à 14, sur 5, dans 4 |
| 157 | 164 | prefix | 2 | beau- 2, dys- 2, photo- 2 |
Polysemy information
[edit]Rows in the table: 27
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 110 | 6 | 13 | 1,06 | 2,17 |
| proper noun | 2066 | 315 | 670 | 1,15 | 2,13 |
| suffix | 148 | 24 | 60 | 1,21 | 2,5 |
| symbol | 8 | 0 | 0 | 1,0 | -1,0 |
| adverb | 2515 | 304 | 686 | 1,14 | 2,26 |
| conjunction | 63 | 9 | 23 | 1,19 | 2,56 |
| letter | 21 | 0 | 0 | 1,0 | -1,0 |
| noun | 21353 | 4748 | 11820 | 1,27 | 2,49 |
| article | 5 | 5 | 15 | 2,0 | 3,0 |
| idiom | 5 | 2 | 4 | 1,29 | 2,0 |
| interjection | 208 | 23 | 49 | 1,11 | 2,13 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| abbreviation | 90 | 4 | 11 | 1,07 | 2,75 |
| infix | 2 | 0 | 0 | 1,0 | -1,0 |
| proverb | 54 | 3 | 6 | 1,05 | 2,0 |
| phrase | 189 | 28 | 61 | 1,15 | 2,18 |
| numeral | 49 | 0 | 0 | 1,0 | -1,0 |
| contraction | 61 | 3 | 6 | 1,05 | 2,0 |
| pronoun | 89 | 43 | 102 | 1,45 | 2,37 |
| verb | 3935 | 2123 | 5867 | 1,62 | 2,76 |
| adjective | 8639 | 1304 | 2940 | 1,16 | 2,25 |
| particle | 9 | 2 | 6 | 1,36 | 3,0 |
| acronym | 1 | 0 | 0 | 1,0 | -1,0 |
| prepositional phrase | 5 | 2 | 4 | 1,29 | 2,0 |
| determiner | 19 | 0 | 0 | 1,0 | -1,0 |
| preposition | 78 | 31 | 112 | 1,74 | 3,61 |
| prefix | 150 | 7 | 14 | 1,04 | 2,0 |
German entries
[edit]Number of words with unknown POS: 48
Number of words and senses
[edit]Rows in the table: 29
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 82 | 88 | initialism | 3 | BVG 3, VfB 3, PM 2 |
| 2654 | 2970 | proper noun | 34 | Neustadt 34, Atlas 5, Bosch 4 |
| 87 | 110 | suffix | 6 | -e 6, -en 5, -ig 3 |
| 9 | 10 | symbol | 2 | – 2, J 1, ☉ 1 |
| 984 | 1253 | adverb | 6 | momentan 6, gar nicht 5, rechts 5 |
| 65 | 81 | conjunction | 4 | als 4, bis 2, infolgedessen 2 |
| 4 | 4 | interfix | 1 | -en- 1, -es- 1, -n- 1 |
| 29 | 29 | letter | 1 | I 1, a 1, C 1 |
| 31690 | 36279 | noun | 24 | H. 24, Minderung 15, c. 15 |
| 28 | 37 | article | 2 | 'n 2, den 2, n 2 |
| 27 | 27 | idiom | 1 | arm wie eine Kirchenmaus 1, Kopf und Kragen 1, Reiz und Reaktion 1 |
| 140 | 155 | interjection | 5 | ach 5, bitte 3, servus 3 |
| 206 | 224 | abbreviation | 3 | OP 3, Fr. 3, KG 3 |
| 151 | 162 | phrase | 3 | Holzweg 3, Das Reich 2, Kraut und Rüben 2 |
| 35 | 35 | proverb | 1 | Arbeit macht frei 1, Blut ist dicker als Wasser 1, Geduld ist eine Tugend 1 |
| 1 | 1 | circumfix | 1 | ge- -t 1 |
| 155 | 161 | numeral | 4 | einer 4, einem 2, einen 2 |
| 22 | 22 | contraction | 1 | am 1, ans 1, aufs 1 |
| 147 | 263 | pronoun | 26 | anderen 26, deiner 5, meiner 5 |
| 6091 | 9961 | adjective | 27 | gelsten 27, hässlichsten 27, langsamsten 27 |
| 4276 | 6387 | verb | 22 | abkommen 22, brennen 12, lösen 9 |
| 6 | 7 | particle | 2 | oder 2, zu 1, dritt 1 |
| 8 | 10 | acronym | 2 | BND 2, DAU 2, CDU 1 |
| 1 | 1 | prepositional phrase | 1 | von der Stange 1 |
| 3 | 5 | participle | 3 | verzaubert 3, gewesen 1 |
| 11 | 14 | determiner | 2 | sein 2, ihr 2, welche 2 |
| 91 | 146 | preposition | 8 | bei 8, nach 5, durch 5 |
| 97 | 123 | prefix | 4 | ab- 4, um- 4, nach- 4 |
| 7 | 8 | postposition | 2 | durch 2, nach 1, entlang 1 |
Polysemy information
[edit]Rows in the table: 29
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 78 | 4 | 10 | 1,07 | 2,5 |
| proper noun | 2398 | 256 | 572 | 1,12 | 2,23 |
| suffix | 74 | 13 | 36 | 1,26 | 2,77 |
| symbol | 8 | 1 | 2 | 1,11 | 2,0 |
| adverb | 784 | 200 | 469 | 1,27 | 2,35 |
| conjunction | 51 | 14 | 30 | 1,25 | 2,14 |
| interfix | 4 | 0 | 0 | 1,0 | -1,0 |
| letter | 29 | 0 | 0 | 1,0 | -1,0 |
| noun | 28484 | 3206 | 7795 | 1,14 | 2,43 |
| article | 19 | 9 | 18 | 1,32 | 2,0 |
| idiom | 27 | 0 | 0 | 1,0 | -1,0 |
| interjection | 130 | 10 | 25 | 1,11 | 2,5 |
| abbreviation | 191 | 15 | 33 | 1,09 | 2,2 |
| phrase | 141 | 10 | 21 | 1,07 | 2,1 |
| proverb | 35 | 0 | 0 | 1,0 | -1,0 |
| circumfix | 1 | 0 | 0 | 1,0 | -1,0 |
| numeral | 151 | 4 | 10 | 1,04 | 2,5 |
| contraction | 22 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 92 | 55 | 171 | 1,79 | 3,11 |
| adjective | 4836 | 1255 | 5125 | 1,64 | 4,08 |
| verb | 3007 | 1269 | 3380 | 1,49 | 2,66 |
| particle | 5 | 1 | 2 | 1,17 | 2,0 |
| acronym | 6 | 2 | 4 | 1,25 | 2,0 |
| prepositional phrase | 1 | 0 | 0 | 1,0 | -1,0 |
| participle | 2 | 1 | 3 | 1,67 | 3,0 |
| determiner | 8 | 3 | 6 | 1,27 | 2,0 |
| preposition | 67 | 24 | 79 | 1,6 | 3,29 |
| prefix | 80 | 17 | 43 | 1,27 | 2,53 |
| postposition | 6 | 1 | 2 | 1,14 | 2,0 |
Serbian entries
[edit]Tatar entries
[edit]Number of words with unknown POS: null
Number of words and senses
[edit]Rows in the table: 11
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 1 | 1 | letter | 1 | ү 1 |
| 1 | 1 | pronoun | 1 | sin 1 |
| 5 | 5 | verb | 1 | абын 1, сабир 1, сабыр 1 |
| 58 | 59 | adjective | 2 | авыр 2, яшь 1, фәрештә 1 |
| 55 | 56 | proper noun | 2 | Neptun 2, Austria 1, Bangladesh 1 |
| 366 | 392 | noun | 3 | çirek 3, дәвер 3, заман 3 |
| 2 | 3 | suffix | 2 | -ле 2, -лы 1 |
| 60 | 61 | numeral | 2 | nol 2, өч йөз 1, öç yöz 1 |
| 3 | 5 | interjection | 3 | әйдә 3, әйе 1 |
| 11 | 13 | adverb | 2 | зерә 2, әрмәнчә 2, әкрен 1 |
| 2 | 2 | conjunction | 1 | белән 1, билан 1 |
Polysemy information
[edit]Rows in the table: 11
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| letter | 1 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 1 | 0 | 0 | 1,0 | -1,0 |
| verb | 5 | 0 | 0 | 1,0 | -1,0 |
| adjective | 57 | 1 | 2 | 1,02 | 2,0 |
| proper noun | 54 | 1 | 2 | 1,02 | 2,0 |
| noun | 343 | 23 | 49 | 1,07 | 2,13 |
| suffix | 1 | 1 | 2 | 1,5 | 2,0 |
| numeral | 59 | 1 | 2 | 1,02 | 2,0 |
| interjection | 2 | 1 | 3 | 1,67 | 3,0 |
| adverb | 9 | 2 | 4 | 1,18 | 2,0 |
| conjunction | 2 | 0 | 0 | 1,0 | -1,0 |
Esperanto entries
[edit]Number of words with unknown POS: 118
Number of words and senses
[edit]Rows in the table: 21
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 4 | 4 | initialism | 1 | UAE 1, MEP 1, IFA 1 |
| 838 | 869 | proper noun | 3 | Rioĥo 3, Venuso 2, Britio 2 |
| 105 | 132 | suffix | 4 | -iĝi 4, -ad- 3, -ant- 3 |
| 49 | 50 | phrase | 2 | mi petas 2, mi malsatas 1, mi soifas 1 |
| 114 | 114 | numeral | 1 | dek 1, tricent 1, cent 1 |
| 749 | 817 | adverb | 3 | nepre 3, tiel 3, okaze 3 |
| 23 | 24 | conjunction | 2 | ĉar 2, plus 1, kaj 1 |
| 67 | 67 | letter | 1 | i 1, I 1, a 1 |
| 62 | 71 | pronoun | 3 | kiun 3, vi 2, ĉiuj 2 |
| 11 | 16 | particle | 3 | ne 3, ĉi 3, ĉu 2 |
| 1821 | 1966 | adjective | 4 | morta 4, feliĉa 4, sama 3 |
| 1760 | 1973 | verb | 4 | pagi 4, valori 4, zorgi 4 |
| 3 | 3 | expression | 1 | aŭ 1, ĉu 1, ĉu 1 |
| 7097 | 7603 | noun | 4 | amaso 4, baskulo 4, batilo 4 |
| 8 | 8 | correlative | 1 | nenial 1, neniel 1, nenies 1 |
| 2 | 2 | article | 1 | la 1, l' 1 |
| 39 | 43 | determiner | 3 | kelkaj 3, ĉia 3, ties 1 |
| 51 | 75 | preposition | 4 | kun 4, dum 4, antaŭ 3 |
| 43 | 45 | interjection | 2 | he 2, gesinjoroj 2, ek 1 |
| 23 | 29 | prefix | 3 | ge- 3, pra- 3, eks- 2 |
| 6 | 6 | abbreviation | 1 | 2-a 1, V 1, n.b. 1 |
Polysemy information
[edit]Rows in the table: 21
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 4 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 808 | 30 | 61 | 1,04 | 2,03 |
| suffix | 86 | 19 | 46 | 1,26 | 2,42 |
| phrase | 48 | 1 | 2 | 1,02 | 2,0 |
| numeral | 114 | 0 | 0 | 1,0 | -1,0 |
| adverb | 687 | 62 | 130 | 1,09 | 2,1 |
| conjunction | 22 | 1 | 2 | 1,04 | 2,0 |
| letter | 67 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 54 | 8 | 17 | 1,15 | 2,12 |
| particle | 8 | 3 | 8 | 1,45 | 2,67 |
| adjective | 1696 | 125 | 270 | 1,08 | 2,16 |
| verb | 1571 | 189 | 402 | 1,12 | 2,13 |
| expression | 3 | 0 | 0 | 1,0 | -1,0 |
| noun | 6643 | 454 | 960 | 1,07 | 2,11 |
| correlative | 8 | 0 | 0 | 1,0 | -1,0 |
| article | 2 | 0 | 0 | 1,0 | -1,0 |
| determiner | 37 | 2 | 6 | 1,1 | 3,0 |
| preposition | 34 | 17 | 41 | 1,47 | 2,41 |
| interjection | 41 | 2 | 4 | 1,05 | 2,0 |
| prefix | 19 | 4 | 10 | 1,26 | 2,5 |
| abbreviation | 6 | 0 | 0 | 1,0 | -1,0 |
Latin entries
[edit]Number of words with unknown POS: 38
Number of words and senses
[edit]Rows in the table: 27
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 7 | 9 | initialism | 2 | RIP 2, IHS 2, SPD 1 |
| 899 | 1010 | proper noun | 5 | peniculus 5, Smalcius 4, Uranus 3 |
| 89 | 106 | suffix | 3 | -ve 3, -icus 3, -brum 3 |
| 5 | 5 | symbol | 1 | HS 1, Ↄ 1, Ⅎ 1 |
| 1117 | 1682 | adverb | 12 | etiam 12, ultro 7, continenter 5 |
| 64 | 140 | conjunction | 29 | an 29, nam 5, autem 4 |
| 1 | 1 | interfix | 1 | -o- 1 |
| 17 | 24 | letter | 5 | ꝑ 5, C 1, Q 1 |
| 9143 | 14845 | noun | 16 | manus 16, terra 9, caput 9 |
| 6 | 6 | idiom | 1 | a lacte cunisque 1, ab acia et acu 1, semel pro semper 1 |
| 51 | 57 | interjection | 3 | en 3, o 2, hui 2 |
| 40 | 45 | abbreviation | 2 | R 2, S. 2, p 2 |
| 1 | 2 | infix | 2 | -n- 2 |
| 30 | 31 | proverb | 2 | obsta principiis 2, albus an ater sit 1, asinus in tegulis 1 |
| 154 | 181 | phrase | 3 | Socratici viri 3, gratia gratiam parit 3, in medio 3 |
| 103 | 111 | numeral | 3 | secundum 3, sexagesima 3, sexagesimum 3 |
| 8 | 8 | contraction | 1 | vin 1, viden 1, eapse 1 |
| 107 | 141 | pronoun | 3 | sui 3, ea 3, memet 3 |
| 5417 | 11569 | verb | 30 | agar 30, iacuerint 28, iacueris 28 |
| 7264 | 10043 | adjective | 10 | raptus 10, residuus 8, socius 6 |
| 3 | 5 | particle | 2 | in- 2, -ne 2, non 1 |
| 2 | 2 | prepositional phrase | 1 | a posteriori 1, sub clave 1 |
| 7 | 15 | gerund | 4 | laborandum 4, definiendum 3, sufflaminandum 2 |
| 7684 | 11018 | participle | 14 | iacens 14, iaciturus 14, productus 9 |
| 8 | 12 | determiner | 3 | ille 3, ambo 2, idem 1 |
| 60 | 120 | preposition | 8 | pro 8, erga 7, iuxta 4 |
| 28 | 38 | prefix | 6 | a- 6, se- 2, co- 2 |
Polysemy information
[edit]Rows in the table: 27
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 5 | 2 | 4 | 1,29 | 2,0 |
| proper noun | 808 | 91 | 202 | 1,12 | 2,22 |
| suffix | 77 | 12 | 29 | 1,19 | 2,42 |
| symbol | 5 | 0 | 0 | 1,0 | -1,0 |
| adverb | 725 | 392 | 957 | 1,51 | 2,44 |
| conjunction | 34 | 30 | 106 | 2,19 | 3,53 |
| interfix | 1 | 0 | 0 | 1,0 | -1,0 |
| letter | 15 | 2 | 9 | 1,41 | 4,5 |
| noun | 5641 | 3502 | 9204 | 1,62 | 2,63 |
| idiom | 6 | 0 | 0 | 1,0 | -1,0 |
| interjection | 46 | 5 | 11 | 1,12 | 2,2 |
| abbreviation | 35 | 5 | 10 | 1,12 | 2,0 |
| infix | 0 | 1 | 2 | 2,0 | 2,0 |
| proverb | 29 | 1 | 2 | 1,03 | 2,0 |
| phrase | 133 | 21 | 48 | 1,18 | 2,29 |
| numeral | 99 | 4 | 12 | 1,08 | 3,0 |
| contraction | 8 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 77 | 30 | 64 | 1,32 | 2,13 |
| verb | 2453 | 2964 | 9116 | 2,14 | 3,08 |
| adjective | 5359 | 1905 | 4684 | 1,38 | 2,46 |
| particle | 1 | 2 | 4 | 1,67 | 2,0 |
| prepositional phrase | 2 | 0 | 0 | 1,0 | -1,0 |
| gerund | 3 | 4 | 12 | 2,14 | 3,0 |
| participle | 5698 | 1986 | 5320 | 1,43 | 2,68 |
| determiner | 5 | 3 | 7 | 1,5 | 2,33 |
| preposition | 35 | 25 | 85 | 2,0 | 3,4 |
| prefix | 23 | 5 | 15 | 1,36 | 3,0 |
Italian entries
[edit]Number of words with unknown POS: 8
Number of words and senses
[edit]Rows in the table: 23
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 41 | 47 | initialism | 3 | PG 3, GdF 2, P.G. 2 |
| 3762 | 4231 | proper noun | 4 | Virgilio 4, Annunziata 3, Bodoni 3 |
| 321 | 384 | suffix | 5 | -ata 5, -ite 5, -i 4 |
| 30 | 35 | proverb | 3 | non piangere sul latte versato 3, più siamo meglio è 3, meglio un uovo oggi che una gallina domani 1 |
| 81 | 87 | phrase | 2 | che figata 2, detto, fatto 2, non c'è che dire 2 |
| 2 | 2 | numeral | 1 | uno 1, otto 1 |
| 3 | 3 | symbol | 1 | × 1, Z 1, W 1 |
| 3918 | 4844 | adverb | 10 | immediatamente 10, subito 8, di basso rango 5 |
| 128 | 159 | conjunction | 3 | se 3, che 3, se non 3 |
| 62 | 64 | contraction | 2 | dal 2, dai 2, della 1 |
| 158 | 220 | pronoun | 4 | ci 4, qualcuno 4, il quale 4 |
| 27 | 27 | letter | 1 | I 1, C 1, X 1 |
| 30185 | 36855 | verb | 12 | abbattere 12, sbattere 9, caricare 9 |
| 22194 | 26376 | adjective | 10 | minore 10, brado 8, grossolano 8 |
| 7 | 7 | prepositional phrase | 1 | a mano armata 1, a rischio 1, in ballo 1 |
| 50216 | 63158 | noun | 12 | titolo 12, tiro 9, fascio 9 |
| 11 | 11 | article | 1 | i 1, le 1, lo 1 |
| 6 | 6 | idiom | 1 | andare coi piedi di piombo 1, la quiete prima della tempesta 1, campa cavallo 1 |
| 309 | 375 | preposition | 9 | di 9, per 6, su 5 |
| 193 | 251 | interjection | 4 | buon giorno 4, accidenti 3, d'accordo 3 |
| 417 | 458 | prefix | 4 | filo- 4, acro- 3, ana- 3 |
| 68 | 72 | abbreviation | 2 | AC 2, PS 2, S. 2 |
| 1 | 1 | affix | 1 | -un- 1 |
Polysemy information
[edit]Rows in the table: 23
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 36 | 5 | 11 | 1,15 | 2,2 |
| proper noun | 3323 | 439 | 908 | 1,12 | 2,07 |
| suffix | 273 | 48 | 111 | 1,2 | 2,31 |
| proverb | 27 | 3 | 8 | 1,17 | 2,67 |
| phrase | 75 | 6 | 12 | 1,07 | 2,0 |
| numeral | 2 | 0 | 0 | 1,0 | -1,0 |
| symbol | 3 | 0 | 0 | 1,0 | -1,0 |
| adverb | 3186 | 732 | 1658 | 1,24 | 2,27 |
| conjunction | 100 | 28 | 59 | 1,24 | 2,11 |
| contraction | 60 | 2 | 4 | 1,03 | 2,0 |
| pronoun | 110 | 48 | 110 | 1,39 | 2,29 |
| letter | 27 | 0 | 0 | 1,0 | -1,0 |
| verb | 25710 | 4475 | 11145 | 1,22 | 2,49 |
| adjective | 18944 | 3250 | 7432 | 1,19 | 2,29 |
| prepositional phrase | 7 | 0 | 0 | 1,0 | -1,0 |
| noun | 40782 | 9434 | 22376 | 1,26 | 2,37 |
| article | 11 | 0 | 0 | 1,0 | -1,0 |
| idiom | 6 | 0 | 0 | 1,0 | -1,0 |
| preposition | 276 | 33 | 99 | 1,21 | 3,0 |
| interjection | 148 | 45 | 103 | 1,3 | 2,29 |
| prefix | 384 | 33 | 74 | 1,1 | 2,24 |
| abbreviation | 64 | 4 | 8 | 1,06 | 2,0 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
Swedish entries
[edit]Number of words with unknown POS: 11
Number of words and senses
[edit]Rows in the table: 26
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 20 | 22 | initialism | 2 | HD 2, OS 2, SAP 1 |
| 1769 | 1881 | proper noun | 3 | Hanna 3, Europa 3, Ester 3 |
| 80 | 110 | suffix | 5 | -t 5, -e 5, -en 4 |
| 711 | 820 | adverb | 7 | tillgodo 7, precis 5, så 5 |
| 46 | 57 | conjunction | 6 | och 6, emellertid 3, då 2 |
| 1 | 1 | interfix | 1 | -s- 1 |
| 11 | 11 | letter | 1 | a 1, o 1, w 1 |
| 9814 | 12429 | noun | 11 | bryt 11, rot 9, tur 9 |
| 7 | 7 | article | 1 | de 1, en 1, den 1 |
| 36 | 38 | idiom | 2 | måla fan på väggen 2, tårta på tårta 2, det är rena grekiskan 1 |
| 108 | 121 | interjection | 3 | god fortsättning 3, varsågod 3, ha 2 |
| 288 | 311 | abbreviation | 5 | BK 5, f. 3, tekn. 3 |
| 89 | 90 | phrase | 2 | sida upp och sida ned 2, glad påsk 1, gott nytt år 1 |
| 28 | 28 | proverb | 1 | Gå inte över ån efter vatten 1, Rom byggdes inte på en dag 1, nära skjuter ingen hare 1 |
| 156 | 157 | numeral | 2 | artonhundra 2, tionde 1, tolfte 1 |
| 1 | 1 | contraction | 1 | venne 1 |
| 135 | 159 | pronoun | 4 | vilken 4, det 3, som 2 |
| 2541 | 3723 | verb | 23 | gå 23, hålla 14, lägga 13 |
| 2226 | 2712 | adjective | 8 | hård 8, öppen 8, lös 7 |
| 10 | 10 | acronym | 1 | ABF 1, EU 1, DI 1 |
| 2 | 2 | particle | 1 | om 1, att 1 |
| 1 | 1 | prepositional phrase | 1 | i mannaminne 1 |
| 11 | 12 | determiner | 2 | var 2, de där 1, dessa 1 |
| 79 | 123 | preposition | 6 | om 6, på 5, runt 5 |
| 44 | 52 | prefix | 3 | över- 3, bi- 2, engångs- 2 |
| 1 | 1 | postposition | 1 | ut 1 |
Polysemy information
[edit]Rows in the table: 26
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 18 | 2 | 4 | 1,1 | 2,0 |
| proper noun | 1665 | 104 | 216 | 1,06 | 2,08 |
| suffix | 63 | 17 | 47 | 1,38 | 2,76 |
| adverb | 638 | 73 | 182 | 1,15 | 2,49 |
| conjunction | 40 | 6 | 17 | 1,24 | 2,83 |
| interfix | 1 | 0 | 0 | 1,0 | -1,0 |
| letter | 11 | 0 | 0 | 1,0 | -1,0 |
| noun | 8173 | 1641 | 4256 | 1,27 | 2,59 |
| article | 7 | 0 | 0 | 1,0 | -1,0 |
| idiom | 34 | 2 | 4 | 1,06 | 2,0 |
| interjection | 97 | 11 | 24 | 1,12 | 2,18 |
| abbreviation | 271 | 17 | 40 | 1,08 | 2,35 |
| phrase | 88 | 1 | 2 | 1,01 | 2,0 |
| proverb | 28 | 0 | 0 | 1,0 | -1,0 |
| numeral | 155 | 1 | 2 | 1,01 | 2,0 |
| contraction | 1 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 115 | 20 | 44 | 1,18 | 2,2 |
| verb | 1853 | 688 | 1870 | 1,47 | 2,72 |
| adjective | 1898 | 328 | 814 | 1,22 | 2,48 |
| acronym | 10 | 0 | 0 | 1,0 | -1,0 |
| particle | 2 | 0 | 0 | 1,0 | -1,0 |
| prepositional phrase | 1 | 0 | 0 | 1,0 | -1,0 |
| determiner | 10 | 1 | 2 | 1,09 | 2,0 |
| preposition | 60 | 19 | 63 | 1,56 | 3,32 |
| prefix | 37 | 7 | 15 | 1,18 | 2,14 |
| postposition | 1 | 0 | 0 | 1,0 | -1,0 |
Spanish entries
[edit]Number of words with unknown POS: 245
Number of words and senses
[edit]Rows in the table: 25
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 52 | 53 | initialism | 2 | RAE 2, ARN 1, AXJ 1 |
| 1848 | 2075 | proper noun | 5 | Amazonas 5, Mérida 4, Jacobo 4 |
| 189 | 270 | suffix | 12 | -ón 12, -ón 7, -azo 4 |
| 10 | 11 | symbol | 2 | @ 2, ∃ 1, ⸘ 1 |
| 1343 | 1521 | adverb | 9 | ya 9, encima 6, al revés 5 |
| 55 | 80 | conjunction | 6 | que 6, no obstante 4, mientras 4 |
| 62 | 62 | letter | 1 | i 1, I 1, a 1 |
| 20475 | 26618 | noun | 15 | medio 15, casco 13, tope 10 |
| 8 | 8 | article | 1 | lo 1, la 1, el 1 |
| 4 | 5 | idiom | 2 | en vilo 2, no se ganó Zamora en una hora 1, quisqui 1 |
| 197 | 242 | interjection | 6 | ojalá 6, hombre 3, bueno 3 |
| 93 | 98 | abbreviation | 3 | SL 3, dpto 2, s/n 2 |
| 1 | 1 | affix | 1 | -un- 1 |
| 193 | 207 | phrase | 4 | a la buena de Dios 4, al fin y al cabo 4, de pe a pa 4 |
| 58 | 60 | proverb | 2 | el hábito no hace al monje 2, del dicho al hecho hay mucho trecho 2, a caballo regalado no le mires el diente 1 |
| 59 | 59 | numeral | 1 | trescientos 1, quince 1, once 1 |
| 7 | 8 | contraction | 2 | desdel 2, na 1, del 1 |
| 127 | 167 | pronoun | 4 | se 4, les 4, me 3 |
| 6537 | 9155 | verb | 17 | dar 17, sacar 15, tirar 14 |
| 6809 | 8379 | adjective | 8 | duro 8, verde 7, pegado 6 |
| 9 | 11 | acronym | 2 | AFI 2, PAN 2, OTAN 1 |
| 1 | 2 | participle | 2 | amado 2 |
| 4 | 4 | determiner | 1 | uno 1, cada 1, cierto 1 |
| 57 | 86 | preposition | 5 | en 5, por 5, según 3 |
| 110 | 113 | prefix | 2 | re- 2, bis- 2, ferro- 2 |
Polysemy information
[edit]Rows in the table: 25
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| initialism | 51 | 1 | 2 | 1,02 | 2,0 |
| proper noun | 1659 | 189 | 416 | 1,12 | 2,2 |
| suffix | 144 | 45 | 126 | 1,43 | 2,8 |
| symbol | 9 | 1 | 2 | 1,1 | 2,0 |
| adverb | 1211 | 132 | 310 | 1,13 | 2,35 |
| conjunction | 41 | 14 | 39 | 1,45 | 2,79 |
| letter | 62 | 0 | 0 | 1,0 | -1,0 |
| noun | 16493 | 3982 | 10125 | 1,3 | 2,54 |
| article | 8 | 0 | 0 | 1,0 | -1,0 |
| idiom | 3 | 1 | 2 | 1,25 | 2,0 |
| interjection | 163 | 34 | 79 | 1,23 | 2,32 |
| abbreviation | 89 | 4 | 9 | 1,05 | 2,25 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| phrase | 185 | 8 | 22 | 1,07 | 2,75 |
| proverb | 56 | 2 | 4 | 1,03 | 2,0 |
| numeral | 59 | 0 | 0 | 1,0 | -1,0 |
| contraction | 6 | 1 | 2 | 1,14 | 2,0 |
| pronoun | 102 | 25 | 65 | 1,31 | 2,6 |
| verb | 5045 | 1492 | 4110 | 1,4 | 2,75 |
| adjective | 5675 | 1134 | 2704 | 1,23 | 2,38 |
| acronym | 7 | 2 | 4 | 1,22 | 2,0 |
| participle | 0 | 1 | 2 | 2,0 | 2,0 |
| determiner | 4 | 0 | 0 | 1,0 | -1,0 |
| preposition | 40 | 17 | 46 | 1,51 | 2,71 |
| prefix | 107 | 3 | 6 | 1,03 | 2,0 |
Mandarin entries
[edit]Number of words with unknown POS: 10629
Number of words and senses
[edit]Rows in the table: 23
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 4 | 4 | measure word | 1 | 遍 1, 杯 1, 盆 1 |
| 339 | 395 | proper noun | 4 | 姑苏 4, 姑蘇 4, 二里岗 3 |
| 23900 | 24110 | hanzi | 10 | 即 10, 场 7, 托 7 |
| 1 | 4 | suffix | 4 | 者 4 |
| 415 | 452 | proverb | 4 | 三十年河东,三十年河西 4, 三十年河東,三十年河西 4, 十年河东,十年河西 3 |
| 21 | 21 | phrase | 1 | PK就PK 1, 己立立人,己達達人 1, 己立立人,己达达人 1 |
| 21 | 26 | classifier | 3 | 組 3, 组 3, 条 1 |
| 3 | 3 | symbol | 1 | ; 1, { 1, ¥ 1 |
| 34 | 44 | adverb | 4 | 剛 4, 刚 4, 便 2 |
| 4 | 4 | conjunction | 1 | 似 1, 如 1, 併 1 |
| 25 | 25 | letter | 1 | ㄅ 1, ㄆ 1, ㄇ 1 |
| 11 | 18 | pronoun | 3 | 誰 3, 谁 3, 幾 3 |
| 163 | 239 | adjective | 4 | 洋洋 4, 美 3, 倭 3 |
| 17 | 24 | particle | 5 | 啊 5, 唄 2, 哩 2 |
| 184 | 320 | verb | 7 | 叫 7, 搞 7, 开 7 |
| 327 | 543 | noun | 12 | 帅 12, 烟 6, 點 6 |
| 3 | 3 | determiner | 1 | 更多 1, 誰家 1, 谁家 1 |
| 150 | 163 | idiom | 2 | 三下五除二 2, 东逃西窜 2, 周瑜打黃蓋 2 |
| 9 | 11 | preposition | 2 | 裏 2, 裡 2, 叫 1 |
| 1 | 1 | pinyin | 1 | Bìxiù 1 |
| 5 | 7 | prefix | 2 | 超 2, 新 2, 沒 1 |
| 21 | 22 | interjection | 2 | 哦 2, 555 1, 88 1 |
| 3 | 3 | postposition | 1 | 間 1, 以來 1, 间 1 |
Polysemy information
[edit]Rows in the table: 23
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| measure word | 4 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 297 | 42 | 98 | 1,17 | 2,33 |
| hanzi | 23761 | 139 | 349 | 1,01 | 2,51 |
| suffix | 0 | 1 | 4 | 4,0 | 4,0 |
| proverb | 385 | 30 | 67 | 1,09 | 2,23 |
| phrase | 21 | 0 | 0 | 1,0 | -1,0 |
| classifier | 18 | 3 | 8 | 1,24 | 2,67 |
| symbol | 3 | 0 | 0 | 1,0 | -1,0 |
| adverb | 28 | 6 | 16 | 1,29 | 2,67 |
| conjunction | 4 | 0 | 0 | 1,0 | -1,0 |
| letter | 25 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 7 | 4 | 11 | 1,64 | 2,75 |
| adjective | 110 | 53 | 129 | 1,47 | 2,43 |
| particle | 13 | 4 | 11 | 1,41 | 2,75 |
| verb | 116 | 68 | 204 | 1,74 | 3,0 |
| noun | 217 | 110 | 326 | 1,66 | 2,96 |
| determiner | 3 | 0 | 0 | 1,0 | -1,0 |
| idiom | 137 | 13 | 26 | 1,09 | 2,0 |
| preposition | 7 | 2 | 4 | 1,22 | 2,0 |
| pinyin | 1 | 0 | 0 | 1,0 | -1,0 |
| prefix | 3 | 2 | 4 | 1,4 | 2,0 |
| interjection | 20 | 1 | 2 | 1,05 | 2,0 |
| postposition | 3 | 0 | 0 | 1,0 | -1,0 |
References
[edit]- ^ This (or more recent) database would be available at the project site wikokit, see Download section at page whinger.krc.karelia.ru.