WJanbaz / Uyghur-Wordlist

Uyghur Word List

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository contains various Word Lists and related information.

Files

  • wordlist-Internet-[version].zip

    A list of more than 2 million unique words which are automatically extracted from HTML content of many popular Uyghur websites as well as Wikipeda. This word list contains majority of Uyghur words used on the Internet. Notice that it may contain many misspelled or erroneous words. The list also includes some statistical information that indicates how frequently a particular word is used among/across the processed webpages. However, this frequency information may not correctly represent real usage of Uyghur words in literature. Each line of the file consists of three fields separated with comma:

[word],[number of appearance in all web pages],[number of web pages this word appeared in]

Licence

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

About

Uyghur Word List