LDKR-Group / UzWordnet

UzWordnet is a lexical-semantic database, or a “word-net”, for the Uzbek language (native: O’zbek till) compatible with Princeton WordNet.

Home Page:http://uzwordnet.ldkr.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The Uzbek Wordnet (UzWordnet)

UzWordnet is a lexical-semantic database, or a “word-net”, for the (Northern) Uzbek language (native: O’zbek till) compatible with Princeton Wordnet. By providing it open source (see License), we aim to motivate, support, and increase the application of database and knowledge graphs principles and techniques to the study of computational aspects of the (Northern) Uzbek language and, more generally, the usability of Uzbek within IT applications and the Internet.

The (Northern) Uzbek language is (the) statutory national language in Uzbekistan. It is a Turkic language spoken by approximately 26.8 million people around the world, remarkably by a large group of ethnic Uzbeks residing abroad, cf. Wikipedia.

See also Reference.

Current status (version 1.0)

  • 28149 synsets
  • 64389 senses
  • 20683 words
  • 71.79% (see Reference for details)

Release and Format

UzWordnet is released through the Uzbek Wordnet's website. The version released are:

  • Version 1.0 — Released 17th June 2021 in the following formats:

    • RDF (size 67.0 MB)
    • JSON (size 37.6 MB)
  • Version 1.1 — Released 29th November 2021 in the following formats:

    • XML (size 14.6 MB)

Note on format and conversions

[2021-11-29] UzWordnet [XML] developed to comply with Global WordNet Association's (lemon-based) Resource Description Framework (RDF) for which a wordnet can be published and submitted to the Inter-Lingual-Index (ILI).

More formats can be generated by using the Global WordNet Converter and Validator, available here.

License

UzWordnet was initially derived by "expansion" from Princeton WordNet under the WordNet License and further developed under the Creative Commons Attribution 4.0 International License CC BY-SA 4.0. You can read more about this license here.

You may use, share and adapt UzWordnet providing attribution is given to Princeton WordNet and explicit reference is made to UzWordnet and the UzWordnet Team using the citation appopriate to your project or paper.

In particular, when writing a paper or producing a software application based on UzWordnet, please use the following citations for hardcopy and the online version of your project or paper.

Hardcopy

See Reference.

Online

Publications should cite the official website of UzWordnet, that is: https://uzwordnet.ldkr.org/.

Contributors

  • Alessandro Agostini (Project Leader - email here)
  • Timur Usmanov (Research and Development)
  • Ulugbek Khamdamov (Research and Validation)
  • Nilufar Abdurakhmonova (Research and Validation)
  • Mukhammadsaid Mamasaidov (Research and Validation)
  • Enver Menadjiev (Development and Website)

Reference

A. Agostini, T. Usmanov, U. Khamdamov, N. Abdurakhmonova, M. Mamasaidov, “UZWORDNET: A Lexical- Semantic Database for the Uzbek Language. In S. Bosch, C. Fellbaum, M. Griesel, A. Rademaker and P. Vossen, editors, Proceedings of the Eleventh International Global Wordnet Conference (GWC-2021), pp. 8–19, Potchefstroom, South Africa, 2021. Available online here. Video-talk here.

About

UzWordnet is a lexical-semantic database, or a “word-net”, for the Uzbek language (native: O’zbek till) compatible with Princeton WordNet.

http://uzwordnet.ldkr.org