There are 1 repository under language-data topic.
Language definitions used by Weblate
a High Agreement Multi-lingual Outlier Detection dataset
A collection of language information tracked by the linguist project.
Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks
k-sncacs dataset for Universal Depdencies
Collection of approximately 20K German texts from the 2010s: User written texts in form of personal stories, poems, poetry, articles and opinion pieces pulled from archives of the Stern NEON Community website.
Language identification with as few characters as possible
several flex databases to be merged
Languages Sql Table - Diller Sql Tablosu