MuffinLinwist / saenkoromance

CLDF Dataset derived from Saenko's "Annotated Swadesh wordlists for the Romance group" from 2015

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CLDF Dataset derived from Saenko's "Annotated Swadesh wordlists for the Romance group" from 2015

CLDF validation

How to cite

If you use these data please cite

Description

This dataset is licensed under a CC-BY-4.0 license

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 43
  • Concepts: 110
  • Lexemes: 4,853
  • Sources: 1
  • Synonymy: 1.03
  • Cognacy: 4,853 cognates in 465 cognate sets (241 singletons)
  • Cognate Diversity: 0.07
  • Invalid lexemes: 0
  • Tokens: 24,768
  • Segments: 128 (0 BIPA errors, 0 CTLS sound class errors, 128 CLTS modified)
  • Inventory size (avg): 43.35

Contributors

Name GitHub user Descriptin Role
Johann-Mattis List @LinguList maintainer Other
Mikhail Saenko data collection Author

CLDF Datasets

The following CLDF datasets are available in cldf:

About

CLDF Dataset derived from Saenko's "Annotated Swadesh wordlists for the Romance group" from 2015

License:Creative Commons Attribution 4.0 International


Languages

Language:Python 82.5%Language:TeX 17.5%