MuffinLinwist / chaconnorthwestarawakan

CLDF datasets accompanying Chacon's "Studies on Tukanoan Dialects" from 2022

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CLDF dataset accompanying Chacon's "Annotated Swadesh Wordlists for Northwest Arawakan Languages" from 2022

How to cite

If you use these data please cite

  • the original source

    Chacon, Thiago C. (2022): Annotated Swadesh wordlists for Northwest Arawakan languages. Leipzig: Max Planck Institute for Evolutionary Anthropology.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Statistics

Glottolog: 100% Concepticon: 0% Source: 95% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 26
  • Concepts: 94
  • Lexemes: 2,390
  • Sources: 14
  • Synonymy: 1.13
  • Cognacy: 2,390 cognates in 561 cognate sets (277 singletons)
  • Cognate Diversity: 0.20
  • Invalid lexemes: 0
  • Tokens: 13,827
  • Segments: 129 (0 BIPA errors, 0 CTLS sound class errors, 129 CLTS modified)
  • Inventory size (avg): 38.08

Possible Improvements:

  • Entries missing sources: 125/2390 (5.23%)

Contributors

Name GitHub user Description Role
Thiago Chacon @thiagochacon main annotator Author
Johann-Mattis List @LinguList maintainer, patron other

CLDF Datasets

The following CLDF datasets are available in cldf:

About

CLDF datasets accompanying Chacon's "Studies on Tukanoan Dialects" from 2022

License:Creative Commons Attribution 4.0 International


Languages

Language:TeX 88.4%Language:Python 11.6%