lexibank / hubercolumbian

Dataset of Huber and Reed's "Comparative Vocabulary"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CLDF dataset derived from Huber and Reed's "Comparative Vocabulary" from 1992

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Huber, R. Q. and Reed, R. B. 1992. Vocabulario comparativo: palabras selectas de lenguas indígenas de Colombia [Comparative vocabulary. Selected words from the indigeneous languages of Columbia]. Santafé de Bogota: Asociatión Instituto Lingüístico de Verano.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at https://gist.github.com/LinguList/7481097

Conceptlists in Concepticon:

Notes

This dataset comprises 69 language varieties spoken in Columbia. The orthography profile was originally created by Jelena Prokić and further modified for our purposes.

Statistics

CLDF validation Glottolog: 100% Concepticon: 96% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 69
  • Concepts: 366
  • Lexemes: 26,723
  • Sources: 1
  • Synonymy: 1.18
  • Invalid lexemes: 0
  • Tokens: 158,489
  • Segments: 109 (0 BIPA errors, 0 CLTS sound class errors, 109 CLTS modified)
  • Inventory size (avg): 32.99

Contributors

Name GitHub user Description Role
Huber, R. Q. Author
Reed, R. B. Author
Johann-Mattis List @LinguList Other
Jelena Prokić orthography profile Other
Michael Cysouw @cysouw Digitization Other
Peter Bouda Digitization Other

CLDF Datasets

The following CLDF datasets are available in cldf:

About

Dataset of Huber and Reed's "Comparative Vocabulary"

License:Creative Commons Attribution 4.0 International


Languages

Language:Python 86.6%Language:TeX 13.4%