Create dataset loader for IndoModal
SamuelCahyawijaya opened this issue · comments
Dataloader name: indomodal/indomodal.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?indomodal
Dataset | indomodal |
---|---|
Description | IndoModal is a dataset consisting of annotations related to modality (meanings concerning possibility and necessity) in Indonesian, more specifically "harus", "mesti" and their derivatives "harusnya", "seharusnya", "mestinya" and "semestnya". These words were annotated with regard to various linguistic features, including semantics (e.g. "flavour" [epistemic vs. root]) and syntax (e.g. "level" [main clause vs. non-main clause]). The sentences were taken from three Indonesian subcorpora in the Leipzig Corpora Collection. The suffixes in the file names denote the annotators. |
Subsets | - |
Languages | ind |
Tasks | Word Sense Disambiguation |
License | Creative Commons Attribution 4.0 (cc-by-4.0) |
Homepage | https://github.com/matbahasa/IndoModal |
HF URL | - |
Paper URL | https://www.anlp.jp/proceedings/annual_meeting/2024/pdf_dir/E9-5.pdf |