CAMeL Lab's repositories
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Arabic_ALA-LC_Romanization
Romanizing Arabic bibliographic records in the ALA-LC standard.
arabic-gec
Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.
arabic_error_type_annotation
The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.
camel_morph
Camel Morph’s goal is to build large open-source morphological models for Arabic and its dialects across many genres and domains.
gender-reinflection
Code, models, and data for "Gender-Aware Reinflection using Linguistically Enhanced Neural Models". COLING 2020, GeBNLP.
camel-tools-data
Repo containing data packages and catalogues used by CAMeL Tools.
Camel_Arabic_Frequency_Lists
The repository for the CAMeL Arabic Frequency Lists dataset
ced_word_alignment
A character edit distance based word aligner.
samer-arabic-readability
Code, models, and data for "Strategies for Arabic Readability Modelling". ArabicNLP 2024, ACL.
camel-kenlm
KenLM: Faster and Smaller Language Model Queries
gender-rewriting
Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022.
wild_diacritics
Wild Diacritics paper repo.
gender-rewriting-shared-task
Evaluation code and data for the gender rewriting shared task
conllx_evaluation
Evaluate accuracy of CoNLL-X annotations performed by annotators
palmyra_server
A server that adds extra functionality to Palmyra
camel_tools_updates
This page will have the latest updates on the different components from CAMeL Tools.
codafication
Code, models, and data for "Exploiting Dialect Identification in Automatic Dialectal Text Normalization". ArabicNLP 2024, ACL.