Anoop Kunchukuttan's repositories
indic_nlp_library
Resources and tools for Indian language Natural Language Processing
indic_nlp_resources
Resources to go with the Indic NLP Library
multinmt_tutorial_coling2020
Material for the COLING 2020 Tutorial on Multilingual NMT
indowordnet_parallel
Parallel corpus mined from IndoWordnet synset gloss and examples
moses_job_scripts
A simple experiment management system for Moses
news_evaluation_script
NEWS shared task evaluation script (ported to Python 3)
DataAugForLRL
Generalized Data Augmentation for Low-Resource Translation
huggingface_notebooks
Notebooks using the Hugging Face libraries š¤
OpenNMT-tf
My customizations to OpenNMT-tf
sacreBLEU-Indic
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
transformers
š¤ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation
dolly
Databricksā Dolly, a large language model trained on the Databricks Machine Learning Platform
indicnlp.ai4bharat.org
Archived old website for AI4BhÄrat Indic-NLP
Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
ml_timeline
Latest developments in LLM space
MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
NER_Open_Data
This repo contains timely updated NER tagged data collected through a-mma NER data collection programme
open-instruct
Fork of work in paper: "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources ."
The-NLP-Pandect
A comprehensive reference for all topics related to Natural Language Processing
yanmtt
Yet Another Neural Machine Translation Toolkit