Alham Fikri Aji's repositories
paracotta-paraphrase
Synthetic multilingual paraphrase data
summerschool-KD-PEFT
Mexican NLP 2024 Summerschool Tutorial on Knowledge Distillation and Parameter Efficient Finetuning
Marian-transfer
Transfer learning experiment demo with Marian
acl-anthology
Data and software for building the ACL Anthology.
data_tooling
Tools for managing datasets for governance and training.
evaluation-robustness-consistency
Tools for evaluating model robustness and consistency
id-nlp-resource
A list of Indonesian NLP resources.
indolem
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
indonesian-mt-data
Benchmarking Multidomain English-Indonesian Machine Translation
intgemm
int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
mosesdecoder
Moses, the machine translation system
nusa-catalogue
Dataset Catalogue Homepage for Indonesian Languages
promptsource
Toolkit for creating, sharing and using natural language prompts.
Semantic_Relatedness_SemEval2024
SemEval 2024 Task 1 : Textual Semantic Relatedness
stif-indonesia
Implementation of "Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation". TBD
variant-lite
variant lite - A C++17-like variant, a type-safe union for C++98, C++11 and later in a single-file header-only library
xmtf
Crosslingual Generalization through Multitask Finetuning