Rifki Afina Putri (rifkiaputri)

rifkiaputri

Geek Repo

Company:KAIST

Home Page:rifkiaputri.github.io

Github PK Tool:Github PK Tool

Rifki Afina Putri's starred repositories

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2532Issues:0Issues:0

BLEnD

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

Language:PythonStargazers:15Issues:0Issues:0

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18284Issues:0Issues:0

Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Language:PythonLicense:Apache-2.0Stargazers:203Issues:0Issues:0

warta-scrap

Indonesia Index News Crawler, including 10 online media

Language:PythonStargazers:76Issues:0Issues:0

nlp-phd-global-equality

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Stargazers:808Issues:0Issues:0

LaMini-LM

LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions

Stargazers:806Issues:0Issues:0

expand-via-lexicon-based-adaptation

Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"

Language:PythonStargazers:30Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14432Issues:0Issues:0

lang2vec

A simple library for querying the URIEL typological database.

Language:PythonLicense:CC-BY-SA-4.0Stargazers:85Issues:0Issues:0

acd

Austronesian Comparative Dictionary

Language:TeXLicense:CC-BY-4.0Stargazers:11Issues:0Issues:0

kbbi-python

A Python module that fetches a page of a word/phrase from the Online Indonesian Dictionary (https://kbbi.kemdikbud.go.id).

Language:PythonLicense:MITStargazers:82Issues:0Issues:0

indonesian-nlp

A curated list of research papers and resources on Indonesian languages

License:Apache-2.0Stargazers:39Issues:0Issues:0

indolem

IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.

Language:PythonStargazers:90Issues:0Issues:0

id-multi-label-hate-speech-and-abusive-language-detection

The Dataset for Multi Label Hate Speech and Abusive Language Detection in Indonesian Twitter

Language:TeXStargazers:61Issues:0Issues:0

deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Language:PythonLicense:NOASSERTIONStargazers:3497Issues:0Issues:0

malaysian-dataset

We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:293Issues:0Issues:0

question-generator

An NLP research mainly exploring sequence-to-sequence (s2s) architecture to build Indonesian Automatic Question Generator (AQG). You can check the paper publication in README.

Language:Jupyter NotebookStargazers:23Issues:0Issues:0

qa-dataset-converter

Code from the paper "What do Models Learn from Question Answering Datasets?" (EMNLP 2020)

Language:PythonLicense:Apache-2.0Stargazers:53Issues:0Issues:0

knockknock

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

Language:PythonLicense:MITStargazers:2771Issues:0Issues:0

NL-Augmenter

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Language:PythonLicense:MITStargazers:766Issues:0Issues:0

KPQA

KPQA is an evaluation metric for generative question answering. (NAACL-21)

Language:Jupyter NotebookStargazers:34Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33995Issues:0Issues:0

KoBERT-NER

NER Task with KoBERT (with Naver NLP Challenge dataset)

Language:PythonLicense:Apache-2.0Stargazers:94Issues:0Issues:0

pytorch-balanced-sampler

PyTorch implementations of `BatchSampler` that under/over sample according to a chosen parameter alpha, in order to create a balanced training distribution.

Language:PythonStargazers:83Issues:0Issues:0

py-googletrans

(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.

Language:PythonLicense:MITStargazers:3801Issues:0Issues:0

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonLicense:Apache-2.0Stargazers:1567Issues:0Issues:0
Language:PythonLicense:MITStargazers:76Issues:0Issues:0

spacy-clausie

Implementation of the ClausIE information extraction system for python+spacy

Language:PythonLicense:GPL-3.0Stargazers:217Issues:0Issues:0

Question-Generation-Paper-List

A summary of must-read papers for Neural Question Generation (NQG)

Stargazers:582Issues:0Issues:0