Genta Indra Winata (gentaiscool)

gentaiscool

Geek Repo

Company:Capital One AI Foundations

Location:New York

Home Page:https://gentawinata.com

Twitter:@gentaiscool

Github PK Tool:Github PK Tool


Organizations
audioku
HLTCHKUST
indobenchmark

Genta Indra Winata's repositories

miners

MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.

Language:PythonLicense:Apache-2.0Stargazers:9Issues:0Issues:0
Language:HTMLLicense:MITStargazers:7Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

acl-anthology

Data and software for building the ACL Anthology.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

code-switching-papers

A curated list of research papers and resources on code-switching

License:Apache-2.0Stargazers:289Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

distfuse

A library to calculate similarity scores between two collections of text sequences encoded using transformer models for bitext mining, dense retrieval, retrieval-based classification, and retrieval-augmented generation (RAG).

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:TeXStargazers:0Issues:0Issues:0

indonesian-nlp

A curated list of research papers and resources on Indonesian languages

License:Apache-2.0Stargazers:39Issues:0Issues:0
Language:TeXLicense:MITStargazers:0Issues:0Issues:0

matrix_fact

Matrix Factorization Library

Language:PythonLicense:BSD-3-ClauseStargazers:7Issues:0Issues:0

mt-metrics-eval

Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

lstm-attention

Attention-based bidirectional LSTM for Classification Task (ICASSP)

Language:PythonStargazers:107Issues:0Issues:0

DataLab

The unified platform for data-related resources.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

meta-emb

Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)

Language:PythonStargazers:32Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

License:MITStargazers:0Issues:0Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

License:Apache-2.0Stargazers:0Issues:0Issues:0

few-shot-lm

The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)

Language:PythonLicense:Apache-2.0Stargazers:52Issues:0Issues:0
Stargazers:0Issues:0Issues:0

end2end-asr-pytorch

End-to-End Automatic Speech Recognition on PyTorch

Language:PythonLicense:MITStargazers:293Issues:0Issues:0

BIG-bench

Beyond the Imitation Game collaborative benchmark for enormous language models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Stargazers:0Issues:0Issues:0

NER-datasets

Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)

Stargazers:0Issues:0Issues:0

NL-Augmenter

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

License:Apache-2.0Stargazers:0Issues:0Issues:0