Genta Indra Winata (gentaiscool)

gentaiscool

Geek Repo

Company:Bloomberg LP

Location:New York

Home Page:https://gentawinata.com

Twitter:@gentaiscool

Github PK Tool:Github PK Tool


Organizations
audioku
HLTCHKUST
indobenchmark

Genta Indra Winata's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

Language:Jupyter NotebookLicense:MITStargazers:4501Issues:84Issues:4

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2088Issues:47Issues:127

nusa-crowd

A collaborative project to collect datasets in Indonesian languages.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:253Issues:6Issues:191

Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Language:PythonLicense:Apache-2.0Stargazers:203Issues:13Issues:9

ACL-anthology-corpus

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Language:Jupyter NotebookStargazers:159Issues:7Issues:3

DeepCubeA

Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.

DataLab

The unified platform for data-related resources.

Language:PythonLicense:Apache-2.0Stargazers:125Issues:11Issues:129

nusax

High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:80Issues:8Issues:0

minilmv2.bb

Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)

Language:PythonLicense:Apache-2.0Stargazers:60Issues:8Issues:2

indonesian-nlp

A curated list of research papers and resources on Indonesian languages

License:Apache-2.0Stargazers:39Issues:6Issues:0

kbir_keybart

Experimental code used in pre-training the KBIR and KeyBART models

Language:PythonLicense:Apache-2.0Stargazers:26Issues:6Issues:1

nusa-writes

NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:26Issues:5Issues:2

mot

Multilingual Open Text

Language:PythonLicense:MITStargazers:24Issues:3Issues:4

english-speaker-friendly-korean-companies

Repository to aggregate data about Korean companies that works with English as official language or accepts non-Korean speaking members

paranames

ParaNames: A multilingual resource for parallel names

Language:Jupyter NotebookLicense:MITStargazers:22Issues:2Issues:4

ds2

Code for DS2 paper

KnowExpert

The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".

Language:PythonLicense:MITStargazers:17Issues:6Issues:2

LLM-Code-Mixing

Can LLMs generate code-mixed sentences through zero-shot prompting?

License:NOASSERTIONStargazers:10Issues:3Issues:0

code-mixed-lid

Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.

Language:PythonStargazers:7Issues:2Issues:0

nusa-catalogue

Dataset Catalogue Homepage for Indonesian Languages

Language:JavaScriptLicense:Apache-2.0Stargazers:6Issues:5Issues:5

globalbench

GlobalBench: A Benchmark for Global Progress in Language Technology

Language:PythonStargazers:6Issues:2Issues:0

rubik

Solve a Rubik's Cube with neural networks

HopeEDI

HopeEDI: A Multilingual Hope Speech Detection Dataset for Equality, Diversity, and Inclusion

Language:PythonStargazers:1Issues:1Issues:0

emoji-GAN

HKUST's ELEC5680/COMP5214 Advanced Deep Learning Architectures Assignment 3

Language:PythonLicense:MITStargazers:1Issues:2Issues:3

.github

Landing page

Language:SCSSLicense:Apache-2.0Stargazers:1Issues:4Issues:0

Weakly-Supervised-Multitask-MAR

Weakly-supervised Multitask Multimodal Affect Recognition.