Doohae

followers

following

stars

@kakaobrain

Seoul, Korea

Doohae Jung (wavy.ocean)'s repositories

DenseRetrieval

Implementation of DPR, GradCache, DSI etc.

Language:Python1 10

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

bigcode-dataset

Language:Jupyter NotebookApache-2.0000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonMIT000

GradCache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Language:PythonApache-2.0000

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

RETRO-pytorch

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Language:PythonApache-2.0000

Building-Python-Web-APIs-with-FastAPI

Building Python Web APIs with FastAPI, published by Packt

Language:PythonMIT000

Chatbot_data

Chatbot_data_for_Korean

MIT000

Concurrent-Programming

010

course

The Hugging Face course

Language:PythonApache-2.0000

cv-update

000

data-engineering

Language:Jupyter Notebook000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

FiD

Fusion-in-Decoder

Language:PythonNOASSERTION000

kolmev

Evaluation for korean language models (e.g. bert, roberta, bart, t5, gpt2...)

Language:PythonMIT000

lassl

Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

Language:PythonApache-2.0000

llmss

LLM simple serving (tensor model parallel, pubsub, grpc)

Language:PythonMIT000

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonMIT000

marcopolo

Marco Emilio Polo (/ˈmɑːrkoʊ ˈpoʊloʊ/ (listen), Venetian: [ˈmaɾko ˈpolo], Italian: [ˈmarko ˈpɔːlo] (listen); c. 1254 – January 8, 1324)[1] was a Venetian merchant,[2][3] explorer, and writer from the Republic of Venice who travelled through Asia along the Silk Road between 1271 and 1295.

Language:PythonMIT000

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION000

Physics-Exp3

Language:Jupyter Notebook010

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++NOASSERTION000

spark-practice

010

Toy-Streamlit

Language:Python010

tppys

Text processing by pyspark (just sample project)

Language:PythonMIT000

tpu-starter

Everything you want to know about Google Cloud TPU

Language:PythonCC0-1.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT000

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonApache-2.0000