Sehyun Choi's repositories
litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
pretraining
Pretraining
bittensor
Internet-scale Neural Networks
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
syncdoth.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
ColossalAI
Making large AI models cheaper, faster and more accessible
ecco
Visualize and explore NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2).
comet-rl
A WIP project of training COMET model using RL.
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
RMT
Implementation of Recurrent Memory Transformer + Gating algorithms applied on long-context dialogue setting.
Chain-of-Hindsight-PyTorch
Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
DSI-transformers
A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"
prompt-compression
This is an unofficial implementation of prompt-compression for my own research.
HKUST_PhD_MPhil_thesis_Latex
The Hong Kong University of Science and Technology PhD/MPhil thesis latex template based on the latest official sample (http://pg.ust.hk/guides_n_forms/students/thesis_sample_page_phd.pdf)
NLP-TDSC
COMP 5331 HKUST Team 16
jekyll-pygments-themes
CSS themes for Pygments syntax highlighter, ready to drop into Jekyll. Also includes a custom theme builder!
Diffusion-LM
Diffusion-LM
An-Application-of-Deep-Reinforcement-Learning-to-Algorithmic-Trading
Experimental code supporting the results presented in the scientific research paper entitled "An Application of Deep Reinforcement Learning to Algorithmic Trading"
MAFIA-Explanation-NLI
Official code for MAFIA (MAsk-based Feature Interaction Attribution): Explain BERT model's decision in NLI task.
vinyl
vinyl playlist
gradio_serving
Simple example of using gradio for serving ML models.
HKUST-KSA-ML-Study
HKUST KSA 2020 ML study repository
face_mask_inpaint
GAN-based project of removing facial masks from face images.
predicting-business-popularity
About Research project on predicting business popularity using GNNs in location-based social networks.