Long Phan's repositories
deepspeed_lora
Training LORA with Deepspeed for pretraining purpose
BioNERBERT
How Language Models teach Language Models to do NER in Biomedical Domain
vietai-research-blog
VietAI Research Scientific Blog Frontend
alpaca-lora
Instruct-tune LLaMA on consumer hardware
campuskvetch
complains a great deal about campus issue and we will fix it
CWRUPS
EECS341 Final Project
DeepSpeedExamples
Example models using DeepSpeed
EECS397-Project
Projects for CWRU's EECS397:System Programming (Spring 2020)
efficient_alpaca
The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca
FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
gec_with_backtrans
Grammatical Error Correction (GEC) with Back Translation
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
loconotion
📄 Python tool to turn Notion.so pages into lightweight, customizable static websites
Prompt-Engineering-Guide
:octopus: Guides, papers, lecture, and resources for prompt engineering
representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
tianshou
An elegant PyTorch deep reinforcement learning library.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.