Afra Feyza Akyürek's starred repositories
Megatron-LM
Ongoing research training transformer models at scale
knowledge_distillation
Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
deprem_openai_apis
Extract addresses and intents from tweet texts
ThoughtSource
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
google-research
Google Research
GEM-metrics
Automatic metrics for GEM tasks
interscript
The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.