ColdFusion2001's repositories
schedule_free
Schedule-Free Optimization in PyTorch
sae
Sparse autoencoders
xlstm-cuda
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports
bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
llm_distillation_playbook
Best practices for distilling large language models.
OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
mamba.py
A Mamba with parallel scan in PyTorch.
awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
trl
Train transformer language models with reinforcement learning.
hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Ladder-Side-Tuning
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
flores
Facebook Low Resource (FLoRes) MT Benchmark