Lucas O. Souza's starred repositories
Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
LLM-RLHF-Tuning-with-PPO-and-DPO
Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various configurations for the Alpaca, LLaMA, and LLaMA2 models.
optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
openplayground
An LLM playground you can run on your laptop
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
graph-of-thoughts
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
awesome-totally-open-chatgpt
A list of totally open alternatives to ChatGPT
awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
openai-cookbook
Examples and guides for using the OpenAI API
lm-evaluation-harness
A framework for few-shot evaluation of language models.
data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
WeightWatcher
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
the-incredible-pytorch
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
deep_learning_curriculum
Language model alignment-focused deep learning curriculum
Tensor-Puzzles
Solve puzzles. Improve your pytorch.