lucasosouza

Lucas O. Souza's starred repositories

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:Python11200

LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various configurations for the Alpaca, LLaMA, and LLaMA2 models.

Language:Python9500

optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Language:PythonApache-2.013000

openplayground

An LLM playground you can run on your laptop

Language:TypeScriptMIT616500

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT637300

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

Language:PythonMIT85500

graph-of-thoughts

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Language:PythonNOASSERTION199000

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:Jupyter NotebookApache-2.0312500

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.0252600

awesome-totally-open-chatgpt

A list of totally open alternatives to ChatGPT

CC0-1.0446900

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

104400

pandas-ai

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Language:PythonNOASSERTION1205600

PaLM

An open-source implementation of Google's PaLM models

Language:PythonMIT80300

openai-cookbook

Examples and guides for using the OpenAI API

Language:MDXMIT5770400

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonMIT2337900

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT587900

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION1439700

llama

Inference code for Llama models

Language:PythonNOASSERTION5427100

composer

Supercharge Your Model Training

Language:PythonApache-2.0507400

data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

Language:Jupyter NotebookApache-2.029100

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Language:PythonApache-2.017000

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonMIT439300

diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Language:Jupyter NotebookApache-2.0340000

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Language:PythonApache-2.0660800

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION494100

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookMIT233600

WeightWatcher

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Language:PythonApache-2.0141500

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

MIT1120000

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

116700

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookMIT293400