alexrame's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38382Issues:384Issues:1617

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18422Issues:154Issues:468

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10807Issues:136Issues:162

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9084Issues:108Issues:81

nebuly

The user analytics platform for LLMs

Language:PythonLicense:Apache-2.0Stargazers:8366Issues:93Issues:202

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:4589Issues:53Issues:321

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:3183Issues:83Issues:253

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1510Issues:19Issues:0

prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Language:PythonLicense:NOASSERTIONStargazers:1292Issues:15Issues:19

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Language:PythonLicense:GPL-3.0Stargazers:1102Issues:21Issues:12

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:741Issues:8Issues:41

cabrita

Finetuning InstructLLaMA with portuguese data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:551Issues:10Issues:14

git-theta

git extension for {collaborative, communal, continual} model development

Language:PythonLicense:Apache-2.0Stargazers:198Issues:8Issues:135

PreferenceTransformer

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

Language:PythonLicense:MITStargazers:142Issues:3Issues:7

catwalk

This project studies the performance and robustness of language models and task-adaptation methods.

Language:PythonLicense:Apache-2.0Stargazers:138Issues:7Issues:24
Language:PythonLicense:BSD-3-ClauseStargazers:124Issues:2Issues:7

cbtm

Code repository for the c-BTM paper

Language:PythonLicense:Apache-2.0Stargazers:105Issues:5Issues:3

ELM

[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning

ExpansionNet_v2

Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"

Language:PythonLicense:MITStargazers:82Issues:5Issues:11

tangent_task_arithmetic

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Language:PythonLicense:MITStargazers:74Issues:1Issues:2

rewardedsoups

Rewarded soups official implementation

weather4cast-2022

WeatherFusionNet - our solution to the NeurIPS 2022 Weather4cast competition

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:30Issues:0Issues:0

Robust_Weight_Signatures

[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang

Language:PythonLicense:MITStargazers:16Issues:11Issues:0

iclmlp

Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"

Language:PythonLicense:NOASSERTIONStargazers:15Issues:0Issues:0