Beast code in Giters

alexrame's starred repositories

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038382 384 1617

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION25896 282 39

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.018422 154 468

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonApache-2.010807 136 162

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonApache-2.09084 108 81

nebuly

The user analytics platform for LLMs

Language:PythonApache-2.08366 93 202

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.04589 53 321

acme

A library of reinforcement learning components and agents

Language:PythonApache-2.03183 83 253

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

MIT1510 190

prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Language:PythonNOASSERTION1292 15 19

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Language:PythonGPL-3.01102 21 12

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonApache-2.0741 8 41

cabrita

Finetuning InstructLLaMA with portuguese data

Language:Jupyter NotebookApache-2.0551 10 14

minimal-llama

Language:Python452 17 11

ConstitutionalHarmlessnessPaper

203 50

git-theta

git extension for {collaborative, communal, continual} model development

Language:PythonApache-2.0198 8 135

PreferenceTransformer

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

Language:PythonMIT142 3 7

catwalk

This project studies the performance and robustness of language models and task-adaptation methods.

Language:PythonApache-2.0138 7 24

ties-merging

Language:PythonBSD-3-Clause124 2 7

machiavelli

Language:PythonMIT113 4 10

cbtm

Code repository for the c-BTM paper

Language:PythonApache-2.0105 5 3

ELM

[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Language:Python97 3 4

ExpansionNet_v2

Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"

Language:PythonMIT82 5 11

tangent_task_arithmetic

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Language:PythonMIT74 1 2

rewardedsoups

Rewarded soups official implementation

Language:HTML41 1 2

weather4cast-2022

WeatherFusionNet - our solution to the NeurIPS 2022 Weather4cast competition

Language:Jupyter NotebookApache-2.03000

Robust_Weight_Signatures

[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang

Language:PythonMIT16 110

iclmlp

Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"

Language:PythonNOASSERTION1500

erm_plusplus

Language:PythonMIT15 1 2