alexrame's starred repositories
ColossalAI
Making large AI models cheaper, faster and more accessible
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
alpaca-lora
Instruct-tune LLaMA on consumer hardware
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
ExpansionNet_v2
Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"
tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
rewardedsoups
Rewarded soups official implementation
weather4cast-2022
WeatherFusionNet - our solution to the NeurIPS 2022 Weather4cast competition
Robust_Weight_Signatures
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang