Rohit Gupta's starred repositories
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
text-generation-inference
Large Language Model Text Generation Inference
mountpoint-s3
A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.
alignment-handbook
Robust recipes to align language models with human and AI preferences
consistencydecoder
Consistency Distilled Diff VAE
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
imageqa-public
Code for paper "Exploring Models and Data for Image Question Answering"
vision-for-action
Code to accompany "Does computer vision matter for action?"
ChannelViT
Channel Vision Transformers: An Image Is Worth C x 16 x 16 Words
Type-to-Track
[NeurIPS 2023] Type-to-Track: Retrieve Any Object via Prompt-based Tracking