Vishaal Udandarao's starred repositories
search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).
ml-mobileclip
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
goldfish-loss
Official implementation of Goldfish Loss: Mitigating Memorization in Generative LLMs
Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
llm_dataset_inference
Official Repository for Dataset Inference for LLMs
matmulfreellm
Implementation for MatMul-free LM.
clip-beyond-tail
Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights
Flickr30k-Image-Viewer
Small Flask-based apps to browse the Flickr30k dataset.
OCRDatasets
A collection of OCR-related datasets
foildataset
Experiments on Foil Dataset
svo_probes
The SVO-Probes Dataset for Verb Understanding
pointingqa
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"