Vishaal Udandarao's starred repositories
routerbench
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
Visual-Table
Stay tuned!
imagenet_d
[CVPR2024 Highlight] Official Code for "ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object"
Inflection-Benchmarks
Public Inflection Benchmarks
attention-interpolation-diffusion
Interpolation Between Text-to-Image Generation!
modelgauge
Make it easy to automatically and uniformly measure the behavior of many AI Systems.
LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
Visual-CoT
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
visual_diversity_budget
Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost