vishaal27

Vishaal Udandarao's starred repositories

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT11585 167 229

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonNOASSERTION2492 40 23

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Language:PythonMIT1409 18 74

mup

maximal update parametrization (µP)

Language:Jupyter NotebookMIT1319 29 61

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonApache-2.01168 40 11

lilac

Curate better data for LLMs

Language:PythonApache-2.0924 13 291

Mind2Web

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Language:Jupyter NotebookMIT644 22 41

Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Language:PythonApache-2.0559 12 64

HPT

HPT - Open Multimodal LLMs from HyperGAI

Language:PythonApache-2.0305 7 11

scaling_on_scales

When do we not need larger vision models?

Language:PythonMIT299 7 14

LAMM

[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents

Language:Python293 8 44

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonApache-2.0234 11 11

LLM-SLERP-Merge

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

Language:Python105 3 1

ml-tic-clip

Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".

Language:PythonNOASSERTION88 150

Visual-CoT

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Language:PythonApache-2.084 1 6

mixinglaws

Language:Jupyter Notebook83 1 4

Chain-of-Spot

Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models

Language:PythonApache-2.080 5 7

DreamLIP

[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions

Language:PythonNOASSERTION78 8 8

attention-interpolation-diffusion

Interpolation Between Text-to-Image Generation!

Language:Python77 3 1

emergent_in_context_learning

Language:PythonApache-2.075 4 5

Inflection-Benchmarks

Public Inflection Benchmarks

MIT67 6 4

that_is_good_data

Apache-2.065 9 1

skerch

Sketched matrix decompositions for PyTorch

Language:PythonMIT63 2 2

mnms

m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks

Language:Python29 7 2

modelgauge

Make it easy to automatically and uniformly measure the behavior of many AI Systems.

Language:PythonApache-2.025 17 121

VL-ICL

Code for paper: VL-ICL Bench: The Devil in the Details of Benchmarking Multimodal In-Context Learning

Language:Python20 1 6

Meta-Prompting

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)

Language:PythonMIT11 3 1

DiversitySSL

1000

visual_diversity_budget

Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost

8 10

clipcov-data-efficient-clip

Code for AISTATS Eff MML

Language:Jupyter Notebook700