Vishaal Udandarao (vishaal27)

vishaal27

Geek Repo

Company:University of Tübingen | University of Cambridge

Location:Tübingen, Germany

Home Page:https://vishaal27.github.io/

Twitter:@vishaal_urao

Github PK Tool:Github PK Tool

Vishaal Udandarao's starred repositories

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11005Issues:162Issues:217

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3072Issues:25Issues:126

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonLicense:NOASSERTIONStargazers:2473Issues:40Issues:22

mup

maximal update parametrization (µP)

Language:Jupyter NotebookLicense:MITStargazers:1226Issues:29Issues:58

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:301Issues:11Issues:14

LAMM

[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:214Issues:11Issues:11

ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Language:PythonLicense:Apache-2.0Stargazers:147Issues:5Issues:8

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:137Issues:11Issues:3
Language:PythonLicense:MITStargazers:130Issues:4Issues:0

LLM-SLERP-Merge

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

Language:PythonLicense:BSD-3-ClauseStargazers:90Issues:4Issues:7

attention-interpolation-diffusion

Interpolation Between Text-to-Image Generation!

routerbench

The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System

Language:PythonLicense:MITStargazers:75Issues:6Issues:5

Inflection-Benchmarks

Public Inflection Benchmarks

Language:PythonLicense:MITStargazers:65Issues:4Issues:5

Visual-CoT

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Language:PythonLicense:Apache-2.0Stargazers:63Issues:1Issues:4

skerch

Sketched matrix decompositions for PyTorch

Language:PythonLicense:MITStargazers:62Issues:2Issues:2

DreamLIP

[Arxiv 2024] Offical Pytorch implementation of DreamLIP: Language-Image Pre-training with Long Captions

imagenet_d

[CVPR2024 Highlight] Official Code for "ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object"

Language:PythonLicense:MITStargazers:36Issues:2Issues:4

modelgauge

Make it easy to automatically and uniformly measure the behavior of many AI Systems.

Language:PythonLicense:Apache-2.0Stargazers:22Issues:17Issues:98

coco-rem

Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."

Language:PythonLicense:MITStargazers:11Issues:1Issues:0

ex2

If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions

visual_diversity_budget

Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost

Language:JavaScriptLicense:Apache-2.0Stargazers:5Issues:0Issues:0

imagenot

The accompanying code of "ImageNot: A contrast with ImageNet preserves model rankings"

Stargazers:2Issues:0Issues:0