Vishaal Udandarao (vishaal27)

vishaal27

Geek Repo

Company:University of Tübingen | University of Cambridge

Location:Tübingen, Germany

Home Page:https://vishaal27.github.io/

Twitter:@vishaal_urao

Github PK Tool:Github PK Tool

Vishaal Udandarao's starred repositories

search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

Language:PythonLicense:MITStargazers:30Issues:0Issues:0

richhf-18k

RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).

Stargazers:21Issues:0Issues:0
Language:DockerfileLicense:Apache-2.0Stargazers:25Issues:0Issues:0

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

Stargazers:53Issues:0Issues:0

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:429Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:42Issues:0Issues:0

ilid

Industrial Language-Image Dataset (ILID), a web-crawled dataset containing language-image samples from various web catalogs, representing parts/components from the industrial domain.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:14Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0

goldfish-loss

Official implementation of Goldfish Loss: Mitigating Memorization in Generative LLMs

Language:PythonLicense:Apache-2.0Stargazers:45Issues:0Issues:0
Stargazers:5Issues:0Issues:0

Recap-DataComp-1B

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

Stargazers:85Issues:0Issues:0

icai

Inverse Constitutional AI: compressing pairwise preference data into a short constitution of principles.

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Stargazers:140Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5Issues:0Issues:0

llm_dataset_inference

Official Repository for Dataset Inference for LLMs

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

CLoG

✌ CLoG: Benchmarking Continual Learning of Image Generation Models

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2162Issues:0Issues:0

videophy

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

ReNO

ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Language:PythonLicense:MITStargazers:46Issues:0Issues:0

clip-beyond-tail

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights

Language:Jupyter NotebookLicense:MITStargazers:13Issues:0Issues:0

Flickr30k-Image-Viewer

Small Flask-based apps to browse the Flickr30k dataset.

Language:PythonLicense:Apache-2.0Stargazers:20Issues:0Issues:0
Language:PythonLicense:MITStargazers:34Issues:0Issues:0
Language:HTMLStargazers:10Issues:0Issues:0

OCRDatasets

A collection of OCR-related datasets

Stargazers:67Issues:0Issues:0

foildataset

Experiments on Foil Dataset

Language:Jupyter NotebookStargazers:7Issues:0Issues:0
Language:PythonLicense:MITStargazers:3Issues:0Issues:0

BLA

Benchmark for Basic Language Abilities of Multimodal Pretrained Transformers

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

svo_probes

The SVO-Probes Dataset for Verb Understanding

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:28Issues:0Issues:0

pointingqa

Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"

Language:PythonStargazers:18Issues:0Issues:0