Vishaal Udandarao (vishaal27)

vishaal27

Geek Repo

Company:University of Tübingen | University of Cambridge

Location:Tübingen, Germany

Home Page:https://vishaal27.github.io/

Twitter:@vishaal_urao

Github PK Tool:Github PK Tool

Vishaal Udandarao's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25706Issues:211Issues:229

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:11404Issues:91Issues:315

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6217Issues:35Issues:1020

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:6074Issues:37Issues:292

mteb

MTEB: Massive Text Embedding Benchmark

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1747Issues:11Issues:378

llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Language:PythonLicense:MITStargazers:1076Issues:18Issues:105

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:381Issues:5Issues:24

better_profanity

Blazingly fast cleaning swear words (and their leetspeak) in strings

Language:PythonLicense:MITStargazers:202Issues:6Issues:34
Language:PythonLicense:Apache-2.0Stargazers:158Issues:14Issues:4

gogetcrawl

Extract web archive data using Wayback Machine and Common Crawl

Language:GoLicense:MITStargazers:141Issues:5Issues:1

ID-Aligner

Official implement of ID-Aligner

SemDeDup

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).

Language:PythonLicense:NOASSERTIONStargazers:97Issues:3Issues:9

improved-t5

Experiments for efforts to train a new and improved t5

tofu

Landing Page for TOFU

Language:PythonLicense:MITStargazers:74Issues:4Issues:34

prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Language:Jupyter NotebookStargazers:55Issues:2Issues:1

modelcomponents

Decomposing and Editing Predictions by Modeling Model Computation

Language:Jupyter NotebookLicense:MITStargazers:48Issues:3Issues:1

VisualWebBench

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

Contamination_Detector

Lightweight tool to identify Data Contamination in LLMs evaluation

imgsys-public

imgsys backend

foundational_fsod

This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"

Language:PythonLicense:Apache-2.0Stargazers:14Issues:3Issues:1

whatsinthebox

analysis of public NLP corpora

Language:Jupyter NotebookLicense:MITStargazers:12Issues:2Issues:0
Language:PythonLicense:GPL-3.0Stargazers:12Issues:1Issues:1

generating-illustrated-instructions-reproduction

Code for reproducing the paper "Generating Illustrated Instructions."

sparo-clip

Separate-head attention read-out

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4Issues:0Issues:0
Stargazers:3Issues:0Issues:0

probing-resamplers

Code & data for our NAACL 2024 paper

License:MITStargazers:2Issues:2Issues:0
Language:JavaScriptStargazers:2Issues:0Issues:0