Zhenhailong Wang (MikeWangWZHL)

MikeWangWZHL

Geek Repo

Company:UIUC

Location:Champaign, Illinois

Home Page:https://mikewangwzhl.github.io/

Twitter:@zhenhailongW

Github PK Tool:Github PK Tool

Zhenhailong Wang's starred repositories

qwqjsq

qwqjsq.com 的 最新地址

Stargazers:259Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2622Issues:0Issues:0

anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Language:PythonStargazers:557Issues:0Issues:0
Language:PythonStargazers:75Issues:0Issues:0

Open-MAGVIT2

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Language:PythonLicense:Apache-2.0Stargazers:348Issues:0Issues:0

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:300Issues:0Issues:0
Stargazers:1062Issues:0Issues:0

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Stargazers:483Issues:0Issues:0

VDLM

Repo for paper: Text-based Reasoning About Vector Graphics

Language:PythonStargazers:18Issues:0Issues:0
Language:JavaScriptStargazers:2176Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3924Issues:0Issues:0

CHOCOLATE

Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:23Issues:0Issues:0
Language:PythonStargazers:81Issues:0Issues:0

maze-dataset

maze datasets for investigating OOD behavior of ML systems

Language:Jupyter NotebookStargazers:14Issues:0Issues:0

torchgeo

TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data

Language:PythonLicense:MITStargazers:2361Issues:0Issues:0

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Language:PythonLicense:NOASSERTIONStargazers:1179Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

atp-video-language

Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).

Language:PythonLicense:MITStargazers:47Issues:0Issues:0

Solo-Performance-Prompting

Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"

Language:PythonStargazers:298Issues:0Issues:0

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4449Issues:0Issues:0
Language:Jupyter NotebookStargazers:1012Issues:0Issues:0

Paxion

Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight

Language:PythonStargazers:32Issues:0Issues:0

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1639Issues:0Issues:0

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:78130Issues:0Issues:0

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1162Issues:0Issues:0

dalle2-laion

Pretrained Dalle2 from laion

Language:PythonStargazers:500Issues:0Issues:0

imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Language:PythonLicense:MITStargazers:7914Issues:0Issues:0

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonLicense:MITStargazers:10987Issues:0Issues:0

LookForTheChange

Code for Look for the Change paper published at CVPR 2022

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

procthor-10k

The ProcTHOR-10K Houses Dataset

Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0