Zhenhailong Wang (MikeWangWZHL)

MikeWangWZHL

Geek Repo

Company:UIUC

Location:Champaign, Illinois

Home Page:https://mikewangwzhl.github.io/

Twitter:@zhenhailongW

Github PK Tool:Github PK Tool

Zhenhailong Wang's repositories

Solo-Performance-Prompting

Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"

EEG-To-Text

code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"

VidIL

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Language:PythonLicense:MITStargazers:110Issues:5Issues:11

Paxion

Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight

Language:PythonStargazers:31Issues:1Issues:0

VDLM

Repo for paper: Text-based Reasoning About Vector Graphics

Language:PythonStargazers:16Issues:1Issues:0

Zemi

Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings

Multitask-Finetuning_CLIP

Code for paper "Rethinking Task Sampling for Few-shot Vision-Language Transfer Learning" COLING 2022 workshop

Language:PythonStargazers:3Issues:3Issues:0

Wikinews_Pipeline

Get news from Wikipedia page's reference section

Language:PythonStargazers:3Issues:0Issues:0
Language:PythonStargazers:1Issues:2Issues:0

MikeWangWZHL.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

diffusers

šŸ¤— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LLaVA

[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MathVista

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

maze-dataset

maze datasets for investigating OOD behavior of ML systems

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

parti-pytorch

Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch

License:MITStargazers:0Issues:0Issues:0

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

License:Apache-2.0Stargazers:0Issues:0Issues:0

singularity

Official PyTorch code for Singularity model in the paper "Revealing Single Frame Bias for Video-and-Language Learning"

License:MITStargazers:0Issues:0Issues:0

Tracking-Anything-with-DEVA

Forked from paper [ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Video-ChatGPT

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0