Zilin Wang (Wayne2Wang)

Wayne2Wang

Geek Repo

Company:University of Michigan

Location:Ann Arbor, MI

Twitter:@zilinwan

Github PK Tool:Github PK Tool

Zilin Wang's starred repositories

ai-for-grant-writing

A curated list of resources for using LLMs to develop more competitive grant applications.

Language:PythonLicense:CC-BY-4.0Stargazers:2010Issues:0Issues:0

rococo

Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

FiT3D

[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Language:Jupyter NotebookLicense:MITStargazers:205Issues:0Issues:0

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:5117Issues:0Issues:0

sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11358Issues:0Issues:0

SpLiCE

Sparse Linear Concept Embeddings

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0

Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Language:Jupyter NotebookLicense:MITStargazers:1773Issues:0Issues:0

clip_text_span

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Language:Jupyter NotebookLicense:MITStargazers:151Issues:0Issues:0

gufi-archive

Public Repo of documentation and scripts how to use GUFI to generate reports to identify data suitable for archive

Language:ShellStargazers:10Issues:0Issues:0

UnSAM

[NeurIPS 2024] Code release for "Segment Anything without Supervision"

Language:Jupyter NotebookStargazers:360Issues:0Issues:0

RPO

Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

Stargazers:892Issues:0Issues:0

clevr4

Starter notebook and utilities for the Clevr-4 dataset

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:16Issues:0Issues:0

ICTC

This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)

Language:PythonLicense:Apache-2.0Stargazers:76Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

projUNN

Fast training of unitary deep network layers from low-rank updates

Language:PythonLicense:MITStargazers:28Issues:0Issues:0
Language:PythonLicense:MITStargazers:6Issues:0Issues:0

images-that-sound

Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions

Language:PythonLicense:MITStargazers:208Issues:0Issues:0

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:637Issues:0Issues:0

FineR

[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

vic

Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Stargazers:832Issues:0Issues:0

U2Seg

[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"

Language:PythonLicense:Apache-2.0Stargazers:167Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:12033Issues:0Issues:0

probe3d

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Language:PythonLicense:MITStargazers:252Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:54516Issues:0Issues:0

igligen

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

Language:PythonStargazers:34Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5921Issues:0Issues:0

b2t

Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation

Language:PythonStargazers:25Issues:0Issues:0

generalized-category-discovery

Code for our CVPR 2022 paper 'Generalized Category Discovery'. Project page: https://www.robots.ox.ac.uk/~vgg/research/gcd/

Language:PythonLicense:MITStargazers:197Issues:0Issues:0