Wayne2Wang

Zilin Wang's starred repositories

ai-for-grant-writing

A curated list of resources for using LLMs to develop more competitive grant applications.

Language:PythonCC-BY-4.0201000

rococo

Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models

Language:PythonMIT700

FiT3D

[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Language:Jupyter NotebookMIT20500

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonNOASSERTION511700

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.01135800

SpLiCE

Sparse Linear Concept Embeddings

Language:PythonApache-2.05400

Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Language:Jupyter NotebookMIT177300

clip_text_span

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Language:Jupyter NotebookMIT15100

gufi-archive

Public Repo of documentation and scripts how to use GUFI to generate reports to identify data suitable for archive

Language:Shell1000

UnSAM

[NeurIPS 2024] Code release for "Segment Anything without Supervision"

Language:Jupyter Notebook36000

RPO

Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023

Language:PythonMIT4900

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

89200

clevr4

Starter notebook and utilities for the Clevr-4 dataset

Language:Jupyter NotebookCC-BY-4.01600

ICTC

This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)

Language:PythonApache-2.07600

hands23_data

Language:Python800

projUNN

Fast training of unitary deep network layers from low-rank updates

Language:PythonMIT2800

guided-cluster-aggregation

Language:PythonMIT600

images-that-sound

Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions

Language:PythonMIT20800

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonNOASSERTION63700

FineR

[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models

Language:PythonApache-2.03500

vic

Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification

Language:PythonMIT10000

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

83200

U2Seg

[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"

Language:PythonApache-2.016700

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1203300

probe3d

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Language:PythonMIT25200

annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5451600