eric-xw

Xin (Eric) Wang's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION66606 557 702

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT64265 5330

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.045474 302 658

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT37517 442 294

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.023981 192 3765

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.014126 116 373

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9196 96 626

instruct-pix2pix

Language:PythonNOASSERTION6098 70 115

multinerf

A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF

Language:PythonApache-2.03564 48 147

awesome-tips

MIT3329 98 4

code_contests

Language:C++Apache-2.02038 39 35

OpenAGI

OpenAGI: When LLM Meets Domain Experts

Language:PythonMIT1825 26 16

Transformer-in-Vision

Recent Transformer-based CV and related works.

1305 87 5

aclpubcheck

Tools for checking ACL paper submissions

Language:PythonMIT551 5 45

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonNOASSERTION537 17 24

teach

Official PyTorch implementation of the paper "TEACH: Temporal Action Compositions for 3D Humans"

Language:PythonNOASSERTION379 15 47

awesome-vision-language-navigation

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

MIT304 130

Structured-Diffusion-Guidance

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

Language:Jupyter NotebookNOASSERTION298 7 14

nsf-proposal-latex-samples

LaTeX samples for NSF Research.gov Proposal Submission. For more information about Research.gov Proposal Submission visit https://www.research.gov/research-web/content/aboutpsm Feedback syee@nsf.gov

Language:TeXAGPL-3.0203 15 1

habitat-matterport3d-dataset

This repository contains code to reproduce experimental results from our HM3D paper in NeurIPS 2021.

Language:PythonMIT132 10 5

PEViT

Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"

Language:PythonMIT94 6 8

VLMbench

NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

Language:PythonMIT75 4 12

Aerial-Vision-and-Dialog-Navigation

Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"

Language:Python32 2 13

CPL

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

Language:PythonMIT31 3 9

pytorch_ldast

A PyTorch implementation of LDAST

Language:Python25 4 5

teach_tatc

Language:Jupyter Notebook22 3 12

IACE-NLU

Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.

Language:PythonMIT17 4 4

FedVLN

[ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"

Language:C++MIT12 30

ACLToolBox

Language:PythonMIT8 20

Diagnose_VLN

Code for "Diagnosing Vision-and-language Navigation: What Really Matters"

Language:PythonMIT7 1 1