Xin (Eric) Wang (eric-xw)

eric-xw

Geek Repo

Company:University of California, Santa Cruz

Github PK Tool:Github PK Tool


Organizations
eric-ai-lab

Xin (Eric) Wang's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66606Issues:557Issues:702

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:64265Issues:533Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45474Issues:302Issues:658

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:37517Issues:442Issues:294

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23981Issues:192Issues:3765

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14126Issues:116Issues:373

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9196Issues:96Issues:626
Language:PythonLicense:NOASSERTIONStargazers:6098Issues:70Issues:115

multinerf

A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF

Language:PythonLicense:Apache-2.0Stargazers:3564Issues:48Issues:147

OpenAGI

OpenAGI: When LLM Meets Domain Experts

Language:PythonLicense:MITStargazers:1825Issues:26Issues:16

Transformer-in-Vision

Recent Transformer-based CV and related works.

aclpubcheck

Tools for checking ACL paper submissions

Language:PythonLicense:MITStargazers:551Issues:5Issues:45

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonLicense:NOASSERTIONStargazers:537Issues:17Issues:24

teach

Official PyTorch implementation of the paper "TEACH: Temporal Action Compositions for 3D Humans"

Language:PythonLicense:NOASSERTIONStargazers:379Issues:15Issues:47

awesome-vision-language-navigation

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

License:MITStargazers:304Issues:13Issues:0

Structured-Diffusion-Guidance

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:298Issues:7Issues:14

nsf-proposal-latex-samples

LaTeX samples for NSF Research.gov Proposal Submission. For more information about Research.gov Proposal Submission visit https://www.research.gov/research-web/content/aboutpsm Feedback syee@nsf.gov

Language:TeXLicense:AGPL-3.0Stargazers:203Issues:15Issues:1

habitat-matterport3d-dataset

This repository contains code to reproduce experimental results from our HM3D paper in NeurIPS 2021.

Language:PythonLicense:MITStargazers:132Issues:10Issues:5

PEViT

Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"

Language:PythonLicense:MITStargazers:94Issues:6Issues:8

VLMbench

NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

Language:PythonLicense:MITStargazers:75Issues:4Issues:12

Aerial-Vision-and-Dialog-Navigation

Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"

CPL

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

Language:PythonLicense:MITStargazers:31Issues:3Issues:9

pytorch_ldast

A PyTorch implementation of LDAST

IACE-NLU

Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.

Language:PythonLicense:MITStargazers:17Issues:4Issues:4

FedVLN

[ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"

Language:C++License:MITStargazers:12Issues:3Issues:0
Language:PythonLicense:MITStargazers:8Issues:2Issues:0

Diagnose_VLN

Code for "Diagnosing Vision-and-language Navigation: What Really Matters"

Language:PythonLicense:MITStargazers:7Issues:1Issues:1