Xin (Eric) Wang's starred repositories
Aerial-Vision-and-Dialog-Navigation
Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Structured-Diffusion-Guidance
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Transformer-in-Vision
Recent Transformer-based CV and related works.
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
habitat-matterport3d-dataset
This repository contains code to reproduce experimental results from our HM3D paper in NeurIPS 2021.
stable-diffusion
A latent text-to-image diffusion model
Diagnose_VLN
Code for "Diagnosing Vision-and-language Navigation: What Really Matters"
pytorch_ldast
A PyTorch implementation of LDAST
nsf-proposal-latex-samples
LaTeX samples for NSF Research.gov Proposal Submission. For more information about Research.gov Proposal Submission visit https://www.research.gov/research-web/content/aboutpsm Feedback syee@nsf.gov
awesome-vision-language-navigation
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"
aclpubcheck
Tools for checking ACL paper submissions