Xingrui Wang (XingruiWang)

XingruiWang

Geek Repo

Company:Johns Hopkins University

Location:Baltimore, MD

Home Page:https://xingruiwang.github.io/

Twitter:@XingruiWang

Github PK Tool:Github PK Tool

Xingrui Wang's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66878Issues:555Issues:705

HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Language:DockerfileLicense:UnlicenseStargazers:65387Issues:396Issues:657

docusaurus

Easy to maintain open source documentation websites.

Language:TypeScriptLicense:MITStargazers:54535Issues:408Issues:3047

slidev

Presentation Slides for Developers

Language:TypeScriptLicense:MITStargazers:32173Issues:134Issues:994

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23881Issues:317Issues:388

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16576Issues:149Issues:1472

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12984Issues:328Issues:313

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:8052Issues:123Issues:298

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:2041Issues:17Issues:112

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Language:PythonLicense:GPL-3.0Stargazers:782Issues:22Issues:22

tdw

ThreeDWorld simulation environment

Language:PythonLicense:BSD-2-ClauseStargazers:485Issues:19Issues:141

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:477Issues:11Issues:16

pixel-profile

Generate a pixel art style profile card from your GitHub data! ✨

Language:TypeScriptLicense:MITStargazers:410Issues:1Issues:3

pytorch_kinematics

Robot kinematics implemented in pytorch

Language:PythonLicense:MITStargazers:369Issues:19Issues:26

PyTorch-Simple-MaskRCNN

A PyTorch implementation of simple Mask R-CNN

Language:PythonLicense:MITStargazers:294Issues:3Issues:26

lcp-physics

A differentiable LCP physics engine in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:290Issues:14Issues:6

ns-vqa

Neural-symbolic visual question answering

diffusion_distiller

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

Language:PythonLicense:MITStargazers:199Issues:6Issues:7

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:142Issues:4Issues:0

discoscene

CVPR 2023 Highlight: DiscoScene

Language:PythonLicense:NOASSERTIONStargazers:138Issues:25Issues:5

stannum

Fusing Taichi into PyTorch

Language:PythonLicense:MITStargazers:131Issues:6Issues:13

Humanoid-Vision-Engine

code for [ECCV 2022 paper] Contributions of Shape, Texture, and Color in Visual Recognition

Language:PythonLicense:MITStargazers:98Issues:3Issues:8

EgoVLPv2

Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]

Language:PythonLicense:MITStargazers:83Issues:5Issues:10

TUVF

[ICLR'24] This repository is the implementation of "TUVF: Learning Generalizable Texture UV Radiance Fields"

Language:PythonStargazers:57Issues:9Issues:0

VRDP

[NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language

Language:PythonLicense:MITStargazers:45Issues:2Issues:3

PhaseCam3D

Code release for PhaseCam3D at ICCP 2019 https://yichengwu.github.io/PhaseCam3D/

NeMo

Neural mesh models for 3D reasoning.

DSCI553

Final letter grade: A

Language:PythonStargazers:5Issues:2Issues:0

SimVQA

[CVPR 2022] SimVQA: Exploring Simulated Environments for Visual Question Answering

Language:PythonLicense:MITStargazers:5Issues:1Issues:2