Xingrui Wang (XingruiWang)

XingruiWang

Geek Repo

Company:Johns Hopkins University

Location:Baltimore, MD

Home Page:https://xingruiwang.github.io/

Twitter:@XingruiWang

Github PK Tool:Github PK Tool

Xingrui Wang's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66415Issues:560Issues:701

HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Language:DockerfileLicense:UnlicenseStargazers:62858Issues:390Issues:641

docusaurus

Easy to maintain open source documentation websites.

Language:TypeScriptLicense:MITStargazers:53953Issues:408Issues:3019

slidev

Presentation Slides for Developers

Language:TypeScriptLicense:MITStargazers:31849Issues:132Issues:963

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23334Issues:319Issues:385

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12930Issues:329Issues:312

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:7972Issues:124Issues:298

roughViz

Reusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.

Language:JavaScriptLicense:MITStargazers:6664Issues:54Issues:33

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:2022Issues:17Issues:112

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Language:PythonLicense:GPL-3.0Stargazers:770Issues:22Issues:22

tdw

ThreeDWorld simulation environment

Language:PythonLicense:BSD-2-ClauseStargazers:476Issues:19Issues:141

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:465Issues:10Issues:14

pixel-profile

Generate a pixel art style profile card from your GitHub data! ✨

Language:TypeScriptLicense:MITStargazers:408Issues:1Issues:3

pytorch_kinematics

Robot kinematics implemented in pytorch

Language:PythonLicense:MITStargazers:334Issues:20Issues:25

PyTorch-Simple-MaskRCNN

A PyTorch implementation of simple Mask R-CNN

Language:PythonLicense:MITStargazers:290Issues:3Issues:26

lcp-physics

A differentiable LCP physics engine in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:289Issues:15Issues:6

ns-vqa

Neural-symbolic visual question answering

diffusion_distiller

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

Language:PythonLicense:MITStargazers:194Issues:6Issues:7

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:142Issues:4Issues:0

discoscene

CVPR 2023 Highlight: DiscoScene

Language:PythonLicense:NOASSERTIONStargazers:137Issues:25Issues:5

stannum

Fusing Taichi into PyTorch

Language:PythonLicense:MITStargazers:129Issues:6Issues:13

Humanoid-Vision-Engine

code for [ECCV 2022 paper] Contributions of Shape, Texture, and Color in Visual Recognition

Language:PythonLicense:MITStargazers:97Issues:3Issues:8

EgoVLPv2

Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]

Language:PythonLicense:MITStargazers:79Issues:5Issues:10

TUVF

[ICLR'24] This repository is the implementation of "TUVF: Learning Generalizable Texture UV Radiance Fields"

Language:PythonStargazers:57Issues:9Issues:0

VRDP

[NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language

Language:PythonLicense:MITStargazers:45Issues:2Issues:3

PhaseCam3D

Code release for PhaseCam3D at ICCP 2019 https://yichengwu.github.io/PhaseCam3D/

NeMo

Neural mesh models for 3D reasoning.

DSCI553

Final letter grade: A

Language:PythonStargazers:5Issues:0Issues:0

SimVQA

[CVPR 2022] SimVQA: Exploring Simulated Environments for Visual Question Answering

Language:PythonLicense:MITStargazers:4Issues:0Issues:0