josh-zhu's repositories
ArtiBoost
[CVPR 2022 Oral] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
chatgpt-on-wechat
使用ChatGPT搭建微信聊天机器人,基于OpenAI API和itchat实现。Wechat robot based on ChatGPT, which using OpenAI api and itchat library.
custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion
DenseMutualAttention
[WACV2023] Interacting Hand-Object Pose Estimation via Dense Mutual Attention
dgrasp
Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
Ditto
Code for Ditto: Building Digital Twins of Articulated Objects from Interaction
douyin_crawl
抖音视频批量爬取
EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
EasyVC
变声技术综合评比
HOIG
[NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation
hyperreel
Code release for HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling
langchain
⚡ Building applications with LLMs through composability ⚡
langflow
⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
MidJourney-Wrapper
MidJourney wrapper in Discord.
NeRF-Art
NeRF-Art: Text-Driven Neural Radiance Fields Stylization
pixano-app
Pixano App is a web-based smart-annotation tool for computer vision applications.
react-nice-avatar
react library for generating avatar
SemanticGuidedHumanMatting
Robust Human Matting via Semantic Guidance, ACCV 2022.
so-vits-svc
SoftVC VITS Singing Voice Conversion
TextBox
TextBox 2.0 is a text generation library with pre-trained language models
torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
v4l2loopback
v4l2-loopback device
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
vall-e-1
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Can be trained on a single GPU!
visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
whisperX
WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.