josh-zhu's repositories
ArtiBoost
[CVPR 2022 Oral] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
chatgpt-on-wechat
使用ChatGPT搭建微信聊天机器人,基于OpenAI API和itchat实现。Wechat robot based on ChatGPT, which using OpenAI api and itchat library.
custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion
DenseMutualAttention
[WACV2023] Interacting Hand-Object Pose Estimation via Dense Mutual Attention
dgrasp
Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
douyin_crawl
抖音视频批量爬取
EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
EasyVC
变声技术综合评比
fish-speech
Brand new TTS solution
GaussianAvatars
[CVPR 2024 (Highlight)] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
hyperreel
Code release for HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling
langchain
⚡ Building applications with LLMs through composability ⚡
langflow
⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
LivePortrait
Make one portrait alive!
MidJourney-Wrapper
MidJourney wrapper in Discord.
pixano-app
Pixano App is a web-based smart-annotation tool for computer vision applications.
react-nice-avatar
react library for generating avatar
so-vits-svc
SoftVC VITS Singing Voice Conversion
talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
TextBox
TextBox 2.0 is a text generation library with pre-trained language models
torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
v4l2loopback
v4l2-loopback device
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
vall-e-1
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Can be trained on a single GPU!
visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
whisperX
WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.