Michael's repositories
Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Speech2Video
Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
Depth-Guided-Inpainting
Code for ECCV 2020 "DVI: Depth Guided Video Inpainting for Autonomous Driving"
dataset-api
Api for visualize sample data, evaluation of different tasks
TrafficPredict
Code for AAAI 2019 (Oral) "TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents"
Event-Radar
Event-Radar: Real-time Local Event Detection System for Geo-Tagged Tweet Streams
RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
sibozhang-page.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
autodistill
Images to inference with no labeling (use foundation models to train supervised models)
dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
insightface
State-of-the-art 2D and 3D Face Analysis Project
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
mmdetection
OpenMMLab Detection Toolbox and Benchmark
mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite