sibozhang

Michael's repositories

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

Language:Python414 12 22

Speech2Video

Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"

98 9 10

Depth-Guided-Inpainting

Code for ECCV 2020 "DVI: Depth Guided Video Inpainting for Autonomous Driving"

Language:C++64 2 3

dataset-api

Api for visualize sample data, evaluation of different tasks

Language:Jupyter NotebookApache-2.036 10

vid2vid

A modified version of vid2vid for Speech2Video, Text2Video Paper

Language:Python35 4 8

TrafficPredict

Code for AAAI 2019 (Oral) "TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents"

Language:Python7 1 1

Deep-Learning-Based-Food-Recognition

Language:Python2 30

Event-Radar

Event-Radar: Real-time Local Event Detection System for Geo-Tagged Tweet Streams

Language:Java2 3 1

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖

Language:SwiftMIT100