Xupeng (Tony) Tong's repositories
multimedia-gpt
Empowering your ChatGPT with image, video, and audio inputs.
bark
🔊 Text-Prompted Generative Audio Model
cytev2
MacOS background screen recorder/reader for easy history search
FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
HumanGaussian
Github Repo for "HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting"
langflow
⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
midjourney-proxy
代理 MidJourney 的discord频道,实现api形式调用AI绘图
next-saas-stripe-starter
An open-source SaaS Starter built using Next.js 14, Prisma, Neon, Auth.js v5, Resend, React Email, Shadcn/ui, Stripe and Server Actions.
novel
Notion-style WYSIWYG editor with AI-powered autocompletions
productgpt
An open-source AI product commercial photo generator
roomGPT
Upload a photo of your room to generate your dream room with AI.
roop
one-click deepfake (face swap)
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
taxyai-browser-extension
Automate your browser with GPT-4
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
voltron-robotics
Voltron: Language-Driven Representation Learning for Robotics