Tingfeng Cao's starred repositories
ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Draw-and-Understand
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
ParaDiffusion
Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'
BarrageGPT
弹幕AI问答互动,支持抖音、虎牙、哔哩哔哩平台。通过弹幕进行ChatGPT问答,然后使用OBS推流进行无人直播。Interactive AI Q&A with barrage, supporting platforms like Douyin, Huya, and Bilibili. Conduct Q&A sessions with ChatGPT through barrage and use OBS for unattended live streaming.
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
SUR-adapter
ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities from large language models to build a high-quality textual semantic representation for text-to-image generation.
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
clip-interrogator
Image to prompt with BLIP and CLIP
PhotoMaker
PhotoMaker [CVPR 2024]