Tingfeng Cao (NicholasCao)

NicholasCao

Geek Repo

Company:SCUT

Location:Guangzhou

Github PK Tool:Github PK Tool


Organizations
goa-go

Tingfeng Cao's starred repositories

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Language:PythonLicense:MITStargazers:374Issues:0Issues:0

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:145Issues:0Issues:0

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonLicense:MITStargazers:826Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:1988Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:196Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11040Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1501Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3887Issues:0Issues:0

Draw-and-Understand

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Language:PythonLicense:Apache-2.0Stargazers:47Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10615Issues:0Issues:0

ParaDiffusion

Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'

Language:PythonStargazers:91Issues:0Issues:0

Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Language:PythonLicense:Apache-2.0Stargazers:496Issues:0Issues:0

BarrageGPT

弹幕AI问答互动,支持抖音、虎牙、哔哩哔哩平台。通过弹幕进行ChatGPT问答,然后使用OBS推流进行无人直播。Interactive AI Q&A with barrage, supporting platforms like Douyin, Huya, and Bilibili. Conduct Q&A sessions with ChatGPT through barrage and use OBS for unattended live streaming.

Language:PythonStargazers:170Issues:0Issues:0

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫

Language:PythonLicense:NOASSERTIONStargazers:15494Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4079Issues:0Issues:0

SUR-adapter

ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities from large language models to build a high-quality textual semantic representation for text-to-image generation.

Language:PythonLicense:MITStargazers:109Issues:0Issues:0

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

License:Apache-2.0Stargazers:1933Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2592Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4449Issues:0Issues:0

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:7270Issues:0Issues:0

Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:1036Issues:0Issues:0
Language:PythonLicense:MITStargazers:11Issues:0Issues:0

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonLicense:Apache-2.0Stargazers:394Issues:0Issues:0

X-Adapter

[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:704Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5785Issues:0Issues:0

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonLicense:MITStargazers:2601Issues:0Issues:0

GradCache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Language:PythonLicense:Apache-2.0Stargazers:335Issues:0Issues:0

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8987Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:8951Issues:0Issues:0