PixelArtAI's repositories
Video-Frame-Interpolation-Summary
Video Frame Interpolation Summary and Infer
Baidu-netdisk-AI-Image-processing-Challenge-handwriting
手写文字擦除第1名方案,水印智能消除赛第1名
NTIRE2024-Blind-Compressed-Image-Enhancement-Second
Blind JPEG Artifacts Removal via Enhanced Swin-Conv-UNet
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/千问/kimi】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
al-folio-web
A beautiful, simple, clean, and responsive Jekyll theme for academics
BSCV-Dataset
Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track
cali.so
Cali 的个人官网开源项目
CloserLookBlindSR
Paper accepted by CVPR Workshop, NTIRE 2022
daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
depth-fm
DepthFM: Fast Monocular Depth Estimation with Flow Matching
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
FunClip
一款基于FunASR高准确率开源语音识别模型的智能视频剪辑工具 / A video clipping tool based on FunASR open source model and Gradio.
GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
python-ffmpegio
Python package to read/write media files with FFmpeg
Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
realsr
neosr is a framework for training real-world single-image super-resolution networks.
sd-webui-fastblend
Make videos smooth!
SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
VideoPipe
跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星:)。
ZoomGS
[arxiv2024] Dual-Camera Smoooth Zoom on Mobile Phones