ZihengWu's starred repositories
professional-programming
A collection of learning resources for curious software engineers
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
video2numpy
Optimized library for large-scale extraction of frames and audio from video.
coze-beautify
针对 coze (目前可免费使用 GPT-4)https://www.coze.com (海外版) 和 https://www.coze.cn (大陆版) 的 bot 界面优化的 Chrome 插件
llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“