Besharp's starred repositories
AnimateDiff
Official implementation of AnimateDiff.
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
FlameGraph
Stack trace visualizer
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
Megatron-LM
Ongoing research training transformer models at scale
CodeFormer_GUI
CodeFormer人脸清晰化工具图形界面版,自带环境解压即用
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
rust-book-chinese
rust 程序设计语言 中文版
comfyui-portrait-master-zh-cn
肖像大师 中文版 comfyui-portrait-master
DiffSynth-Studio
Enjoy the magic of Diffusion models!
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
RoFormer_pytorch
RoFormer V1 & V2 pytorch