Pan's repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
canvas-lms
The open LMS by Instructure, Inc.
ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
ChatGPT-Next-Web
One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。
clash-for-linux
Linux 端使用 Clash 作为代理工具
homepage
My homepage.
CPM-Bee
百亿参数的中英文双语基座大模型
faster-whisper
Faster Whisper transcription with CTranslate2
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
parrot_drone
Simple instructions to control the parrot drone in Sphinx/real environment.
Prometheus
Open source software for autonomous drones.
read-papers
Record the contents of the paper in a concise form.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
so-vits-svc
SoftVC VITS Singing Voice Conversion
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Whisper-Finetune
微调Whisper语音识别模型和加速推理,支持Web部署和Android部署