hly990's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
Bert-VITS2
vits2 backbone with multilingual-bert
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
gluestack-ui
React & React Native Components & Patterns (copy-paste components & patterns crafted with Tailwind CSS (NativeWind))
ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
cog-face-to-many
Turn any face into a video game character, pixel art, claymation, 3D or toy
VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
hallo-for-windows
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
JianYingApi
Third Party JianYing Api. 第三方剪映Api
Advanced-QA-and-RAG-Series
This repository contains advanced LLM-based chatbots for Q&A using LLM agents, and Retrieval Augmented Generation (RAG) and with different databases. (VectorDB, GraphDB, SQLite, CSV, XLSX, etc.)
JianYingSrt
模拟剪映转换字幕
bark-rvc-pipeline
TTS pipeline that uses RVC to enhance Bark audio quality and cloning
Lets_Build_Market_Analysis_Team_w_AI_Agents
Let's Build Market Analysis Team w/ AI Agents
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)