hly990's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:33314Issues:204Issues:1233

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:28133Issues:253Issues:0

chatwoot

Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬

Language:RubyLicense:NOASSERTIONStargazers:20645Issues:232Issues:4316

one-api

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.

Language:JavaScriptLicense:MITStargazers:18210Issues:99Issues:1426

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用

Language:PythonLicense:GPL-3.0Stargazers:10137Issues:65Issues:519

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7845Issues:47Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6854Issues:71Issues:587

StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5828Issues:86Issues:143

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:5497Issues:55Issues:87

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Language:PythonLicense:Apache-2.0Stargazers:3299Issues:34Issues:211

RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Language:PythonLicense:Apache-2.0Stargazers:2959Issues:25Issues:46

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:2502Issues:49Issues:186

gluestack-ui

React & React Native Components & Patterns (copy-paste components & patterns crafted with Tailwind CSS (NativeWind))

Language:TypeScriptLicense:MITStargazers:2434Issues:20Issues:449

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:2400Issues:25Issues:267

ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonLicense:MITStargazers:1557Issues:77Issues:43

cog-face-to-many

Turn any face into a video game character, pixel art, claymation, 3D or toy

Language:PythonLicense:NOASSERTIONStargazers:1250Issues:9Issues:48

VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Language:PythonLicense:Apache-2.0Stargazers:755Issues:9Issues:82

Uni-TTS

本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务

Language:PythonLicense:MITStargazers:631Issues:8Issues:31

sentient

the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8

Language:PythonLicense:MITStargazers:366Issues:5Issues:19

chat-api

基于One API与New API的基础上进行二次开发

Language:JavaScriptLicense:NOASSERTIONStargazers:365Issues:8Issues:81

hallo-for-windows

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:183Issues:6Issues:6

JianYingApi

Third Party JianYing Api. 第三方剪映Api

Language:PythonLicense:MITStargazers:147Issues:5Issues:9

Advanced-QA-and-RAG-Series

This repository contains advanced LLM-based chatbots for Q&A using LLM agents, and Retrieval Augmented Generation (RAG) and with different databases. (VectorDB, GraphDB, SQLite, CSV, XLSX, etc.)

Language:Jupyter NotebookStargazers:124Issues:5Issues:4
Language:PythonStargazers:42Issues:0Issues:0

font

OWenT's Utils -- Font branch

Language:PythonStargazers:39Issues:2Issues:0

JianYingSrt

模拟剪映转换字幕

Language:PythonLicense:GPL-3.0Stargazers:35Issues:1Issues:13

bark-rvc-pipeline

TTS pipeline that uses RVC to enhance Bark audio quality and cloning

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

Lets_Build_Market_Analysis_Team_w_AI_Agents

Let's Build Market Analysis Team w/ AI Agents

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0