Beast code in Giters

xcarson's starred repositories

VideoLingo

Netflix级字幕切割翻译、精确对齐和个性化配音，一键全自动视频搬运

Language:PythonMIT78700

Omost

Your image is almost there!

Language:PythonApache-2.0712900

ReHiFace-S

Real Time High-Fidelity Faceswap

Language:PythonNOASSERTION22000

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:Go2146100

generative-models

Generative Models by Stability AI

Language:PythonMIT2388500

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonAGPL-3.02975300

一个基于Entity-Component模式的灵活、通用、可扩展的轻量战斗（技能）框架，配置可选使用ScriptableObject或是Excel表格. A flexible, generic, easy to extend, lightweight combat (skills) framework based on Entity-Component pattern. Configuration can choose to use ScriptableObject or Excel tables.

Language:C#MIT190100

flux

Official inference repo for FLUX.1 models

Language:PythonApache-2.01187000

AutoLOD

Automatic LOD generation + scene optimization

Language:C#NOASSERTION179400

stack

Open-source Clerk/Auth0 alternative

Language:TypeScriptNOASSERTION270100

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0395300

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonApache-2.0210100

wiseflow

Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.

Language:JavaScriptNOASSERTION329500

exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Language:PythonGPL-3.0603000

mem0

The memory layer for Personalized AI

Language:PythonApache-2.02016000

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonApache-2.0417000

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:Python93300

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.0710200

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookAGPL-3.0238100

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6655500

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause1062800

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01081800

xcarson