neargostudio's repositories
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/文心一言】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/斗鱼/YouTube/twitch】 直播中与观众实时互动 或 直接在本地进行聊天。它使用自然语言处理和文本转语音技术【edge-tts/VITS/elevenlabs/bark/VALL-E-X】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;通过特定指令协同SD进行画图。
AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Auto-Synced-Translated-Dubs
Automatically translates the text of a video based on a subtitle file, and also uses AI voice to dub the video, and synced using the subtitle's timings
bark
🔊 Text-Prompted Generative Audio Model
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
douyin-Fay
抖音[直播伴侣]推流密钥获取工具 抖音直播间弹幕、进入房间等数据通过Websocket对接Fay
douyin-signature
douyin signature
DouyinLiveRecorder
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、网易cc、pandaTV等平台直播录制,抓取多平台直播源地址,抖音无水印解析,快手无水印解析
duix.ai
offline 2d digitalhuman demo for edge devices (android/ios/etc.)
elevenlabs-python
The official Python API for ElevenLabs text-to-speech.
facefusion
Next generation face swapper and enhancer
faster-whisper
Faster Whisper transcription with CTranslate2
FastGPT
FastGPT is a knowledge-based QA system built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization!
fxsound-app
FxSound application and DSP source code
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
gptsovits-api
适用于 GPT-SoVITS 的api调用接口
HeyGenClone
A simple and open-source analogue of the HeyGen system
HotPatcher
Unreal Engine hot update manage and package plugin.
metaperson-ue-sample
MetaPerson - sample for Unreal Engine 5
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
NativeSpeaker
make your Speaker talking as Native style with own voice!
one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
PHP-FFMpeg
An object oriented PHP driver for FFMpeg binary
promptbase
All things prompt engineering
spleeter
Deezer source separation library including pretrained models.
TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
workerman
An asynchronous event driven PHP socket framework. Supports HTTP, Websocket, SSL and other custom protocols.