scottsln's repositories
audio-slicer
Python script that slices audio with silence detection
AudioSlicer
Audio Slicer that uses silence detection to split .wav audio files into several .wav samples.
bark
🔊 Text-Prompted Generative Audio Model
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
chatgpt-mirror
A mirror of ChatGPT based on the gpt-3.5-turbo model.
chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
HairCLIPv2
[ICCV 2023] HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
mmdetection
OpenMMLab Detection Toolbox and Benchmark
MoeVoiceStudio
一个使用C++编写的音频处理软件
Openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
piper
A fast, local neural text to speech system
rustdesk
An open-source remote desktop, and alternative to TeamViewer.
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
so-vits-svc
SoftVC VITS Singing Voice Conversion
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
stylegan2
StyleGAN2 - Official TensorFlow Implementation
stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
stylegan3
Official PyTorch implementation of StyleGAN3
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
Yi
A series of large language models trained from scratch by developers @01-ai
yt-dlp
A youtube-dl fork with additional features and fixes