Beast code in Giters

lxz's repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:Python1 10

tacotronv2_wavernn_chinese

tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)

Language:Python1 10

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT000

bark-training-cloning

for training the model

NOASSERTION000

carefree-creator

An AI-powered creator for everyone.

000

DiffSinger

PyTorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Language:PythonMIT010

DiffSinger-1

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Forked and maintained by the OpenVPI community

MIT000

disable-flutter-tls-verification

A Frida script that disables Flutter's TLS verification

000

dream-textures

Stable Diffusion built-in to the Blender shader editor

GPL-3.0000

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Language:PythonApache-2.0000

Games

Home Page Link:

000

lobe-chat

🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

MIT000

MDM

MIT000

metahuman-stream

Real time interactive streaming digital human

MIT000

midi-js-soundfonts

Pre-rendered General MIDI soundfonts that can be used immediately with MIDI.js

MIT010

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

MIT000

OpenVoice

Instant voice cloning by MyShell.

MIT000

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

Apache-2.0000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

MIT000

ppg-vc

PPG-Based Voice Conversion

Apache-2.0000

roop

one-click deepfake (face swap)

AGPL-3.0000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Apache-2.0000

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

Language:Python010

so-vits-svc

SoftVC VITS Singing Voice Conversion

AGPL-3.0000

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT010

test_push

020

UniAudio

The Open Source Code of UniAudio

Language:Python000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

vits

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

MIT000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Apache-2.0000

daxiangpanda

lxz's repositories

stable-diffusion-webui

tacotronv2_wavernn_chinese

audiocraft_plus

bark-training-cloning

carefree-creator

DiffSinger

DiffSinger-1

disable-flutter-tls-verification

dream-textures

facechain

Games

lobe-chat

MDM

metahuman-stream

midi-js-soundfonts

muzic

OpenVoice

PaddleSpeech

ParallelWaveGAN

ppg-vc

roop

segment-anything

singing_transcription_ICASSP2021

so-vits-svc

spleeter

test_push

UniAudio

unilm

vits

wenet