lxz (daxiangpanda)

daxiangpanda

Geek Repo

Company:UESTC

Location:Sichuan Province,China

Github PK Tool:Github PK Tool

lxz's repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonStargazers:1Issues:1Issues:0

tacotronv2_wavernn_chinese

tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)

Language:PythonStargazers:1Issues:1Issues:0

audiocraft_plus

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bark-training-cloning

for training the model

License:NOASSERTIONStargazers:0Issues:0Issues:0

carefree-creator

An AI-powered creator for everyone.

Stargazers:0Issues:0Issues:0

DiffSinger

PyTorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DiffSinger-1

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Forked and maintained by the OpenVPI community

License:MITStargazers:0Issues:0Issues:0

disable-flutter-tls-verification

A Frida script that disables Flutter's TLS verification

Stargazers:0Issues:0Issues:0

dream-textures

Stable Diffusion built-in to the Blender shader editor

License:GPL-3.0Stargazers:0Issues:0Issues:0

Equalizer

Equalizer on python

Language:PythonStargazers:0Issues:1Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Games

Home Page Link:

Stargazers:0Issues:0Issues:0

lobe-chat

🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

License:MITStargazers:0Issues:0Issues:0

MDM

MDM

License:MITStargazers:0Issues:0Issues:0

midi-js-soundfonts

Pre-rendered General MIDI soundfonts that can be used immediately with MIDI.js

License:MITStargazers:0Issues:1Issues:0

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

License:MITStargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

License:MITStargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

License:MITStargazers:0Issues:0Issues:0

ppg-vc

PPG-Based Voice Conversion

License:Apache-2.0Stargazers:0Issues:0Issues:0

roop

one-click deepfake (face swap)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

Language:PythonStargazers:0Issues:1Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

License:AGPL-3.0Stargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

UniAudio

The Open Source Code of UniAudio

Language:PythonStargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

License:MITStargazers:0Issues:0Issues:0

vits

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

License:MITStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0