lvzhiqiang's repositories
Anomaly-Transformer
About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
bark
🔊 Text-Prompted Generative Audio Model
best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
datasets_emotion
This repository collects information about different data sets for Music Emotion Recognition.
DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
espnet_onnx
Onnx wrapper for espnet infrernce model
FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
gtn
Automatic differentiation with weighted finite-state transducers.
gtn_applications
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
llama.cpp
My develoopment fork of llama.cpp. For now working on RK3588 NPU backend
mfa
About how to use 'Montreal Forced Aligner'.
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
riffusion-app
Stable diffusion for real-time music generation (web app)
SpeechAlgorithms
Speech Algorithms Collections
Squeezeformer
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit