lvzhiqiang's repositories

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Anomaly-Transformer

About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

best-rq-pytorch

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

datasets_emotion

This repository collects information about different data sets for Music Emotion Recognition.

Stargazers:0Issues:1Issues:0

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

deepvac

PyTorch python project standard.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

License:GPL-3.0Stargazers:0Issues:0Issues:0

espnet_onnx

Onnx wrapper for espnet infrernce model

License:MITStargazers:0Issues:0Issues:0

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

License:Apache-2.0Stargazers:0Issues:0Issues:0

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

License:AGPL-3.0Stargazers:0Issues:0Issues:0

gtn

Automatic differentiation with weighted finite-state transducers.

License:MITStargazers:0Issues:0Issues:0

gtn_applications

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

License:MITStargazers:0Issues:0Issues:0

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

laion-prepro

Get hundred of million of image+url from the crawling at home dataset and preprocess them

Stargazers:0Issues:0Issues:0

llama.cpp

My develoopment fork of llama.cpp. For now working on RK3588 NPU backend

License:MITStargazers:0Issues:0Issues:0

mfa

About how to use 'Montreal Forced Aligner'.

Stargazers:0Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

Multi-Singer

PyTorch Implementation of Multi-Singer (ACM-MM'21)

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

pkwrap

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

License:NOASSERTIONStargazers:0Issues:0Issues:0

PSL

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

riffusion-app

Stable diffusion for real-time music generation (web app)

License:MITStargazers:0Issues:0Issues:0

rVADfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SpeechAlgorithms

Speech Algorithms Collections

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Squeezeformer

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0