Beast code in Giters

lvzhiqiang's repositories

3m-asr

Language:PythonApache-2.0010

Anomaly-Transformer

About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_

Language:PythonMIT010

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT000

bark

🔊 Text-Prompted Generative Audio Model

Language:PythonNOASSERTION000

best-rq-pytorch

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Language:PythonMIT000

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Language:C++BSD-3-Clause000

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++NOASSERTION000

datasets_emotion

This repository collects information about different data sets for Music Emotion Recognition.

010

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Language:PythonNOASSERTION010

deepvac

PyTorch python project standard.

Language:PythonGPL-3.0010

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CGPL-3.0000

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonMIT010

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookApache-2.0010

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

AGPL-3.0000

gtn_applications

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Language:PythonMIT010

HanLP

中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理

Language:PythonApache-2.0010

laion-prepro

Get hundred of million of image+url from the crawling at home dataset and preprocess them

Language:Python010

llama.cpp

My develoopment fork of llama.cpp. For now working on RK3588 NPU backend

Language:CMIT000

mfa

About how to use 'Montreal Forced Aligner'.

010

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:JavaScriptNOASSERTION010

Multi-Singer

PyTorch Implementation of Multi-Singer (ACM-MM'21)

MIT000

music_source_separation

Language:PythonNOASSERTION010

pkwrap

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

Language:PythonNOASSERTION010

PSL

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Language:PythonGPL-3.0010

riffusion-app

Stable diffusion for real-time music generation (web app)

Language:TypeScriptMIT000

rVADfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Language:PythonMIT010

SFANC-Window

Real-time Implementation of CNN-based selective fixed-filter active noise control and effectiveness analysis using explainable AI

Language:Python000

SpeechAlgorithms

Speech Algorithms Collections

Language:CApache-2.0010

Squeezeformer

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Language:PythonApache-2.0010

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++Apache-2.0010