runngezhang-jx

followers

following

stars

runngezhang-jx's repositories

kissdsp

000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Apache-2.0000

CTCNet

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

Apache-2.0000

phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

Language:Python000

silero-vad

Language:C++MIT000

VideoGPT

MIT000

mlx-examples

Examples in the MLX framework

MIT000

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Apache-2.0000

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Apache-2.0000

awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

MIT000

pybind11

Seamless operability between C++11 and Python

NOASSERTION000

WavLM-DIHARD3

000

mvdrpf

MIT000

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Apache-2.0000

KWS_NLTM

000

silero-vad2

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

MIT000

dlrover

DLRover: An Automatic Distributed Deep Learning System

NOASSERTION000

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning.

NOASSERTION000

AP-BWE

Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction

MIT000

zhconv

Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.

MIT000

python

Boost.org python module

BSL-1.0000

MoeVoiceStudio

一个使用C++编写的音频处理软件

AGPL-3.0000

IF

NOASSERTION000

SRP-DNN

A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]

MIT000

fastRAG

Efficient Retrieval Augmentation and Generation Framework

Apache-2.0000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.0000

DTLN_pytorch

Dual-signal Transformation LSTM Network, PyTorch,NCNN

Apache-2.0000

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

MIT000

odas

ODAS: Open embeddeD Audition System

MIT000

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonCC-BY-4.0000