runngezhang-jx's repositories

Stargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

CTCNet

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

License:Apache-2.0Stargazers:0Issues:0Issues:0

phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

Language:PythonStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

mlx-examples

Examples in the MLX framework

License:MITStargazers:0Issues:0Issues:0

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

License:Apache-2.0Stargazers:0Issues:0Issues:0

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

License:MITStargazers:0Issues:0Issues:0

pybind11

Seamless operability between C++11 and Python

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

silero-vad2

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

License:MITStargazers:0Issues:0Issues:0

dlrover

DLRover: An Automatic Distributed Deep Learning System

License:NOASSERTIONStargazers:0Issues:0Issues:0

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning.

License:NOASSERTIONStargazers:0Issues:0Issues:0

AP-BWE

Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction

License:MITStargazers:0Issues:0Issues:0

zhconv

Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.

License:MITStargazers:0Issues:0Issues:0

python

Boost.org python module

License:BSL-1.0Stargazers:0Issues:0Issues:0

MoeVoiceStudio

一个使用C++编写的音频处理软件

License:AGPL-3.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

SRP-DNN

A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]

License:MITStargazers:0Issues:0Issues:0

fastRAG

Efficient Retrieval Augmentation and Generation Framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DTLN_pytorch

Dual-signal Transformation LSTM Network, PyTorch,NCNN

License:Apache-2.0Stargazers:0Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

License:MITStargazers:0Issues:0Issues:0

odas

ODAS: Open embeddeD Audition System

License:MITStargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0