zcy618's repositories

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.

License:Apache-2.0Stargazers:0Issues:0Issues:0

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Stargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

License:Apache-2.0Stargazers:0Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

License:MITStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

Yuzukilizard

Yuzukilizard is a Small Heterogeneous & AI Powered Dev Board Based on Allwinner V851S

License:CERN-OHL-S-2.0Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

License:MITStargazers:0Issues:0Issues:0

InternLM

InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:0Issues:0Issues:0

biquad

Collection of alterable digital biquad filters for dynamic audio effect creation

License:MITStargazers:0Issues:0Issues:0

UIT_Mobile

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"

License:GPL-3.0Stargazers:0Issues:0Issues:0

ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

License:MITStargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

License:Apache-2.0Stargazers:0Issues:0Issues:0

nnabla

Neural Network Libraries

License:Apache-2.0Stargazers:0Issues:0Issues:0

cpp_torch

It is tiny-dnn based on libtorch. Only headers without dependencies other than libtorch, deep learning framework

License:MITStargazers:0Issues:0Issues:0

tflite-micro

TensorFlow Lite for Microcontrollers

License:Apache-2.0Stargazers:0Issues:0Issues:0

QRSolutionToMatrixInverse

We use c languafe to implememnt the QRSolutionToMatrixInverse.

Stargazers:0Issues:0Issues:0

IS2022-CVQ

Samples for Complex VQ-VAE speech enhancement - ICASSP2021

Stargazers:0Issues:0Issues:0

TaylorSENet

This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)

License:MITStargazers:0Issues:0Issues:0

android-cmake-sample

Android and CMake sample - learn how to compile native code inside an Android app with CMake

Stargazers:0Issues:0Issues:0

Speech-enhancement

Deep learning for audio denoising

License:MITStargazers:0Issues:0Issues:0

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

License:MITStargazers:0Issues:0Issues:0

voice_activity_detection-1

Voice Activity Detection based on Deep Learning & TensorFlow

License:GPL-3.0Stargazers:0Issues:0Issues:0