Beast code in Giters

zcy618's repositories

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0000

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT000

DeepFilterNet

Noise supression using deep filtering

Language:PythonNOASSERTION000

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.

Apache-2.0000

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Language:PythonNOASSERTION000

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Apache-2.0000

asteroid

The PyTorch-based audio source separation toolkit for researchers

MIT000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Apache-2.0000

Yuzukilizard

Yuzukilizard is a Small Heterogeneous & AI Powered Dev Board Based on Allwinner V851S

CERN-OHL-S-2.0000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

MIT000

InternLM

InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.

Apache-2.0000

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT000

biquad

Collection of alterable digital biquad filters for dynamic audio effect creation

MIT000

UIT_Mobile

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"

GPL-3.0000

ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

MIT000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Apache-2.0000

nnabla

Neural Network Libraries

Apache-2.0000

cpp_torch

It is tiny-dnn based on libtorch. Only headers without dependencies other than libtorch, deep learning framework

MIT000

tflite-micro

TensorFlow Lite for Microcontrollers

Apache-2.0000

QRSolutionToMatrixInverse

We use c languafe to implememnt the QRSolutionToMatrixInverse.

000

IS2022-CVQ

Samples for Complex VQ-VAE speech enhancement - ICASSP2021

000

TaylorSENet

This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)

MIT000