Beast code in Giters

zcy618's repositories

android-cmake-sample

Android and CMake sample - learn how to compile native code inside an Android app with CMake

Language:Kotlin000

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT000

biquad

Collection of alterable digital biquad filters for dynamic audio effect creation

MIT000

ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

MIT000

cpp_torch

It is tiny-dnn based on libtorch. Only headers without dependencies other than libtorch, deep learning framework

MIT000

DeepFilterNet

Noise supression using deep filtering

Language:PythonNOASSERTION000

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

MIT000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Language:PythonNOASSERTION000

InternLM

InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.

Apache-2.0000

IS2022-CVQ

Samples for Complex VQ-VAE speech enhancement - ICASSP2021

000

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT000

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT000

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

MIT000

QRSolutionToMatrixInverse

We use c languafe to implememnt the QRSolutionToMatrixInverse.

000

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.

Apache-2.0000

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Apache-2.0000

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0000

TaylorSENet

This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)

MIT000

tflite-micro

TensorFlow Lite for Microcontrollers

Apache-2.0000

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

MIT000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Apache-2.0000

UIT_Mobile

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"

GPL-3.0000

voice_activity_detection-1

Voice Activity Detection based on Deep Learning & TensorFlow

GPL-3.0000

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Apache-2.0000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

Yuzukilizard

Yuzukilizard is a Small Heterogeneous & AI Powered Dev Board Based on Allwinner V851S

CERN-OHL-S-2.0000

zcy618

zcy618's repositories

aispeech-earbuds

android-cmake-sample

asteroid

biquad

ComputeLibrary

cpp_torch

DeepFilterNet

DTLN

FunASR

InternLM

IS2022-CVQ

MNBVC

nnabla

pyroomacoustics

pytorch-distributed

QRSolutionToMatrixInverse

RapidOCR

room-impulse-responses

s3prl

Speech-enhancement

speechbrain

TaylorSENet

tflite-micro

torch-audiomentations

tvm

UIT_Mobile

voice_activity_detection-1

VoiceprintRecognition-Pytorch

whisper

Yuzukilizard