garrywrj's starred repositories

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33208Issues:309Issues:414

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33143Issues:749Issues:1191

transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Language:PythonLicense:MITStargazers:12989Issues:341Issues:334

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9834Issues:131Issues:46

cupy

NumPy & SciPy for GPU

Language:PythonLicense:MITStargazers:7868Issues:127Issues:2175

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:3955Issues:48Issues:841

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

nlp-competitions-list-review

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2036Issues:30Issues:257
Language:PythonLicense:Apache-2.0Stargazers:802Issues:47Issues:589

MQBench

Model Quantization Benchmark

Language:ShellLicense:Apache-2.0Stargazers:730Issues:14Issues:195

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:PythonLicense:BSD-3-ClauseStargazers:347Issues:8Issues:33

onnx-tool

A parser, editor and profiler tool for ONNX models.

Language:PythonLicense:MITStargazers:321Issues:6Issues:63

pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.

Language:PythonLicense:Apache-2.0Stargazers:286Issues:5Issues:4

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

License:MITStargazers:228Issues:11Issues:0

EfficientWord-Net

OneShot Learning-based hotword detection.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:211Issues:12Issues:36

malaya-speech

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookLicense:MITStargazers:195Issues:18Issues:41

EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Language:PythonLicense:MITStargazers:191Issues:5Issues:25
Language:PythonLicense:Apache-2.0Stargazers:161Issues:8Issues:7

3m-asr

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

Language:PythonLicense:Apache-2.0Stargazers:116Issues:6Issues:5

ssspy

A Python toolkit for sound source separation.

Language:PythonLicense:Apache-2.0Stargazers:112Issues:6Issues:103

Wav2Keyword

Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.

Language:PythonLicense:MITStargazers:95Issues:5Issues:11

LLaMA-Pruning

Structural Pruning for LLaMA

Language:PythonLicense:GPL-3.0Stargazers:53Issues:9Issues:3

unsup_speech_enh_adaptation

Unsupervised domain adaptation for conversational speech enhancement using RemixIT

Language:Jupyter NotebookLicense:MITStargazers:50Issues:3Issues:5
Language:HTMLLicense:CC0-1.0Stargazers:31Issues:2Issues:0

NeSsi

Keras/Pytorch neural network size, operations and parameters counter

Language:MATLABStargazers:7Issues:0Issues:0