Steven Wang's starred repositories

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20353Issues:0Issues:0

e2e_lfmmi

E2E system with LF-MMI; word N-gram for Mandarin

Language:PythonStargazers:162Issues:0Issues:0

kaldifst

Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files

Language:C++License:NOASSERTIONStargazers:47Issues:0Issues:0

BeamformIt

BeamformIt acoustic beamforming software

Language:C++Stargazers:337Issues:0Issues:0

NotepadNext

A cross-platform, reimplementation of Notepad++

Language:C++License:GPL-3.0Stargazers:8822Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5273Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10334Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65563Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11157Issues:0Issues:0

CppCoreGuidelines-zh-CN

Translation of C++ Core Guidelines [https://github.com/isocpp/CppCoreGuidelines] into Simplified Chinese.

License:NOASSERTIONStargazers:2092Issues:0Issues:0

cs344

Introduction to Parallel Programming class code

Language:CudaStargazers:1289Issues:0Issues:0

awesome-neovim

Collections of awesome neovim plugins.

License:CC0-1.0Stargazers:15161Issues:0Issues:0

Book4_Power-of-Matrix

Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!

Language:PythonStargazers:8233Issues:0Issues:0

CUDA-Programming-Guide-in-Chinese

This is a Chinese translation of the CUDA programming guide

Stargazers:1080Issues:0Issues:0

kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

Language:C++License:NOASSERTIONStargazers:178Issues:0Issues:0

CL_Anthology

An anthology of recent continual learning papers, where people interested in this fascinating topic can start discovering its multidimensional representations.

Stargazers:9Issues:0Issues:0

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

Language:PostScriptLicense:CC0-1.0Stargazers:16719Issues:0Issues:0

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:616Issues:0Issues:0

fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

Language:PythonLicense:NOASSERTIONStargazers:136Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2670Issues:0Issues:0

awesome-multi-task-learning

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

Stargazers:603Issues:0Issues:0

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonLicense:MITStargazers:1893Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130177Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1780Issues:0Issues:0

riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Language:PythonStargazers:78Issues:0Issues:0

one-python-craftsman

来自一位 Pythonista 的编程经验分享,内容涵盖编码技巧、最佳实践与思维模式等方面。

License:Apache-2.0Stargazers:6769Issues:0Issues:0

python3-cookbook

《Python Cookbook》 3rd Edition Translation

Language:Jupyter NotebookStargazers:11563Issues:0Issues:0

python_cn_resouce

python书籍免费下载;python库参考;Django、Flask、FastAPI资源大全、DevOps资源大全、python测试库资源大全。公众号:pythontesting

Language:PythonStargazers:226Issues:0Issues:0

tinydiarize

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

Language:PythonLicense:MITStargazers:401Issues:0Issues:0