weimeng23

Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.12. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.

Language:PythonApache-2.011844 136 2421

IDM-Activation-Script

IDM Activation & Trail Reset Script

Language:BatchfileGPL-3.09479 77 25

copilot.vim

Neovim plugin for GitHub Copilot

Language:Vim ScriptNOASSERTION8363 1220

RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

Language:TypeScriptMIT5126 43 359

noise-suppression-for-voice

Noise suppression plugin based on Xiph's RNNoise

Language:C++GPL-3.04882 57 169

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT4594 39 171

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT3296 57 696

DeepFilterNet

Noise supression using deep filtering

Language:PythonNOASSERTION2418 31 276

ngram

The n-gram Language Model

Language:C1320 500

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptMIT1302 12 103

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonMIT897 23 33

ai00_server

A localized open-source AI server that is better than ChatGPT.

Language:RustMIT468 15 64

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonApache-2.0443 9 16

voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

Language:PythonGPL-3.0352 12 15

kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Language:PythonAGPL-3.0337 30 62

onnxruntime-extensions

onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime

Language:C++MIT325 30 159

g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Language:PythonApache-2.0278 5 17

NeMo-text-processing

NeMo text processing for ASR and TTS

Language:PythonApache-2.0270 15 35

NOTSOFAR1-Challenge

NOTSOFAR-1 Challenge: Distant Diarization and ASR

Language:PythonMIT42 14 9

Voice-Privacy-Challenge-2024

Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software

Language:PythonNOASSERTION39 3 13

kaldi-decoder

Decoders from Kaldi using OpenFst

Language:C++Apache-2.024 5 3

audio-speech-datasets

:scroll: A list of various Audio/Speech datasets about Speech Recognition, Speech Synthesis, Noise, Audio Tagging/Sound Event Detection, Speaker Diarization, Speaker Recognition, (Inverse) Text normalization, Speech Translation, Multilingual, etc. (continuously update)

CC-BY-SA-4.02 10