Meng Wei's starred repositories
Microsoft-Activation-Scripts
Open-source Windows and Office activator featuring HWID, Ohook, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
IDM-Activation-Script
IDM Activation & Trail Reset Script
copilot.vim
Neovim plugin for GitHub Copilot
RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
noise-suppression-for-voice
Noise suppression plugin based on Xiph's RNNoise
CTranslate2
Fast inference engine for Transformer models
DeepFilterNet
Noise supression using deep filtering
onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
ai00_server
A localized open-source AI server that is better than ChatGPT.
voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
NeMo-text-processing
NeMo text processing for ASR and TTS
NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
Voice-Privacy-Challenge-2024
Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software
kaldi-decoder
Decoders from Kaldi using OpenFst
audio-speech-datasets
:scroll: A list of various Audio/Speech datasets about Speech Recognition, Speech Synthesis, Noise, Audio Tagging/Sound Event Detection, Speaker Diarization, Speaker Recognition, (Inverse) Text normalization, Speech Translation, Multilingual, etc. (continuously update)