windowxiaoming's starred repositories

llama.cpp

LLM inference in C/C++

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:31873Issues:273Issues:1056

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17370Issues:174Issues:2103

conv_arithmetic

A technical report on convolution arithmetic in the context of deep learning

luci

LuCI - OpenWrt Configuration Interface

Language:JavaScriptLicense:Apache-2.0Stargazers:6087Issues:269Issues:2654

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:3845Issues:148Issues:194

glow

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

Language:PythonLicense:MITStargazers:3101Issues:232Issues:97

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:Apache-2.0Stargazers:2938Issues:75Issues:263

libqrencode

A fast and compact QR Code encoding library

Language:CLicense:LGPL-2.1Stargazers:2487Issues:128Issues:139

NJUCS

2023南京大学计算机科学与技术845考研公共课和专业课资料:数学一、英语一、政治、数据结构、计算机网络、计算机系统基础、操作系统教程、算法设计与分析-包括真题、期末考试、PPT、模拟题、专业课参考书及课后答案、报录比、经验等等

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:852Issues:31Issues:95

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Language:PythonLicense:MITStargazers:651Issues:20Issues:73

melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Language:PythonLicense:BSD-3-ClauseStargazers:630Issues:30Issues:59

PESQ

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

vadnet

Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks

Language:PythonLicense:LGPL-3.0Stargazers:414Issues:20Issues:32

voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Language:PythonLicense:Apache-2.0Stargazers:371Issues:25Issues:25
Language:PythonLicense:Apache-2.0Stargazers:341Issues:31Issues:39

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)

Language:PythonLicense:MITStargazers:190Issues:7Issues:3

resemble-unity-text-to-speech

Resemble's voice cloning engine within Unity

tacotron2-mandarin

Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.

Language:PythonLicense:MITStargazers:127Issues:7Issues:11

online_calibration

This is an online calibration system between multiple sensors (camera, lidar, IMU). It is being created. . . . . .

Language:C++License:GPL-3.0Stargazers:64Issues:5Issues:1

Vinorm

Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syllables

Language:MakefileLicense:NOASSERTIONStargazers:46Issues:3Issues:9

video_decode_ffmpeg

Video Decoder by ffmpeg, Solve the problem that the rtsp stream will receive some invalid frame when network is unstable.

EASY-EAI-Toolkit-C-Demo

EASY-EAI-Toolkit for EASY EAI NANO

Language:CLicense:BSD-3-ClauseStargazers:22Issues:1Issues:0

ffmepgRtmp

利用opencv + FFmpeg实现直播推流

Language:CStargazers:8Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

learnopencv

Learn OpenCV : C++ and Python Examples

Stargazers:1Issues:0Issues:0

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonLicense:MITStargazers:1Issues:1Issues:0