Mingli Wu (wmlgl)

wmlgl

Geek Repo

Location:GuangZhou, CN

Github PK Tool:Github PK Tool

Mingli Wu's starred repositories

LaikeTui

来客推商城系统, [ 微信 + 支付宝 + 百度 + 头条 ] 小程序 + APP + 公众号 + PC + H5,注重界面美感与用户体验,打造独特电商系统生态圈,不可多得的二开神器。 【JAVA商城 PHP商城系统 uniapp商城系统 分销商城 多用户商城 SaaS O2O商城 B2B2C S2B2C 小程序直播 商城源码 跨境电商系统 社区团购】

Language:PLpgSQLLicense:Apache-2.0Stargazers:812Issues:0Issues:0

ray-so

Create code snippets, browse AI prompts, create extension icons and more.

Language:TypeScriptLicense:MITStargazers:1167Issues:0Issues:0

floating_ball

基于pyside6开发的windows平台悬浮球工具

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

OpenHMD

Free and Open Source API and drivers for immersive technology.

Language:CLicense:BSL-1.0Stargazers:1212Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5529Issues:0Issues:0

whisper-export

openvino version of openai/whisper

Language:Jupyter NotebookLicense:MITStargazers:10Issues:0Issues:0

whisper-openvino

openvino version of openai/whisper

Language:Jupyter NotebookLicense:MITStargazers:153Issues:0Issues:0

ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Language:PythonStargazers:1953Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66163Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2949Issues:0Issues:0

sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++License:Apache-2.0Stargazers:949Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33831Issues:0Issues:0

sonic

Simple library to speed up or slow down speech

Language:CLicense:Apache-2.0Stargazers:606Issues:0Issues:0

ComfyUI-Impact-Pack

Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.

Language:PythonLicense:GPL-3.0Stargazers:1600Issues:0Issues:0

CushyStudio

🛋 The AI and Generative Art platform for everyone

Language:TypeScriptLicense:AGPL-3.0Stargazers:643Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9395Issues:0Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:47162Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4250Issues:0Issues:0

subclipse

Subclipse - Eclipse SVN Provider

Language:JavaLicense:EPL-1.0Stargazers:455Issues:0Issues:0

web-voice-changer

web voice changer sample by web api and tone.js

Language:TypeScriptStargazers:2Issues:0Issues:0

VirtualWife

VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama

Language:PythonLicense:MITStargazers:1393Issues:0Issues:0

OpenLive3D.core

The core of the motion capture part of OpenLive3D

Language:JavaScriptLicense:Apache-2.0Stargazers:6Issues:0Issues:0

kalidoface-3d

Face and Body Tracking for VRM 3D models on the web.

Language:HTMLLicense:NOASSERTIONStargazers:422Issues:0Issues:0

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Language:PythonLicense:MITStargazers:476Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10794Issues:0Issues:0

vits-simple-api

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Language:PythonLicense:AGPL-3.0Stargazers:767Issues:0Issues:0

vits-finetuning

Fine-Tuning your VITS model using a pre-trained model

Language:PythonLicense:MITStargazers:540Issues:0Issues:0

vrm-dance-viewer

VRM HTML5 Viewer with VMD motion files support

Language:TypeScriptLicense:MITStargazers:50Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7672Issues:0Issues:0

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4683Issues:0Issues:0