Beast code in Giters

Mingli Wu's starred repositories

LaikeTui

来客推商城系统， [ 微信 + 支付宝 + 百度 + 头条 ] 小程序 + APP + 公众号 + PC + H5，注重界面美感与用户体验，打造独特电商系统生态圈，不可多得的二开神器。【JAVA商城 PHP商城系统 uniapp商城系统分销商城多用户商城 SaaS O2O商城 B2B2C S2B2C 小程序直播商城源码跨境电商系统社区团购】

Language:PLpgSQLApache-2.081200

ray-so

Create code snippets, browse AI prompts, create extension icons and more.

Language:TypeScriptMIT116700

floating_ball

基于pyside6开发的windows平台悬浮球工具

Language:PythonMIT1200

OpenHMD

Free and Open Source API and drivers for immersive technology.

Language:CBSL-1.0121200

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION552900

whisper-export

openvino version of openai/whisper

Language:Jupyter NotebookMIT1000

whisper-openvino

openvino version of openai/whisper

Language:Jupyter NotebookMIT15300

ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Language:Python195300

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6616300

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++Apache-2.0294900

sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++Apache-2.094900

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++MIT3383100

sonic

Simple library to speed up or slow down speech

Language:CApache-2.060600

ComfyUI-Impact-Pack

Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.

Language:PythonGPL-3.0160000

CushyStudio

🛋 The AI and Generative Art platform for everyone

Language:TypeScriptAGPL-3.064300

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonApache-2.0939500

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.04716200

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT425000

subclipse

Subclipse - Eclipse SVN Provider

Language:JavaEPL-1.045500

web-voice-changer

web voice changer sample by web api and tone.js

Language:TypeScript200

VirtualWife

VirtualWife是一个虚拟数字人项目，支持B站直播，支持openai、ollama

Language:PythonMIT139300

OpenLive3D.core

The core of the motion capture part of OpenLive3D

Language:JavaScriptApache-2.0600

kalidoface-3d

Face and Body Tracking for VRM 3D models on the web.

Language:HTMLNOASSERTION42200

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Language:PythonMIT47600

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01079400

wmlgl