w-okada's repositories

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:18800Issues:134Issues:1145

image-analyze-workers

The zoo of image processing webworkers for javascript or typescript.

Language:PythonLicense:NOASSERTIONStargazers:151Issues:4Issues:8
Language:TypeScriptLicense:NOASSERTIONStargazers:29Issues:1Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:3Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:3Issues:0Issues:0

tinyvc

a lightweight voice conversion

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0
Language:PythonStargazers:3Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:2Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

APNet2

Source code of APNet2, a vocoder

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

beatrice-vst

声質変換 VST

Language:C++License:MITStargazers:1Issues:1Issues:0
Language:PythonLicense:MITStargazers:1Issues:1Issues:0

Real-Time-Latent-Consistency-Model

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server

Language:PythonStargazers:1Issues:1Issues:0

seed-vc

State-of-the-Art zero-shot voice conversion & singing voice conversion, with real-time support

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:TypeScriptStargazers:0Issues:0Issues:0

EfficientWord-Net

OneShot Learning-based hotword detection.

License:Apache-2.0Stargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LibreChat

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.

License:MITStargazers:0Issues:0Issues:0

mastra

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:TypeScriptStargazers:0Issues:0Issues:0

obs-studio

OBS Studio - Free and open source software for live streaming and screen recording

Language:CLicense:GPL-2.0Stargazers:0Issues:1Issues:0

pdf.js

PDF Reader in JavaScript

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

porcupine

On-device wake word detection powered by deep learning

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

vad-web

Voice activity detector (VAD) for the browser

License:MITStargazers:0Issues:0Issues:0

Windows-driver-samples

This repo contains driver samples prepared for use with Microsoft Visual Studio and the Windows Driver Kit (WDK). It contains both Universal Windows Driver and desktop-only driver samples.

Language:CLicense:MS-PLStargazers:0Issues:1Issues:0
Language:TypeScriptStargazers:0Issues:2Issues:0