w-okada's repositories
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
image-analyze-workers
The zoo of image processing webworkers for javascript or typescript.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
SenseVoice
Multilingual Voice Understanding Model
beatrice-vst
声質変換 VST
Real-Time-Latent-Consistency-Model
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server
EfficientWord-Net
OneShot Learning-based hotword detection.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
LibreChat
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
mastra
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
obs-studio
OBS Studio - Free and open source software for live streaming and screen recording
pdf.js
PDF Reader in JavaScript
porcupine
On-device wake word detection powered by deep learning
vad-web
Voice activity detector (VAD) for the browser
Windows-driver-samples
This repo contains driver samples prepared for use with Microsoft Visual Studio and the Windows Driver Kit (WDK). It contains both Universal Windows Driver and desktop-only driver samples.