w-okada

User data from Github https://github.com/w-okada

followers

following

stars

w-okada's repositories

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonNOASSERTION18800 134 1145

image-analyze-workers

The zoo of image processing webworkers for javascript or typescript.

Language:TypeScript291 19 37

ttsclient

Language:PythonNOASSERTION151 4 8

beatrice-trainer-colab

Language:Jupyter Notebook30 1 2

asrclient

Language:TypeScriptNOASSERTION29 10

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT300

mastra-realtime-voice-api-demo

Language:TypeScriptMIT300

tinyvc

a lightweight voice conversion

Language:PythonApache-2.03 10

vcclient-for-zeroshot-server

Language:Python300

colab-file-uploader

Language:Jupyter Notebook2 20

SenseVoice

Multilingual Voice Understanding Model

Language:PythonNOASSERTION200

APNet2

Source code of APNet2, a vocoder

Language:PythonMIT1 10

beatrice-vst

声質変換 VST

Language:C++MIT1 10

LLVC

Language:PythonMIT1 10

Real-Time-Latent-Consistency-Model

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server

Language:Python1 10

seed-vc

State-of-the-Art zero-shot voice conversion & singing voice conversion, with real-time support

Language:PythonGPL-3.0100

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT1 10

beatrice-dataset-generator

Language:TypeScript000

EfficientWord-Net

OneShot Learning-based hotword detection.

Apache-2.0000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION000

LibreChat

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.

MIT000

mastra

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

Language:TypeScriptNOASSERTION000

mastra-realtime-voice-issue

Language:TypeScript000

obs-studio

OBS Studio - Free and open source software for live streaming and screen recording

Language:CGPL-2.0010

pdf.js

PDF Reader in JavaScript

Language:JavaScriptApache-2.0000

porcupine

On-device wake word detection powered by deep learning

Apache-2.0000

RMVPE

Language:Python010

vad-web

Voice activity detector (VAD) for the browser

MIT000

Windows-driver-samples

This repo contains driver samples prepared for use with Microsoft Visual Studio and the Windows Driver Kit (WDK). It contains both Universal Windows Driver and desktop-only driver samples.

Language:CMS-PL010

worker-manager

Language:TypeScript020