Yeshua WB III's repositories
AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch
AutobiographyApp
My biographical app uses offline-first structure, MVVM and repository pattern, along with popular libraries like Retrofit, Glide, and Room. I also leverage technologies such as LiveData, Flow, ViewBinding, and Databinding.
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Bark_text-to-speech
Playground with Bark
controllable_talknet_server
A server for providing a webservice interface to Controllable TalkNet
ControllableTalkNet
A web app that lets you play around with TalkNet models
InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
SillyTavern-extras
Extensions API for SillyTavern
speech-generation-webui
A simple web UI for Suno-AI Bark
talknet-hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
teach-anything
Teach any questions in seconds (by OpenAI)
TTS-Voice-Wizard
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Vlad-O-matic
Opinionated fork/implementation of Stable Diffusion
voicevox_engine
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン