Donny's starred repositories
stable-diffusion
A latent text-to-image diffusion model
realtime-transcribe
Transcribe your speech or the audio playing on your computer with Whisper in realtime, and show the captions on your screen.
fsdp_qlora
Training LLMs with QLoRA + FSDP
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
gpt2-japanese
Japanese GPT2 Generation Model
sketch_simplification
Models and code related to sketch simplification of rough sketches.
DeepSpeech-examples
Examples of how to use or integrate DeepSpeech
edge-connect
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
nlp-survey-text2image
NLP & imageについて幅広くサーベイする。
multiple-objects-gan
Implementation for "Generating Multiple Objects at Spatially Distinct Locations" (ICLR 2019)