Sani's starred repositories
Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!
StableDiffusion-CheatSheet
A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.
sqlite-vss
A SQLite extension for efficient vector search, based on Faiss!
sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
speech-to-text
Real-time transcription using faster-whisper
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
nerf_bridge
ROS streaming of images and poses to nerfstudio.
aitools_client
Seth's AI Tools: A Unity based Stable Diffusion front-end for AUTOMATIC1111's WebUI focused on gamedev
nn-zero-to-hero-notes
Jupyter Notebook notes on Andrej Karpathy's tutorial series, "Neural Networks: Zero to Hero."
whisper-onnx-cpu
ONNX implementation of Whisper. PyTorch free.
zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
PaddleOCR-ONNX-Sample
PaddleOCRのPythonでのONNX推論サンプル
VoicevoxPlayer
VoicevoxのUnreal Engine 4.27.2 ~ / Unreal Engine 5 プラグイン