Abul22's starred repositories
supervision
We write your reusable computer vision tools. 💜
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
spotify-downloader
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
GooglePhotosTakeoutHelper
Script that organizes the Google Takeout archive into one big chronological folder
StableSwarmUI
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
sqlite-web
Web-based SQLite database browser written in Python
SecureAI-Tools
Private and secure AI tools for everyone's productivity.
WSLHostPatcher
Dynamic patch WSL2 to listen port on any interface.
ComfyTextures
Unreal Engine ⚔️ ComfyUI - Automatic texturing using generative diffusion models
quickviewer
A image/comic viewer application for Windows, Mac and Linux, it can show images very fast
BrowserGPT
Command your browser with GPT
Auto1111SDK
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
libav.wasm
libav WebAssembly port
Thinkremote
Personal cloud computing is the technology stack that allows you to access a computer remotely