Matin Mahmood's starred repositories
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
activitywatch
The best free and open-source automated time tracker. Cross-platform, extensible, privacy-focused.
labelformat
A tool for converting computer vision label formats.
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
gpt-code-ui
An open source implementation of OpenAI's ChatGPT Code interpreter
embedchain
Memory for AI agents
dalle-flow
π A Human-in-the-Loop workflow for creating HD images from text
WavJourney
WavJourney: Compositional Audio Creation with LLMs
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
video-clip
Let's make a video clip
faster-whisper
Faster Whisper transcription with CTranslate2
customized-voice-text-bot-for-whatsapp-telegram
Customized Voice and Text Chatbot fully integrated to database (Cloudant an COS) using Watson Services deployed to IBM Cloud using Code Engine.
naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
PowerGridworld
PowerGridworld provides users with a lightweight, modular, and customizable framework for creating power-systems-focused, multi-agent Gym environments that readily integrate with existing training frameworks for reinforcement learning (RL). https://arxiv.org/abs/2111.05969
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
python-jobsearch
AI-based job search in Python
babyagi-asi
BabyAGI: an Autonomous and Self-Improving agent, or BASI