Muhammad Shifa's repositories
awesome-yolo-object-detection
๐๐๐ A collection of some awesome public YOLO object detection series projects.
CodeFormer
PyTorch codes for "Towards Robust Blind Face Restoration with Codebook Lookup Transformer" (NeurIPS 2022)
tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
90DaysOfDevOps
This repository is my documenting repository for learning the world of DevOps. I started this journey on the 1st January 2022 and I plan to run to March 31st for a complete 90-day romp on spending an hour a day including weekends to get a foundational knowledge across a lot of different areas that make up DevOps.
ChatWithOllama
let's chat/talk with Ollama models through browser based application.
aiortc
WebRTC and ORTC implementation for Python using asyncio
colorful
Terminal string styling done right, in Python :snake: :tada:
fcdd
Repository for the Explainable Deep One-Class Classification paper
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI ๐ https://microsoft.github.io/generative-ai-for-beginners/
hand-gesture-recognition-using-mediapipe
MediaPipe(Python็)ใ็จใใฆๆใฎๅงฟๅขๆจๅฎใ่กใใๆคๅบใใใญใผใใคใณใใ็จใใฆใ็ฐกๆใชMLPใงใใณใใตใคใณใจใใฃใณใฌใผใธใงในใใฃใผใ่ช่ญใใใตใณใใซใใญใฐใฉใ ใงใใ๏ผEstimate hand pose using MediaPipe(Python version). This is a sample program that recognizes hand signs and finger gestures with a simple MLP using the detected key points.๏ผ
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
learnopencv
Learn OpenCV : C++ and Python Examples
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
MuhammadShifa
My Personal Profile
NeMo
NeMo: a toolkit for conversational AI
portfolio
Website URL
py.gpt.prompt
PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-term memory and task automation.
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
serverless-ml-course
Serverless ML Course for building AI-enabled Prediction Services from models and features
silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
stable-diffusion-webui
Stable Diffusion web UI
streamlit-webrtc
Real-time video and audio streams over the network, with Streamlit.
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Visual-Inspection
Explainable Defect Detection using Convolutional Neural Networks: Case Study
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
website
Source for https://fullstackdeeplearning.com
wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.