MuhammadShifa

Muhammad Shifa's repositories

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.

100

CodeFormer

PyTorch codes for "Towards Robust Blind Face Restoration with Codebook Lookup Transformer" (NeurIPS 2022)

Language:PythonNOASSERTION100

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonApache-2.0100

tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

Language:Jupyter Notebook100

90DaysOfDevOps

This repository is my documenting repository for learning the world of DevOps. I started this journey on the 1st January 2022 and I plan to run to March 31st for a complete 90-day romp on spending an hour a day including weekends to get a foundational knowledge across a lot of different areas that make up DevOps.

Language:ShellNOASSERTION000

ChatWithOllama

let's chat/talk with Ollama models through browser based application.

Language:Python000

aiortc

WebRTC and ORTC implementation for Python using asyncio

Language:PythonBSD-3-Clause000

buns_counter

000

colorful

Terminal string styling done right, in Python :snake: :tada:

Language:PythonMIT000

fcdd

Repository for the Explainable Deep One-Class Classification paper

Language:PythonMIT000

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

MIT000

hand-gesture-recognition-using-mediapipe

MediaPipe(Python版)を用いて手の姿勢推定を行い、検出したキーポイントを用いて、簡易なMLPでハンドサインとフィンガージェスチャーを認識するサンプルプログラムです。（Estimate hand pose using MediaPipe(Python version). This is a sample program that recognizes hand signs and finger gestures with a simple MLP using the detected key points.）

Language:Jupyter NotebookApache-2.0000

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

MIT000

learnopencv

Learn OpenCV : C++ and Python Examples

Language:Jupyter Notebook000

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Apache-2.0000

MuhammadShifa

My Personal Profile

010

NeMo

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0000

portfolio

Website URL

Language:HTML000

py.gpt.prompt

PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-term memory and task automation.

NOASSERTION000

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonNOASSERTION000

serverless-ml-course

Serverless ML Course for building AI-enabled Prediction Services from models and features

Language:Jupyter NotebookCC0-1.0000

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookNOASSERTION000

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0000

streamlit-webrtc

Real-time video and audio streams over the network, with Streamlit.

Language:PythonMIT000

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0000

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT000

Visual-Inspection

Explainable Defect Detection using Convolutional Neural Networks: Case Study

Language:Python000

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT000

website

Source for https://fullstackdeeplearning.com

Language:HTML000

wunjo.wladradchenko.ru

Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.

Language:PythonMIT000