Md Sumsuddin's repositories
ATIS_dataset
The ATIS (Airline Travel Information System) Dataset
CompreFace
Leading free and open-source face recognition system
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Face-Liveness-Detection-SDK-Android
Robust, Realtime, On-Device Face Liveness Detection (Face Anti Spoofing) Android
facefusion
Next generation face swapper and enhancer
llm-numbers
Numbers every LLM developer should know
magika
Detect file content types with deep learning
PhotoMaker
PhotoMaker
privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
quivr
🧠 Dump all your files and chat with it using your Generative AI Second Brain using LLMs ( GPT 3.5/4, Private, Anthropic, VertexAI ) & Embeddings 🧠
resumake.io
📝 A website for automatically generating elegant LaTeX resumes.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
whisper.cpp
Port of OpenAI's Whisper model in C/C++