Akash Sonowal's starred repositories
Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
segment-anything-fast
A batched offline inference oriented version of segment-anything
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
kubernetes
Production-Grade Container Scheduling and Management
ml-deployment-k8s-fastapi
This project shows how to serve an ONNX-optimized image classification model as a web service with FastAPI, Docker, and Kubernetes.
deploy-hf-tf-vision-models
This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.
you-dont-know-tensorflow
Contains materials for my talk "You don't know TensorFlow".
diffusion-fast
Faster generation with text-to-image diffusion models.
full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
distributed-churn-prediction
End-to-end customer churn prediction pipeline using Spark.