Akash Mahajan's starred repositories
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
promptbase
All things prompt engineering
alignment-handbook
Robust recipes to align language models with human and AI preferences
deepdoctection
A Repo For Document AI
webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
tamil-llama
A New Tamil Large Language Model (LLM) Based on Llama 2
vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.