Ankush Malaker's starred repositories
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
activitywatch
The best free and open-source automated time tracker. Cross-platform, extensible, privacy-focused.
segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (2024)
annotated_research_papers
This repo contains annotated research papers that I found really good and useful
Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
khoj
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.
dynamic-superb
The official repository of Dynamic-SUPERB.
vision-agent
Vision agent
granite-code-models
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
wyoming-satellite
Remote voice satellite using Wyoming protocol
Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
mmdetection
OpenMMLab Detection Toolbox and Benchmark