Shubham Kushwaha's repositories
bench-llama
benchmarking sota inference speedups for Apple Silicon first citizens
Hand-Pose-Estimation
This project is centered around hand pose estimation and real time sign language translator using Google's MediaPipe library.
prayog
An LLM eval suite with Indic evals
prayog-IndicInstruct
Indic evals for quantised models AWQ / GPTQ / EXL2
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
arxiv-style
A Latex style and template for paper preprints (based on NIPS style)
Awesome-EdgeAI
Resources of our survey paper "A Systematic Review of AI Deployment on Resource-Constrained Edge Devices: Challenges, Techniques, and Applications"
DALM
Domain Adapted Language Modeling Toolkit
edge-ai
A curated list of resources for embedded AI
FLAP
Pruned Transformers - beta
LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
local.ai
🎒 local.ai - Run AI locally on your PC!
LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference
mindgraph
proof of concept prototype for generating and querying against a large knowledge graph with ai
python-project
A clean, extensible, python project template
repo_level_retrieval
Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.
WebextLLM
Web extension that embeds LLMs in your browser to power AI in web apps
window.ai
Use your own AI models on the web
wllama
WebAssembly binding for llama.cpp - Enabling in-browser LLM inference