Mahmoud Zamani's repositories
amazon-scraper
Free Trial Amazon Scraper API for extracting search, product, offer listing, reviews, question and answers, best sellers and sellers data.
autogen
A programming framework for agentic AI 🤖
Autogen_GraphRAG_Ollama
Microsoft's GraphRAG + AutoGen + Ollama + Chainlit = Fully Local & Free Multi-Agent RAGÂ Superbot
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
aws-neuron-sdk
Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services
bringup-bench
Bringup-Bench is a collection of standalone minimal library and system dependence benchmarks useful for bringing up newly designed CPUs, accelerators, compilers and operating systems. You probably don't need Bringup-Bench, but if you do, you probably need it badly!
causality-lab
Causal discovery algorithms and tools for implementing new ones
cutlass
CUDA Templates for Linear Algebra Subroutines
dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
EconML
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making.
garnet
Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication features. Garnet can work with existing Redis clients.
Gaudi-tutorials
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
glow
Compiler for Neural Network hardware accelerators
gorilla
Gorilla: An API store for LLMs
ibex
Ibex is a small 32 bit RISC-V CPU core, previously known as zero-riscy.
kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
loghub
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
metropolis-nim-workflows
Collection of reference workflows for building intelligent agents with NIMs
Model-References
TensorFlow and PyTorch Reference models for Gaudi(R)
MultiCoT
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
parler-tts
Inference and training library for high-quality TTS models.
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
serve
Serve, optimize and scale PyTorch models in production
spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
targeted-causal-reduction
TCR is a method for explaining a phenomenon (called target) in high-dimensional simulations (low-level model) by learning a low-dimensional causal model (high-level model) that captures the most important causes of the target. It uses shift interventions in the low-level model and its effects on the target to learn the high-level model.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs