Hussein Lezzaik's starred repositories
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
mlx-examples
Examples in the MLX framework
Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Pytorch-Project-Template
A scalable template for PyTorch projects, with examples in Image Segmentation, Object classification, GANs and Reinforcement Learning.
LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
mixtral-inference
inference code for mixtral-8x7b-32kseqlen
nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
taylor-series-linear-attention
Explorations into the recently proposed Taylor Series Linear Attention
Together-API-Basics
Some information for working with the Together inference API for Open Source AI models