abdul dakkak's starred repositories
zsh-autosuggestions
Fish-like autosuggestions for zsh
lm-evaluation-harness
A framework for few-shot evaluation of language models.
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
llama2.mojo
Inference Llama 2 in one file of pure 🔥
ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
distribution
Home of the JELOS Linux distribution.
how-to-optimize-gemm
row-major matmul optimization
metal-benchmarks
Apple GPU microarchitecture
YHs_Sample
Yinghan's Code Sample
MatmulTutorial
A Easy-to-understand TensorOp Matmul Tutorial
NVIDIA_SGEMM_PRACTICE
Step-by-step optimization of CUDA SGEMM
NBAssembler
Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.
gccontent-benchmark
Benchmarking different languages for a simple bioinformatics task (Counting the GC fraction of DNA in a FASTA file)
amd_matrix_instruction_calculator
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
llvm-tutor
A collection of out-of-tree LLVM passes for teaching and learning
MojoPkgWorkflow
This Repository shows how to use a simple GitHub Action script for compiling a mojo directory into a package.