Daniel-NJ's starred repositories
everyone-can-use-english
人人都能用英语
awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
RDMA-Tutorial
A tutorial on RDMA based programming using code examples
text-generation-inference
Large Language Model Text Generation Inference
whisper.cpp
Port of OpenAI's Whisper model in C/C++
hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
uarch-bench
A benchmark for low-level CPU micro-architectural features
Faithful-and-Efficient-Simulation-of-High-Performance-Linpack
Artifacts for the eponymous paper
compilerbook
compilerbook