Daniel-NJ

followers

following

stars

Daniel-NJ's starred repositories

everyone-can-use-english

人人都能用英语

Language:TypeScriptMPL-2.02168000

awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

CC0-1.0139300

RDMA-Tutorial

A tutorial on RDMA based programming using code examples

Language:CApache-2.047300

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.0840700

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.01780400

rocHPL

High Performance Linpack for Next-Generation AMD HPC Accelerators

Language:C++NOASSERTION3900

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause778400

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++MIT3317100

llama.cpp

LLM inference in C/C++

Language:C++MIT6178900

hyperqueue

Scheduler for sub-node tasks for HPC systems with batch scheduling

Language:RustMIT26800

Graph500

Repository contains scripts to run Graph500 benchmark on Salomon cluster

Language:Shell100

dalai

The simplest way to run LLaMA on your local machine

Language:CSS1309800

paper-qa

LLM Chain for answering questions from documents with citations

Language:PythonApache-2.0379200

zstd

Zstandard - Fast real-time compression algorithm

Language:CNOASSERTION2282900

gpt4all

GPT4All: Chat with Local LLMs on Any Device

Language:C++MIT6747600

gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.

Language:C++NOASSERTION103500

perf-ninja

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

Language:C++231400

perf-book

The book "Performance Analysis and Tuning on Modern CPU"

Language:TeXCC0-1.0199900

uarch-bench

A benchmark for low-level CPU micro-architectural features

Language:C++MIT66900

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonMIT8929500

simulating_mpi_applications_at_scale

Language:TeX200

simgrid

MIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.

Language:C++NOASSERTION16200

Faithful-and-Efficient-Simulation-of-High-Performance-Linpack

Artifacts for the eponymous paper

Language:Jupyter NotebookMIT100

sst-dumpi

SST DUMPI Trace Library

Language:CNOASSERTION1400

sst-macro

SST Macro Element Library

Language:C++NOASSERTION3300

vimllearn

A book for VimL Script language

Language:Vim ScriptNOASSERTION88600

callpath

Library for representing callpaths consistently in distributed-memory performance tools.

Language:ShellNOASSERTION700

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++NOASSERTION1398100

HPCInfo

Information about many aspects of high-performance computing. Wiki content moved to ~/docs.

Language:CMIT26500

compilerbook

compilerbook

4300