Daniel-NJ's starred repositories

everyone-can-use-english

人人都能用英语

Language:TypeScriptLicense:MPL-2.0Stargazers:21680Issues:0Issues:0

awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

License:CC0-1.0Stargazers:1393Issues:0Issues:0

RDMA-Tutorial

A tutorial on RDMA based programming using code examples

Language:CLicense:Apache-2.0Stargazers:473Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8407Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:17804Issues:0Issues:0

rocHPL

High Performance Linpack for Next-Generation AMD HPC Accelerators

Language:C++License:NOASSERTIONStargazers:39Issues:0Issues:0

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:7784Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33171Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:61789Issues:0Issues:0

hyperqueue

Scheduler for sub-node tasks for HPC systems with batch scheduling

Language:RustLicense:MITStargazers:268Issues:0Issues:0

Graph500

Repository contains scripts to run Graph500 benchmark on Salomon cluster

Language:ShellStargazers:1Issues:0Issues:0

dalai

The simplest way to run LLaMA on your local machine

Language:CSSStargazers:13098Issues:0Issues:0

paper-qa

LLM Chain for answering questions from documents with citations

Language:PythonLicense:Apache-2.0Stargazers:3792Issues:0Issues:0

zstd

Zstandard - Fast real-time compression algorithm

Language:CLicense:NOASSERTIONStargazers:22829Issues:0Issues:0

gpt4all

GPT4All: Chat with Local LLMs on Any Device

Language:C++License:MITStargazers:67476Issues:0Issues:0

gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.

Language:C++License:NOASSERTIONStargazers:1035Issues:0Issues:0

perf-ninja

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

Language:C++Stargazers:2314Issues:0Issues:0

perf-book

The book "Performance Analysis and Tuning on Modern CPU"

Language:TeXLicense:CC0-1.0Stargazers:1999Issues:0Issues:0

uarch-bench

A benchmark for low-level CPU micro-architectural features

Language:C++License:MITStargazers:669Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:89295Issues:0Issues:0

simgrid

MIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.

Language:C++License:NOASSERTIONStargazers:162Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

sst-dumpi

SST DUMPI Trace Library

Language:CLicense:NOASSERTIONStargazers:14Issues:0Issues:0

sst-macro

SST Macro Element Library

Language:C++License:NOASSERTIONStargazers:33Issues:0Issues:0

vimllearn

A book for VimL Script language

Language:Vim ScriptLicense:NOASSERTIONStargazers:886Issues:0Issues:0

callpath

Library for representing callpaths consistently in distributed-memory performance tools.

Language:ShellLicense:NOASSERTIONStargazers:7Issues:0Issues:0

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:13981Issues:0Issues:0

HPCInfo

Information about many aspects of high-performance computing. Wiki content moved to ~/docs.

Language:CLicense:MITStargazers:265Issues:0Issues:0

compilerbook

compilerbook

Stargazers:43Issues:0Issues:0