Beast code in Giters

Honggyu Kim's starred repositories

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoApache-2.027762 274 11006

llamafile

Distribute and run LLMs with a single file.

Language:C++NOASSERTION16380 149 356

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language:CNOASSERTION14127 172 314

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++MIT13217 242 6109

pyroscope

Continuous Profiling Platform. Debug performance issues down to a single line of code

Language:CAGPL-3.09566 91 1218

ZLUDA

CUDA on AMD GPUs

Language:RustApache-2.08104 114 147

llama-cpp-python

Python bindings for llama.cpp

Language:PythonMIT6961 67 940

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, Axolotl, etc.

Language:PythonApache-2.06154 245 2340

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.05591 65 623

util-linux

Language:CGPL-2.02540 110 1323

slurm

Slurm: A Highly Scalable Workload Manager

Language:CNOASSERTION2410 1260

gstreamer

GStreamer open-source multimedia framework

Language:CNOASSERTION2176 1220

rdma-core

RDMA core userspace libraries and daemons

Language:CNOASSERTION1386 1070

dstat

Versatile resource statistics tool (the real one, not the Red Hat clone)

Language:PythonGPL-2.01340 88 91

llama3.np

llama3.np is a pure NumPy implementation for Llama 3 model.

Language:PythonMIT905 13 4

kernel-development

Presentation on how the Linux kernel is developed

Language:TeXCC-BY-SA-4.0610 59 1

HammerDB

HammerDB Database Load Testing and Benchmarking Tool

Language:TclGPL-3.0530 32 274

dataverse

The Universe of Data. All about data, data science, and data engineering

Language:PythonApache-2.0415 8 22

memkind

Memkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM

Language:CNOASSERTION374 37 117

dool

Python3 compatible fork of dstat

Language:PythonGPL-3.0302 11 38

tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

Language:C++Apache-2.0287 17 4470

llama3.cuda

llama3.cuda is a pure C/CUDA implementation for Llama 3 model.

Language:CudaMIT25700

optimum-benchmark

A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Language:PythonApache-2.0206 6 68

FLASK

[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Language:Python198 2 3

evalverse

The Universe of Evaluation. All about the evaluation for LLMs.

Language:PythonApache-2.0182 7 10

unified-runtime

Language:C++NOASSERTION26 19 373

precise-leak-sanitizer

A dynamic memory leak detector that can pinpoint where memory is lost, using LLVM pass

Language:LLVMApache-2.016 4 68

grace-kernel

Upstream Kernel with Grace upstream pending patches for partners. Patches include any bug fixes during Grace production while they await upstreaming.

Language:CNOASSERTION15 30

idle_page_tracking

Language:C1100

tglx-history

Language:CNOASSERTION10 30

honggyukim