Beast code in Giters

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonApache-2.0000

alpa

Training and serving large-scale neural networks

Language:PythonApache-2.0000

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.0000

ColossalAI-Documentation

Documentation for Colossal-AI

Language:JavaScriptApache-2.0000

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

BSD-2-Clause000

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.0000

djl-serving

A universal scalable machine learning model deployment solution

Apache-2.0000

EIKB-KIX-simulation

Language:C++020

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CMIT000

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

MIT000

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Language:PythonMIT000

mlas-old

Language:Assembly020

PipeEdge

PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices

BSD-3-Clause000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION010

tensorflow

Language:C++Apache-2.0020

tensorflow-fork

An Open Source Machine Learning Framework for Everyone

Language:C++Apache-2.0000

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml/gguf), Llama models.

Language:PythonAGPL-3.0000

TLCBench

Language:Python020

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0010

vllm-test

Misc test and benchmark code for vllm

Language:Python000