Beast code in Giters

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

Language:C++MIT1279 25 49

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT1103 22 36

granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Apache-2.01036 21 10

PiPPy

Pipeline Parallelism for PyTorch

Language:PythonBSD-3-Clause697 37 258

unet.cu

UNet diffusion model in pure CUDA

Language:Cuda556 20

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Language:PythonApache-2.0282 9 26

awesome-emulators-simulators

A curated list of software emulators and simulators of PCs, home computers, mainframes, consoles, robots and much more...

180 21 17

CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Language:PythonMIT126 5 5

bpe.c

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

Language:CMIT11100

Awesome-Mainframes

Awesome list of mainframe related resources & projects

78 16 7

bark.cpp

Port of Suno AI's Bark in C/C++ for fast inference

Language:C++MIT4900

farel-bench

Testing LLM reasoning abilities with family relationship quizzes.

Language:PythonMIT4000

Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In Proceedings of The 46th IEEE/ACM International Conference on Software Engineering (ICSE 2024), Lisbon, Portugal, April 2024

Language:PythonMIT37 2 1

zfan20

Ziwei Fan's starred repositories

LLM101n

llm.c

unsloth

Perplexica

ggml

tiny-gpu

ToolBench

matmulfreellm

LLM-Agent-Survey

reflexion

c-style

ThunderKittens

GaLore

distributed-llama

flash-linear-attention

granite-code-models

ToolLearningPapers

PiPPy

unet.cu

RepoAgent

awesome-emulators-simulators

CEPE

bpe.c

Awesome-Mainframes

bark.cpp

farel-bench

PLTranslationEmpirical

llama_duo

Retroformer

proxy_based_uncertainty