Yard1

Antoni Baum's repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0800

pdx-steam-workshop-publisher-action

Automatically upload your Hearts of Iron IV (support for more Paradox games to come) mod to Steam Workshop.

Language:ShellMIT3 30

HoI4-GFX-Search

This tool helps you easily find GFX from unmodded (vanilla) Hearts of Iron 4 1.14.

Language:HTML2 2 6

hoi4_headless

A Docker container to upload Hearts of Iron IV mods automatically

Language:ShellMIT2 30

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT200

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.0200

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0100

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonMIT100

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonApache-2.01 10

Ray-LLM-script

Language:Python1 10

ray-serve-deepspeed

Run deepspeed on ray serve

Language:Python100

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

cupy

NumPy & SciPy for GPU

Language:PythonMIT000

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaApache-2.0000

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonApache-2.0000

gradio

Create UIs for your machine learning model in Python in 3 minutes

Language:PythonApache-2.0000

joblib

Computing with Python functions.

Language:PythonBSD-3-Clause010

lightgbm_ray

LightGBM on Ray

Language:PythonApache-2.0010

llama-cpp-python

Python bindings for llama.cpp

Language:PythonMIT000

LLM-Ray

Language:Jupyter Notebook010

pyod

A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

Language:PythonBSD-2-Clause000

RWKV-LM

RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.0000

Yard1

Antoni Baum's repositories

Ray-DeepSpeed-Inference

vllm

Ray-StableDiffusion

pdx-steam-workshop-publisher-action

HoI4-GFX-Search

hoi4_headless

lm-evaluation-harness

text-generation-inference

DeepSpeed

lm-format-enforcer

megablocks

ray

Ray-LLM-script

ray-serve-deepspeed

accelerate

cupy

flashinfer

gpt-neox

gradio

joblib

lightgbm_ray

llama-cpp-python

LLM-Ray

pyod

RWKV-LM

trlx

vllm-flash-attention

Whisper-Ray

xgboost_ray

Yard1