ryyzn9

followers

following

stars

Dip_an 's repositories

DCFormer

Language:PythonMIT100

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.0000

halutmatmul_for_windows

Stella Nera is the first Maddness accelerator achieving 15x higher area efficiency (GMAC/s/mm^2) and 25x higher energy efficiency (TMAC/s/W) than direct MatMul accelerators in the same technology

Language:PythonMIT000

Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

Language:PythonMIT000

AtLas

Apache-2.0000

flash-attention

Fast and memory-efficient exact attention

BSD-3-Clause000

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT000

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Apache-2.0000

grok-1

Grok open release

Apache-2.0000

lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

MIT000

linear_open_lm

A repository for research on medium sized language models.

MIT000

llama3

The official Meta Llama 3 GitHub site

NOASSERTION000

llamafile

Distribute and run LLMs with a single file.

NOASSERTION000

LLM-Agents-Papers

A repo lists papers related to LLM based agent

000

llm-foundry

LLM training code for Databricks foundation models

Apache-2.0000

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

NOASSERTION000

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

MIT000

minLlama3

a quick & complete guide to Llama 3's architecture

Language:Python000

nanoGPT-TK

The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!

Language:MakefileMIT000

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoMIT000

othello_mamba

Evaluating the Mamba architecture on the Othello game

000

pykan

Kolmogorov Arnold Networks

MIT000

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Apache-2.0000

retnet_transformer

MIT000

ThunderKittens

Tile primitives for speedy kernels

MIT000

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

000

torchscale

Foundation Architecture for (M)LLMs

MIT000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000

X_net

a new transformer architecture

Language:Python000