okoge-kaz

Kazuki Fujii's repositories

wandb_watcher

ABCI 大規模言語モデル構築支援にてwandbのジョブを監視するためのツール

Language:Python200

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Apache-2.0000

OLMo

Modeling, training, eval, and inference code for OLMo

Apache-2.0000

llm-jp-sakura-ansible

Language:Jinja400

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookApache-2.0000

t5x

Apache-2.0000

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Apache-2.0000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Apache-2.0000

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Apache-2.0000

nvtop

GPUs process monitoring for AMD, Intel and NVIDIA

NOASSERTION000

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.0000

epochraft-hf-fsdp

Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP

Language:PythonMIT000

gpu-burn

Multi-GPU CUDA stress test

BSD-2-Clause000

relora

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Apache-2.0000

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT000

Yi

A series of large language models trained from scratch by developers @01-ai

Apache-2.0000

abci-unhealthy-nodes

Language:Python000

open-llms

📋 A list of open LLMs available for commercial use.

Apache-2.0000

m2

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

000

math-lm

MIT000

Megatron-LLM

distributed trainer for LLMs

Language:PythonNOASSERTION000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

open_lm

A repository for research on medium sized language models.

MIT000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

attention

several types of attention modules written in PyTorch

000

python-fire

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

NOASSERTION000

simple-simcse-ja

Japanese Simple-SimCSE

000

fmengine

Utilities for Training Very Large Models

Apache-2.0000

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

BSD-3-Clause000

GenerativeImage2Text

GIT: A Generative Image-to-text Transformer for Vision and Language

MIT000