YOKOTA Laboratory at Tokyo Tech

An adaptable federated learning framework with a central server, supporting diverse datasets, models, and optimizers. Facilitates collaborative, yet private, data training with customizable aggregation algorithms.

Language:PythonMIT000

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonApache-2.0000

grok-1

Grok open release

Language:PythonApache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonMIT000

m2

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

000

megablocks

Language:PythonApache-2.0000

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

NOASSERTION000

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonNOASSERTION000

nbd

N-Body generator for Hatrix

Language:C++010

nbd-cpu-only

Language:C++000

PixPro

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

Language:PythonMIT000

stars-h

Software for Testing Accuracy, Reliability and Scalability of Hierarchical computations.

Language:CBSD-3-Clause000

STRUMPACK

Structured Matrix Package (LBNL)

Language:C++NOASSERTION000

toast-gpt

Language:PythonMIT000

toast-vit

Language:PythonMIT000

ylab_server_public

ひなどりクラスタの使い方 (for public)

000

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

NOASSERTION000

YOKOTA Laboratory at Tokyo Tech

rioyokotalab