Kazuki Fujii (okoge-kaz)

okoge-kaz

Geek Repo

Company:Tokyo Institute of Technology

Location:Tokyo Japan

Twitter:@okoge_kaz

Github PK Tool:Github PK Tool


Organizations
llm-jp
rioyokotalab
SakanaAI
sbintuitions
turingmotors

Kazuki Fujii's repositories

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llm-recipes

Ongoing Research Project for continaual pre-training LLM(dense mode)

Language:PythonStargazers:15Issues:0Issues:0

moe-recipes

Ongoing research training Mixture of Expert models.

Language:PythonStargazers:16Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

llama3v

A SOTA vision model built on top of llama3 8B.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama-recipes

Examples and recipes for Llama 2 model

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

deploymentmanager-samples

Deployment Manager samples and templates.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

License:Apache-2.0Stargazers:0Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0
Language:ShellStargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

multi-gpu-programming-models

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Megatron-LM-ABCI

NVIDIA Megatron-LM fork

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NeMo-Megatron-Launcher

NeMo Megatron launcher and tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0