Kazuki Fujii (okoge-kaz)

okoge-kaz

Geek Repo

Company:Tokyo Institute of Technology

Location:Tokyo Japan

Twitter:@okoge_kaz

Github PK Tool:Github PK Tool


Organizations
llm-jp
rioyokotalab
SakanaAI
sbintuitions
turingmotors

Kazuki Fujii's repositories

moe-recipes

Ongoing research training Mixture of Expert models.

Language:PythonStargazers:16Issues:3Issues:0

llm-recipes

Ongoing Research Project for continaual pre-training LLM(dense mode)

wandb_watcher

ABCI 大規模言語モデル構築支援にてwandbのジョブを監視するためのツール

Language:PythonStargazers:2Issues:1Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

deploymentmanager-samples

Deployment Manager samples and templates.

License:Apache-2.0Stargazers:0Issues:0Issues:0

grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0

levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llama-recipes

Examples and recipes for Llama 2 model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

llama3v

A SOTA vision model built on top of llama3 8B.

Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Megatron-LM-ABCI

NVIDIA Megatron-LM fork

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NeMo-Megatron-Launcher

NeMo Megatron launcher and tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Language:CStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0