Kazuki Fujii (okoge-kaz)

okoge-kaz

Geek Repo

Company:Tokyo Institute of Technology

Location:Tokyo Japan

Twitter:@okoge_kaz

Github PK Tool:Github PK Tool


Organizations
llm-jp
rioyokotalab
SakanaAI
sbintuitions
turingmotors

Kazuki Fujii's repositories

wandb_watcher

ABCI 大規模言語モデル構築支援にてwandbのジョブを監視するためのツール

Language:PythonStargazers:2Issues:0Issues:0

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

License:Apache-2.0Stargazers:0Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JinjaStargazers:4Issues:0Issues:0

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

License:Apache-2.0Stargazers:0Issues:0Issues:0

nvtop

GPUs process monitoring for AMD, Intel and NVIDIA

License:NOASSERTIONStargazers:0Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:0Issues:0Issues:0

epochraft-hf-fsdp

Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gpu-burn

Multi-GPU CUDA stress test

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

relora

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

License:Apache-2.0Stargazers:0Issues:0Issues:0

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Yi

A series of large language models trained from scratch by developers @01-ai

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

open-llms

📋 A list of open LLMs available for commercial use.

License:Apache-2.0Stargazers:0Issues:0Issues:0

m2

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Megatron-LLM

distributed trainer for LLMs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

open_lm

A repository for research on medium sized language models.

License:MITStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

attention

several types of attention modules written in PyTorch

Stargazers:0Issues:0Issues:0

python-fire

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

License:NOASSERTIONStargazers:0Issues:0Issues:0

simple-simcse-ja

Japanese Simple-SimCSE

Stargazers:0Issues:0Issues:0

fmengine

Utilities for Training Very Large Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

GenerativeImage2Text

GIT: A Generative Image-to-text Transformer for Vision and Language

License:MITStargazers:0Issues:0Issues:0