Beast code in Giters

Taishi Nakamura's repositories

fmengine

Utilities for Training Very Large Models

Language:PythonApache-2.0100

llama-recipes

Examples and recipes for Llama 2 model

Language:Jupyter NotebookNOASSERTION100

multimodal

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonApache-2.0100

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:Python000

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0000

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonApache-2.0000

EasyContext

Language:Python000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0000

FIN-bench

Evaluation of Finnish generative models

Language:PythonApache-2.0000

hpsc-2024

Language:Shell000

llm-jp-sakura-ansible

Language:Jinja000

llm-jp-sft

Language:ShellApache-2.0000

lm-evaluation-harness

Language:PythonApache-2.0000

megablocks

Language:PythonApache-2.0000

Megatron-LLM

distributed trainer for LLMs

Language:PythonNOASSERTION000

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Apache-2.0000

llm-leaderboard

Project of llm evaluation to Japanese tasks

Language:Python000

long-context

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT000

Megatron-LM-LUMI

Ongoing research training transformer models at scale

NOASSERTION000

mlmm-evaluation

Multilingual Large Language Models Evaluation Benchmark

Apache-2.0000

moe-recipes

Mixtre of Experts Library forked from kotoba-recipes

Language:Shell000

nccl-tests

NCCL Tests

BSD-3-Clause000

Robin

Language:PythonApache-2.0000

rome

Locating and editing factual associations in GPT (NeurIPS 2022)

MIT000

SEED

Empowers LLMs with the ability to see and draw.

Language:PythonNOASSERTION000

t5x

Apache-2.0000

Taishi-N324.github.io

Language:HTML000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

VMLU

MIT000