muupan

followers

following

stars

@pfnet

https://github.com/muupan/resume

Organizations

chainer

pfnet

Yasuhiro Fujita's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.031895 204 4919

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20705 205 377

Karabiner-Elements

Karabiner-Elements is a powerful utility for keyboard customization on macOS Sierra (10.12) or later.

Language:C++Unlicense18654 206 3744

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.016468 116 863

Vim

:star: Vim for Visual Studio Code

Language:TypeScriptMIT13832 125 5903

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause13644 115 1047

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonMIT6948 38 450

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT6612 37 1095

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.04539 109 134

llm-numbers

Numbers every LLM developer should know

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.02061 19 81

mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Language:PythonApache-2.01784 65 13

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.01464 7 142

CameraController

📷 Control USB Cameras from an app

Language:SwiftGPL-3.01409 32 98

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.01311 18 84

open-instruct

Language:PythonApache-2.01215 14 112

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

CC0-1.0675 19 4

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.0529 16 70

SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Language:PythonApache-2.0477 29 18

Online-RLHF

A recipe for online RLHF and online iterative DPO.

Language:Python384 18 21

qv

Quickly view your data

Language:RustApache-2.0276 3 12

evals

CC-BY-4.0233 70

vscode-journal

Lightweight journal and simple notes support for Visual Studio Code

Language:TypeScriptGPL-3.0233 10 96

llm-japanese-dataset

LLM構築用の日本語チャットデータセット

Language:Python77 3 14

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonApache-2.051 10

japanese-llm-ranking

Language:Jupyter NotebookApache-2.047 1 5

exact-optimization

ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment

Language:PythonMIT45 4 5

CPO_SIMPO

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

Language:Python30 2 3

instruction_ja

Japanese instruction data (日本語指示データ)

Language:PythonMIT22 30