Yasuhiro Fujita (muupan)

muupan

Geek Repo

Company:@pfnet

Home Page:https://github.com/muupan/resume

Github PK Tool:Github PK Tool


Organizations
chainer
pfnet

Yasuhiro Fujita's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31895Issues:204Issues:4919

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20705Issues:205Issues:377

Karabiner-Elements

Karabiner-Elements is a powerful utility for keyboard customization on macOS Sierra (10.12) or later.

Language:C++License:UnlicenseStargazers:18654Issues:206Issues:3744

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16468Issues:116Issues:863

Vim

:star: Vim for Visual Studio Code

Language:TypeScriptLicense:MITStargazers:13832Issues:125Issues:5903

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13644Issues:115Issues:1047

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:6948Issues:38Issues:450

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6612Issues:37Issues:1095

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4539Issues:109Issues:134

llm-numbers

Numbers every LLM developer should know

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:2061Issues:19Issues:81

mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Language:PythonLicense:Apache-2.0Stargazers:1784Issues:65Issues:13

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1575Issues:20Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1464Issues:7Issues:142

CameraController

📷 Control USB Cameras from an app

Language:SwiftLicense:GPL-3.0Stargazers:1409Issues:32Issues:98

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1311Issues:18Issues:84
Language:PythonLicense:Apache-2.0Stargazers:1215Issues:14Issues:112

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:529Issues:16Issues:70

SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Language:PythonLicense:Apache-2.0Stargazers:477Issues:29Issues:18

Online-RLHF

A recipe for online RLHF and online iterative DPO.

qv

Quickly view your data

Language:RustLicense:Apache-2.0Stargazers:276Issues:3Issues:12
License:CC-BY-4.0Stargazers:233Issues:7Issues:0

vscode-journal

Lightweight journal and simple notes support for Visual Studio Code

Language:TypeScriptLicense:GPL-3.0Stargazers:233Issues:10Issues:96

llm-japanese-dataset

LLM構築用の日本語チャットデータセット

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonLicense:Apache-2.0Stargazers:51Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47Issues:1Issues:5

exact-optimization

ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment

Language:PythonLicense:MITStargazers:45Issues:4Issues:5

CPO_SIMPO

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

instruction_ja

Japanese instruction data (日本語指示データ)

Language:PythonLicense:MITStargazers:22Issues:3Issues:0