Yasuhiro Fujita (muupan)

muupan

Geek Repo

Company:@pfnet

Home Page:https://github.com/muupan/resume

Github PK Tool:Github PK Tool


Organizations
chainer
pfnet

Yasuhiro Fujita's starred repositories

SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Language:PythonLicense:Apache-2.0Stargazers:415Issues:0Issues:0

CPO_SIMPO

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

Language:PythonStargazers:18Issues:0Issues:0

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonLicense:Apache-2.0Stargazers:43Issues:0Issues:0

Online-RLHF

A recipe for online RLHF.

Language:PythonStargazers:342Issues:0Issues:0

CameraController

📷 Control USB Cameras from an app

Language:SwiftLicense:GPL-3.0Stargazers:1361Issues:0Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1343Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20297Issues:0Issues:0

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:6370Issues:0Issues:0

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:456Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4275Issues:0Issues:0

mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Language:PythonLicense:Apache-2.0Stargazers:1765Issues:0Issues:0

Vim

:star: Vim for Visual Studio Code

Language:TypeScriptLicense:MITStargazers:13577Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1114Issues:0Issues:0

llm-japanese-dataset

LLM構築用の日本語チャットデータセット

Language:PythonStargazers:75Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1259Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27785Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12622Issues:0Issues:0

vscode-journal

Lightweight journal and simple notes support for Visual Studio Code

Language:TypeScriptLicense:GPL-3.0Stargazers:232Issues:0Issues:0
License:CC-BY-4.0Stargazers:224Issues:0Issues:0

instruction_ja

Japanese instruction data (日本語指示データ)

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13135Issues:0Issues:0

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

License:CC0-1.0Stargazers:632Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1905Issues:0Issues:0

Karabiner-Elements

Karabiner-Elements is a powerful utility for keyboard customization on macOS Sierra (10.12) or later.

Language:C++License:UnlicenseStargazers:18328Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1522Issues:0Issues:0

llm-numbers

Numbers every LLM developer should know

Stargazers:4006Issues:0Issues:0

YouTube-Blocker

A Chrome Extension that blocks non-educational YouTube videos

Language:JavaScriptStargazers:17Issues:0Issues:0

big-list-of-naughty-strings

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.

Language:PythonLicense:MITStargazers:46048Issues:0Issues:0

vimGPT

Browse the web with GPT-4V and Vimium

Language:PythonLicense:MITStargazers:2562Issues:0Issues:0