Yasuhiro Fujita's starred repositories
Online-RLHF
A recipe for online RLHF.
CameraController
📷 Control USB Cameras from an app
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
NeMo-Aligner
Scalable toolkit for efficient model alignment
alignment-handbook
Robust recipes to align language models with human and AI preferences
mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
llm-japanese-dataset
LLM構築用の日本語チャットデータセット
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
flash-attention
Fast and memory-efficient exact attention
vscode-journal
Lightweight journal and simple notes support for Visual Studio Code
instruction_ja
Japanese instruction data (日本語指示データ)
awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Karabiner-Elements
Karabiner-Elements is a powerful utility for keyboard customization on macOS Sierra (10.12) or later.
llm-numbers
Numbers every LLM developer should know
YouTube-Blocker
A Chrome Extension that blocks non-educational YouTube videos
big-list-of-naughty-strings
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.