Yoshinari Fujinuma's starred repositories
function_vectors
Function Vectors in Large Language Models (ICLR 2024)
MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
deep_learning_curriculum
Language model alignment-focused deep learning curriculum
Triton-Puzzles
Puzzles for learning Triton
mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
alignment-handbook
Robust recipes to align language models with human and AI preferences
gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
mistral-inference
Official inference library for Mistral models
CoLT5-attention
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
mixture-of-attention
Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts