Yoshinari Fujinuma (akkikiki)

akkikiki

Geek Repo

Company:AWS AI Labs

Location:New York, USA

Home Page:http://akkikiki.github.io

Twitter:@akkikiki

Github PK Tool:Github PK Tool

Yoshinari Fujinuma's starred repositories

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:34020Issues:356Issues:299

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17617Issues:156Issues:1361

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11254Issues:99Issues:370

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10209Issues:82Issues:288

ggml

Tensor library for machine learning

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8557Issues:78Issues:956

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5525Issues:46Issues:73

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5056Issues:51Issues:528

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4120Issues:112Issues:119

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2501Issues:32Issues:90

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2405Issues:23Issues:23

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonLicense:MITStargazers:1998Issues:26Issues:204

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

shell-ai

LangChain powered shell command generator and runner CLI

Language:PythonLicense:MITStargazers:968Issues:14Issues:20

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonLicense:NOASSERTIONStargazers:963Issues:147Issues:21

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:781Issues:6Issues:7

RRHF

[NIPS2023] RRHF & Wombat

mamba.py

A simple and efficient Mamba implementation in PyTorch and MLX.

Language:PythonLicense:MITStargazers:740Issues:4Issues:23

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonLicense:Apache-2.0Stargazers:432Issues:6Issues:10

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:334Issues:21Issues:59

CoLT5-attention

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

Language:PythonLicense:MITStargazers:217Issues:8Issues:7
Language:PythonLicense:NOASSERTIONStargazers:127Issues:5Issues:7

mixture-of-attention

Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts

Language:PythonLicense:MITStargazers:100Issues:8Issues:0
Language:PythonLicense:MITStargazers:84Issues:2Issues:13

deepform

Experimental form data extraction for journalism

Language:PythonLicense:MITStargazers:75Issues:2Issues:31

kotomamba

Mamba training library developed by kotoba technologies

Language:PythonLicense:Apache-2.0Stargazers:59Issues:5Issues:0

rl4f

Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.

Language:PythonLicense:MITStargazers:56Issues:3Issues:3
Language:PythonLicense:Apache-2.0Stargazers:49Issues:3Issues:1