Fariz Ikhwantri (farizikhwantri)

farizikhwantri

Geek Repo

Company:Tokyo Institute of Technology

Location:Tokyo, Japan

Home Page:https://sites.google.com/view/farizikhwantri/home

Twitter:@farizikhwantri

Github PK Tool:Github PK Tool

Fariz Ikhwantri's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44915Issues:299Issues:646

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14532Issues:108Issues:923

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:13936Issues:133Issues:394

llama-recipes

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7850Issues:68Issues:227

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:4589Issues:53Issues:321

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonLicense:NOASSERTIONStargazers:2961Issues:44Issues:357

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2114Issues:26Issues:54

pystack

🔍 🐍 Like pstack but for Python!

Language:PythonLicense:Apache-2.0Stargazers:965Issues:12Issues:49

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

Language:PythonLicense:MITStargazers:849Issues:14Issues:31

the-story-of-heads

This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned" and the ACL 2021 paper "Analyzing Source and Target Contributions to NMT Predictions".

Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:276Issues:13Issues:30

Physics-Aware-Training

Instructional implementation of Physics-Aware Training (PAT) with demonstrations on simulated experiments.

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:273Issues:15Issues:5

xl-sum

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

trak

A fast, effective data attribution method for neural networks in PyTorch

Language:PythonLicense:MITStargazers:145Issues:9Issues:42

eraserbenchmark

A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/

Language:PythonLicense:Apache-2.0Stargazers:97Issues:10Issues:9

jestimator

Amos optimizer with JEstimator lib.

Language:PythonLicense:Apache-2.0Stargazers:77Issues:5Issues:3

brmp

Bayesian Regression Models in Pyro

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:68Issues:12Issues:51

backpacks-flash-attn

The original Backpack Language Model implementation, a fork of FlashAttention

Language:PythonLicense:BSD-3-ClauseStargazers:62Issues:2Issues:4
Language:PythonLicense:NOASSERTIONStargazers:49Issues:6Issues:6

torchscale

Transformers at any scale

Language:PythonLicense:MITStargazers:40Issues:0Issues:0

time_interpret

Unified Model Interpretability Library for Time Series

Language:PythonLicense:MITStargazers:29Issues:3Issues:2

tokenizations

Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/

Language:RustLicense:MITStargazers:25Issues:2Issues:0

learning-scaffold

This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"

Language:Jupyter NotebookStargazers:19Issues:3Issues:0

robust-attribution-regularization

Robust Attribution Regularization

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:16Issues:2Issues:2

DiffuSum

codebase for paper DiffuSum: Generation Enhanced Extractive Summarization with Diffusion

Language:PythonLicense:Apache-2.0Stargazers:15Issues:2Issues:5
Language:PythonStargazers:8Issues:1Issues:0

autoalign

Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation

Language:PythonLicense:MITStargazers:6Issues:3Issues:0

WebQAmGaze

WebQAmGaze, a multilingual low-cost eye-tracking dataset (using webgazer)

Language:JavaScriptLicense:NOASSERTIONStargazers:6Issues:2Issues:1
Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0