farizikhwantri

Fariz Ikhwantri's starred repositories

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.044915 299 646

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.014532 108 923

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++NOASSERTION13936 133 394

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.04589 53 321

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonNOASSERTION2961 44 357

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.02114 26 54

pystack

🔍 🐍 Like pstack but for Python!

Language:PythonApache-2.0965 12 49

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

Language:PythonMIT849 14 31

the-story-of-heads

This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned" and the ACL 2021 paper "Analyzing Source and Target Contributions to NMT Predictions".

Language:Python284 8 8

Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Language:PythonApache-2.0276 13 30

Physics-Aware-Training

Instructional implementation of Physics-Aware Training (PAT) with demonstrations on simulated experiments.

Language:Jupyter NotebookCC-BY-4.0273 15 5

xl-sum

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

Language:Python247 6 15