shahules786

Shahul ES's starred repositories

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.023820 162 3752

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.06459 48 579

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6293 61 76

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT5519 33 778

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.05071 51 180

OpenCopilot

🤖 🔥 Language-to-actions engine

Language:TypeScriptMIT4949 46 99

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.02507 13 168

deepeval

The LLM Evaluation Framework

Language:PythonApache-2.02061 15 191

1password-teams-open-source

Get a free 1Password Teams membership for your open source project

1564 37 16

fastmoe

A fast MoE impl for PyTorch

Language:PythonApache-2.01430 12 113

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:Python1252 14 8

mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Language:PythonGPL-3.0870 3 25

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonApache-2.0862 14 37

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.0853 10 26

awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

773 21 1

megablocks

Language:PythonApache-2.0765 12 34

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonNOASSERTION625 13 38

zeno-build

Build, evaluate, understand, and fix LLM-based apps

Language:Jupyter NotebookMIT480 9 69

Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

Apache-2.0477 14 3

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT471 11 57

pykoi-rlhf-finetuned-transformers

pykoi: Active learning in one unified interface

Language:Jupyter NotebookApache-2.0403 10 4

AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language:PythonMIT380 8 40

st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Language:PythonMIT241 5 11

MoEBERT

This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).

Language:PythonApache-2.093 1 6

LLM-SLERP-Merge

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

Language:Python92 3 1

MDEL

Multi-Domain Expert Learning

Language:PythonApache-2.068 21 29

soft-mixture-of-experts

PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)

Language:PythonMIT60 30

decontamination

This repository contains code for cleaning your training data of benchmark data to help combat data snooping.

Language:Jupyter NotebookApache-2.025 30

smear

Language:Python21 2 1

spade-experiments

Experiments to assess SPADE on different LLM pipelines.

Language:Python1500