Beast code in Giters

DewEfresh's repositories

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT000

BitNet-Transformers

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture

Language:Python000

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonMIT000

Kosmos2.5

My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"

Language:PythonMIT000

Magic-AI-Wiki

Language:Python000

mamba-1.58bits

Language:PythonApache-2.0000

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonApache-2.0000

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT000

qmoe

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Language:PythonApache-2.0000

relora

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Language:Jupyter NotebookApache-2.0000

RWKV-infctx-trainer

RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!

Language:Jupyter NotebookApache-2.0000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT000