Beast code in Giters

Pumpkin's starred repositories

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01095100

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonApache-2.0126600

MAC

Online Adaptation of Language Models with a Memory of Amortized Contexts

Language:Python4700

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellApache-2.02450400

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT1115500

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

85600

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.0255400

LongChat

Official repository for LongChat and LongEval

Language:PythonApache-2.050000

LLM-Conversation-Safety

[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

5000

ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Language:Jupyter NotebookMIT175800

mamba

Mamba SSM architecture

Language:PythonApache-2.01171200

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonMIT50600

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.0255400

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonApache-2.01091900

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT213800

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT145600

llama_parse

Parse files for optimal RAG

Language:PythonMIT189000

ring-flash-attention

Ring attention implementation with flash attention

Language:Python44400

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0165200

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT2953000

chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Language:PythonMIT2097000

plugins-quickstart

Get a ChatGPT plugin up and running in under 5 minutes!

Language:PythonMIT425000

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION1438900

ringattention

Transformers with Arbitrarily Large Context

Language:PythonApache-2.057500

LWM

Language:PythonApache-2.0701300

guided-diffusion

Language:PythonMIT588900

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION569100

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonGPL-3.080700

Neko9810