Beast code in Giters

z562's starred repositories

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

MIT174600

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION140000

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonApache-2.032300

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1128500

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1305500

pandas-ai

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Language:PythonNOASSERTION1239600

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.0449100

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

Apache-2.0732900

LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonGPL-3.033200

llm-foundry

LLM training code for Databricks foundation models

Language:PythonApache-2.0392700

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookMIT248200

FewCLUE

FewCLUE 小样本学习测评基准，中文版

Language:Python48900

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT16596000

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.0566900

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.0254500

triton

Development repository for the Triton language and compiler

Language:C++MIT1230200

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.02928400

FLASHQuad_pytorch

Language:PythonMIT6500

Tnn

[ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling

Language:Python7000

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01213100

z562

z562's starred repositories

DeepSeek-Coder-V2

LLMTest_NeedleInAHaystack

ChunkLlama

Awesome-Multimodal-Large-Language-Models

flash-attention

pandas-ai

RedPajama-Data

open_llama

LEval

open-instruct

llm-foundry

chain-of-thought-hub

FewCLUE

AutoGPT

LLaMA-Adapter

Alpaca-CoT

triton

stanford_alpaca

FLASHQuad_pytorch

Tnn

RWKV-LM

knn-box

Image-Super-Resolution-via-Iterative-Refinement

stable-diffusion

transformers

DiffuSeq

CAB

CoNT

ParaSCI

cosFormer