StickCui

followers

following

stars

Institute of Computing Technology Chinese Academy of Sciences

Beijing, China

https://stickcui.github.io/

Stick Cui's starred repositories

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonAGPL-3.038612 329 3519

GPTs

leaked prompts of GPTs

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.027657 187 4381

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023577 218 3597

simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

Language:C++Apache-2.018863 241 816

btop

A monitor of resources

Language:C++Apache-2.018391 105 562

HandBrake

HandBrake's main development repository

Language:CNOASSERTION16764 287 4591

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonNOASSERTION15646 134 615

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.013157 99 758

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.013055 90 611

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause12594 117 916

eza

A modern alternative to ls

Language:RustMIT9797 20 396

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7687 75 152

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07660 89 1627

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.07391 110 150

InternLM

Official release of InternLM2.5 7B base and chat models. 1M context support

Language:PythonApache-2.05878 54 306

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonApache-2.05662 66 127

YAYI

雅意大模型：为客户打造安全可靠的专属大模型，基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型，由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

Language:PythonApache-2.03245 12 11

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

Vary

[ECCV2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:Python1671 53 121

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonApache-2.01443 26 24

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonApache-2.01246 14 27

MOSS-RLHF

MOSS-RLHF

Language:PythonApache-2.01235 34 51

XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Language:PythonApache-2.0649 18 31

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.0611 21 428

WanJuan1.0

万卷1.0多模态语料

CC-BY-4.0446 9 28

CValues

面向中文大模型价值观的评估与对齐研究

Language:PythonApache-2.0443 1 7

collie

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonApache-2.0352 9 62

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonApache-2.0301 4 26