Stick Cui (StickCui)

StickCui

Geek Repo

Company:Institute of Computing Technology Chinese Academy of Sciences

Location:Beijing, China

Home Page:https://stickcui.github.io/

Github PK Tool:Github PK Tool

Stick Cui's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:38612Issues:329Issues:3519

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27657Issues:187Issues:4381

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23577Issues:218Issues:3597

simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

Language:C++License:Apache-2.0Stargazers:18863Issues:241Issues:816

btop

A monitor of resources

Language:C++License:Apache-2.0Stargazers:18391Issues:105Issues:562

HandBrake

HandBrake's main development repository

Language:CLicense:NOASSERTIONStargazers:16764Issues:287Issues:4591

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15646Issues:134Issues:615

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:13157Issues:99Issues:758

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13055Issues:90Issues:611

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12594Issues:117Issues:916

eza

A modern alternative to ls

Language:RustLicense:MITStargazers:9797Issues:20Issues:396

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7687Issues:75Issues:152

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7660Issues:89Issues:1627

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7391Issues:110Issues:150

InternLM

Official release of InternLM2.5 7B base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:5878Issues:54Issues:306

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5662Issues:66Issues:127

YAYI

雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

Language:PythonLicense:Apache-2.0Stargazers:3245Issues:12Issues:11

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

Vary

[ECCV2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonLicense:Apache-2.0Stargazers:1443Issues:26Issues:24

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonLicense:Apache-2.0Stargazers:1246Issues:14Issues:27

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1235Issues:34Issues:51

XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Language:PythonLicense:Apache-2.0Stargazers:649Issues:18Issues:31

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:611Issues:21Issues:428

WanJuan1.0

万卷1.0多模态语料

CValues

面向中文大模型价值观的评估与对齐研究

Language:PythonLicense:Apache-2.0Stargazers:443Issues:1Issues:7

collie

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonLicense:Apache-2.0Stargazers:352Issues:9Issues:62

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonLicense:Apache-2.0Stargazers:301Issues:4Issues:26