Lianke Qin (brucechin)

brucechin

Geek Repo

Company:Bytedance Inc

Home Page:liankeqin.com

Github PK Tool:Github PK Tool

Lianke Qin's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35354Issues:346Issues:1707

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33463Issues:339Issues:2616

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:24667Issues:168Issues:3978

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:21364Issues:197Issues:3114

chatbox

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

Language:TypeScriptLicense:GPL-3.0Stargazers:19497Issues:123Issues:1336

awesome-quant

A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11641Issues:104Issues:834

triton

Development repository for the Triton language and compiler

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10557Issues:195Issues:2121

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9062Issues:109Issues:80

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8269Issues:101Issues:1155

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7228Issues:84Issues:1477

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6730Issues:59Issues:137

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5604Issues:65Issues:623

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5521Issues:46Issues:73

FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Language:PythonLicense:Apache-2.0Stargazers:3795Issues:43Issues:210

beringei

Beringei is a high performance, in-memory storage engine for time series data.

Language:C++License:NOASSERTIONStargazers:3166Issues:201Issues:0

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonLicense:NOASSERTIONStargazers:2917Issues:55Issues:129

risc0

RISC Zero is a zero-knowledge verifiable general computing platform based on zk-STARKs and the RISC-V microarchitecture.

Language:C++License:Apache-2.0Stargazers:1528Issues:53Issues:488

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1296Issues:21Issues:32

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1093Issues:13Issues:22

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1050Issues:20Issues:51

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

llm-engine

Scale LLM Engine public repository

Language:PythonLicense:Apache-2.0Stargazers:752Issues:21Issues:52

veScale

A PyTorch Native LLM Training Framework

Language:PythonLicense:Apache-2.0Stargazers:466Issues:34Issues:5

snowflake-jdbc

Snowflake JDBC Driver

Language:JavaLicense:Apache-2.0Stargazers:169Issues:22Issues:655
Stargazers:82Issues:0Issues:0