Lianke Qin's starred repositories
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
awesome-quant
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
flash-attention
Fast and memory-efficient exact attention
text-generation-inference
Large Language Model Text Generation Inference
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
FasterTransformer
Transformer related optimization, including BERT, GPT
deepsparse
Sparsity-aware deep learning inference runtime for CPUs
DeepSeek-LLM
DeepSeek LLM: Let there be answers
LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
llm-engine
Scale LLM Engine public repository