brucechin

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011682 206 2247

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonApache-2.09124 111 81

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08847 99 1314

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.08309 89 1826

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.06990 59 138

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION6070 45 80

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.05805 62 625

FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Language:PythonApache-2.03816 43 210

beringei

Beringei is a high performance, in-memory storage engine for time series data.

Language:C++NOASSERTION3173 2000

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonNOASSERTION2986 56 136

risc0

RISC Zero is a zero-knowledge verifiable general computing platform based on zk-STARKs and the RISC-V microarchitecture.

Language:C++Apache-2.01632 52 530

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.01413 23 60

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileMIT1404 24 32

LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

Apache-2.01399 10 1

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonNOASSERTION1181 12 27

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

908 11 3