samsara1995

samsara's starred repositories

NumCpp

C++ implementation of the Python Numpy library

Language:C++MIT341100

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonApache-2.02718500

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++Apache-2.076200

llamafile

Distribute and run LLMs with a single file.

Language:C++NOASSERTION1612400

tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API

Language:C++MIT665700

onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

Language:C++Apache-2.069600

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.01173000

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.0892500

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLApache-2.0692700

ControlNet_TensorRT

天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛初赛第三名方案

Language:Python4300

ControlNet

Let us control diffusion models!

Language:PythonApache-2.02838400

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2286600

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonApache-2.0222600

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.01268900

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02317300

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.03965200

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02033500

torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

Language:C++NOASSERTION121200

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0699600

buddy-mlir

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

Language:C++Apache-2.043000

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.0218100

Compass_Optimizer

Compass Optimizer (OPT for short), is part of the Zhouyi Compass Neural Network Compiler. The OPT is designed for converting the float Intermediate Representation (IR) generated by the Compass Unified Parser to an optimized quantized or mixed IR which is suited for Zhouyi NPU hardware platforms.

Language:PythonApache-2.02300