Yan Yucheng (EzioZz)

EzioZz

Geek Repo

Company:University of Chinese Academy of Sciences

Location:Beijing

Github PK Tool:Github PK Tool

Yan Yucheng's starred repositories

triton

Development repository for the Triton language and compiler

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:9341Issues:150Issues:3498

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7090Issues:82Issues:1437

ROCm

AMD ROCm™ Software - GitHub Home

Language:ShellLicense:MITStargazers:4222Issues:211Issues:2231

ucasthesis

LaTeX Thesis Template for the University of Chinese Academy of Sciences

iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

Language:C++License:Apache-2.0Stargazers:2377Issues:84Issues:3522

starcoder2

Home of StarCoder2!

Language:PythonLicense:Apache-2.0Stargazers:1538Issues:18Issues:17

torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

Language:C++License:NOASSERTIONStargazers:1218Issues:250Issues:638

lagent

A lightweight framework for building LLM-based agents

Language:PythonLicense:Apache-2.0Stargazers:965Issues:11Issues:44

maxas

Assembler for NVIDIA Maxwell architecture

Language:SassLicense:MITStargazers:921Issues:88Issues:11

PatrickStar

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.

Language:PythonLicense:BSD-3-ClauseStargazers:742Issues:16Issues:56

buddy-mlir

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

Language:C++License:Apache-2.0Stargazers:432Issues:12Issues:49

rocBLAS

Next generation BLAS implementation for ROCm platform

Language:C++License:NOASSERTIONStargazers:326Issues:59Issues:151

composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

Language:C++License:NOASSERTIONStargazers:237Issues:25Issues:195

xdsl

A Python Compiler Design Toolkit

Language:PythonLicense:NOASSERTIONStargazers:207Issues:18Issues:393
Language:PythonLicense:Apache-2.0Stargazers:193Issues:28Issues:115

LightSeq

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Language:C++License:Apache-2.0Stargazers:126Issues:4Issues:37

ppcg

Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)

Language:CLicense:MITStargazers:112Issues:13Issues:2

rocSPARSE

Next generation SPARSE implementation for ROCm platform

Language:C++License:MITStargazers:110Issues:36Issues:38

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:102Issues:9Issues:29

isl

Integer Set Library (source repository: http://repo.or.cz/w/isl.git)

Language:CLicense:MITStargazers:62Issues:0Issues:0

MISA

Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)

Language:PythonLicense:MITStargazers:32Issues:26Issues:15

Triton-Compiler

Triton Compiler related materials.

License:MITStargazers:25Issues:0Issues:0

tvm-gdb-commands

Small set of gdb commands for useful tasks in tvm

Language:PythonLicense:MITStargazers:15Issues:1Issues:0

swDNN

a highly-efficient library for deep neural networks based on Sunway TaihuLight supercomputer.

Language:RoffStargazers:14Issues:3Issues:0

swGEMM

A highly efficient library for GEMM operations on Sunway TaihuLight

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:9Issues:1Issues:0

antlr-4

learn Antlr 4 (with c++ examples & cmake)

Language:C++Stargazers:5Issues:2Issues:0