matrix97317's starred repositories

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

run

润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新**人的核心宗教,核心信念。

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8325Issues:89Issues:1829

MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Language:C++License:BSD-3-ClauseStargazers:1194Issues:24Issues:190

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:808Issues:13Issues:15

hidet

An open-source efficient deep learning framework/compiler, written in python.

Language:PythonLicense:Apache-2.0Stargazers:648Issues:17Issues:84

ss-python

An evolving Python project template that covers the full development lifecycle.

Language:JinjaLicense:MITStargazers:74Issues:4Issues:180

OneNeuralNetwork

This is a cross-chip platform collection of operators and a unified neural network library.

Language:PythonLicense:Apache-2.0Stargazers:12Issues:2Issues:1

Awesome-Embodied-AI

This repository mainly organizes resources related to embodied intelligence, including data, models, hardware, and software infrastructure.

License:MITStargazers:9Issues:0Issues:0

pig-solver

This is a toy deep-learning computing framework( such as Pytorch,Caffe etc.).

Language:C++Stargazers:1Issues:1Issues:0