HanBing Guo (T800GHB)

T800GHB

Geek Repo

Location:Shanghai,China

Github PK Tool:Github PK Tool

HanBing Guo's starred repositories

geektime-books

:books: 极客时间电子书

Stargazers:8292Issues:0Issues:0

tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

Language:C++License:NOASSERTIONStargazers:495Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:16542Issues:0Issues:0

Cpp-Templates-2ed

C++11/14/17/20 templates and generic programming, the most complex and difficult technical details of C++, indispensable in building infrastructure libraries.

Language:C++License:Apache-2.0Stargazers:1587Issues:0Issues:0

iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

Language:C++License:Apache-2.0Stargazers:2446Issues:0Issues:0

mlir-tutorial

MLIR For Beginners tutorial

Language:C++Stargazers:605Issues:0Issues:0

trtllm-llama

☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化

Language:C++License:Apache-2.0Stargazers:35Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7083Issues:0Issues:0

ComputerArchitectureAndCppBooks

📚 计算机体系结构与C++书籍收集(持续更新)

License:MITStargazers:248Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:9333Issues:0Issues:0

byteir

A model compilation solution for various hardware

Language:MLIRLicense:Apache-2.0Stargazers:320Issues:0Issues:0

folly

An open-source C++ library developed and used at Facebook.

Language:C++License:Apache-2.0Stargazers:27300Issues:0Issues:0

code_generator

Simple and straightforward code generator for creating program code. At the moment offers support for C++, Java and HTML5 for generating reports.

Language:PythonLicense:MITStargazers:97Issues:0Issues:0

tvm_mlir_learn

compiler learning resources collect.

Language:PythonStargazers:1871Issues:0Issues:0

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Language:PythonLicense:Apache-2.0Stargazers:1389Issues:0Issues:0

yoloair

🔥🔥🔥 专注于YOLOv5,YOLOv7、YOLOv8、YOLOv9改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Language:PythonLicense:GPL-3.0Stargazers:2388Issues:0Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:4472Issues:0Issues:0

doctest

The fastest feature-rich C++11/14/17/20/23 single-header testing framework

Language:C++License:MITStargazers:5660Issues:0Issues:0

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CLicense:NOASSERTIONStargazers:5571Issues:0Issues:0

inter-operator-scheduler

[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration

Language:C++License:MITStargazers:188Issues:0Issues:0

stb

stb single-file public domain libraries for C/C++

Language:CLicense:NOASSERTIONStargazers:25553Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4730Issues:0Issues:0

tiny-cuda-nn

Lightning fast C++/CUDA neural network framework

Language:C++License:NOASSERTIONStargazers:3494Issues:0Issues:0

MonoDTR

MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)

Language:PythonLicense:MITStargazers:122Issues:0Issues:0

lcm

Lightweight Communications and Marshalling

Language:JavaLicense:LGPL-2.1Stargazers:935Issues:0Issues:0

Point-Cloud-Processing-example

点云库PCL从入门到精通 书中配套案例

Language:C++Stargazers:472Issues:0Issues:0

pycls

Codebase for Image Classification Research, written in PyTorch.

Language:PythonLicense:MITStargazers:2117Issues:0Issues:0

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5598Issues:0Issues:0

RODNet

RODNet: Radar object detection network

Language:PythonLicense:MITStargazers:222Issues:0Issues:0