j4yan / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the program on the GPU in detail. The reduce optimization has been completed. The optimization of GEMM has completed the CUDA C code. The assembler is currently being used to tune the code, and the code will be issued later.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

This is a series of GPU optimization topics. Here we will introduce how to optimize the program on the GPU in detail. The reduce optimization has been completed. The optimization of GEMM has completed the CUDA C code. The assembler is currently being used to tune the code, and the code will be issued later.

License:Apache License 2.0


Languages

Language:Cuda 97.6%Language:Shell 2.0%Language:Makefile 0.4%