Weikai Tang (yofufufufu)

yofufufufu

Geek Repo

Company:Jilin University

Github PK Tool:Github PK Tool

Weikai Tang's repositories

ATOS

Multi-GPU dynamic scheduler using PGAS style cross-GPU communication

Language:CudaStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

code-samples

Source code examples from the Parallel Forall Blog

Language:HTMLLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

discrete-mathematics-vocabulary

discrete mathematics vocabulary

Stargazers:0Issues:1Issues:0

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:0Issues:0Issues:0

stdgpu

stdgpu: Efficient STL-like Data Structures on the GPU

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

VBlog

V部落,Vue+SpringBoot实现的多用户博客管理平台!

Language:JavaStargazers:0Issues:0Issues:0