HuaYangCheng

HuaYangCheng

Geek Repo

Github PK Tool:Github PK Tool

HuaYangCheng's starred repositories

PTX-ISA

CUDA PTX-ISA Document 中文翻译版

License:Apache-2.0Stargazers:22Issues:0Issues:0
Stargazers:31Issues:0Issues:0

nncpp

[WIP] NNCpp - Neural Networks in C++ with CUDA ops

Language:CudaLicense:MITStargazers:1Issues:0Issues:0

CogDL

CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)

Language:PythonLicense:MITStargazers:1709Issues:0Issues:0
Language:C++Stargazers:56Issues:0Issues:0

NiuTensor

NiuTensor is an open-source toolkit developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. It provides tensor utilities to create and train neural networks.

Language:C++License:Apache-2.0Stargazers:380Issues:0Issues:0

Computer-Science-Textbooks

Collect some CS textbooks for learning.

Stargazers:457Issues:0Issues:0

REKCARC-TSC-UHT

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:32910Issues:0Issues:0

CUDALibrarySamples

CUDA Library Samples

Language:CudaLicense:NOASSERTIONStargazers:1451Issues:0Issues:0

Simple_CUDA_GEMM

Sgemm kernel function on Nvidia Pascal GPU, able to achieve 60% theoretical performance.

Language:CudaLicense:GPL-3.0Stargazers:5Issues:0Issues:0

SGEMM-Implementation-and-Optimization

:pencil: Some source code about matrix multiplication implementation on CUDA

Language:CudaStargazers:35Issues:0Issues:0
Language:CSSStargazers:17Issues:0Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:375Issues:0Issues:0

LLVM_for_cpu0

This is a tutorial to learn LLVM, I realize a backend to compiler machine code for cpu0 which is a simple RISC cpu.

Language:C++Stargazers:193Issues:0Issues:0

llvm-ir-tutorial

LLVM IR入门指南

Language:LLVMLicense:CC-BY-4.0Stargazers:1268Issues:0Issues:0

Professional-CUDA-C-Programming-Code-and-Notes

CUDA C 编程权威指南代码实现 包含了书上第二章到第八章的大部分代码实现和作者笔记,全由作者本人手动实现,难免有错误的地方,请大家谨慎参考,非常欢迎对错误的指正。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!

Language:CudaStargazers:247Issues:0Issues:0

tvm_walk_through

code reading for tvm

Language:PythonStargazers:69Issues:0Issues:0

tvm_mlir_learn

compiler learning resources collect.

Language:PythonStargazers:1984Issues:0Issues:0

zju-icicles

浙江大学课程攻略共享计划

Language:HTMLStargazers:36729Issues:0Issues:0

CuAssembler

An unofficial cuda assembler, for all generations of SASS, hopefully :)

Language:PythonLicense:MITStargazers:375Issues:0Issues:0

gdev

First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.

Language:CLicense:MITStargazers:338Issues:0Issues:0

kernelgen

A prototype of LLVM-based auto-parallelizing Fortran/C compiler for NVIDIA GPUs, targeting numerical modeling code

Language:C++License:NOASSERTIONStargazers:4Issues:0Issues:0
Language:CLicense:MITStargazers:1Issues:0Issues:0

gdev

First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.

Language:CLicense:MITStargazers:44Issues:0Issues:0
Language:CLicense:MITStargazers:1Issues:0Issues:0

ImageNet-Datasets-Downloader

ImageNet dataset downloader. Creates a custom dataset by specifying the required number of classes and images in a class.

Language:PythonStargazers:495Issues:0Issues:0

ImageNet-1K

ImageNet-1K data download, processing for using as a dataset

Language:PythonStargazers:48Issues:0Issues:0

pytorch_quantization

A pytorch implementation of dorefa quantization

Language:PythonLicense:MITStargazers:111Issues:0Issues:0

DFQ

PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.

Language:PythonLicense:MITStargazers:254Issues:0Issues:0