Liaukx's starred repositories

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:766Issues:0Issues:0

Examples

LaTeX Examples Document Source

Language:TeXLicense:NOASSERTIONStargazers:224Issues:0Issues:0

Examples

LaTeX Examples Document Source

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8Issues:0Issues:0

kokkos

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

Language:C++License:NOASSERTIONStargazers:1811Issues:0Issues:0

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:89501Issues:0Issues:0

LeetcodeTop

汇总各大互联网公司容易考察的高频leetcode题🔥

Stargazers:18377Issues:0Issues:0

ncnn-with-cuda

Tencent NCNN with added CUDA support

Language:C++License:NOASSERTIONStargazers:66Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:19829Issues:0Issues:0

kernel_memory_management

总结整理linux内核的内存管理的资料,包含论文,文章,视频,以及应用程序的内存泄露,内存池相关

Stargazers:909Issues:0Issues:0

aimto408

🇨🇳🇨🇳🇨🇳这个repo是为了那些准备死磕 计算机考研 4️⃣0️⃣8️⃣的考研党准备的,当然你如果4门课中的部分也可以看看,欢迎star📝📝📝,祝你们一战成硕🏆🏆🏆~~(更新23年大纲变化----2023年408和数学基本无变化)

Stargazers:3942Issues:0Issues:0

how-to-optimize-gemm

row-major matmul optimization

Language:C++License:GPL-3.0Stargazers:570Issues:0Issues:0

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CLicense:NOASSERTIONStargazers:5843Issues:0Issues:0
Language:PythonStargazers:77Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9495Issues:0Issues:0

Coursera-ML-AndrewNg-Notes

吴恩达老师的机器学习课程个人笔记

Language:HTMLStargazers:31112Issues:0Issues:0

machine-learning-toy-code

《机器学习》(西瓜书)代码实战

Language:Jupyter NotebookLicense:MITStargazers:592Issues:0Issues:0

tutorials

This repository contains tutorials and examples for Triton Inference Server

Language:PythonLicense:BSD-3-ClauseStargazers:484Issues:0Issues:0

Zero-DCE

Zero-DCE code and model

Language:HTMLStargazers:766Issues:0Issues:0

Awesome-Low-Light-Enhancement

Awesome Low-light Enhancement

Stargazers:116Issues:0Issues:0

stylegan2

StyleGAN2 - Official TensorFlow Implementation

Language:PythonLicense:NOASSERTIONStargazers:10897Issues:0Issues:0

KinD

Kindling the Darkness: a Practical Low-light Image Enhancer

Language:PythonStargazers:280Issues:0Issues:0

cmake-examples-Chinese

快速入门CMake,通过例程学习语法。在线阅读地址:https://sfumecjf.github.io/cmake-examples-Chinese/

Stargazers:1Issues:0Issues:0

translation-Introduction-to-HPC

为 Eijhout 教授的Introduction to HPC提供中文翻译、 PPT和Lab。

Language:CStargazers:255Issues:0Issues:0

parsl

Parsl - a Python parallel scripting library

Language:PythonLicense:Apache-2.0Stargazers:470Issues:0Issues:0
Language:CudaStargazers:2024Issues:0Issues:0

Cplusplus-Concurrency-In-Practice

A Detailed Cplusplus Concurrency Tutorial 《C++ 并发编程指南》

Language:C++License:MITStargazers:5252Issues:0Issues:0

tensorflow

图解tensorflow 源码

Stargazers:2182Issues:0Issues:0

awesome-database-learning

A list of learning materials to understand databases internals

Stargazers:9047Issues:0Issues:0

learning-growth

「一叶知秋」集散地,主要是我的一些阅读、学习、社交、研究、思考、放松娱乐记录整理。

License:MITStargazers:99Issues:0Issues:0

mpi-boruvka-mst

Get Minimum Spanning Tree of a connected Graph using Boruvka's Algorithm and MPI.

Language:CLicense:MITStargazers:1Issues:0Issues:0