yawen_Li's starred repositories
YHs_Sample
Yinghan's Code Sample
NVIDIA-OpenCL-Samples
可编译的 nvidia opencl 官方 实例代码,https://developer.nvidia.com/opencl
iphone_dcim_backup
back up iphone photo
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
QualcommOpenCLSDKNote
The note of Qualcomm OpenCL SDK
cpu-cache-test
cpu cache延迟实验
OpenCL-correlation-using-local-memory
Correlation demo in OpenCL that uses local memory.
ArmNeonOptimization
arm-neon
Cplusplus-Concurrency-In-Practice
A Detailed Cplusplus Concurrency Tutorial 《C++ 并发编程指南》
mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
mobilenet-ssd-snpe
mobilenet-ssd snpe demo
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
CPlusPlusThings
C++那些事