GPUs's repositories
caffe-opencl
Deep learning with Caffe on phones, with OpenCL support for CPU and GPU devices.
darknet
Convolutional Neural Networks
gemm_optimization
The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能,提供binary,开盒即用。
maxas
Assembler for NVIDIA Maxwell architecture
Simd
C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
SSE-convolution
A demonstration of speeding up a 1D convolution using SSE
tensor
A Modern C++ Heterogeneous Computing Library
ucc162.3
A lightweight open-source C compiler for research and education.
VKL
An abstraction layer on-top of Vulkan to help reduce boiler-plate code.
vulkan_minimal_compute
Minimal Example of Using Vulkan for Compute Operations. Only ~400LOC.
VulkanSubgroups
vulkan subgroups example for reduce and scan
Winograd-OpenCL
Winograd-based convolution implementation in OpenCL
XNet
Simple CuDNN wrapper
6.824-2017
:zap: 6.824: Distributed Systems (Spring 2017). A course which present abstractions and implementation techniques for engineering distributed systems.
Binary-Convolutional-Neural-Network-Inference-on-GPU
GPU implementation of Xnor network on inference level.
build-scripts-of-ffmpeg-x264-for-android-ndk
ffmpeg build scripts for android ndk usage (including x264)
caffe-int8-convert-tools
Generate a quantization parameter file for ncnn framework int8 inference
Depth_conv-for-mobileNet
Depth_conv for MobileNet
Distributed-Systems
MIT课程《Distributed Systems 》学习和翻译
GLSL-Card
着色器语言 GLSL (opengl-shader-language)入门大全
goVideoCompressor
video distributed compressor with ffmpeg
Lee-SLAM-source
SLAM 开发学习资源与经验分享
ROCm_Documentation
ROCm Software Platform Documentation
slam-python
用python学习rgbd-slam系列