GPUs

GPUs

Geek Repo

Github PK Tool:Github PK Tool

GPUs's repositories

caffe-opencl

Deep learning with Caffe on phones, with OpenCL support for CPU and GPU devices.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

darknet

Convolutional Neural Networks

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

gemm_optimization

The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能,提供binary,开盒即用。

Language:CLicense:MITStargazers:0Issues:0Issues:0

maxas

Assembler for NVIDIA Maxwell architecture

Language:CSSLicense:MITStargazers:0Issues:0Issues:0

Simd

C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.

Language:C++License:MITStargazers:0Issues:0Issues:0

SSE-convolution

A demonstration of speeding up a 1D convolution using SSE

Language:CStargazers:0Issues:0Issues:0

tensor

A Modern C++ Heterogeneous Computing Library

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:C++License:BSD-2-ClauseStargazers:0Issues:0Issues:0

ucc162.3

A lightweight open-source C compiler for research and education.

Language:CStargazers:0Issues:0Issues:0

VKL

An abstraction layer on-top of Vulkan to help reduce boiler-plate code.

Language:CLicense:MITStargazers:0Issues:0Issues:0

vulkan_minimal_compute

Minimal Example of Using Vulkan for Compute Operations. Only ~400LOC.

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

VulkanSubgroups

vulkan subgroups example for reduce and scan

Language:C++License:MITStargazers:0Issues:0Issues:0

Winograd-OpenCL

Winograd-based convolution implementation in OpenCL

Language:CStargazers:0Issues:0Issues:0

XNet

Simple CuDNN wrapper

Language:C++Stargazers:0Issues:0Issues:0

6.824-2017

:zap: 6.824: Distributed Systems (Spring 2017). A course which present abstractions and implementation techniques for engineering distributed systems.

Language:GoStargazers:0Issues:0Issues:0

Binary-Convolutional-Neural-Network-Inference-on-GPU

GPU implementation of Xnor network on inference level.

Stargazers:0Issues:0Issues:0

build-scripts-of-ffmpeg-x264-for-android-ndk

ffmpeg build scripts for android ndk usage (including x264)

Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

caffe-int8-convert-tools

Generate a quantization parameter file for ncnn framework int8 inference

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

Depth_conv-for-mobileNet

Depth_conv for MobileNet

Language:CudaStargazers:0Issues:0Issues:0

Distributed-Systems

MIT课程《Distributed Systems 》学习和翻译

Language:GoStargazers:0Issues:0Issues:0

GLSL-Card

着色器语言 GLSL (opengl-shader-language)入门大全

Stargazers:0Issues:0Issues:0

goVideoCompressor

video distributed compressor with ffmpeg

Stargazers:0Issues:0Issues:0

Lee-SLAM-source

SLAM 开发学习资源与经验分享

Stargazers:0Issues:0Issues:0
Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ROCm_Documentation

ROCm Software Platform Documentation

Language:C++Stargazers:0Issues:0Issues:0

slam-python

用python学习rgbd-slam系列

Language:PythonStargazers:0Issues:0Issues:0