杨现's repositories
model_cryptor
深度学习模型加解密工具
ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
ThreadPool
A simple C++11 Thread Pool implementation
ArchProbe
A profiler to disclose and quantify hardware features on GPUs.
bark
🔊 Text-Prompted Generative Audio Model
ffmpeg_beginner
食铁兽(feater.top)ffmpeg4入门系列教程代码
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
neon
neon优化实例代码
opencv-mobile
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, WebAssembly
tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
tvm_phone
tvm arm gpu opencl
yangxianpku
Config files for my GitHub profile.