Liang Geng's starred repositories
networking
Enhanced networking support for TensorFlow. Maintained by SIG-networking.
Ring-Buffer
simple C++11 ring buffer implementation, allocated and evaluated at compile time
Tensorflow-RDMA
Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation over RDMA, which can get about 4.5x speedup on two nodes comparing with TCP/IP.
cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
rdma-example
RDMA exmaple
RDMA-Tutorial
A tutorial on RDMA based programming using code examples
BullshitGenerator
Needs to generate some texts to test if my GUI rendering codes good or not. so I made this.
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
variadic_table
Formatted Table For Printing To Console
hpctoolkit
HPCToolkit performance tools: measurement and analysis components
cuda_benchmark
A library to benchmark CUDA code, similar to google benchmark.
AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
aliyundrive
阿里云盘api
computer_expert_paper
1000+份计算机paper,卡耐基梅隆大学,哈佛,斯坦福,芝加哥大学,MIT,facebook,google,微软,Amazon,twitter等大牛一作,持续更新中