pgplus1628 / argmax-perf

Performance of argmax in different systems/libraries

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Compare argmax op in different system/libries

  • Eigen : tensorflow's choice, with tensor support
  • mxnet : handcrafted argmax, with tensor support
  • cublas : amax, absolute max, not argmax, only support vector
  • cub : nvidia-cub, support vector and batch-vector

cub has the best performance than others

About

Performance of argmax in different systems/libraries


Languages

Language:Cuda 56.3%Language:C++ 27.4%Language:Python 11.2%Language:Makefile 4.7%Language:Shell 0.3%