There are 2 repositories under bitonic-merge-sort topic.
This is a very simple tool for in-place sorting of ComputeBuffers of ints on the GPU, meant for use in Unity.
Bitonic sort using simd (avx/neon) instructions
An example implementation of a parallel bitonic sort algorithm using an OpenMPI CPU cluster.
Network Packet classification on FPGA
Parallel Bitonic sort for CUDA that works with arbitrary inputs
Demonstration of implementation of bitonic sort in both regular C and OpenCL
Sorting arbitrary sequences using a bitonic sort algorithm distributed with MPI