[runtime][hal][hip] Implement collectives HAL via RCCL
sogartar opened this issue · comments
Similar to how NCCL is used in the CUDA driver we can implement collectives for the HIP driver.
It is going to be mostly copy-paste driven development since it seems RCCL's API is a copy of NCCL's API. Down to the exact same symbol names.