pcie_throughput.py: measure the total PCIe throughput when copying data to multiple GPUs simultaneously.