GPU memory in searching

Question

GPU memory in searching

wangq95 opened this issue 4 years ago · comments

Hi, may I ask a question about the GPU memory occupation during training? Is that smaller than DARTS as only one block was actived for each layer? But the $mi$ in Eq.5 is relexed by Gumbel Softmax so that every $mi$ is greater than zero, so every block should be calculated, how it different from DARTS? Thank you.