uvm cache

Question

uvm cache

MichoChan opened this issue a year ago · comments

why do uvm cache using self lru/lfu in fbgemm?
as I know, uvm depends on page fault，so is there already exist a page replacement algorithm like lru/lfu in os system?

sryap · Answer 1 · Tue Aug 08 2023 00:06:20 GMT+0800 (China Standard Time)

Hi @MichoChan, FBGEMM-GPU's table batched embedding (TBE) supports 4 table placements

GPU's HBM memory (DEVICE)
GPU's UVM memory (MANAGED)
GPU's UVM memory with software managed cache (MANAGED_CACHING)
Host's memory (HOST)

What you are referring to is Option 3 that is MANAGED_CACHING. The full embedding table is stored in UVM. The cache stores only some rows of the embedding table, and it resides on HBM. Rows are staged from UVM to cache (HBM) in every iteration via the prefetch function. LRU/LFU is the cache eviction policy.

For more information, please refer to https://arxiv.org/pdf/2010.11305.pdf

Hope this helps.