[QUESTION] Quotesion related to HSA_OVERRIDE_GFX_VERSION and optimization?

Question

[QUESTION] Quotesion related to HSA_OVERRIDE_GFX_VERSION and optimization?

serhii-nakon opened this issue 4 months ago · comments

I not fully understand does this variable is solution or just trick/workaround. I mean if we can use gfx1100 code for gfx1103 or gfx1102 why just not link all this codes internally inside ROCM why it force users to use this variable? Does it produce some problem (like performance or etc) of using existing code but for another compatible GPU

Also I want to know what difference in case if I rebuild ROCM especially for gfx1102 or gfx1103, does it will produce exactly the same files like for gfx1100 but with another name or it will produce optimized code/files for specific architecture?

If it produce optimized code, what parts of already prebuilt ROCM need to rebuild - if I correctly understand it should be enough to rebuild only this packages: rocBLAS, rocFFT, rocSPARSE, MIOpen, rocRAND, rccl... is not it?

serhii-nakon · Answer 1 · Mon Mar 25 2024 21:06:38 GMT+0800 (China Standard Time)

I ask about it because this https://salsa.debian.org/rocm-team/community/team-project/-/wikis/Supported-GPU-list#architecture-notes show that ROCM can technically work with almost all GPUs. And I need to know what the best way to use ROCM for unofficially supported GPUs. Does better to rebuild or just use env variable and it will do exactly the same as for rebuild...

Gavin Zhao · Answer 2 · Tue Mar 26 2024 04:07:11 GMT+0800 (China Standard Time)

Perhaps this thread on Reddit may answer some of your questions.

tl;dr: HSA_OVERRIDE_GFX_VERSION shouldn't bring any noticeable performance regression and I don't recall ever seeing anyone reporting so. If you do suspect there are significant regressions, please benchmark and file a bug report :)

serhii-nakon · Answer 3 · Tue Mar 26 2024 05:59:12 GMT+0800 (China Standard Time)

@GZGavinZhao Hello
Hmm, I just now have laptop with gfx1012 with pre-compiled ROCm 5.4 for especially for it, if it true that 6.1 will released with gfx1010 I will benchmark it.

serhii-nakon · Answer 4 · Tue Mar 26 2024 06:51:25 GMT+0800 (China Standard Time)

@GZGavinZhao One more question where I can check all available gfx101* in new ROCm release? Also does it mean that Docker container will with already prebuild ML frameworks?
Because I don't see here gfx101* https://hub.docker.com/layers/rocm/pytorch-nightly/latest/images/sha256-9520c161d80bc72132b54ebcd32bd1ac842d18ecd73099dcab81c1d5f416c7ab?context=explore

ppanchad-amd · Answer 5 · Thu Jul 04 2024 23:56:33 GMT+0800 (China Standard Time)

@serhii-nakon gfx101* series are not supported in the latest ROCm 6.1.2 release. Please let me know if this ticket can be closed. Thanks!

serhii-nakon · Answer 6 · Wed Jul 17 2024 02:11:36 GMT+0800 (China Standard Time)

@ppanchad-amd Hello, yes you can