Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
The migration is almost done, at least the rest should happen in the background. There are still a few technical difference between the old cluster and the new ones, and they are summarized in this issue. Please pay attention to the TL:DR at the end of the comment.
Since Kernel 5.1 up-to Kernel 5.2 my Vega 56 card's memory clock is stuck at 167MHz and does not boost up any more.
The exact same setup boosts fine to 1000MHz memclk when running Kernel 5.0.13.
Is there any info I can provide to get this fixed?
The issue is fully fixed on kernel master (currently I am using commit 460b48a0fefce25beb0fc0139e721c5691d65d7f) when reverting drivers/gpu/drm/amd/powerplay/hwmgr/vega10_hwmgr.c back to the state it was around kernel 5.0.13.
I have to confirm this issue with kernel 5.2.
HBM2 clocks are at 167MHz if i try to overclock memory via write to:
/sys/class/drm/card0/device/pp_od_clk_voltage or
/sys/class/drm/card0/device/pp_table
Sometimes MCLK gets stuck on 500Mhz and SCLK on 879MHz. With these clocks after some time under load my laptop makes sound notification as if power cord was disconnected and power led also switches off, then the screen looks like a TV blank screen with white noise. And I have to reboot.
Same behaviour using pptables: memory either get stuck to 167MHz (level 0) or 800MHz (level 2), on 5.2 with https://aur.archlinux.org/packages/amdgpu-dkms/ which from what I understand should pull latest changes to amdgpu.
If using the classic echo "m 3 200 1050" | sudo tee /sys/class/drm/card0/device/pp_od_clk_voltage I found myself able to set an absurd memory frequency like 1400MHz which is reported to be used on my different cli tools, but it doesn't look like it does anything.
Same behaviour using pptables: memory either get stuck to 167MHz (level 0)
or 800MHz (level 2), on 5.2 with https://aur.archlinux.org/packages/amdgpu-dkms/ which from what I understand
should pull latest changes to amdgpu.
If using the classic echo "m 3 200 1050" | sudo tee <br> /sys/class/drm/card0/device/pp_od_clk_voltage I found myself able to set an
absurd memory frequency like 1400MHz which is reported to be used on my
different cli tools, but it doesn't look like it does anything.
Yes, I also observe sometimes unreal freqs like 2131MHz or something. But I've noticed that when I plug the power cord of my laptop after kernel is booted in the bootloader then MCLK is set at level 500MHz and SCLK is 879MHz which is enough for all my games. But if a game is more demanding then the whole system may fail with TV static effect but it can be workarounded by sending "manual" to /sys/class/drm/card0/device/power_dpm_force_performance_level (or using corectrl https://aur.archlinux.org/packages/corectrl/) and setting level 5 in /sys/class/drm/card0/device/pp_dpm_socclk which is 847Mhz. With that setting my system is stable.
TV static is also observable when using suspend2ram on resume.
Same behaviour using pptables: memory either get stuck to 167MHz (level 0)
or 800MHz (level 2), on 5.2 with https://aur.archlinux.org/packages/amdgpu-dkms/ which from what I understand
should pull latest changes to amdgpu.
If using the classic echo "m 3 200 1050" | sudo tee <br> /sys/class/drm/card0/device/pp_od_clk_voltage I found myself able to set an
absurd memory frequency like 1400MHz which is reported to be used on my
different cli tools, but it doesn't look like it does anything.
Yes, I also observe sometimes unreal freqs like 2131MHz or something. But
I've noticed that when I plug the power cord of my laptop after kernel is
booted in the bootloader then MCLK is set at level 500MHz and SCLK is 879MHz
which is enough for all my games. But if a game is more demanding then the
whole system may fail with TV static effect but it can be workarounded by
sending "manual" to
/sys/class/drm/card0/device/power_dpm_force_performance_level (or using
corectrl https://aur.archlinux.org/packages/corectrl/) and setting level 5
in /sys/class/drm/card0/device/pp_dpm_socclk which is 847Mhz. With that
setting my system is stable.
TV static is also observable when using suspend2ram on resume.
I don't know if it changes anything but I'm on a desktop system.
Thanks, I share very similar values. I thought adjusting OD_MCLK voltages would only set core voltage floor, but I'm not sure what this mean in practice.
Thanks, I share very similar values. I thought adjusting OD_MCLK voltages
would only set core voltage floor, but I'm not sure what this mean in
practice.
Yes, the OD_MCLK voltage values are (somehow missleading) actually core voltages linked by indices in MCLK table.