Thank you so much for doing this, Bruce!
This also happens on my Vega 56 for what it's worth, but it happens seemingly at random, this time it took over 12 hours for it to occur and I had been gaming for several hours within that time period (though not at the time of the crash)
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:40 vmid:4 pasid:32783, for process firefox pid 62950 thread firefox:cs0 pid 63047)
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: in page starting at address 0x000080010a0cb000 from IH client 0x1b (UTCL2)
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00441051
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: MORE_FAULTS: 0x1
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: WALKER_ERROR: 0x0
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: PERMISSION_FAULTS: 0x5
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: MAPPING_ERROR: 0x0
Aug 13 21:24:13 main kernel: amdgpu 0000:09:00.0: amdgpu: RW: 0x1
This also happens on my Vega 56, I've tried the linked patch above but I'm getting the same results as Kassin.
Attaching my dmesg (with patch applied) for good measure: dmesg.log
EDIT: It seems controlling it with commands works, aka echo "s 7 1600 1000" > /sys/class/drm/card0/device/pp_od_clk_voltage
etc, so now at least I can avoid grilling my GPU (as it was sitting at 100c+ at load at stock voltage).
Done, I just assumed it'd be for all AMD GPUs, but I couldn't test any other.
Version, Distribution, Desktop Environment:
Pipewire 99e6c775, Arch Linux, KDE Plasma 5.22.5
Description of Problem:
Connecting a monitor or TV with speakers to a hdmi port on my AMD Vega 56 suddenly stopped working, I bisected it to 99e6c775. Software still seem to act like everything is fine, but no sound is coming out and I haven't gone deaf just yet. Other devices (such as my USB DAC) still work fine.
How Reproducible:
Every time after 99e6c775
Steps to Reproduce:
Actual Results:
No audio is being played through the speakers
Expected Results:
Sound entering my ears
Version, Distribution, Desktop Environment:
9a65d90e and beyond, Arch Linux, chroot
Description of Problem:
As of 9a65d90e (bisected), the final test when doing meson test -C build --print-errorlogs
fails and says killed by signal 6 SIGABRT
(full output)
How Reproducible:
Every time
Steps to Reproduce:
Actual Results:
Checks complete or to be "Expected Fail" so that packaging can continue.
Expected Results:
Checks fail, build fails, packaging stops.
Additional Info Eg. pw-dump -N > file
(As Attachment Please):
testlog.txt: https://gist.github.com/kattjevfel/2b22c3666277e17e2318166867edf23b
9a65d90e88f4bc40cb1504901ad151a67b8a25f2 is the first bad commit
commit 9a65d90e88f4bc40cb1504901ad151a67b8a25f2
Author: Peter Hutterer <peter.hutterer@who-t.net>
Date: Tue Jun 8 20:00:36 2021 +1000
test: move the spa tests to pwtest
Move the spa tests to the pwtest framework. The pod tests have only been
wrapped in the function callers, they don't use the variuos pwtest helpers -
too much work for very little gain here. Can be done incrementally if needed.
Note that this removes the spa tests from the installed tests. Arguably,
installing those tests was unnecessary anyway since they are static binaries
and don't load anything. So having them installed runs the same tests as
having them run in the source tree.
Goal for the pwtest framework is to allow for installed tests, just not there
yet.
spa/tests/meson.build | 29 -
spa/tests/test-node.c | 236 -------
spa/tests/test-pod.c | 1607 -----------------------------------------------
spa/tests/test-utils.c | 842 -------------------------
test/meson.build | 3 +
test/test-spa-node.c | 251 ++++++++
test/test-spa-pod.c | 1626 ++++++++++++++++++++++++++++++++++++++++++++++++
test/test-spa-utils.c | 872 ++++++++++++++++++++++++++
8 files changed, 2752 insertions(+), 2714 deletions(-)
delete mode 100644 spa/tests/test-node.c
delete mode 100644 spa/tests/test-pod.c
delete mode 100644 spa/tests/test-utils.c
create mode 100644 test/test-spa-node.c
create mode 100644 test/test-spa-pod.c
create mode 100644 test/test-spa-utils.c