BeamNG Linux Chrash [gfxhub] page fault
While trying to play BeamNG in native linux version, it's not longer possible under
- mesa 23.3~git and
- mesa 23.2.0-rc3
It doesn't matter, if i use linux 6.5.4 or 6.6-rc2.
Game is loading the menu and then, after i try to start the racing itself, after a while of loading, screen becomes black, then it comes back freezed. I can get the system back with sysrq r, e, i.
If i use amdgpu.mcbp=0 the freeze does not occur, but the log is showing the same messages
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:6 pasid:32794, for process BeamNG.drive.x6 pid 2828 thread BeamNG.dri:cs0 pid 2829)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: in page starting at address 0x0000760000000000 from client 0x1b (UTCL2)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00601431
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) (0xa)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: MORE_FAULTS: 0x1
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: WALKER_ERROR: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: MAPPING_ERROR: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: RW: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:6 pasid:32794, for process BeamNG.drive.x6 pid 2828 thread BeamNG.dri:cs0 pid 2829)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: in page starting at address 0x0000760000000000 from client 0x1b (UTCL2)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: MORE_FAULTS: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: WALKER_ERROR: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: MAPPING_ERROR: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: RW: 0x0
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:6 pasid:32794, for process BeamNG.drive.x6 pid 2828 thread BeamNG.dri:cs0 pid 2829)
[Fr Sep 22 11:57:45 2023] amdgpu 0000:0b:00.0: amdgpu: in page starting at address 0x0000760000000000 from client 0x1b (UTCL2
and so on until i kill the game. But no blackscreen or freeze.
Log without amdgpu.mcbp=0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:2 pasid:32783, for process BeamNG.drive.x6 pid 4987 thread BeamNG.dri:cs0 pid 4988)
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: in page starting at address 0x00001101fa000000 from client 0x1b (UTCL2)
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00201431
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) (0xa)
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: MORE_FAULTS: 0x1
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: WALKER_ERROR: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: MAPPING_ERROR: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: RW: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:2 pasid:32783, for process BeamNG.drive.x6 pid 4987 thread BeamNG.dri:cs0 pid 4988)
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: in page starting at address 0x00001101fa000000 from client 0x1b (UTCL2)
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: MORE_FAULTS: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: WALKER_ERROR: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: PERMISSION_FAULTS: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: MAPPING_ERROR: 0x0
Sep 21 19:18:16 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: RW: 0x0
Sep 21 19:18:26 Defiant kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=278240, emitted seq=278242
Sep 21 19:18:26 Defiant kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process BeamNG.drive.x6 pid 4987 thread BeamNG.dri:cs0 pid 4988
Sep 21 19:18:26 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GPU reset begin!
Sep 21 19:18:26 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: MODE1 reset
Sep 21 19:18:26 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GPU mode1 reset
Sep 21 19:18:26 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GPU smu mode1 reset
Sep 21 19:18:26 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GPU reset succeeded, trying to resume
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: RAS: optional ras ta ucode is not available
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: SMU is resuming...
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b2b00 (59.43.0)
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: SMU driver if version not matched
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: use vbios provided pptable
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: SMU is resumed successfully!
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: recover vram bo from shadow start
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: recover vram bo from shadow done
Sep 21 19:18:27 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: GPU reset(2) succeeded!
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to get a new IB (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to clear page tables on GEM object close (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to get a new IB (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to clear page tables on GEM object close (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to get a new IB (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to clear page tables on GEM object close (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to get a new IB (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to clear page tables on GEM object close (-512)
Sep 21 19:24:37 Defiant kernel: amdgpu 0000:0b:00.0: amdgpu: failed to get a new IB (-512)
Edited by Torsten Wohlfarth