Whole system freeze from isolate_migratepages_block / ttm_pool_alloc
Whole system freeze from isolate_migratepages_block / ttm_pool_alloc
When I played minecract 1.19 for nearly 6 hours, I was trying to open the journalmap map mod, the whole system was frozen, lost the response, and had to restart the computer forcibly.
I found some of the last logs, which show that it is related to amdgpu, so report here This is the first time I have encountered this situation
- r9-5950x
- rx7900xtx
- Fedora 37
- linux 6.1.14
- xorg-x11-server-Xorg-1.20.14-18.fc37.x86_64
- mesa 22.3.6
plasma x11 desktop environment
The related log is as follows
Hardware description:
- CPU: AMD Ryzen 9 5950X (32) @ 3.400GHz
- GPU: 2d:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 31 [Radeon RX 7900 XT/7900 XTX] [1002:744c] (rev c8)
- System Memory: 32G
- Display(s): 4K 60hz
- Type of Display Connection: DP
System information:
- Distro name and Version: Fedora 37
- Kernel version: Linux nihui-pc 6.1.14-200.fc37.x86_64 #1 (closed) SMP PREEMPT_DYNAMIC Sun Feb 26 00:13:26 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
- AMD official driver version: N/A
Log files (for system lockups / game freezes / crashes)
3月 11 17:21:29 nihui-pc kernel: ------------[ cut here ]------------
3月 11 17:21:29 nihui-pc kernel: kernel BUG at mm/zsmalloc.c:1793!
3月 11 17:21:29 nihui-pc kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
3月 11 17:21:29 nihui-pc kernel: CPU: 29 PID: 1645 Comm: Xorg Tainted: G OE 6.1.14-200.fc37.x86_64 #1
3月 11 17:21:29 nihui-pc kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C94/MAG B550M MORTAR WIFI (MS-7C94), BIOS 1.80 07/01/2021
3月 11 17:21:29 nihui-pc kernel: RIP: 0010:zs_page_putback+0x87/0x90
3月 11 17:21:29 nihui-pc kernel: Code: 5d e9 7d 4d a4 00 48 c7 c6 a8 d5 70 9b 48 89 df e8 9e 2d f6 ff 0f 0b 48 c7 c6 80 47 71 9b 48 89 df e8 8d 2d f6 ff 0f 0b 0f 0b <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 53 48 89 fb e8 e1 c3
3月 11 17:21:29 nihui-pc kernel: RSP: 0018:ffffb3edc7fab4b8 EFLAGS: 00010246
3月 11 17:21:29 nihui-pc kernel: RAX: 0000000000000000 RBX: ffff90966e88cc08 RCX: ffffb3edc7fab690
3月 11 17:21:29 nihui-pc kernel: RDX: 00000000000000ff RSI: fffff6908c7b8108 RDI: 0000000000000000
3月 11 17:21:29 nihui-pc kernel: RBP: ffff90966e88cc38 R08: 000000000000006c R09: ffffffffffffffff
3月 11 17:21:29 nihui-pc kernel: R10: 00000000000389c0 R11: ffff909b1f2d5000 R12: ffffb3edc7fab690
3月 11 17:21:29 nihui-pc kernel: R13: dead000000000122 R14: dead000000000100 R15: fffff6908c7b8108
3月 11 17:21:29 nihui-pc kernel: FS: 00007fcd991ffa80(0000) GS:ffff909aff140000(0000) knlGS:0000000000000000
3月 11 17:21:29 nihui-pc kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
3月 11 17:21:29 nihui-pc kernel: CR2: 00007fcd8dc2c0b2 CR3: 0000000110204000 CR4: 0000000000750ee0
3月 11 17:21:29 nihui-pc kernel: PKRU: 55555554
3月 11 17:21:29 nihui-pc kernel: Call Trace:
3月 11 17:21:29 nihui-pc kernel: <TASK>
3月 11 17:21:29 nihui-pc kernel: putback_movable_pages+0x2b1/0x310
3月 11 17:21:29 nihui-pc kernel: isolate_migratepages_block+0x32f/0x1840
3月 11 17:21:29 nihui-pc kernel: ? __compaction_suitable+0x74/0xb0
3月 11 17:21:29 nihui-pc kernel: compact_zone+0x378/0xdd0
3月 11 17:21:29 nihui-pc kernel: compact_zone_order+0xaa/0x100
3月 11 17:21:29 nihui-pc kernel: try_to_compact_pages+0xf0/0x2f0
3月 11 17:21:29 nihui-pc kernel: __alloc_pages_direct_compact+0x85/0x270
3月 11 17:21:29 nihui-pc kernel: __alloc_pages_slowpath.constprop.0+0x6c3/0xe20
3月 11 17:21:29 nihui-pc kernel: ? prepare_alloc_pages.constprop.0+0xf6/0x1a0
3月 11 17:21:29 nihui-pc kernel: __alloc_pages+0x209/0x230
3月 11 17:21:29 nihui-pc kernel: ttm_pool_alloc+0x2af/0x5a0 [ttm]
3月 11 17:21:29 nihui-pc kernel: amdgpu_ttm_tt_populate+0x35/0x90 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: ttm_tt_populate+0x9d/0x140 [ttm]
3月 11 17:21:29 nihui-pc kernel: ttm_bo_handle_move_mem+0x15f/0x170 [ttm]
3月 11 17:21:29 nihui-pc kernel: ttm_mem_evict_first+0x204/0x490 [ttm]
3月 11 17:21:29 nihui-pc kernel: ttm_bo_mem_space+0x1c9/0x220 [ttm]
3月 11 17:21:29 nihui-pc kernel: ttm_bo_validate+0x97/0x120 [ttm]
3月 11 17:21:29 nihui-pc kernel: ? drm_vma_offset_add+0x59/0x60
3月 11 17:21:29 nihui-pc kernel: ttm_bo_init_reserved+0x15f/0x1d0 [ttm]
3月 11 17:21:29 nihui-pc kernel: amdgpu_bo_create+0x1c0/0x480 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: ? amdgpu_bo_vm_destroy+0x80/0x80 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: amdgpu_bo_create_user+0x2c/0x50 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: amdgpu_gem_create_ioctl+0x138/0x370 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: ? amdgpu_bo_vm_destroy+0x80/0x80 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: ? amdgpu_gem_force_release+0x140/0x140 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: drm_ioctl_kernel+0xa9/0x150
3月 11 17:21:29 nihui-pc kernel: drm_ioctl+0x22f/0x410
3月 11 17:21:29 nihui-pc kernel: ? amdgpu_gem_force_release+0x140/0x140 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: amdgpu_drm_ioctl+0x4a/0x80 [amdgpu]
3月 11 17:21:29 nihui-pc kernel: __x64_sys_ioctl+0x90/0xd0
3月 11 17:21:29 nihui-pc kernel: do_syscall_64+0x5b/0x80
3月 11 17:21:29 nihui-pc kernel: ? __rseq_handle_notify_resume+0x96/0x460
3月 11 17:21:29 nihui-pc kernel: ? fpregs_restore_userregs+0x12/0xe0
3月 11 17:21:29 nihui-pc kernel: ? exit_to_user_mode_prepare+0x18f/0x1f0
3月 11 17:21:29 nihui-pc kernel: ? syscall_exit_to_user_mode+0x17/0x40
3月 11 17:21:29 nihui-pc kernel: ? do_syscall_64+0x67/0x80
3月 11 17:21:29 nihui-pc kernel: ? exit_to_user_mode_prepare+0x180/0x1f0
3月 11 17:21:29 nihui-pc kernel: entry_SYSCALL_64_after_hwframe+0x63/0xcd
3月 11 17:21:29 nihui-pc kernel: RIP: 0033:0x7fcd9988bd6f
3月 11 17:21:29 nihui-pc kernel: Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 18 48 8b 44 24 18 64 48 2b 04 25 28 00 00
3月 11 17:21:29 nihui-pc kernel: RSP: 002b:00007fffef91c6b0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
3月 11 17:21:29 nihui-pc kernel: RAX: ffffffffffffffda RBX: 0000555c28e9e670 RCX: 00007fcd9988bd6f
3月 11 17:21:29 nihui-pc kernel: RDX: 00007fffef91c750 RSI: 00000000c0206440 RDI: 0000000000000012
3月 11 17:21:29 nihui-pc kernel: RBP: 00007fffef91c750 R08: 0000000000000007 R09: 0000000000000010
3月 11 17:21:29 nihui-pc kernel: R10: 0000555c276b0010 R11: 0000000000000246 R12: 00000000c0206440
3月 11 17:21:29 nihui-pc kernel: R13: 0000000000000012 R14: 0000555c2776e360 R15: 0000000000000211
3月 11 17:21:29 nihui-pc kernel: </TASK>
3月 11 17:21:29 nihui-pc kernel: Modules linked in: tls uinput exfat rfcomm snd_seq_dummy snd_hrtimer nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntra>
3月 11 17:21:29 nihui-pc kernel: videodev bluetooth snd_timer cfg80211 mc joydev rapl snd wmi_bmof pcspkr i2c_piix4 k10temp soundcore rfkill gpio_amdpt gpio_generic acpi_cpufreq zram amdgpu drm_ttm_helper ttm iommu_v2 video crct10dif_pclmul crc32_pclmul crc32c_intel gpu_sched polyval_clmulni polyval_generic nvme d>
3月 11 17:21:29 nihui-pc kernel: ---[ end trace 0000000000000000 ]---
3月 11 17:21:29 nihui-pc kernel: RIP: 0010:zs_page_putback+0x87/0x90
3月 11 17:21:29 nihui-pc kernel: Code: 5d e9 7d 4d a4 00 48 c7 c6 a8 d5 70 9b 48 89 df e8 9e 2d f6 ff 0f 0b 48 c7 c6 80 47 71 9b 48 89 df e8 8d 2d f6 ff 0f 0b 0f 0b <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 53 48 89 fb e8 e1 c3
3月 11 17:21:29 nihui-pc kernel: RSP: 0018:ffffb3edc7fab4b8 EFLAGS: 00010246
3月 11 17:21:29 nihui-pc kernel: RAX: 0000000000000000 RBX: ffff90966e88cc08 RCX: ffffb3edc7fab690
3月 11 17:21:29 nihui-pc kernel: RDX: 00000000000000ff RSI: fffff6908c7b8108 RDI: 0000000000000000
3月 11 17:21:29 nihui-pc kernel: RBP: ffff90966e88cc38 R08: 000000000000006c R09: ffffffffffffffff
3月 11 17:21:29 nihui-pc kernel: R10: 00000000000389c0 R11: ffff909b1f2d5000 R12: ffffb3edc7fab690
3月 11 17:21:29 nihui-pc kernel: R13: dead000000000122 R14: dead000000000100 R15: fffff6908c7b8108
3月 11 17:21:29 nihui-pc kernel: FS: 00007fcd991ffa80(0000) GS:ffff909aff140000(0000) knlGS:0000000000000000
3月 11 17:21:29 nihui-pc kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
3月 11 17:21:29 nihui-pc kernel: CR2: 00007fcd8dc2c0b2 CR3: 0000000110204000 CR4: 0000000000750ee0
3月 11 17:21:29 nihui-pc kernel: PKRU: 55555554
3月 11 17:21:29 nihui-pc kernel: note: Xorg[1645] exited with preempt_count 1
3月 11 17:21:30 nihui-pc abrt-dump-journal-oops[1511]: abrt-dump-journal-oops: Found oopses: 1
3月 11 17:21:30 nihui-pc abrt-dump-journal-oops[1511]: abrt-dump-journal-oops: Creating problem directories
3月 11 17:21:30 nihui-pc abrt-server[41279]: Can't find a meaningful backtrace for hashing in '.'
3月 11 17:21:30 nihui-pc abrt-server[41279]: Preserving oops '.' because DropNotReportableOopses is 'no'
3月 11 17:21:31 nihui-pc abrt-notification[41296]: [🡕] System encountered a non-fatal error in ??()
3月 11 17:21:31 nihui-pc abrt-dump-journal-oops[1511]: Reported 1 kernel oopses to Abrt
3月 11 17:21:33 nihui-pc kernel: sched: RT throttling activated
3月 11 17:21:56 nihui-pc kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 26s! [Port0:3055]