Kernel warning "WQ_MEM_RECLAIM sdma1:drm_sched_run_job_work [gpu_sched] is flushing !WQ_MEM_RECLAIM events:amdgpu_device_delay_enable_gfx_off [amdgpu]"
Brief summary of the problem:
Nothing particularly happens, but I thought it's important to track as it's a kernel oops anyway.
Hardware description:
- CPU: Ryzen 9 5950X
- GPU: 10:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 21 [Radeon RX 6800/6800 XT / 6900 XT] [1002:73bf] (rev c0)
- System Memory: 64GB RAM DDR4 3600MHz
- Display(s): LG 27GR93U as main, LG 27GR75Q as side
- Type of Display Connection: DP
System information:
- Distro name and Version: Artix Linux
- Kernel version: Linux naomi-pc 6.12.1-artix1-1 #1 (closed) SMP PREEMPT_DYNAMIC Fri, 22 Nov 2024 22:38:24 +0000 x86_64 GNU/Linux
How to reproduce the issue:
The simplest way I can reproduce it is by mounting a drive and copying files to it on my KDE Plasma 6 installation.
Attached files:
Screenshots/video files
Log files (for system lockups / game freezes / crashes)
The same screenshot but as text
[Dec 1 11:26] ------------[ cut here ]------------
[ +0,000005] workqueue: WQ_MEM_RECLAIM sdma1:drm_sched_run_job_work [gpu_sched] is flushing !WQ_MEM_RECLAIM events:amdgpu_device_delay_enable_gfx_off [amdgpu]
[ +0,000226] WARNING: CPU: 24 PID: 27441 at kernel/workqueue.c:3704 check_flush_dependency+0xfc/0x120
[ +0,000007] Modules linked in: snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_seq rfcomm zram 842_decompress 842_compress lz4hc_compress lz4_compress uhid cmac algif_hash algif_skcipher af_alg bnep qrtr vfat fat amd_atl intel_rapl_msr intel_rapl_common uinput nct6775 kvm_amd snd_hda_codec_realtek nct6775_core iwlmvm snd_hda_codec_generic hwmon_vid kvm snd_hda_scodec_component snd_hda_codec_hdmi drivetemp crct10dif_pclmul crc32_pclmul mac80211 snd_hda_intel polyval_clmulni uvcvideo i2c_dev snd_intel_dspcfg polyval_generic videobuf2_vmalloc crypto_user snd_usb_audio uvc snd_intel_sdw_acpi ghash_clmulni_intel libarc4 snd_usbmidi_lib videobuf2_memops sha512_ssse3 snd_hda_codec btusb videobuf2_v4l2 snd_ump sha1_ssse3 snd_hda_core snd_rawmidi btrtl videobuf2_common aesni_intel iwlwifi snd_hwdep snd_seq_device btintel gf128mul snd_pcm crypto_simd btbcm videodev btmtk ee1004 mousedev joydev cryptd mc sp5100_tco snd_timer igb rapl cfg80211 bluetooth snd i2c_piix4 wmi_bmof ptp mxm_wmi i2c_smbus pps_core soundcore
[ +0,000097] ccp zenpower(OE) dca rfkill mac_hid hid_generic usbhid amdgpu btrfs crc16 video amdxcp i2c_algo_bit drm_ttm_helper ttm blake2b_generic drm_exec libcrc32c crc32c_generic gpu_sched xor drm_suballoc_helper raid6_pq drm_buddy nvme crc32c_intel drm_display_helper sha256_ssse3 nvme_core cec nvme_auth wmi
[ +0,000035] CPU: 24 UID: 0 PID: 27441 Comm: kworker/u128:2 Tainted: G OE 6.12.1-artix1-1 #1 8b9d87d8fa650f97631bc0b3278186da6e75cf40
[ +0,000005] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[ +0,000001] Hardware name: To Be Filled By O.E.M. X570 Taichi/X570 Taichi, BIOS P5.60 01/18/2024
[ +0,000002] Workqueue: sdma1 drm_sched_run_job_work [gpu_sched]
[ +0,000005] RIP: 0010:check_flush_dependency+0xfc/0x120
[ +0,000003] Code: 8b 45 18 48 8d b2 c0 00 00 00 49 89 e8 48 8d 8b c0 00 00 00 48 c7 c7 50 8b ae 92 c6 05 a8 04 16 02 01 48 89 c2 e8 04 8e fd ff <0f> 0b e9 1f ff ff ff 80 3d 93 04 16 02 00 75 93 e9 4a ff ff ff 66
[ +0,000002] RSP: 0018:ffffb7f8dc7a7c60 EFLAGS: 00010082
[ +0,000003] RAX: 0000000000000000 RBX: ffff998cc0050c00 RCX: 0000000000000027
[ +0,000002] RDX: ffff999bfee218c8 RSI: 0000000000000001 RDI: ffff999bfee218c0
[ +0,000002] RBP: ffffffffc0865a60 R08: 0000000000000000 R09: ffffb7f8dc7a7ae0
[ +0,000001] R10: ffff999bfe1fffa8 R11: 0000000000000003 R12: ffff998ffd1c1b00
[ +0,000002] R13: ffff998d6963f800 R14: ffffb7f8dc7a7c90 R15: 0000000000000001
[ +0,000001] FS: 0000000000000000(0000) GS:ffff999bfee00000(0000) knlGS:0000000000000000
[ +0,000002] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ +0,000002] CR2: 00007af95c75f000 CR3: 0000000107cae000 CR4: 0000000000f50ef0
[ +0,000002] PKRU: 55555554
[ +0,000002] Call Trace:
[ +0,000001] <TASK>
[ +0,000002] ? check_flush_dependency+0xfc/0x120
[ +0,000003] ? __warn.cold+0x93/0xf6
[ +0,000003] ? check_flush_dependency+0xfc/0x120
[ +0,000003] ? report_bug+0xff/0x140
[ +0,000004] ? handle_bug+0x58/0x90
[ +0,000003] ? exc_invalid_op+0x17/0x70
[ +0,000002] ? asm_exc_invalid_op+0x1a/0x20
[ +0,000004] ? __pfx_amdgpu_device_delay_enable_gfx_off+0x10/0x10 [amdgpu a23f830851c51aa39e7594eed4736a88404a55c5]
[ +0,000179] ? check_flush_dependency+0xfc/0x120
[ +0,000004] __flush_work+0x110/0x2c0
[ +0,000005] cancel_delayed_work_sync+0x5e/0x80
[ +0,000004] amdgpu_gfx_off_ctrl+0xad/0x140 [amdgpu a23f830851c51aa39e7594eed4736a88404a55c5]
[ +0,000218] amdgpu_ring_alloc+0x43/0x60 [amdgpu a23f830851c51aa39e7594eed4736a88404a55c5]
[ +0,000193] amdgpu_ib_schedule+0xf0/0x730 [amdgpu a23f830851c51aa39e7594eed4736a88404a55c5]
[ +0,000193] amdgpu_job_run+0x8c/0x170 [amdgpu a23f830851c51aa39e7594eed4736a88404a55c5]
[ +0,000244] ? mod_delayed_work_on+0xa4/0xb0
[ +0,000005] drm_sched_run_job_work+0x25c/0x3f0 [gpu_sched c9bc3beb3d41f07197b72f2619d66593d5d285ba]
[ +0,000006] process_one_work+0x17e/0x330
[ +0,000004] worker_thread+0x2ce/0x3f0
[ +0,000003] ? __pfx_worker_thread+0x10/0x10
[ +0,000003] kthread+0xd2/0x100
[ +0,000004] ? __pfx_kthread+0x10/0x10
[ +0,000003] ret_from_fork+0x34/0x50
[ +0,000004] ? __pfx_kthread+0x10/0x10
[ +0,000003] ret_from_fork_asm+0x1a/0x30
[ +0,000006] </TASK>
[ +0,000002] ---[ end trace 0000000000000000 ]---