igt@gem_eio@hibernate|igt@gem_exec_suspend@basic-s4-devices- incomplete/dmesg-warn/dmesg-fail -clocksource: timekeeping watchdog on CPU\d+: Marking clocksource 'tsc' as unstable because the skew is too large| PM: hibernation: hibernation (exit|entry)
7> [519.119188] i915 0000:00:02.0: [drm:intel_power_well_disable [i915]] disabling power well 2
<6> [519.198269] smsc75xx 3-9:1.0 enx803f5d70659a: resuming from SUSPEND2
<7> [522.120093] i915 0000:00:02.0: [drm:intel_pps_vdd_off_sync_unlocked [i915]] Turning [ENCODER:235:DDI A/PHY A] VDD off
<7> [522.120468] i915 0000:00:02.0: [drm:intel_pps_vdd_off_sync_unlocked [i915]] PP_STATUS: 0x80000008 PP_CONTROL: 0x00000067
<7> [522.120632] i915 0000:00:02.0: [drm:intel_power_well_disable [i915]] disabling DC off
<7> [522.120827] i915 0000:00:02.0: [drm:skl_enable_dc6 [i915]] Enabling DC6
<7> [522.121002] i915 0000:00:02.0: [drm:gen9_set_dc_state.part.15 [i915]] Setting DC state from 00 to 02
<6> [523.148054] acpi LNXPOWER:00: Turning OFF
<7> [523.148157] i915 0000:00:02.0: [drm:i915_hdcp_component_bind [i915]] I915 HDCP comp bind
<6> [523.148312] mei_hdcp 0000:00:16.0-b638ab7e-94e2-4ea2-a552-d1c54b627f04: bound 0000:00:02.0 (ops i915_hdcp_component_ops [i915])
<7> [523.148597] PM: hibernation: Basic memory bitmaps freed
<6> [523.148600] OOM killer enabled.
<6> [523.148602] Restarting tasks ... done.
<6> [523.212422] PM: hibernation: hibernation exit
- Show closed items
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Reporter
The CI Bug Log issue associated to this bug has been updated by Lakshmi Vudum.
New filters associated
- ADL_P: igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation exit
- https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_10304/re-adlp-pub1/igt@gem_exec_suspend@basic-s4-devices.html
- https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_10305/re-adlp-pub1/igt@gem_exec_suspend@basic-s4-devices.html
- https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_10306/re-adlp-pub1/igt@gem_exec_suspend@basic-s4-devices.html
- ADL_P: igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation exit
- LAKSHMINARAYANA VUDUM changed title from igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation exit to igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation (exit|entry)
changed title from igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation exit to igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation (exit|entry)
- Reporter
A CI Bug Log filter associated to this bug has been updated by Lakshmi Vudum:
Description: ADL_P: igt@gem_exec_suspend@basic-s4-devices - incomplete - PM: hibernation: hibernation (exit|entry)
Equivalent query: runconfig_tag IS IN ["DRM-TIP"] AND (machine_name IS IN ["re-adlp-pub1", "bat-adlp-4"] OR machine_tag IS IN ["ADL-P"]) AND ((testsuite_name = "IGT" AND test_name IS IN ["igt@gem_exec_suspend@basic-s4-devices"])) AND ((testsuite_name = "IGT" AND status_name IS IN ["incomplete"])) AND dmesg ~= 'PM: hibernation: hibernation (exit|entry)'
New failures caught by the filter:
- Ashutosh Dixit mentioned in issue #3640 (closed)
mentioned in issue #3640 (closed)
- Ashutosh Dixit mentioned in issue #3666 (closed)
mentioned in issue #3666 (closed)
- Developer
#assessment
igt@gem_exec_suspend@basic-s4-devices submits some batches, then puts the system into hibernate and then resumes the system after 5 seconds. On ADLP we see hang/incomplete (without any further trace in dmesg) on both hibernate entry and exit.
This issue appears identical to the one we see in #3640 (closed). I am proposing we close one of #3640 (closed) and #3749 (closed) as a duplicate.
- LAKSHMINARAYANA VUDUM marked #3640 (closed) as a duplicate of this issue
marked #3640 (closed) as a duplicate of this issue
- LAKSHMINARAYANA VUDUM marked this issue as related to #3640 (closed)
marked this issue as related to #3640 (closed)
- Reporter
The CI Bug Log issue associated to this bug has been updated by Lakshmi Vudum.
New filters associated
- ADL_P: igt@gem_eio@hibernate - incomplete (No new failures associated)
- Developer
While local reproduction observed issue with network. This seems a issue from network interface a softlockup in network transmit queue, network watchdog is triggering with different types of network card (PCI and USB) while resuming from hibernate.
[ 1567.788400] ------------[ cut here ]------------ [ 1567.788436] NETDEV WATCHDOG: eth1 (e1000e): transmit queue 0 timed out [ 1567.788467] WARNING: CPU: 0 PID: 0 at /home/anshuma1/drm-intel/drm-tip/net/sched/sch_generic.c:478 dev_watchdog+0x290/0x2d0 [ 1567.788480] Modules linked in: x86_pkg_temp_thermal coretemp mei_hdcp fuse snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_hda_codec snd_hwdep snd_hda_core snd_pcm mei_me mei crct10dif_pclmul crc32_pclmul e1000e ghash_clmulni_intel ptp i2c_i801 pps_core i2c_smbus i915 intel_lpss_pci ttm prime_numbers [ 1567.788567] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G U 5.14.0-rc1-test-CI-CI_DRM_10499+ #146 (closed) [ 1567.788572] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS ADLPFWI1.R00.2227.A00.2105311135 05/31/2021 [ 1567.788577] RIP: 0010:dev_watchdog+0x290/0x2d0 [ 1567.788583] Code: 66 ff e9 5e ff ff ff 4c 89 ef c6 05 e1 a6 e9 00 01 e8 a4 03 fc ff 89 d9 48 89 c2 4c 89 ee 48 c7 c7 48 ca 3a 82 e8 90 74 71 ff <0f> 0b e9 41 ff ff ff e8 f4 54 14 00 85 c0 74 b6 80 3d 04 a1 e9 00 [ 1567.788587] RSP: 0018:ffffc90000003e68 EFLAGS: 00010282 [ 1567.788596] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000103 [ 1567.788599] RDX: 0000000000000103 RSI: 00000000000000f6 RDI: 00000000ffffffff [ 1567.788604] RBP: ffff88810ff803e0 R08: 0000000000000001 R09: 0000000000000001 [ 1567.788608] R10: 0000000000000001 R11: ffffc90000003c68 R12: ffff88810ff80438 [ 1567.788611] R13: ffff88810ff80000 R14: 0000000000000001 R15: ffff88810fddee80 [ 1567.788616] FS: 0000000000000000(0000) GS:ffff88849f200000(0000) knlGS:0000000000000000 [ 1567.788620] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1567.788623] CR2: 00007fbe3431a000 CR3: 000000010581c004 CR4: 0000000000770ef0 [ 1567.788627] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 1567.788631] DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400 [ 1567.788635] PKRU: 55555554 [ 1567.788639] Call Trace: [ 1567.788642] [ 1567.788647] ? qdisc_destroy+0x110/0x110 [ 1567.788655] call_timer_fn+0x9c/0x2c0 [ 1567.788667] ? qdisc_destroy+0x110/0x110 [ 1567.788675] run_timer_softirq+0x4ab/0x580 [ 1567.788690] __do_softirq+0xe3/0x492 [ 1567.788703] irq_exit_rcu+0xe8/0xf0 [ 1567.788709] sysvec_apic_timer_interrupt+0x8a/0xb0 [ 1567.788717] [ 1567.788721] asm_sysvec_apic_timer_interrupt+0x12/0x20 [ 1567.788728] RIP: 0010:mwait_idle+0x50/0x70 [ 1567.788732] Code: 31 d2 65 48 8b 04 25 40 6f 01 00 48 89 d1 0f 01 c8 48 8b 00 a8 08 75 21 eb 07 0f 00 2d c1 ae 51 00 31 c0 48 89 c1 fb 0f 01 c9 <65> 48 8b 04 25 40 6f 01 00 f0 80 60 02 df c3 fb eb ee 0f ae f0 0f [ 1567.788737] RSP: 0018:ffffffff82603e98 EFLAGS: 00000246 [ 1567.788744] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000 [ 1567.788747] RDX: 0000000000000000 RSI: ffffffff8238ece7 RDI: ffffffff8232aa7f [ 1567.788752] RBP: ffffffff8286cef8 R08: 0000000000000001 R09: 0000000000000001 [ 1567.788755] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000 [ 1567.788759] R13: 0000000000000000 R14: ffffffffffffffff R15: ffffffff826199c0
[ 1567.788400] ------------[ cut here ]------------ [ 1567.788436] NETDEV WATCHDOG: eth1 (e1000e): transmit queue 0 timed out [ 1567.788467] WARNING: CPU: 0 PID: 0 at /home/anshuma1/drm-intel/drm-tip/net/sched/sch_generic.c:478 dev_watchdog+0x290/0x2d0 [ 1567.788480] Modules linked in: x86_pkg_temp_thermal coretemp mei_hdcp fuse snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_hda_codec snd_hwdep snd_hda_core snd_pcm mei_me mei crct10dif_pclmul crc32_pclmul e1000e ghash_clmulni_intel ptp i2c_i801 pps_core i2c_smbus i915 intel_lpss_pci ttm prime_numbers [ 1567.788567] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G U 5.14.0-rc1-test-CI-CI_DRM_10499+ #146 (closed) [ 1567.788572] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS ADLPFWI1.R00.2227.A00.2105311135 05/31/2021 [ 1567.788577] RIP: 0010:dev_watchdog+0x290/0x2d0 [ 1567.788583] Code: 66 ff e9 5e ff ff ff 4c 89 ef c6 05 e1 a6 e9 00 01 e8 a4 03 fc ff 89 d9 48 89 c2 4c 89 ee 48 c7 c7 48 ca 3a 82 e8 90 74 71 ff <0f> 0b e9 41 ff ff ff e8 f4 54 14 00 85 c0 74 b6 80 3d 04 a1 e9 00 [ 1567.788587] RSP: 0018:ffffc90000003e68 EFLAGS: 00010282 [ 1567.788596] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000103 [ 1567.788599] RDX: 0000000000000103 RSI: 00000000000000f6 RDI: 00000000ffffffff [ 1567.788604] RBP: ffff88810ff803e0 R08: 0000000000000001 R09: 0000000000000001 [ 1567.788608] R10: 0000000000000001 R11: ffffc90000003c68 R12: ffff88810ff80438 [ 1567.788611] R13: ffff88810ff80000 R14: 0000000000000001 R15: ffff88810fddee80 [ 1567.788616] FS: 0000000000000000(0000) GS:ffff88849f200000(0000) knlGS:0000000000000000 [ 1567.788620] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1567.788623] CR2: 00007fbe3431a000 CR3: 000000010581c004 CR4: 0000000000770ef0 [ 1567.788627] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 1567.788631] DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400 [ 1567.788635] PKRU: 55555554 [ 1567.788639] Call Trace: [ 1567.788642] [ 1567.788647] ? qdisc_destroy+0x110/0x110 [ 1567.788655] call_timer_fn+0x9c/0x2c0 [ 1567.788667] ? qdisc_destroy+0x110/0x110 [ 1567.788675] run_timer_softirq+0x4ab/0x580 [ 1567.788690] __do_softirq+0xe3/0x492 [ 1567.788703] irq_exit_rcu+0xe8/0xf0 [ 1567.788709] sysvec_apic_timer_interrupt+0x8a/0xb0 [ 1567.788717] [ 1567.788721] asm_sysvec_apic_timer_interrupt+0x12/0x20 [ 1567.788728] RIP: 0010:mwait_idle+0x50/0x70 [ 1567.788732] Code: 31 d2 65 48 8b 04 25 40 6f 01 00 48 89 d1 0f 01 c8 48 8b 00 a8 08 75 21 eb 07 0f 00 2d c1 ae 51 00 31 c0 48 89 c1 fb 0f 01 c9 <65> 48 8b 04 25 40 6f 01 00 f0 80 60 02 df c3 fb eb ee 0f ae f0 0f [ 1567.788737] RSP: 0018:ffffffff82603e98 EFLAGS: 00000246 [ 1567.788744] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000 [ 1567.788747] RDX: 0000000000000000 RSI: ffffffff8238ece7 RDI: ffffffff8232aa7f [ 1567.788752] RBP: ffffffff8286cef8 R08: 0000000000000001 R09: 0000000000000001 [ 1567.788755] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000 [ 1567.788759] R13: 0000000000000000 R14: ffffffffffffffff R15: ffffffff826199c0
- Developer
Use markdown so we can see the oops message in the above comment.
[ 1567.788400] -----------[ cut here ]----------- [ 1567.788436] NETDEV WATCHDOG: eth1 (e1000e): transmit queue 0 timed out [ 1567.788467] WARNING: CPU: 0 PID: 0 at /home/anshuma1/drm-intel/drm-tip/net/sched/sch_generic.c:478 dev_watchdog+0x290/0x2d0 [ 1567.788480] Modules linked in: x86_pkg_temp_thermal coretemp mei_hdcp fuse snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_hda_codec snd_hwdep snd_hda_core snd_pcm mei_me mei crct10dif_pclmul crc32_pclmul e1000e ghash_clmulni_intel ptp i2c_i801 pps_core i2c_smbus i915 intel_lpss_pci ttm prime_numbers [ 1567.788567] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G U 5.14.0-rc1-test-CI-CI_DRM_10499+ #146 [ 1567.788572] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS ADLPFWI1.R00.2227.A00.2105311135 05/31/2021 [ 1567.788577] RIP: 0010:dev_watchdog+0x290/0x2d0 [ 1567.788583] Code: 66 ff e9 5e ff ff ff 4c 89 ef c6 05 e1 a6 e9 00 01 e8 a4 03 fc ff 89 d9 48 89 c2 4c 89 ee 48 c7 c7 48 ca 3a 82 e8 90 74 71 ff <0f> 0b e9 41 ff ff ff e8 f4 54 14 00 85 c0 74 b6 80 3d 04 a1 e9 00 [ 1567.788587] RSP: 0018:ffffc90000003e68 EFLAGS: 00010282 [ 1567.788596] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000103 [ 1567.788599] RDX: 0000000000000103 RSI: 00000000000000f6 RDI: 00000000ffffffff [ 1567.788604] RBP: ffff88810ff803e0 R08: 0000000000000001 R09: 0000000000000001 [ 1567.788608] R10: 0000000000000001 R11: ffffc90000003c68 R12: ffff88810ff80438 [ 1567.788611] R13: ffff88810ff80000 R14: 0000000000000001 R15: ffff88810fddee80 [ 1567.788616] FS: 0000000000000000(0000) GS:ffff88849f200000(0000) knlGS:0000000000000000 [ 1567.788620] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1567.788623] CR2: 00007fbe3431a000 CR3: 000000010581c004 CR4: 0000000000770ef0 [ 1567.788627] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 1567.788631] DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400 [ 1567.788635] PKRU: 55555554 [ 1567.788639] Call Trace: [ 1567.788642] <IRQ> [ 1567.788647] ? qdisc_destroy+0x110/0x110 [ 1567.788655] call_timer_fn+0x9c/0x2c0 [ 1567.788667] ? qdisc_destroy+0x110/0x110 [ 1567.788675] run_timer_softirq+0x4ab/0x580 [ 1567.788690] __do_softirq+0xe3/0x492 [ 1567.788703] irq_exit_rcu+0xe8/0xf0 [ 1567.788709] sysvec_apic_timer_interrupt+0x8a/0xb0 [ 1567.788717] </IRQ> [ 1567.788721] asm_sysvec_apic_timer_interrupt+0x12/0x20 [ 1567.788728] RIP: 0010:mwait_idle+0x50/0x70 [ 1567.788732] Code: 31 d2 65 48 8b 04 25 40 6f 01 00 48 89 d1 0f 01 c8 48 8b 00 a8 08 75 21 eb 07 0f 00 2d c1 ae 51 00 31 c0 48 89 c1 fb 0f 01 c9 <65> 48 8b 04 25 40 6f 01 00 f0 80 60 02 df c3 fb eb ee 0f ae f0 0f [ 1567.788737] RSP: 0018:ffffffff82603e98 EFLAGS: 00000246 [ 1567.788744] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000 [ 1567.788747] RDX: 0000000000000000 RSI: ffffffff8238ece7 RDI: ffffffff8232aa7f [ 1567.788752] RBP: ffffffff8286cef8 R08: 0000000000000001 R09: 0000000000000001 [ 1567.788755] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000 [ 1567.788759] R13: 0000000000000000 R14: ffffffffffffffff R15: ffffffff826199c0
- Ashutosh Dixit mentioned in issue #4168 (closed)
mentioned in issue #4168 (closed)
- Reporter
The CI Bug Log issue associated to this bug has been updated by Lakshmi Vudum.
New filters associated
- RKL ADL_P : igt@gem_eio@hibernate - fail - Failed assertion: igt_sysfs_set(power_dir, "state", suspend_state_name[state]) (No new failures associated)
- Developer
There is a display related failure being seen here:
<3> [374.336396] i915 0000:00:02.0: [drm] *ERROR* Link Training Unsuccessful <3> [374.376771] i915 0000:00:02.0: [drm] *ERROR* AUX B/DDI B/PHY B: did not complete or timeout within 10ms (status 0xad4003ff) <7> [375.034907] i915 0000:00:02.0: [drm:drm_dp_dpcd_access] AUX B/DDI B/PHY B: Too many retries, giving up. First error: -110 <3> [387.287910] i915 0000:00:02.0: [drm] *ERROR* AUX USBC4/DDI TC4/PHY TC4: did not complete or timeout within 10ms (status 0xac1003ff)
Edited by Ashutosh Dixit - Reporter
The CI Bug Log issue associated to this bug has been updated by Lakshmi Vudum.
Removed filters
- ADL_P: igt@gem_eio@hibernate - incomplete (added on 2 months, 2 weeks ago)
- Reporter
The CI Bug Log issue associated to this bug has been updated by Lakshmi Vudum.
New filters associated
- ADL_P: igt@gem_eio@hibernate - dmesg-fail - Failed assertion: igt_sysfs_set(power_dir, "state", suspend_state_name[state]), INFO: task gem_eio:\d+ blocked for more than 30 seconds.
- Author Reporter
New filter is added to this issue ADL_P: igt@gem_eio@hibernate - dmesg-fail - Failed assertion: igt_sysfs_set(power_dir, "state", suspend_state_name[state]), INFO: task gem_eio:\d+ blocked for more than 30 seconds.
- Reporter
The CI Bug Log issue associated to this bug has been updated by Lakshmi Vudum.
New filters associated
- ADL_P: igt@gem_eio@hibernate - incomplete -clocksource: timekeeping watchdog on CPU\d: Marking clocksource 'tsc' as unstable because the skew is too large:
- Reporter
A CI Bug Log filter associated to this bug has been updated by Lakshmi Vudum:
Description: ADL_P: igt@gem_eio@hibernate - incomplete/dmesg-warn -clocksource: timekeeping watchdog on CPU\d: Marking clocksource 'tsc' as unstable because the skew is too large:
Equivalent query: runconfig_tag IS IN ["DRM-TIP"] AND (machine_name IS IN ["re-adlp-pub3", "re-adlp-pub2", "re-adlp-5", "bat-adlp-4", "re-adlp-pub1"] OR machine_tag IS IN ["ADL-P"]) AND ((testsuite_name = "IGT" AND test_name IS IN ["igt@gem_eio@hibernate"])) AND ((testsuite_name = "IGT" AND status_name IS IN ["incomplete", "dmesg-warn"])) AND dmesg ~= 'clocksource: timekeeping watchdog on CPU\d: Marking clocksource 'tsc' as unstable because the skew is too large'
New failures caught by the filter: