Results for igt@xe_render_copy@render-stress-2-copies

Result: Abort 41 Warning(s)

i915_display_info3 igt_runner3 results3.json results3-xe-load.json guc_logs3.tar i915_display_info_post_exec3 boot3 dmesg3

DetailValue
Duration unknown
Hostname
shard-bmg-2
Igt-Version
IGT-Version: 2.4-g561940047 (x86_64) (Linux: 7.1.0-rc1-lgci-xe-xe-4950-9ee30ac229d686465-debug+ x86_64)
Out
Using IGT_SRANDOM=1777510116 for randomisation
Opened device: /dev/dri/card0
Starting subtest: render-stress-2-copies
[thread:9976] Stack trace:
[thread:9977] Stack trace:
[thread:9977]   #0 ../lib/igt_core.c:2075 __igt_fail_assert()
[thread:9976]   #0 ../lib/igt_core.c:2075 __igt_fail_assert()
[thread:9977]   #1 ../lib/xe/xe_ioctl.c:556 xe_bo_mmap_offset()
[thread:9977]   #2 ../lib/xe/xe_ioctl.c:564 xe_bo_map()
[thread:9977]   #3 ../tests/intel/xe_render_copy.c:488 run_thread_mem_copy()
[thread:9976]   #1 [syncobj_wait+0xab]
[thread:9976]   #2 ../lib/xe/xe_ioctl.c:319 __xe_vm_bind_sync()
[thread:9976]   #3 ../lib/xe/xe_ioctl.c:327 xe_vm_bind_sync()
[thread:9976]   #4 ../tests/intel/xe_render_copy.c:491 run_thread_mem_copy()
runner: This test was killed due to a kernel taint (0x244).

This test caused an abort condition: Kernel badly tainted (0x244, 0x200) (check dmesg for details):
	TAINT_WARN: WARN_ON has happened.
Err
Starting subtest: render-stress-2-copies
(xe_render_copy:9972) [thread:9976] igt_syncobj-CRITICAL: Test assertion failure function syncobj_wait, file ../lib/igt_syncobj.c:240:
(xe_render_copy:9972) [thread:9976] igt_syncobj-CRITICAL: Failed assertion: ret == 0
(xe_render_copy:9972) [thread:9976] igt_syncobj-CRITICAL: error: -125 != 0
Received signal SIGQUIT.
Stack trace: 
(xe_render_copy:9972) [thread:9977] xe/xe_ioctl-CRITICAL: Test assertion failure function xe_bo_mmap_offset, file ../lib/xe/xe_ioctl.c:553:
(xe_render_copy:9972) [thread:9977] xe/xe_ioctl-CRITICAL: Failed assertion: igt_ioctl(fd, (((2U|1U) << (((0+8)+8)+14)) | ((('d')) << (0+8)) | (((0x40 + 0x02)) << 0) | ((((sizeof(struct drm_xe_gem_mmap_offset)))) << ((0+8)+8))), &mmo) == 0
(xe_render_copy:9972) [thread:9977] xe/xe_ioctl-CRITICAL: Last errno: 125, Operation canceled
(xe_render_copy:9972) [thread:9977] xe/xe_ioctl-CRITICAL: error: -1 != 0
 #0 [fatal_sig_handler+0x17b]
 #1 [__sigaction+0x50]
 #2 [__lll_lock_wait_private+0x8e]
 #3 [pthread_mutex_lock+0x111]
 #4 [__igt_unique____real_main681+0x530]
 #5 [main+0x3a]
 #6 [__libc_init_first+0x8a]
 #7 [__libc_start_main+0x8b]
 #8 [_start+0x25]
Dmesg

<6> [479.621636] Console: switching to colour dummy device 80x25
<6> [479.621943] [IGT] xe_render_copy: executing
<6> [479.631369] [IGT] xe_render_copy: starting subtest render-stress-2-copies
<6> [479.735893] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [479.735919] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [479.735929] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [479.735939] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [480.395510] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d292a
<7> [480.395665] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2d2c2d
<3> [481.916772] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60968 recv=60967
<3> [484.220050] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60968 recv=60967
<7> [484.563573] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2a2b
<7> [484.563724] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2c2d
<3> [486.524933] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60969 recv=60967
<7> [488.154906] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2a2b
<7> [488.155192] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2d2d
<3> [488.827498] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60969 recv=60967
<3> [491.131543] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60970 recv=60967
<3> [491.140441] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60971 recv=60967
<6> [491.258936] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [491.258961] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [491.258971] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [491.258981] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [493.435525] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60970 recv=60967
<3> [493.444421] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60971 recv=60967
<3> [495.739491] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60972 recv=60967
<3> [495.748394] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60973 recv=60967
<3> [498.043535] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60972 recv=60967
<3> [498.052443] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=60973 recv=60967
<7> [503.235143] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2e2d2b2b
<7> [503.235627] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2e2d2d2e
<7> [503.484854] xe 0000:03:00.0: [drm:xe_hw_engine_snapshot_capture [xe]] Tile0: GT0: Proceeding with manual engine snapshot
<4> [503.485472] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=95569, lrc_seqno=95569, guc_id=0, not started
<4> [508.604018] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=95569, lrc_seqno=95569, guc_id=0, not started
<4> [513.723513] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=95569, lrc_seqno=95569, guc_id=0, not started
<6> [518.138196] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [518.138211] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [518.138217] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [518.138222] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [518.204991] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2e2e2b2c
<7> [518.205196] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2e2e2d2e
<4> [518.843402] xe 0000:03:00.0: [drm] Tile0: GT0: Schedule disable failed to respond, guc_id=0
<6> [519.031477] xe 0000:03:00.0: [drm] Xe device coredump has been created
<6> [519.031497] xe 0000:03:00.0: [drm] Check your /sys/class/drm/card0/device/devcoredump/data
<6> [519.031499] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [519.031592] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [519.031724] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [519.031876] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [519.032721] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [519.032819] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [519.032912] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [519.033001] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [519.033121] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [519.033220] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [519.033315] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [519.033408] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [519.033496] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [519.033592] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [519.033682] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [519.034932] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<3> [519.045629] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status = 0x400000A0, time = 10ms, freq = 2150MHz (req 2133MHz)
<3> [519.057481] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status: Reset = 0, BootROM = 0x50, UKernel = 0x00, MIA = 0x00, Auth = 0x01
<3> [519.070266] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: firmware signature verification failed
<3> [519.079144] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: reset failed (-EPROTO)
<3> [519.086293] xe 0000:03:00.0: [drm] *ERROR* CRITICAL: Xe has declared device 0000:03:00.0 as wedged.
IOCTLs and executions are blocked.
For recovery procedure, refer to https://docs.kernel.org/gpu/drm-uapi.html#device-wedging
Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/xe/kernel/issues/new
<7> [519.118159] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [519.118380] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT1: GuC CT communication channel stopped
<3> [519.181991] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT1: GuC mmio request 0x5507: no reply 0x5507
<6> [519.190814] xe 0000:03:00.0: [drm] device wedged, needs recovery
<4> [519.194894] ------------[ cut here ]------------
<4> [519.194902] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [519.194910] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#3: kworker/u64:4/9946
<4> [519.196126] Modules linked in: xe_vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp mei_pxp mei_hdcp spi_nor eeepc_wmi mtd asus_wmi sparse_keymap wmi_bmof binfmt_misc kvm_intel usbhid hid kvm snd_intel_dspcfg irqbypass aesni_intel snd_hda_codec gf128mul snd_hda_core r8169 snd_hwdep rapl intel_cstate snd_pcm video realtek snd_timer phy_package mei_me snd idma64 spi_intel_pci i2c_i801 spi_intel i2c_mux i2c_smbus soundcore mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry acpi_tad intel_vsec acpi_pad pinctrl_alderlake wmi dm_multipath msr nvme_fabrics
<4> [519.196426] fuse efi_pstore nfnetlink autofs4 [last unloaded: snd_hda_intel]
<4> [519.196449] CPU: 3 UID: 0 PID: 9946 Comm: kworker/u64:4 Tainted: G S U 7.1.0-rc1-lgci-xe-xe-4950-9ee30ac229d686465-debug+ #1 PREEMPT(lazy)
<4> [519.196460] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER
<4> [519.196465] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [519.196470] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [519.196492] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [519.196788] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 40 d1 3c e1 48 89 c6 48 8d 3d d6 92 d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [519.196793] RSP: 0018:ffffc90003577ca0 EFLAGS: 00010246
<4> [519.196801] RAX: ffffffffa12bb978 RBX: 0000000000000000 RCX: 0000000000000000
<4> [519.196806] RDX: ffff888104347990 RSI: ffffffffa12bb978 RDI: ffffffffa0c03f10
<4> [519.196810] RBP: ffffc90003577db0 R08: 0000000000000000 R09: 0000000000000000
<4> [519.196813] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [519.196817] R13: ffff888104347990 R14: ffff888118eda018 R15: 00000000ffffffc2
<4> [519.196821] FS: 0000000000000000(0000) GS:ffff8888dae03000(0000) knlGS:0000000000000000
<4> [519.196827] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [519.196831] CR2: 000000c000496000 CR3: 000000000344a001 CR4: 0000000000f72ef0
<4> [519.196835] PKRU: 55555554
<4> [519.196839] Call Trace:
<4> [519.196843] <TASK>
<4> [519.196855] ? __pfx_lock_acquire+0x10/0x10
<4> [519.196872] ? lock_release+0xd0/0x2b0
<4> [519.196888] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [519.196907] process_one_work+0x239/0x740
<4> [519.196929] worker_thread+0x200/0x3f0
<4> [519.196939] ? __pfx_worker_thread+0x10/0x10
<4> [519.196947] kthread+0x10d/0x150
<4> [519.196954] ? __pfx_kthread+0x10/0x10
<4> [519.196963] ret_from_fork+0x3bd/0x470
<4> [519.196971] ? __pfx_kthread+0x10/0x10
<4> [519.196979] ret_from_fork_asm+0x1a/0x30
<4> [519.197003] </TASK>
<4> [519.197007] irq event stamp: 5799
<4> [519.197010] hardirqs last enabled at (5805): [<ffffffff814ab1b9>] __up_console_sem+0x79/0xa0
<4> [519.197018] hardirqs last disabled at (5810): [<ffffffff814ab19e>] __up_console_sem+0x5e/0xa0
<4> [519.197023] softirqs last enabled at (4914): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.197032] softirqs last disabled at (4907): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.197040] ---[ end trace 0000000000000000 ]---
<4> [519.199310] ------------[ cut here ]------------
<4> [519.199325] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [519.199337] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#14: kworker/u64:1/9945
<4> [519.199902] Modules linked in: xe_vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp mei_pxp mei_hdcp spi_nor eeepc_wmi mtd asus_wmi sparse_keymap wmi_bmof binfmt_misc kvm_intel usbhid hid kvm snd_intel_dspcfg irqbypass aesni_intel snd_hda_codec gf128mul snd_hda_core r8169 snd_hwdep rapl intel_cstate snd_pcm video realtek snd_timer phy_package mei_me snd idma64 spi_intel_pci i2c_i801 spi_intel i2c_mux i2c_smbus soundcore mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry acpi_tad intel_vsec acpi_pad pinctrl_alderlake wmi dm_multipath msr nvme_fabrics
<4> [519.200206] fuse efi_pstore nfnetlink autofs4 [last unloaded: snd_hda_intel]
<4> [519.200215] CPU: 14 UID: 0 PID: 9945 Comm: kworker/u64:1 Tainted: G S U W 7.1.0-rc1-lgci-xe-xe-4950-9ee30ac229d686465-debug+ #1 PREEMPT(lazy)
<4> [519.200219] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [519.200221] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [519.200223] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [519.200232] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [519.200358] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 40 d1 3c e1 48 89 c6 48 8d 3d d6 92 d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [519.200361] RSP: 0018:ffffc90002f77ca0 EFLAGS: 00010246
<4> [519.200364] RAX: ffffffffa12bb978 RBX: 0000000000000000 RCX: 0000000000000000
<4> [519.200366] RDX: ffff888104347990 RSI: ffffffffa12bb978 RDI: ffffffffa0c03f10
<4> [519.200368] RBP: ffffc90002f77db0 R08: 0000000000000000 R09: 0000000000000000
<4> [519.200370] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [519.200372] R13: ffff888104347990 R14: ffff888118eda018 R15: 00000000ffffffc2
<4> [519.200374] FS: 0000000000000000(0000) GS:ffff8888db383000(0000) knlGS:0000000000000000
<4> [519.200376] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [519.200378] CR2: 000000c000497000 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [519.200380] PKRU: 55555554
<4> [519.200382] Call Trace:
<4> [519.200383] <TASK>
<4> [519.200390] ? __pfx_lock_acquire+0x10/0x10
<4> [519.200398] ? lock_release+0xd0/0x2b0
<4> [519.200406] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [519.200416] process_one_work+0x239/0x740
<4> [519.200427] worker_thread+0x200/0x3f0
<4> [519.200431] ? __pfx_worker_thread+0x10/0x10
<4> [519.200435] kthread+0x10d/0x150
<4> [519.200438] ? __pfx_kthread+0x10/0x10
<4> [519.200443] ret_from_fork+0x3bd/0x470
<4> [519.200446] ? __pfx_kthread+0x10/0x10
<4> [519.200450] ret_from_fork_asm+0x1a/0x30
<4> [519.200463] </TASK>
<4> [519.200464] irq event stamp: 3215
<4> [519.200466] hardirqs last enabled at (3221): [<ffffffff814ab1b9>] __up_console_sem+0x79/0xa0
<4> [519.200469] hardirqs last disabled at (3226): [<ffffffff814ab19e>] __up_console_sem+0x5e/0xa0
<4> [519.200471] softirqs last enabled at (2650): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.200475] softirqs last disabled at (2645): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.200479] ---[ end trace 0000000000000000 ]---
<6> [519.200482] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<4> [519.200627] ------------[ cut here ]------------
<4> [519.200629] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [519.200631] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#14: kworker/u64:1/9945
<4> [519.200751] Modules linked in: xe_vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp mei_pxp mei_hdcp spi_nor eeepc_wmi mtd asus_wmi sparse_keymap wmi_bmof binfmt_misc kvm_intel usbhid hid kvm snd_intel_dspcfg irqbypass aesni_intel snd_hda_codec gf128mul snd_hda_core r8169 snd_hwdep rapl intel_cstate snd_pcm video realtek snd_timer phy_package mei_me snd idma64 spi_intel_pci i2c_i801 spi_intel i2c_mux i2c_smbus soundcore mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry acpi_tad intel_vsec acpi_pad pinctrl_alderlake wmi dm_multipath msr nvme_fabrics
<4> [519.200842] fuse efi_pstore nfnetlink autofs4 [last unloaded: snd_hda_intel]
<4> [519.200850] CPU: 14 UID: 0 PID: 9945 Comm: kworker/u64:1 Tainted: G S U W 7.1.0-rc1-lgci-xe-xe-4950-9ee30ac229d686465-debug+ #1 PREEMPT(lazy)
<4> [519.200854] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [519.200855] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [519.200857] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [519.200865] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [519.200986] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 40 d1 3c e1 48 89 c6 48 8d 3d d6 92 d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [519.200988] RSP: 0018:ffffc90002f77ca0 EFLAGS: 00010246
<4> [519.200991] RAX: ffffffffa12bb978 RBX: 0000000000000000 RCX: 0000000000000000
<4> [519.200993] RDX: ffff888104347990 RSI: ffffffffa12bb978 RDI: ffffffffa0c03f10
<4> [519.200994] RBP: ffffc90002f77db0 R08: 0000000000000000 R09: 0000000000000000
<4> [519.200996] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [519.200998] R13: ffff888104347990 R14: ffff888118eda018 R15: 00000000ffffffc2
<4> [519.201000] FS: 0000000000000000(0000) GS:ffff8888db383000(0000) knlGS:0000000000000000
<4> [519.201002] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [519.201004] CR2: 000000c000497000 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [519.201006] PKRU: 55555554
<4> [519.201007] Call Trace:
<4> [519.201009] <TASK>
<4> [519.201015] ? __pfx_lock_acquire+0x10/0x10
<4> [519.201022] ? lock_release+0xd0/0x2b0
<4> [519.201030] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [519.201040] process_one_work+0x239/0x740
<4> [519.201469] worker_thread+0x200/0x3f0
<4> [519.201475] ? __pfx_worker_thread+0x10/0x10
<4> [519.201479] kthread+0x10d/0x150
<4> [519.201482] ? __pfx_kthread+0x10/0x10
<4> [519.201486] ret_from_fork+0x3bd/0x470
<4> [519.201489] ? __pfx_kthread+0x10/0x10
<4> [519.201493] ret_from_fork_asm+0x1a/0x30
<4> [519.201506] </TASK>
<4> [519.201507] irq event stamp: 4075
<4> [519.201509] hardirqs last enabled at (4081): [<ffffffff814ab1b9>] __up_console_sem+0x79/0xa0
<4> [519.201512] hardirqs last disabled at (4086): [<ffffffff814ab19e>] __up_console_sem+0x5e/0xa0
<4> [519.201514] softirqs last enabled at (2650): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.201518] softirqs last disabled at (2645): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.201521] ---[ end trace 0000000000000000 ]---
<6> [519.201524] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<4> [519.201660] ------------[ cut here ]------------
<4> [519.201662] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [519.201664] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#14: kworker/u64:1/9945
<4> [519.201783] Modules linked in: xe_vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp mei_pxp mei_hdcp spi_nor eeepc_wmi mtd asus_wmi sparse_keymap wmi_bmof binfmt_misc kvm_intel usbhid hid kvm snd_intel_dspcfg irqbypass aesni_intel snd_hda_codec gf128mul snd_hda_core r8169 snd_hwdep rapl intel_cstate snd_pcm video realtek snd_timer phy_package mei_me snd idma64 spi_intel_pci i2c_i801 spi_intel i2c_mux i2c_smbus soundcore mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry acpi_tad intel_vsec acpi_pad pinctrl_alderlake wmi dm_multipath msr nvme_fabrics
<4> [519.201873] fuse efi_pstore nfnetlink autofs4 [last unloaded: snd_hda_intel]
<4> [519.201881] CPU: 14 UID: 0 PID: 9945 Comm: kworker/u64:1 Tainted: G S U W 7.1.0-rc1-lgci-xe-xe-4950-9ee30ac229d686465-debug+ #1 PREEMPT(lazy)
<4> [519.201885] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [519.201886] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [519.201888] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [519.201896] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [519.202014] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 40 d1 3c e1 48 89 c6 48 8d 3d d6 92 d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [519.202016] RSP: 0018:ffffc90002f77ca0 EFLAGS: 00010246
<4> [519.202019] RAX: ffffffffa12bb978 RBX: 0000000000000000 RCX: 0000000000000000
<4> [519.202021] RDX: ffff888104347990 RSI: ffffffffa12bb978 RDI: ffffffffa0c03f10
<4> [519.202023] RBP: ffffc90002f77db0 R08: 0000000000000000 R09: 0000000000000000
<4> [519.202024] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [519.202026] R13: ffff888104347990 R14: ffff888118eda018 R15: 00000000ffffffc2
<4> [519.202028] FS: 0000000000000000(0000) GS:ffff8888db383000(0000) knlGS:0000000000000000
<4> [519.202030] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [519.202032] CR2: 000000c000497000 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [519.202034] PKRU: 55555554
<4> [519.202035] Call Trace:
<4> [519.202037] <TASK>
<4> [519.202043] ? __pfx_lock_acquire+0x10/0x10
<4> [519.202058] ? lock_release+0xd0/0x2b0
<4> [519.202066] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [519.202076] process_one_work+0x239/0x740
<4> [519.202087] worker_thread+0x200/0x3f0
<4> [519.202091] ? __pfx_worker_thread+0x10/0x10
<4> [519.202095] kthread+0x10d/0x150
<4> [519.202098] ? __pfx_kthread+0x10/0x10
<4> [519.202102] ret_from_fork+0x3bd/0x470
<4> [519.202105] ? __pfx_kthread+0x10/0x10
<4> [519.202110] ret_from_fork_asm+0x1a/0x30
<4> [519.202122] </TASK>
<4> [519.202124] irq event stamp: 4939
<4> [519.202126] hardirqs last enabled at (4945): [<ffffffff814ab1b9>] __up_console_sem+0x79/0xa0
<4> [519.202129] hardirqs last disabled at (4950): [<ffffffff814ab19e>] __up_console_sem+0x5e/0xa0
<4> [519.202131] softirqs last enabled at (4872): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.202135] softirqs last disabled at (4867): [<ffffffff813d228b>] __irq_exit_rcu+0xdb/0x1c0
<4> [519.202139] ---[ end trace 0000000000000000 ]---
<6> [519.208693] Console: switching to colour frame buffer device 240x67
<7> [519.226985] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
Created at 2026-04-30 01:33:32