Results for igt@xe_exec_system_allocator@many-large-malloc-prefetch

Result: Abort 47 Warning(s)

i915_display_info24 igt_runner24 results24.json results24-xe-load.json guc_logs24.tar i915_display_info_post_exec24 boot24 dmesg24

DetailValue
Duration unknown
Hostname
shard-bmg-2
Igt-Version
IGT-Version: 2.4-gb52b42b0c (x86_64) (Linux: 7.0.0-rc7-lgci-xe-xe-pw-164701v1-debug+ x86_64)
Out
Using IGT_SRANDOM=1775877972 for randomisation
Opened device: /dev/dri/card0
Starting subtest: many-large-malloc-prefetch
runner: This test was killed due to a kernel taint (0x244).

This test caused an abort condition: Kernel badly tainted (0x244, 0x200) (check dmesg for details):
	TAINT_WARN: WARN_ON has happened.
Err
Starting subtest: many-large-malloc-prefetch
Received signal SIGQUIT.
Stack trace: 
 #0 [fatal_sig_handler+0x17b]
 #1 [__sigaction+0x50]
 #2 [ioctl+0x3d]
 #3 [drmIoctl+0x30]
 #4 [___xe_vm_bind+0xf7]
 #5 [__xe_vm_bind+0x35]
 #6 [__xe_vm_bind_assert+0x32]
 #7 [xe_vm_prefetch_async+0x2b]
 #8 [test_exec+0x111b]
 #9 [__igt_unique____real_main2349+0x2be0]
 #10 [main+0x2d]
 #11 [__libc_init_first+0x8a]
 #12 [__libc_start_main+0x8b]
 #13 [_start+0x25]
Dmesg

<6> [556.778097] Console: switching to colour dummy device 80x25
<6> [556.778447] [IGT] xe_exec_system_allocator: executing
<7> [558.660500] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [558.764408] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<3> [559.045785] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69786 recv=69785
<3> [561.348938] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69786 recv=69785
<3> [563.653865] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69787 recv=69785
<3> [565.956390] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69787 recv=69785
<3> [568.260418] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69788 recv=69785
<3> [570.564325] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69788 recv=69785
<6> [570.574981] [IGT] xe_exec_system_allocator: starting subtest many-large-malloc-prefetch
<7> [570.576113] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<6> [570.688219] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [570.688245] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [570.688255] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [570.688265] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [570.905812] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2c2c2829
<7> [570.905957] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2c2b2c
<3> [572.868271] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69789 recv=69785
<3> [575.172351] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69789 recv=69785
<3> [577.476268] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69790 recv=69785
<3> [579.780280] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69790 recv=69785
<3> [582.084228] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69791 recv=69785
<3> [584.388171] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69791 recv=69785
<7> [585.889774] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2c2d2a2a
<7> [585.889927] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2c2c2d
<3> [586.692218] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69792 recv=69785
<3> [588.996207] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69792 recv=69785
<3> [591.300141] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69793 recv=69785
<3> [593.604114] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69793 recv=69785
<6> [593.725087] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [593.725113] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [593.725123] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [593.725133] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [595.908082] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69794 recv=69785
<3> [598.212184] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=69794 recv=69785
<7> [598.224083] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [600.863126] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2a2b
<7> [600.863269] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2d2e
<4> [603.268946] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=80812, lrc_seqno=80812, guc_id=0, not started
<4> [608.388075] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=80812, lrc_seqno=80812, guc_id=0, not started
<4> [613.507974] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=80812, lrc_seqno=80812, guc_id=0, not started
<7> [615.864390] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2e2d2a2b
<7> [615.864531] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2e2d2d2e
<4> [618.627917] xe 0000:03:00.0: [drm] Tile0: GT0: Schedule disable failed to respond, guc_id=0
<7> [618.627945] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [618.628385] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [618.628901] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [618.628993] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [618.629203] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [618.629922] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [618.630422] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [618.630951] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [618.631405] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [618.631886] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [618.632319] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [618.632770] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [618.633212] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [618.633648] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [618.634210] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [618.634364] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [618.635366] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<3> [618.646165] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status = 0x400000A0, time = 10ms, freq = 2150MHz (req 2133MHz)
<3> [618.658554] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status: Reset = 0, BootROM = 0x50, UKernel = 0x00, MIA = 0x00, Auth = 0x01
<3> [618.671337] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: firmware signature verification failed
<3> [618.679886] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: reset failed (-EPROTO)
<3> [618.687287] xe 0000:03:00.0: [drm] *ERROR* CRITICAL: Xe has declared device 0000:03:00.0 as wedged.
IOCTLs and executions are blocked.
For recovery procedure, refer to https://docs.kernel.org/gpu/drm-uapi.html#device-wedging
Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/xe/kernel/issues/new
<7> [618.719145] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [618.719294] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT1: GuC CT communication channel stopped
<3> [618.779162] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT1: GuC mmio request 0x5507: no reply 0x5507
<6> [618.787998] xe 0000:03:00.0: [drm] device wedged, needs recovery
<4> [618.789575] ------------[ cut here ]------------
<4> [618.789585] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [618.789594] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#5: kworker/u64:26/3793
<4> [618.790067] Modules linked in: vfio_pci_core vfio_iommu_type1 vfio iommufd xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor asus_nb_wmi coretemp asus_wmi mtd sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof binfmt_misc kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 snd_hda_codec rapl snd_hda_core usbhid intel_cstate snd_hwdep hid realtek snd_pcm video snd_timer i2c_i801 snd i2c_mux spi_intel_pci idma64 spi_intel soundcore i2c_smbus intel_pmc_core nls_iso8859_1 pmt_telemetry pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry pinctrl_alderlake wmi acpi_pad acpi_tad intel_vsec dm_multipath
<4> [618.790417] msr nvme_fabrics fuse efi_pstore nfnetlink autofs4 [last unloaded: xe_vfio_pci]
<4> [618.790453] CPU: 5 UID: 0 PID: 3793 Comm: kworker/u64:26 Tainted: G S U 7.0.0-rc7-lgci-xe-xe-pw-164701v1-debug+ #1 PREEMPT(lazy)
<4> [618.790468] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER
<4> [618.790474] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [618.790481] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [618.790509] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [618.790919] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 06 d6 5e e1 48 89 c6 48 8d 3d 3c 96 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [618.790927] RSP: 0018:ffffc9000d9dfca0 EFLAGS: 00010246
<4> [618.790937] RAX: ffffffffa1201f7b RBX: 0000000000000000 RCX: 0000000000000000
<4> [618.790943] RDX: ffff888103fd5110 RSI: ffffffffa1201f7b RDI: ffffffffa1003ee0
<4> [618.790948] RBP: ffffc9000d9dfdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [618.790953] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [618.790958] R13: ffff888103fd5110 R14: ffff8881dbe20818 R15: 00000000ffffffc2
<4> [618.790963] FS: 0000000000000000(0000) GS:ffff8888daf17000(0000) knlGS:0000000000000000
<4> [618.790970] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [618.790975] CR2: 000063dcdb6f4968 CR3: 000000000344c005 CR4: 0000000000f72ef0
<4> [618.790982] PKRU: 55555554
<4> [618.790987] Call Trace:
<4> [618.790992] <TASK>
<4> [618.791009] ? lock_sync+0x100/0x100
<4> [618.791029] ? lock_release+0xd0/0x2b0
<4> [618.791053] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [618.791077] process_one_work+0x239/0x760
<4> [618.791104] worker_thread+0x200/0x3f0
<4> [618.791115] ? __pfx_worker_thread+0x10/0x10
<4> [618.791125] kthread+0x10d/0x150
<4> [618.791137] ? __pfx_kthread+0x10/0x10
<4> [618.791152] ret_from_fork+0x3d4/0x480
<4> [618.791162] ? __pfx_kthread+0x10/0x10
<4> [618.791176] ret_from_fork_asm+0x1a/0x30
<4> [618.791207] </TASK>
<4> [618.791212] irq event stamp: 1058965
<4> [618.791218] hardirqs last enabled at (1058971): [<ffffffff814aacd9>] __up_console_sem+0x79/0xa0
<4> [618.791231] hardirqs last disabled at (1058976): [<ffffffff814aacbe>] __up_console_sem+0x5e/0xa0
<4> [618.791241] softirqs last enabled at (1058196): [<ffffffff813d1e9f>] __irq_exit_rcu+0x13f/0x160
<4> [618.791252] softirqs last disabled at (1058191): [<ffffffff813d1e9f>] __irq_exit_rcu+0x13f/0x160
<4> [618.791263] ---[ end trace 0000000000000000 ]---
<6> [618.952223] Console: switching to colour frame buffer device 240x67
<4> [618.953837] ------------[ cut here ]------------
<4> [618.953840] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [618.953842] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#2: kworker/u64:3/202
<7> [618.953853] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<4> [618.953923] Modules linked in: vfio_pci_core vfio_iommu_type1 vfio iommufd xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor asus_nb_wmi coretemp asus_wmi mtd sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof binfmt_misc kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 snd_hda_codec rapl snd_hda_core usbhid intel_cstate snd_hwdep hid realtek snd_pcm video snd_timer i2c_i801 snd i2c_mux spi_intel_pci idma64 spi_intel soundcore i2c_smbus intel_pmc_core nls_iso8859_1 pmt_telemetry pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry pinctrl_alderlake wmi acpi_pad acpi_tad intel_vsec dm_multipath
<4> [618.953991] msr nvme_fabrics fuse efi_pstore nfnetlink autofs4 [last unloaded: xe_vfio_pci]
<4> [618.953999] CPU: 2 UID: 0 PID: 202 Comm: kworker/u64:3 Tainted: G S U W 7.0.0-rc7-lgci-xe-xe-pw-164701v1-debug+ #1 PREEMPT(lazy)
<4> [618.954002] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [618.954003] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [618.954004] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [618.954010] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [618.954075] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 06 d6 5e e1 48 89 c6 48 8d 3d 3c 96 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [618.954077] RSP: 0018:ffffc9000160fca0 EFLAGS: 00010246
<4> [618.954079] RAX: ffffffffa1201f7b RBX: 0000000000000000 RCX: 0000000000000000
<4> [618.954081] RDX: ffff888103fd5110 RSI: ffffffffa1201f7b RDI: ffffffffa1003ee0
<4> [618.954082] RBP: ffffc9000160fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [618.954083] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [618.954084] R13: ffff888103fd5110 R14: ffff8881dbe20818 R15: 00000000ffffffc2
<4> [618.954085] FS: 0000000000000000(0000) GS:ffff8888dad97000(0000) knlGS:0000000000000000
<4> [618.954087] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [618.954088] CR2: 00007d30e4605008 CR3: 000000000344c002 CR4: 0000000000f72ef0
<4> [618.954089] PKRU: 55555554
<4> [618.954090] Call Trace:
<4> [618.954091] <TASK>
<4> [618.954097] ? lock_sync+0x100/0x100
<4> [618.954121] ? lock_release+0xd0/0x2b0
<4> [618.954126] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [618.954131] process_one_work+0x239/0x760
<4> [618.954137] worker_thread+0x200/0x3f0
<4> [618.954140] ? __pfx_worker_thread+0x10/0x10
<4> [618.954142] kthread+0x10d/0x150
<4> [618.954145] ? __pfx_kthread+0x10/0x10
<4> [618.954148] ret_from_fork+0x3d4/0x480
<4> [618.954150] ? __pfx_kthread+0x10/0x10
<4> [618.954153] ret_from_fork_asm+0x1a/0x30
<4> [618.954160] </TASK>
<4> [618.954161] irq event stamp: 1576001
<4> [618.954162] hardirqs last enabled at (1576007): [<ffffffff814aacd9>] __up_console_sem+0x79/0xa0
<4> [618.954165] hardirqs last disabled at (1576012): [<ffffffff814aacbe>] __up_console_sem+0x5e/0xa0
<4> [618.954167] softirqs last enabled at (1574894): [<ffffffff82616d8e>] neigh_periodic_work+0x27e/0x360
<4> [618.954171] softirqs last disabled at (1574890): [<ffffffff82616b48>] neigh_periodic_work+0x38/0x360
<4> [618.954173] ---[ end trace 0000000000000000 ]---
<6> [618.954175] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<4> [618.954257] ------------[ cut here ]------------
<4> [618.954258] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [618.954259] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#2: kworker/u64:3/202
<4> [618.954322] Modules linked in: vfio_pci_core vfio_iommu_type1 vfio iommufd xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor asus_nb_wmi coretemp asus_wmi mtd sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof binfmt_misc kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 snd_hda_codec rapl snd_hda_core usbhid intel_cstate snd_hwdep hid realtek snd_pcm video snd_timer i2c_i801 snd i2c_mux spi_intel_pci idma64 spi_intel soundcore i2c_smbus intel_pmc_core nls_iso8859_1 pmt_telemetry pmt_discovery mei_me
<7> [618.954321] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<4> [618.954375] pmt_class mei intel_pmc_ssram_telemetry pinctrl_alderlake wmi acpi_pad acpi_tad intel_vsec dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4 [last unloaded: xe_vfio_pci]
<4> [618.954387] CPU: 2 UID: 0 PID: 202 Comm: kworker/u64:3 Tainted: G S U W 7.0.0-rc7-lgci-xe-xe-pw-164701v1-debug+ #1 PREEMPT(lazy)
<4> [618.954389] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [618.954390] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [618.954391] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [618.954401] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [618.954472] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 06 d6 5e e1 48 89 c6 48 8d 3d 3c 96 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [618.954474] RSP: 0018:ffffc9000160fca0 EFLAGS: 00010246
<4> [618.954476] RAX: ffffffffa1201f7b RBX: 0000000000000000 RCX: 0000000000000000
<4> [618.954477] RDX: ffff888103fd5110 RSI: ffffffffa1201f7b RDI: ffffffffa1003ee0
<4> [618.954479] RBP: ffffc9000160fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [618.954480] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [618.954481] R13: ffff888103fd5110 R14: ffff8881dbe20818 R15: 00000000ffffffc2
<4> [618.954482] FS: 0000000000000000(0000) GS:ffff8888dad97000(0000) knlGS:0000000000000000
<4> [618.954484] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [618.954485] CR2: 00007d30e4605008 CR3: 000000000344c002 CR4: 0000000000f72ef0
<4> [618.954486] PKRU: 55555554
<4> [618.954488] Call Trace:
<4> [618.954489] <TASK>
<4> [618.954492] ? lock_sync+0x100/0x100
<4> [618.954496] ? lock_release+0xd0/0x2b0
<4> [618.954502] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [618.954507] process_one_work+0x239/0x760
<4> [618.954513] worker_thread+0x200/0x3f0
<4> [618.954516] ? __pfx_worker_thread+0x10/0x10
<4> [618.954519] kthread+0x10d/0x150
<4> [618.954521] ? __pfx_kthread+0x10/0x10
<4> [618.954525] ret_from_fork+0x3d4/0x480
<4> [618.954527] ? __pfx_kthread+0x10/0x10
<4> [618.954530] ret_from_fork_asm+0x1a/0x30
<4> [618.954537] </TASK>
<4> [618.954538] irq event stamp: 1576863
<4> [618.954540] hardirqs last enabled at (1576869): [<ffffffff814aacd9>] __up_console_sem+0x79/0xa0
<4> [618.954542] hardirqs last disabled at (1576874): [<ffffffff814aacbe>] __up_console_sem+0x5e/0xa0
<4> [618.954544] softirqs last enabled at (1574894): [<ffffffff82616d8e>] neigh_periodic_work+0x27e/0x360
<4> [618.954547] softirqs last disabled at (1574890): [<ffffffff82616b48>] neigh_periodic_work+0x38/0x360
<4> [618.954549] ---[ end trace 0000000000000000 ]---
<6> [618.954551] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<4> [618.954626] ------------[ cut here ]------------
<4> [618.954630] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [618.954632] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#5: kworker/u64:26/3793
<4> [618.954740] Modules linked in: vfio_pci_core vfio_iommu_type1 vfio iommufd xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor asus_nb_wmi coretemp asus_wmi mtd sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof binfmt_misc kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 snd_hda_codec rapl snd_hda_core usbhid intel_cstate snd_hwdep hid realtek snd_pcm video snd_timer i2c_i801 snd i2c_mux spi_intel_pci idma64 spi_intel soundcore i2c_smbus intel_pmc_core nls_iso8859_1 pmt_telemetry pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry pinctrl_alderlake wmi acpi_pad acpi_tad intel_vsec dm_multipath
<4> [618.954816] msr nvme_fabrics fuse efi_pstore nfnetlink autofs4 [last unloaded: xe_vfio_pci]
<4> [618.954823] CPU: 5 UID: 0 PID: 3793 Comm: kworker/u64:26 Tainted: G S U W 7.0.0-rc7-lgci-xe-xe-pw-164701v1-debug+ #1 PREEMPT(lazy)
<4> [618.954827] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [618.954828] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [618.954829] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [618.954834] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [618.954915] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 06 d6 5e e1 48 89 c6 48 8d 3d 3c 96 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [618.954917] RSP: 0018:ffffc9000d9dfca0 EFLAGS: 00010246
<4> [618.954919] RAX: ffffffffa1201f7b RBX: 0000000000000000 RCX: 0000000000000000
<4> [618.954921] RDX: ffff888103fd5110 RSI: ffffffffa1201f7b RDI: ffffffffa1003ee0
<4> [618.954922] RBP: ffffc9000d9dfdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [618.954923] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [618.954924] R13: ffff888103fd5110 R14: ffff8881dbe20818 R15: 00000000ffffffc2
<4> [618.954926] FS: 0000000000000000(0000) GS:ffff8888daf17000(0000) knlGS:0000000000000000
<4> [618.954927] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [618.954929] CR2: 000063dcdb6f4968 CR3: 000000000344c005 CR4: 0000000000f72ef0
<4> [618.954930] PKRU: 55555554
<4> [618.954931] Call Trace:
<4> [618.954932] <TASK>
<4> [618.954936] ? lock_sync+0x100/0x100
<4> [618.954941] ? lock_release+0xd0/0x2b0
<4> [618.954946] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [618.954952] process_one_work+0x239/0x760
<4> [618.954958] worker_thread+0x200/0x3f0
<4> [618.954961] ? __pfx_worker_thread+0x10/0x10
<4> [618.954963] kthread+0x10d/0x150
<4> [618.954966] ? __pfx_kthread+0x10/0x10
<4> [618.954970] ret_from_fork+0x3d4/0x480
<4> [618.954972] ? __pfx_kthread+0x10/0x10
<4> [618.954975] ret_from_fork_asm+0x1a/0x30
<4> [618.954983] </TASK>
<4> [618.954984] irq event stamp: 1059939
<4> [618.954985] hardirqs last enabled at (1059945): [<ffffffff814aacd9>] __up_console_sem+0x79/0xa0
<4> [618.954988] hardirqs last disabled at (1059950): [<ffffffff814aacbe>] __up_console_sem+0x5e/0xa0
<4> [618.954990] softirqs last enabled at (1059172): [<ffffffff813d1e9f>] __irq_exit_rcu+0x13f/0x160
<4> [618.954993] softirqs last disabled at (1059165): [<ffffffff813d1e9f>] __irq_exit_rcu+0x13f/0x160
<4> [618.954995] ---[ end trace 0000000000000000 ]---
<7> [618.956692] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<7> [618.963340] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
Created at 2026-04-11 03:30:45