Result: 67 Warning(s)
i915_display_info4 igt_runner4 results4.json results4-xe-load.json guc_logs4.tar i915_display_info_post_exec4 boot4 dmesg4
| Detail | Value |
|---|---|
| Duration | unknown |
| Hostname |
shard-bmg-2 |
| Igt-Version |
IGT-Version: 2.4-g0a8f2f8f5 (x86_64) (Linux: 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ x86_64) |
| Out |
Using IGT_SRANDOM=1775523048 for randomisation Opened device: /dev/dri/card0 Starting subtest: many-64k-free Stack trace: #0 ../lib/igt_core.c:2075 __igt_fail_assert() #1 [xe_wait_ufence+0x57] #2 ../tests/intel/xe_exec_system_allocator.c:1757 test_exec() #3 ../tests/intel/xe_exec_system_allocator.c:2577 __igt_unique____real_main2349() #4 ../tests/intel/xe_exec_system_allocator.c:2349 main() #5 [__libc_init_first+0x8a] #6 [__libc_start_main+0x8b] #7 [_start+0x25] Subtest many-64k-free: FAIL (16.552s) runner: This test was killed due to a kernel taint (0x244). This test caused an abort condition: Kernel badly tainted (0x244, 0x200) (check dmesg for details): TAINT_WARN: WARN_ON has happened. |
| Err |
Starting subtest: many-64k-free (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:763: (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: Last errno: 62, Timer expired (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: error: -62 != 0 Subtest many-64k-free failed. **** DEBUG **** (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:763: (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: Last errno: 62, Timer expired (xe_exec_system_allocator:3937) xe/xe_ioctl-CRITICAL: error: -62 != 0 (xe_exec_system_allocator:3937) igt_core-INFO: Stack trace: (xe_exec_system_allocator:3937) igt_core-INFO: #0 ../lib/igt_core.c:2075 __igt_fail_assert() (xe_exec_system_allocator:3937) igt_core-INFO: #1 [xe_wait_ufence+0x57] (xe_exec_system_allocator:3937) igt_core-INFO: #2 ../tests/intel/xe_exec_system_allocator.c:1757 test_exec() (xe_exec_system_allocator:3937) igt_core-INFO: #3 ../tests/intel/xe_exec_system_allocator.c:2577 __igt_unique____real_main2349() (xe_exec_system_allocator:3937) igt_core-INFO: #4 ../tests/intel/xe_exec_system_allocator.c:2349 main() (xe_exec_system_allocator:3937) igt_core-INFO: #5 [__libc_init_first+0x8a] (xe_exec_system_allocator:3937) igt_core-INFO: #6 [__libc_start_main+0x8b] (xe_exec_system_allocator:3937) igt_core-INFO: #7 [_start+0x25] **** END **** Subtest many-64k-free: FAIL (16.552s) |
| Dmesg |
<6> [92.457048] Console: switching to colour dummy device 80x25
<6> [92.457244] [IGT] xe_exec_system_allocator: executing
<6> [92.467918] [IGT] xe_exec_system_allocator: starting subtest many-64k-free
<7> [92.468234] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<7> [92.471742] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.473663] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.475296] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.476304] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.477292] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.478290] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.479261] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.480244] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.481192] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.482182] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.483143] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.484097] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.485583] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.486554] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.487492] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.488900] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.489906] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.490987] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.492051] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.493156] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.494247] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.495343] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.496478] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.497638] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.498766] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.499972] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.501135] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.502257] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.503384] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.504476] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.505465] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.506402] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.507204] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.508012] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.508935] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.509825] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.510752] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.511564] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.512347] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.513093] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.513815] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.514531] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.515437] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.516155] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.516871] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.517617] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.518337] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.519069] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.519843] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.520688] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.521427] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.522233] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.523007] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.523750] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.524508] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.525269] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.526123] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.526918] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.527721] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.528489] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.529239] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.529998] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.530734] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.531578] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.532335] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.533071] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.533878] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.534629] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.535391] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.536120] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.536940] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.537672] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.538449] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.539199] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.539948] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.540811] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.541757] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.542550] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.543389] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.544140] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.544862] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.545566] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.546322] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.547037] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.547852] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.548561] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.549309] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.550071] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.550851] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.551601] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.552359] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.553251] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.554084] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.554884] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [92.809892] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [92.913902] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<7> [97.738734] xe 0000:03:00.0: [drm:xe_hw_engine_snapshot_capture [xe]] Tile0: GT0: Proceeding with manual engine snapshot
<4> [97.739247] ------------[ cut here ]------------
<4> [97.739259] xe 0000:03:00.0: [drm] Tile0: GT0: Unexpected engine class:instance 3:8 for context utilization
<4> [97.739273] WARNING: drivers/gpu/drm/xe/xe_lrc.c:2580 at xe_lrc_timestamp+0x196/0x460 [xe], CPU#1: kworker/u64:25/2336
<4> [97.739670] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [97.740015] autofs4 [last unloaded: xe]
<4> [97.740036] CPU: 1 UID: 0 PID: 2336 Comm: kworker/u64:25 Tainted: G S U 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [97.740050] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER
<4> [97.740056] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [97.740064] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [97.740090] RIP: 0010:xe_lrc_timestamp+0x1a7/0x460 [xe]
<4> [97.740532] Code: b6 79 26 48 89 55 c8 88 45 d0 e8 24 b5 5c e1 48 89 c6 48 8d 3d ea ba 37 00 0f b6 4d d0 41 54 45 89 e9 45 0f b6 c7 48 8b 55 c8 <67> 48 0f b9 3a 58 eb 91 41 0f b6 bd 99 0f 00 00 e8 14 e0 03 00 41
<4> [97.740541] RSP: 0018:ffffc90003fafc00 EFLAGS: 00010246
<4> [97.740553] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [97.740560] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1004f50
<4> [97.740567] RBP: ffffc90003fafc90 R08: 0000000000000000 R09: 0000000000000003
<4> [97.740573] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000008
<4> [97.740578] R13: 0000000000000003 R14: ffff888156f88980 R15: 0000000000000000
<4> [97.740585] FS: 0000000000000000(0000) GS:ffff8888dad17000(0000) knlGS:0000000000000000
<4> [97.740593] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [97.740600] CR2: 00007f3be7de61b4 CR3: 000000000344c005 CR4: 0000000000f72ef0
<4> [97.740608] PKRU: 55555554
<4> [97.740613] Call Trace:
<4> [97.740619] <TASK>
<4> [97.740640] ? xe_lrc_start_seqno+0x33/0x70 [xe]
<4> [97.741183] guc_exec_queue_timedout_job+0xf80/0x2400 [xe]
<4> [97.741659] ? lock_acquire+0x20/0x2f0
<4> [97.741687] ? lock_release+0xd0/0x2b0
<4> [97.741715] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [97.741745] process_one_work+0x239/0x760
<4> [97.741779] worker_thread+0x200/0x3f0
<4> [97.741793] ? __pfx_worker_thread+0x10/0x10
<4> [97.741805] kthread+0x10d/0x150
<4> [97.741819] ? __pfx_kthread+0x10/0x10
<4> [97.741838] ret_from_fork+0x3d4/0x480
<4> [97.741849] ? __pfx_kthread+0x10/0x10
<4> [97.741866] ret_from_fork_asm+0x1a/0x30
<4> [97.741903] </TASK>
<4> [97.741909] irq event stamp: 88447
<4> [97.741915] hardirqs last enabled at (88453): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [97.741931] hardirqs last disabled at (88458): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [97.741942] softirqs last enabled at (87668): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [97.741954] softirqs last disabled at (87661): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [97.741966] ---[ end trace 0000000000000000 ]---
<7> [97.741980] xe 0000:03:00.0: [drm:guc_exec_queue_timedout_job [xe]] Tile0: GT0: Check job timeout: seqno=8166, lrc_seqno=8166, guc_id=0, running_time_ms=234440, timeout_ms=5000, diff=0xff85658b
<6> [98.383314] xe 0000:03:00.0: [drm] Tile0: GT0: Engine reset: engine_class=bcs, logical_mask: 0x2, guc_id=0, state=0x209
<5> [98.383879] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8166, lrc_seqno=8166, guc_id=0, flags=0x73 in no process [-1]
<6> [98.567378] xe 0000:03:00.0: [drm] Xe device coredump has been created
<6> [98.567399] xe 0000:03:00.0: [drm] Check your /sys/class/drm/card0/device/devcoredump/data
<4> [98.567402] ------------[ cut here ]------------
<4> [98.567403] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [98.567405] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#10: kworker/u64:25/2336
<4> [98.567493] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [98.567558] autofs4 [last unloaded: xe]
<4> [98.567562] CPU: 10 UID: 0 PID: 2336 Comm: kworker/u64:25 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [98.567565] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [98.567566] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [98.567568] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [98.567573] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [98.567636] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [98.567638] RSP: 0018:ffffc90003fafca0 EFLAGS: 00010246
<4> [98.567640] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [98.567641] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [98.567642] RBP: ffffc90003fafdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [98.567643] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [98.567644] R13: ffff888102525010 R14: ffff888158138000 R15: 00000000ffffffc2
<4> [98.567646] FS: 0000000000000000(0000) GS:ffff8888db197000(0000) knlGS:0000000000000000
<4> [98.567647] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [98.567648] CR2: 000057b1322f2a78 CR3: 000000000344c001 CR4: 0000000000f72ef0
<4> [98.567650] PKRU: 55555554
<4> [98.567651] Call Trace:
<4> [98.567652] <TASK>
<4> [98.567656] ? lock_acquire+0x20/0x2f0
<4> [98.567662] ? __pfx_autoremove_wake_function+0x10/0x10
<4> [98.567667] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [98.567672] process_one_work+0x239/0x760
<4> [98.567679] worker_thread+0x200/0x3f0
<4> [98.567681] ? __pfx_worker_thread+0x10/0x10
<4> [98.567683] kthread+0x10d/0x150
<4> [98.567686] ? __pfx_kthread+0x10/0x10
<4> [98.567689] ret_from_fork+0x3d4/0x480
<4> [98.567692] ? __pfx_kthread+0x10/0x10
<4> [98.567695] ret_from_fork_asm+0x1a/0x30
<4> [98.567702] </TASK>
<4> [98.567703] irq event stamp: 90607
<4> [98.567704] hardirqs last enabled at (90613): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [98.567707] hardirqs last disabled at (90618): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [98.567709] softirqs last enabled at (89748): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [98.567711] softirqs last disabled at (89741): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [98.567713] ---[ end trace 0000000000000000 ]---
<6> [98.567715] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [98.567776] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [98.567842] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [98.567971] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [98.568679] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [98.568805] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [98.568932] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [98.569078] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [98.569208] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [98.569292] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [98.569368] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [98.569442] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [98.569510] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [98.569593] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [98.569695] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [98.570908] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [98.581686] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [98.581949] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [98.583678] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [98.583758] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [98.583832] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [98.583905] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [98.583987] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [98.584122] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [98.584234] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [98.584340] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [98.584451] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [98.584538] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [98.584609] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [98.584680] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [98.584751] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [98.584820] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [98.584918] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [98.585049] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [98.585161] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [98.585271] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [98.585379] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [98.585473] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [98.585549] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [98.585624] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [98.585698] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [98.585770] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [98.585873] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [98.585988] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [98.586117] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [98.586238] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [98.586356] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [98.586435] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [98.586511] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [98.586583] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [98.586658] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [98.586764] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [98.586881] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [98.587005] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [98.587144] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [98.587260] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [98.587354] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [98.587425] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [98.587498] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [98.587568] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [98.587637] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [98.587727] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [98.587844] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [98.587954] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [98.588100] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [98.588217] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [98.588293] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [98.588361] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [98.588431] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [98.588504] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [98.588585] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [98.588700] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [98.588808] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [98.588922] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [98.589064] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [98.589154] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [98.589228] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [98.591054] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [98.591063] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967262, lrc_seqno=4294967262, guc_id=2, flags=0x0 in xe_exec_system_ [3937]
<7> [98.591066] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<7> [98.591378] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [98.591531] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<6> [98.697081] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [98.697117] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [98.697130] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [98.697142] nvme 0000:05:00.0: [ 0] RxErr (First)
<5> [103.880743] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8166, lrc_seqno=8166, guc_id=0, flags=0x73 in no process [-1]
<7> [103.880774] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [103.881217] ------------[ cut here ]------------
<4> [103.881224] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [103.881233] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#12: kworker/u64:21/2332
<4> [103.881759] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [103.882155] autofs4 [last unloaded: xe]
<4> [103.882180] CPU: 12 UID: 0 PID: 2332 Comm: kworker/u64:21 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [103.882198] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [103.882206] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [103.882215] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [103.882248] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [103.882736] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [103.882747] RSP: 0018:ffffc90003f8fca0 EFLAGS: 00010246
<4> [103.882761] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [103.882771] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [103.882779] RBP: ffffc90003f8fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [103.882788] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [103.882796] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [103.882805] FS: 0000000000000000(0000) GS:ffff8888db297000(0000) knlGS:0000000000000000
<4> [103.882816] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [103.882825] CR2: 0000756fec021e58 CR3: 000000000344c002 CR4: 0000000000f72ef0
<4> [103.882835] PKRU: 55555554
<4> [103.882845] Call Trace:
<4> [103.882853] <TASK>
<4> [103.882882] ? lock_acquire+0x20/0x2f0
<4> [103.882920] ? lock_release+0xd0/0x2b0
<4> [103.882955] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [103.882995] process_one_work+0x239/0x760
<4> [103.883039] worker_thread+0x200/0x3f0
<4> [103.883056] ? __pfx_worker_thread+0x10/0x10
<4> [103.883071] kthread+0x10d/0x150
<4> [103.883088] ? __pfx_kthread+0x10/0x10
<4> [103.883111] ret_from_fork+0x3d4/0x480
<4> [103.883124] ? __pfx_kthread+0x10/0x10
<4> [103.883144] ret_from_fork_asm+0x1a/0x30
<4> [103.883196] </TASK>
<4> [103.883204] irq event stamp: 48861
<4> [103.883211] hardirqs last enabled at (48867): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [103.883228] hardirqs last disabled at (48872): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [103.883241] softirqs last enabled at (48698): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [103.883256] softirqs last disabled at (48693): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [103.883268] ---[ end trace 0000000000000000 ]---
<6> [103.883396] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [103.883888] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [103.883937] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [103.884246] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [103.884924] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [103.885412] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [103.885870] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [103.886296] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [103.886789] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [103.887187] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [103.887637] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [103.887777] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [103.887853] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [103.887943] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [103.888051] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [103.889059] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [103.899816] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [103.900251] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [103.902199] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [103.902443] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [103.902642] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [103.902838] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [103.903032] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [103.903237] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [103.903458] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [103.903655] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [103.903855] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [103.904049] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [103.904238] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [103.904452] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [103.904647] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [103.904824] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [103.905003] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [103.905176] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [103.905359] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [103.905527] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [103.905697] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [103.905896] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [103.906084] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [103.906264] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [103.906462] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [103.906627] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [103.906785] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [103.906942] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [103.907088] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [103.907240] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [103.907414] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [103.907571] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [103.907723] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [103.907870] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [103.908016] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [103.908161] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [103.908304] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [103.908466] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [103.908604] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [103.908737] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [103.908868] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [103.908992] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [103.909121] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [103.909248] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [103.909382] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [103.909503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [103.909623] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [103.909736] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [103.909848] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [103.909959] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [103.910068] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [103.910173] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [103.910274] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [103.910391] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [103.910496] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [103.910606] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [103.910710] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [103.910812] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [103.910910] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [103.911005] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [103.911105] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [103.911256] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<7> [103.912637] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [103.912760] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [104.896651] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2b2c
<7> [104.896796] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2c2c2d
<5> [109.000190] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8166, lrc_seqno=8166, guc_id=0, flags=0x73 in no process [-1]
<7> [109.000219] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [109.000606] ------------[ cut here ]------------
<4> [109.000614] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [109.000622] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#2: kworker/u64:17/2328
<4> [109.001139] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [109.001583] autofs4 [last unloaded: xe]
<4> [109.001606] CPU: 2 UID: 0 PID: 2328 Comm: kworker/u64:17 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [109.001622] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [109.001629] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [109.001637] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [109.001666] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [109.002102] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [109.002118] RSP: 0018:ffffc9000210bca0 EFLAGS: 00010246
<4> [109.002140] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [109.002154] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [109.002166] RBP: ffffc9000210bdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [109.002177] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [109.002191] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [109.002206] FS: 0000000000000000(0000) GS:ffff8888dad97000(0000) knlGS:0000000000000000
<4> [109.002221] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [109.002234] CR2: 00007570b0db5000 CR3: 000000000344c006 CR4: 0000000000f72ef0
<4> [109.002247] PKRU: 55555554
<4> [109.002259] Call Trace:
<4> [109.002270] <TASK>
<4> [109.002308] ? lock_acquire+0x20/0x2f0
<4> [109.002357] ? lock_release+0xd0/0x2b0
<4> [109.002404] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [109.002456] process_one_work+0x239/0x760
<4> [109.002516] worker_thread+0x200/0x3f0
<4> [109.002541] ? __pfx_worker_thread+0x10/0x10
<4> [109.002562] kthread+0x10d/0x150
<4> [109.002586] ? __pfx_kthread+0x10/0x10
<4> [109.002617] ret_from_fork+0x3d4/0x480
<4> [109.002633] ? __pfx_kthread+0x10/0x10
<4> [109.002661] ret_from_fork_asm+0x1a/0x30
<4> [109.002731] </TASK>
<4> [109.002742] irq event stamp: 280821
<4> [109.002753] hardirqs last enabled at (280827): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [109.002769] hardirqs last disabled at (280832): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [109.002781] softirqs last enabled at (280658): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.002793] softirqs last disabled at (280653): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.002804] ---[ end trace 0000000000000000 ]---
<5> [109.004824] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8168, lrc_seqno=8168, guc_id=0, flags=0x73 in no process [-1]
<7> [109.004850] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [109.005298] ------------[ cut here ]------------
<4> [109.005305] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [109.005313] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#4: kworker/u64:17/2328
<4> [109.005713] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [109.006082] autofs4 [last unloaded: xe]
<4> [109.006103] CPU: 4 UID: 0 PID: 2328 Comm: kworker/u64:17 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [109.006118] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [109.006124] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [109.006131] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [109.006157] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [109.006535] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [109.006544] RSP: 0018:ffffc9000210bca0 EFLAGS: 00010246
<4> [109.006556] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [109.006563] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [109.006570] RBP: ffffc9000210bdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [109.006576] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [109.006582] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [109.006589] FS: 0000000000000000(0000) GS:ffff8888dae97000(0000) knlGS:0000000000000000
<4> [109.006596] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [109.006603] CR2: 0000000000e2e322 CR3: 000000000344c003 CR4: 0000000000f72ef0
<4> [109.006611] PKRU: 55555554
<4> [109.006616] Call Trace:
<4> [109.006622] <TASK>
<4> [109.006643] ? lock_acquire+0x20/0x2f0
<4> [109.006671] ? lock_release+0xd0/0x2b0
<4> [109.006698] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [109.006726] process_one_work+0x239/0x760
<4> [109.006758] worker_thread+0x200/0x3f0
<4> [109.006772] ? __pfx_worker_thread+0x10/0x10
<4> [109.006784] kthread+0x10d/0x150
<4> [109.006796] ? __pfx_kthread+0x10/0x10
<4> [109.006799] ret_from_fork+0x3d4/0x480
<4> [109.006801] ? __pfx_kthread+0x10/0x10
<4> [109.006804] ret_from_fork_asm+0x1a/0x30
<4> [109.006811] </TASK>
<4> [109.006812] irq event stamp: 281779
<4> [109.006813] hardirqs last enabled at (281785): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [109.006815] hardirqs last disabled at (281790): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [109.006817] softirqs last enabled at (281296): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.006819] softirqs last disabled at (281283): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.006821] ---[ end trace 0000000000000000 ]---
<6> [109.006823] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [109.006892] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [109.006898] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [109.006939] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [109.007149] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [109.007237] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [109.007321] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [109.007398] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [109.007474] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [109.007549] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [109.007624] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [109.007699] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [109.007770] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [109.007857] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [109.007977] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [109.009009] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [109.018955] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [109.019235] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [109.020519] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<6> [109.020647] [IGT] xe_exec_system_allocator: finished subtest many-64k-free, FAIL
<7> [109.020615] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [109.020696] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [109.020775] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [109.020850] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [109.020937] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [109.021012] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [109.021084] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [109.021154] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<6> [109.021203] [IGT] xe_exec_system_allocator: exiting, ret=98
<7> [109.021223] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [109.021291] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [109.021360] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [109.021428] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [109.021498] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [109.021566] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [109.021631] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<6> [109.021688] Console: switching to colour frame buffer device 240x67
<7> [109.021696] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [109.021764] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [109.021830] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [109.021919] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [109.022002] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [109.022080] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [109.022150] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [109.022218] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [109.022286] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [109.022352] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [109.022419] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [109.022490] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [109.022568] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [109.022645] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [109.022717] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [109.022787] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [109.022865] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [109.022958] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [109.023041] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [109.023121] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [109.023191] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [109.023257] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [109.023323] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [109.023387] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [109.023453] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [109.023521] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [109.023588] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [109.023653] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [109.023724] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [109.023791] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [109.023858] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [109.023936] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [109.024013] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [109.024087] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [109.024159] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [109.024235] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [109.024308] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [109.024381] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [109.024453] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [109.024526] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [109.024598] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [109.024671] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [109.024748] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [109.024866] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [109.024871] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8168, lrc_seqno=8168, guc_id=0, flags=0x73 in no process [-1]
<7> [109.024873] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [109.024955] ------------[ cut here ]------------
<4> [109.024956] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [109.024958] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#8: kworker/u64:17/2328
<4> [109.025033] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [109.025099] autofs4 [last unloaded: xe]
<4> [109.025103] CPU: 8 UID: 0 PID: 2328 Comm: kworker/u64:17 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [109.025106] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [109.025108] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [109.025109] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [109.025115] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [109.025187] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [109.025189] RSP: 0000:ffffc9000210bca0 EFLAGS: 00010246
<4> [109.025191] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [109.025193] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [109.025194] RBP: ffffc9000210bdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [109.025195] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [109.025196] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [109.025198] FS: 0000000000000000(0000) GS:ffff8888db097000(0000) knlGS:0000000000000000
<4> [109.025199] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [109.025201] CR2: 0000794c8e129c30 CR3: 00000001384d5005 CR4: 0000000000f72ef0
<4> [109.025202] PKRU: 55555554
<4> [109.025203] Call Trace:
<4> [109.025204] <TASK>
<4> [109.025208] ? lock_acquire+0x20/0x2f0
<4> [109.025214] ? lock_release+0xd0/0x2b0
<4> [109.025219] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [109.025225] process_one_work+0x239/0x760
<4> [109.025231] worker_thread+0x200/0x3f0
<4> [109.025234] ? __pfx_worker_thread+0x10/0x10
<4> [109.025237] kthread+0x10d/0x150
<4> [109.025239] ? __pfx_kthread+0x10/0x10
<4> [109.025243] ret_from_fork+0x3d4/0x480
<4> [109.025245] ? __pfx_kthread+0x10/0x10
<4> [109.025249] ret_from_fork_asm+0x1a/0x30
<4> [109.025256] </TASK>
<4> [109.025257] irq event stamp: 284903
<4> [109.025258] hardirqs last enabled at (284909): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [109.025261] hardirqs last disabled at (284914): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [109.025264] softirqs last enabled at (283772): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.025266] softirqs last disabled at (283767): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.025268] ---[ end trace 0000000000000000 ]---
<6> [109.025270] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [109.025342] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [109.025348] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [109.025730] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [109.026073] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [109.026163] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [109.026251] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [109.026335] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [109.026416] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [109.026495] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [109.026575] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [109.026655] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [109.026730] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [109.026827] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [109.026953] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [109.027965] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [109.037883] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [109.038134] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [109.039331] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [109.039413] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [109.039489] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [109.039566] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [109.039641] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [109.039716] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [109.039790] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [109.039864] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [109.039950] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [109.040024] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [109.040097] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [109.040169] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [109.040241] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [109.040313] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [109.040384] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [109.040457] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [109.040528] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [109.040598] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [109.040669] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [109.040751] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [109.040829] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [109.040923] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [109.041001] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [109.041075] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [109.041152] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [109.041226] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [109.041299] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [109.041374] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [109.041450] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [109.041524] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [109.041599] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [109.041674] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [109.041754] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [109.041833] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [109.041920] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [109.041996] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [109.042072] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [109.042145] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [109.042217] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [109.042288] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [109.042363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [109.042436] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [109.042508] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [109.042582] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [109.042658] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [109.042730] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [109.042803] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [109.042879] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [109.042953] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [109.043025] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [109.043098] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [109.043173] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [109.043247] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [109.043319] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [109.043390] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [109.043464] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [109.043537] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [109.043610] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [109.043687] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [109.043802] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [109.043806] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8168, lrc_seqno=8168, guc_id=0, flags=0x73 in no process [-1]
<7> [109.043809] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [109.043871] ------------[ cut here ]------------
<4> [109.043872] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [109.043873] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#8: kworker/u64:17/2328
<4> [109.043960] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [109.044026] autofs4 [last unloaded: xe]
<4> [109.044029] CPU: 8 UID: 0 PID: 2328 Comm: kworker/u64:17 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [109.044032] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [109.044033] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [109.044035] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [109.044040] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [109.044111] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [109.044112] RSP: 0000:ffffc9000210bca0 EFLAGS: 00010246
<4> [109.044115] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [109.044116] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [109.044117] RBP: ffffc9000210bdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [109.044119] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [109.044120] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [109.044121] FS: 0000000000000000(0000) GS:ffff8888db097000(0000) knlGS:0000000000000000
<4> [109.044123] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [109.044124] CR2: 0000794c8e129c30 CR3: 00000001384d5005 CR4: 0000000000f72ef0
<4> [109.044125] PKRU: 55555554
<4> [109.044127] Call Trace:
<4> [109.044128] <TASK>
<4> [109.044132] ? lock_acquire+0x20/0x2f0
<4> [109.044137] ? lock_release+0xd0/0x2b0
<4> [109.044142] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [109.044148] process_one_work+0x239/0x760
<4> [109.044154] worker_thread+0x200/0x3f0
<4> [109.044157] ? __pfx_worker_thread+0x10/0x10
<4> [109.044159] kthread+0x10d/0x150
<4> [109.044162] ? __pfx_kthread+0x10/0x10
<4> [109.044166] ret_from_fork+0x3d4/0x480
<4> [109.044168] ? __pfx_kthread+0x10/0x10
<4> [109.044171] ret_from_fork_asm+0x1a/0x30
<4> [109.044178] </TASK>
<4> [109.044179] irq event stamp: 288053
<4> [109.044181] hardirqs last enabled at (288059): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [109.044183] hardirqs last disabled at (288064): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [109.044185] softirqs last enabled at (287032): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.044188] softirqs last disabled at (287025): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [109.044190] ---[ end trace 0000000000000000 ]---
<3> [110.088080] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 32239, action 5503, done no
<5> [110.099483] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [110.099488] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [110.099491] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [110.099492] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [110.099494] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [110.099495] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [110.099497] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [110.099498] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [110.099499] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [110.099501] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [110.099502] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [110.099503] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [110.099504] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [110.099506] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [110.099507] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [110.099508] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [110.099518] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [110.108943] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [110.109095] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<3> [111.305605] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12073 recv=12072
<3> [113.607907] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12074 recv=12072
<5> [113.617385] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8169, lrc_seqno=8169, guc_id=0, flags=0x73 in no process [-1]
<7> [113.617389] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [113.617463] ------------[ cut here ]------------
<4> [113.617465] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [113.617466] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#10: kworker/u64:30/2341
<4> [113.617537] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [113.617621] autofs4 [last unloaded: xe]
<4> [113.617626] CPU: 10 UID: 0 PID: 2341 Comm: kworker/u64:30 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [113.617629] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [113.617630] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [113.617632] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [113.617639] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [113.617703] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [113.617705] RSP: 0018:ffffc900016a7ca0 EFLAGS: 00010246
<4> [113.617707] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [113.617709] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [113.617710] RBP: ffffc900016a7db0 R08: 0000000000000000 R09: 0000000000000000
<4> [113.617711] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [113.617712] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [113.617714] FS: 0000000000000000(0000) GS:ffff8888db197000(0000) knlGS:0000000000000000
<4> [113.617715] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [113.617717] CR2: 00007e17140910f8 CR3: 0000000119477006 CR4: 0000000000f72ef0
<4> [113.617718] PKRU: 55555554
<4> [113.617719] Call Trace:
<4> [113.617721] <TASK>
<4> [113.617725] ? lock_acquire+0x20/0x2f0
<4> [113.617731] ? lock_release+0xd0/0x2b0
<4> [113.617737] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [113.617742] process_one_work+0x239/0x760
<4> [113.617749] worker_thread+0x200/0x3f0
<4> [113.617752] ? __pfx_worker_thread+0x10/0x10
<4> [113.617754] kthread+0x10d/0x150
<4> [113.617757] ? __pfx_kthread+0x10/0x10
<4> [113.617760] ret_from_fork+0x3d4/0x480
<4> [113.617762] ? __pfx_kthread+0x10/0x10
<4> [113.617765] ret_from_fork_asm+0x1a/0x30
<4> [113.617773] </TASK>
<4> [113.617774] irq event stamp: 48311
<4> [113.617775] hardirqs last enabled at (48317): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [113.617778] hardirqs last disabled at (48322): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [113.617780] softirqs last enabled at (47900): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [113.617782] softirqs last disabled at (47891): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [113.617784] ---[ end trace 0000000000000000 ]---
<6> [113.617786] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<7> [113.617761] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<6> [113.617849] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [113.617856] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [113.617904] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [113.618204] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [113.618298] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [113.618442] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [113.618591] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [113.618725] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [113.618856] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [113.618985] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [113.619116] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [113.619248] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [113.619376] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [113.619545] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [113.619912] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [113.621663] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [113.632669] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [113.633203] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [113.635152] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [113.635384] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [113.635604] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [113.635916] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [113.636212] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [113.636489] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [113.636834] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [113.637032] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [113.637211] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [113.637385] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [113.637556] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [113.637743] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [113.637900] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [113.638059] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [113.638216] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [113.638372] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [113.638518] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [113.638701] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [113.638844] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [113.639003] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [113.639162] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [113.639315] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [113.639468] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [113.639618] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [113.639760] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [113.639905] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [113.640043] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [113.640178] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [113.640313] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [113.640447] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [113.640579] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [113.640741] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [113.640871] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [113.640998] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [113.641121] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [113.641242] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [113.641373] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [113.641491] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [113.641605] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [113.641711] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [113.641821] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [113.641933] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [113.642041] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [113.642145] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [113.642256] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [113.642363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [113.642468] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [113.642571] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [113.642693] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [113.642789] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [113.642886] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [113.642983] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [113.643077] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [113.643174] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [113.643269] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [113.643363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [113.643455] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [113.643550] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [113.643657] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [113.643800] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [113.643806] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8169, lrc_seqno=8169, guc_id=0, flags=0x73 in no process [-1]
<7> [113.643810] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [113.643881] ------------[ cut here ]------------
<4> [113.643883] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [113.643885] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#0: kworker/u64:30/2341
<4> [113.643971] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [113.644056] autofs4 [last unloaded: xe]
<4> [113.644061] CPU: 0 UID: 0 PID: 2341 Comm: kworker/u64:30 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [113.644065] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [113.644066] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [113.644068] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [113.644073] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [113.644154] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [113.644156] RSP: 0018:ffffc900016a7ca0 EFLAGS: 00010246
<4> [113.644159] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [113.644160] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [113.644162] RBP: ffffc900016a7db0 R08: 0000000000000000 R09: 0000000000000000
<4> [113.644164] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [113.644165] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [113.644167] FS: 0000000000000000(0000) GS:ffff8888dac97000(0000) knlGS:0000000000000000
<4> [113.644169] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [113.644170] CR2: 000000c00047b000 CR3: 000000000344c006 CR4: 0000000000f72ef0
<4> [113.644172] PKRU: 55555554
<4> [113.644173] Call Trace:
<4> [113.644175] <TASK>
<4> [113.644180] ? lock_acquire+0x20/0x2f0
<4> [113.644186] ? lock_release+0xd0/0x2b0
<4> [113.644192] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [113.644199] process_one_work+0x239/0x760
<4> [113.644206] worker_thread+0x200/0x3f0
<4> [113.644210] ? __pfx_worker_thread+0x10/0x10
<4> [113.644213] kthread+0x10d/0x150
<4> [113.644216] ? __pfx_kthread+0x10/0x10
<4> [113.644220] ret_from_fork+0x3d4/0x480
<4> [113.644223] ? __pfx_kthread+0x10/0x10
<4> [113.644227] ret_from_fork_asm+0x1a/0x30
<4> [113.644236] </TASK>
<4> [113.644237] irq event stamp: 51469
<4> [113.644238] hardirqs last enabled at (51475): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [113.644242] hardirqs last disabled at (51480): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [113.644244] softirqs last enabled at (50618): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [113.644247] softirqs last disabled at (50611): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [113.644250] ---[ end trace 0000000000000000 ]---
<6> [113.644252] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [113.644331] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [113.644338] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [113.644429] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [113.644665] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [113.644761] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [113.644857] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [113.644943] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [113.645030] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [113.645112] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [113.645195] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [113.645282] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [113.645364] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [113.645464] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [113.645577] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [113.646601] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [113.656589] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [113.656858] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [113.657946] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [113.658024] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [113.658096] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [113.658167] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [113.658239] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [113.658308] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [113.658377] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [113.658445] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [113.658515] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [113.658590] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [113.658676] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [113.658744] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [113.658811] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [113.658879] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [113.658947] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [113.659015] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [113.659083] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [113.659151] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [113.659219] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [113.659297] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [113.659374] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [113.659447] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [113.659519] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [113.659594] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [113.659667] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [113.659739] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [113.659809] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [113.659880] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [113.659954] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [113.660026] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [113.660097] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [113.660170] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [113.660249] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [113.660325] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [113.660400] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [113.660474] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [113.660547] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [113.660631] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [113.660702] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [113.660770] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [113.660841] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [113.660912] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [113.660981] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [113.661050] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [113.661124] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [113.661194] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [113.661262] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [113.661328] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [113.661396] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [113.661463] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [113.661529] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [113.661630] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [113.661699] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [113.661768] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [113.661839] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [113.661911] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [113.661980] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [113.662049] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [113.662122] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [113.662231] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [113.662235] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8169, lrc_seqno=8169, guc_id=0, flags=0x73 in no process [-1]
<7> [113.662238] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [113.662294] ------------[ cut here ]------------
<4> [113.662296] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [113.662297] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#0: kworker/u64:30/2341
<4> [113.662366] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [113.662430] autofs4 [last unloaded: xe]
<4> [113.662434] CPU: 0 UID: 0 PID: 2341 Comm: kworker/u64:30 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [113.662437] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [113.662438] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [113.662439] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [113.662444] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [113.662512] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [113.662513] RSP: 0018:ffffc900016a7ca0 EFLAGS: 00010246
<4> [113.662515] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [113.662517] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [113.662518] RBP: ffffc900016a7db0 R08: 0000000000000000 R09: 0000000000000000
<4> [113.662519] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [113.662520] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [113.662522] FS: 0000000000000000(0000) GS:ffff8888dac97000(0000) knlGS:0000000000000000
<4> [113.662523] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [113.662525] CR2: 000000c00047b000 CR3: 000000000344c006 CR4: 0000000000f72ef0
<4> [113.662526] PKRU: 55555554
<4> [113.662528] Call Trace:
<4> [113.662529] <TASK>
<4> [113.662532] ? lock_acquire+0x20/0x2f0
<4> [113.662538] ? lock_release+0xd0/0x2b0
<4> [113.662543] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [113.662548] process_one_work+0x239/0x760
<4> [113.662555] worker_thread+0x200/0x3f0
<4> [113.662557] ? __pfx_worker_thread+0x10/0x10
<4> [113.662560] kthread+0x10d/0x150
<4> [113.662562] ? __pfx_kthread+0x10/0x10
<4> [113.662566] ret_from_fork+0x3d4/0x480
<4> [113.662568] ? __pfx_kthread+0x10/0x10
<4> [113.662571] ret_from_fork_asm+0x1a/0x30
<4> [113.662578] </TASK>
<4> [113.662584] irq event stamp: 54555
<4> [113.662586] hardirqs last enabled at (54561): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [113.662590] hardirqs last disabled at (54566): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [113.662593] softirqs last enabled at (53458): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [113.662596] softirqs last disabled at (53453): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [113.662598] ---[ end trace 0000000000000000 ]---
<3> [114.695799] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 32285, action 5503, done no
<5> [114.706035] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [114.706038] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [114.706041] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [114.706042] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [114.706043] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [114.706045] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [114.706046] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [114.706048] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [114.706049] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [114.706050] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [114.706051] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [114.706053] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [114.706054] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [114.706055] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [114.706056] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [114.706058] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [114.706072] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [114.715320] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [114.715445] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<3> [115.977046] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12077 recv=12075
<5> [115.986423] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8170, lrc_seqno=8170, guc_id=0, flags=0x73 in no process [-1]
<7> [115.986427] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [115.986539] ------------[ cut here ]------------
<4> [115.986541] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [115.986542] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#8: kworker/u64:17/2328
<4> [115.986615] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [115.986681] autofs4 [last unloaded: xe]
<4> [115.986686] CPU: 8 UID: 0 PID: 2328 Comm: kworker/u64:17 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [115.986689] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [115.986690] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [115.986692] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [115.986697] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [115.986766] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [115.986767] RSP: 0018:ffffc9000210bca0 EFLAGS: 00010246
<4> [115.986769] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [115.986771] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [115.986772] RBP: ffffc9000210bdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [115.986773] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [115.986775] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [115.986776] FS: 0000000000000000(0000) GS:ffff8888db097000(0000) knlGS:0000000000000000
<4> [115.986777] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [115.986779] CR2: 0000794c8e129c30 CR3: 000000000344c004 CR4: 0000000000f72ef0
<4> [115.986780] PKRU: 55555554
<4> [115.986781] Call Trace:
<4> [115.986782] <TASK>
<4> [115.986786] ? lock_acquire+0x20/0x2f0
<4> [115.986792] ? lock_release+0xd0/0x2b0
<4> [115.986797] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [115.986802] process_one_work+0x239/0x760
<4> [115.986809] worker_thread+0x200/0x3f0
<4> [115.986812] ? __pfx_worker_thread+0x10/0x10
<4> [115.986814] kthread+0x10d/0x150
<4> [115.986817] ? __pfx_kthread+0x10/0x10
<4> [115.986820] ret_from_fork+0x3d4/0x480
<4> [115.986825] ? __pfx_kthread+0x10/0x10
<4> [115.986828] ret_from_fork_asm+0x1a/0x30
<4> [115.986835] </TASK>
<4> [115.986836] irq event stamp: 292705
<4> [115.986837] hardirqs last enabled at (292711): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [115.986840] hardirqs last disabled at (292716): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [115.986842] softirqs last enabled at (291944): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [115.986844] softirqs last disabled at (291939): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [115.986846] ---[ end trace 0000000000000000 ]---
<6> [115.986848] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [115.986915] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [115.986922] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [115.986971] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [115.987170] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [115.987255] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [115.987338] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [115.987416] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [115.987511] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [115.987594] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [115.987671] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [115.987749] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [115.987819] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [115.987905] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [115.988010] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [115.989112] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [115.999496] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [115.999942] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [116.001366] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [116.001590] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [116.001777] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [116.001948] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [116.002117] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [116.002285] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [116.002452] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [116.002643] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [116.002802] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [116.002958] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [116.003104] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [116.003249] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [116.003395] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [116.003554] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [116.003695] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [116.003834] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [116.003973] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [116.004110] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [116.004240] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [116.004393] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [116.004560] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [116.004700] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [116.004839] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [116.004975] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [116.005104] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [116.005228] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [116.005352] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [116.005486] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [116.005615] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [116.005737] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [116.005865] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [116.005981] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [116.006104] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [116.006225] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [116.006347] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [116.006478] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [116.006594] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [116.006708] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [116.006817] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [116.006924] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [116.007032] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [116.007136] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [116.007239] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [116.007343] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [116.007456] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [116.007579] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [116.007685] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [116.007779] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [116.007875] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [116.007969] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [116.008060] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [116.008156] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [116.008249] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [116.008341] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [116.008433] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [116.008536] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [116.008630] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [116.008718] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [116.008808] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [116.008937] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [116.008942] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8170, lrc_seqno=8170, guc_id=0, flags=0x73 in no process [-1]
<7> [116.008945] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [116.009014] ------------[ cut here ]------------
<4> [116.009015] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [116.009017] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#8: kworker/u64:17/2328
<4> [116.009102] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [116.009174] autofs4 [last unloaded: xe]
<4> [116.009178] CPU: 8 UID: 0 PID: 2328 Comm: kworker/u64:17 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [116.009182] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [116.009183] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [116.009185] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [116.009190] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [116.009269] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [116.009271] RSP: 0018:ffffc9000210bca0 EFLAGS: 00010246
<4> [116.009273] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [116.009275] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [116.009276] RBP: ffffc9000210bdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [116.009278] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [116.009279] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [116.009281] FS: 0000000000000000(0000) GS:ffff8888db097000(0000) knlGS:0000000000000000
<4> [116.009282] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [116.009284] CR2: 0000794c8e129c30 CR3: 000000000344c004 CR4: 0000000000f72ef0
<4> [116.009285] PKRU: 55555554
<4> [116.009287] Call Trace:
<4> [116.009288] <TASK>
<4> [116.009292] ? lock_acquire+0x20/0x2f0
<4> [116.009299] ? lock_release+0xd0/0x2b0
<4> [116.009304] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [116.009311] process_one_work+0x239/0x760
<4> [116.009318] worker_thread+0x200/0x3f0
<4> [116.009321] ? __pfx_worker_thread+0x10/0x10
<4> [116.009323] kthread+0x10d/0x150
<4> [116.009326] ? __pfx_kthread+0x10/0x10
<4> [116.009330] ret_from_fork+0x3d4/0x480
<4> [116.009333] ? __pfx_kthread+0x10/0x10
<4> [116.009336] ret_from_fork_asm+0x1a/0x30
<4> [116.009345] </TASK>
<4> [116.009346] irq event stamp: 295837
<4> [116.009347] hardirqs last enabled at (295843): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [116.009350] hardirqs last disabled at (295848): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [116.009353] softirqs last enabled at (294976): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [116.009355] softirqs last disabled at (294969): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [116.009358] ---[ end trace 0000000000000000 ]---
<6> [116.009360] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [116.009438] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [116.009445] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<6> [116.115170] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [116.115197] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [116.115207] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [116.115216] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [117.063677] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 32306, action 5503, done no
<5> [117.073909] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [117.073912] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [117.073914] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [117.073916] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [117.073917] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [117.073918] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [117.073920] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [117.073921] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [117.073922] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [117.073924] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [117.073925] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [117.073926] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [117.073928] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [117.073929] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [117.073930] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [117.073932] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [117.073946] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [117.083196] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [117.083348] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [117.083749] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [117.083985] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [117.084069] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [117.084148] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [117.084222] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [117.084294] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [117.084364] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [117.084467] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [117.084549] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [117.084620] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [117.084703] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [117.084806] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [117.085848] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [117.096594] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [117.096933] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [117.098398] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [117.098602] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [117.098820] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [117.099058] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [117.099296] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [117.099575] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [117.099783] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [117.099940] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [117.100102] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [117.100270] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [117.100587] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [117.100808] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [117.101004] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [117.101194] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [117.101377] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [117.101590] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [117.101776] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [117.101959] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [117.102141] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [117.102352] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [117.102589] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [117.102805] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [117.103020] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [117.103224] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [117.103439] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [117.103651] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [117.103867] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [117.104089] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [117.104309] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [117.104566] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [117.104816] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [117.105060] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [117.105305] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [117.105575] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [117.105820] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [117.105939] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [117.106017] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [117.106091] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [117.106164] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [117.106237] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [117.106313] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [117.106390] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [117.106472] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [117.106545] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [117.106621] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [117.106693] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [117.106765] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [117.106836] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [117.106909] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [117.106980] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [117.107052] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [117.107128] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [117.107201] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [117.107275] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [117.107347] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [117.107422] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [117.107495] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [117.107567] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [117.107641] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [117.107756] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [117.107763] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=8170, lrc_seqno=8170, guc_id=0, flags=0x73 in no process [-1]
<7> [117.107767] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [117.107829] ------------[ cut here ]------------
<4> [117.107830] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [117.107832] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#4: kworker/u64:21/2332
<4> [117.107904] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic spi_nor coretemp eeepc_wmi mei_pxp mei_hdcp mtd asus_wmi sparse_keymap platform_profile wmi_bmof binfmt_misc kvm_intel usbhid hid kvm irqbypass r8169 ghash_clmulni_intel aesni_intel snd_intel_dspcfg snd_hda_codec rapl intel_cstate snd_hda_core video snd_hwdep realtek snd_pcm snd_timer i2c_i801 spi_intel_pci mei_me snd i2c_mux idma64 spi_intel soundcore i2c_smbus mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [117.107976] autofs4 [last unloaded: xe]
<4> [117.107981] CPU: 4 UID: 0 PID: 2332 Comm: kworker/u64:21 Tainted: G S U W 7.0.0-rc6-lgci-xe-xe-pw-164323v2-debug+ #1 PREEMPT(lazy)
<4> [117.107984] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [117.107985] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [117.107987] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [117.107993] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [117.108065] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 46 9f 5e e1 48 89 c6 48 8d 3d ac 94 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [117.108067] RSP: 0018:ffffc90003f8fca0 EFLAGS: 00010246
<4> [117.108069] RAX: ffffffffa1202f81 RBX: 0000000000000000 RCX: 0000000000000000
<4> [117.108071] RDX: ffff888102525010 RSI: ffffffffa1202f81 RDI: ffffffffa1003ef0
<4> [117.108072] RBP: ffffc90003f8fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [117.108073] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [117.108075] R13: ffff888102525010 R14: ffff888136cf5018 R15: 00000000ffffffc2
<4> [117.108076] FS: 0000000000000000(0000) GS:ffff8888dae97000(0000) knlGS:0000000000000000
<4> [117.108078] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [117.108079] CR2: 00007e1714092338 CR3: 0000000119477005 CR4: 0000000000f72ef0
<4> [117.108080] PKRU: 55555554
<4> [117.108082] Call Trace:
<4> [117.108083] <TASK>
<4> [117.108087] ? lock_acquire+0x20/0x2f0
<4> [117.108093] ? lock_release+0xd0/0x2b0
<4> [117.108099] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [117.108104] process_one_work+0x239/0x760
<4> [117.108112] worker_thread+0x200/0x3f0
<4> [117.108115] ? __pfx_worker_thread+0x10/0x10
<4> [117.108117] kthread+0x10d/0x150
<4> [117.108120] ? __pfx_kthread+0x10/0x10
<4> [117.108124] ret_from_fork+0x3d4/0x480
<4> [117.108127] ? __pfx_kthread+0x10/0x10
<4> [117.108130] ret_from_fork_asm+0x1a/0x30
<4> [117.108138] </TASK>
<4> [117.108139] irq event stamp: 56693
<4> [117.108141] hardirqs last enabled at (56699): [<ffffffff814a9ca9>] __up_console_sem+0x79/0xa0
<4> [117.108144] hardirqs last disabled at (56704): [<ffffffff814a9c8e>] __up_console_sem+0x5e/0xa0
<4> [117.108146] softirqs last enabled at (55670): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [117.108149] softirqs last disabled at (55663): [<ffffffff813d0e9f>] __irq_exit_rcu+0x13f/0x160
<4> [117.108151] ---[ end trace 0000000000000000 ]---
<3> [118.151683] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 32328, action 5503, done no
<5> [118.161915] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [118.161919] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [118.161921] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [118.161922] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [118.161923] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [118.161925] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [118.161926] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [118.161927] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [118.161929] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [118.161930] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [118.161931] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [118.161933] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [118.161934] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [118.161935] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [118.161936] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [118.161938] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [118.161952] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [118.171438] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [118.171570] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<3> [119.367660] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12078 recv=12077
<7> [119.891309] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2f2f2c2d
<7> [119.891472] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2f2f2e30
<3> [121.671588] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12079 recv=12077
<7> [121.681575] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<3> [123.975487] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12080 recv=12077
<3> [126.279410] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=12081 recv=12077
<7> [126.304055] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
|