Result: 81 Warning(s)
i915_display_info11 igt_runner11 results11.json results11-xe-load.json guc_logs11.tar i915_display_info_post_exec11 serial_data11 boot11 dmesg11
| Detail | Value |
|---|---|
| Duration | unknown |
| Hostname |
shard-bmg-2 |
| Igt-Version |
IGT-Version: 2.4-g98b65acc4 (x86_64) (Linux: 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ x86_64) |
| Out |
Using IGT_SRANDOM=1779495453 for randomisation Opened device: /dev/dri/card0 Starting subtest: many-large-execqueues-free-race runner: This test was killed due to a kernel taint (0x244). This test caused an abort condition: Kernel badly tainted (0x244, 0x200) (check dmesg for details): TAINT_WARN: WARN_ON has happened. |
| Err |
Starting subtest: many-large-execqueues-free-race Received signal SIGQUIT. Stack trace: #0 [fatal_sig_handler+0x17b] #1 [__sigaction+0x50] #2 [ioctl+0x3b] #3 [drmIoctl+0x30] #4 [__xe_wait_ufence+0x84] #5 [xe_wait_ufence+0x17] #6 [test_exec+0xe26] #7 [__igt_unique____real_main2422+0x2a3b] #8 [main+0x2d] #9 [__libc_init_first+0x8a] #10 [__libc_start_main+0x8b] #11 [_start+0x25] |
| Dmesg |
<6> [133.734333] Console: switching to colour dummy device 80x25
<6> [133.734561] [IGT] xe_exec_system_allocator: executing
<6> [133.750864] [IGT] xe_exec_system_allocator: starting subtest many-large-execqueues-free-race
<7> [133.751162] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<7> [133.773879] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.781134] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.784787] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.790233] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.797551] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.799597] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.803982] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.806644] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.807459] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.810784] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.814818] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.815853] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.819398] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.822264] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.823096] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.826663] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.831097] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.832405] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.837887] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.841699] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.842705] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.846418] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.850401] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.851829] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.857550] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.862706] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.864037] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.867923] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.870946] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.871748] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.874268] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.876539] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.877352] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.880445] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.883052] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.884000] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.888582] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.895699] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.896693] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.900600] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.903591] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.904398] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.907360] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.910876] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.911544] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.913801] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.915972] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.916689] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.918937] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.921136] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.921788] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.924018] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.927207] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.928288] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.930866] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.933685] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.934654] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.938405] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.942188] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.943514] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.948114] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.951989] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.953320] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.957860] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.961495] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.962495] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.966409] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.970593] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.971304] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.975684] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [133.979708] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<7> [133.981138] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [133.987335] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [135.529127] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [135.633011] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<7> [139.432521] xe 0000:03:00.0: [drm:xe_hw_engine_snapshot_capture [xe]] Tile0: GT0: Proceeding with manual engine snapshot
<4> [139.433872] ------------[ cut here ]------------
<4> [139.433879] xe 0000:03:00.0: [drm] Tile0: GT0: Unexpected engine class:instance 3:8 for utilization
<4> [139.433889] WARNING: drivers/gpu/drm/xe/xe_lrc.c:2627 at engine_id_to_hwe+0x7a/0xc0 [xe], CPU#0: kworker/u64:37/2785
<4> [139.434315] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [139.434679] CPU: 0 UID: 0 PID: 2785 Comm: kworker/u64:37 Tainted: G S U 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [139.434695] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER
<4> [139.434701] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [139.434708] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [139.434735] RIP: 0010:engine_id_to_hwe+0x89/0xc0 [xe]
<4> [139.435135] Code: 3f 45 0f b6 64 24 26 44 0f b6 70 08 e8 50 8f 3b e1 48 89 c6 48 8d 3d 66 4c d8 ff 53 41 0f b6 ce 45 89 e9 45 0f b6 c4 4c 89 fa <67> 48 0f b9 3a 45 31 f6 58 48 8d 65 d8 4c 89 f0 5b 41 5c 41 5d 41
<4> [139.435144] RSP: 0018:ffffc90005127bd8 EFLAGS: 00010246
<4> [139.435156] RAX: ffffffffa12bc20d RBX: 0000000000000008 RCX: 0000000000000000
<4> [139.435164] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c04940
<4> [139.435171] RBP: ffffc90005127c08 R08: 0000000000000000 R09: 0000000000000003
<4> [139.435177] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [139.435183] R13: 0000000000000003 R14: 0000000000000000 R15: ffff88810379d010
<4> [139.435189] FS: 0000000000000000(0000) GS:ffff8888dac83000(0000) knlGS:0000000000000000
<4> [139.435197] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [139.435205] CR2: 00005f4fd0729458 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [139.435212] PKRU: 55555554
<4> [139.435218] Call Trace:
<4> [139.435224] <TASK>
<4> [139.435241] xe_lrc_context_timestamp+0x9e/0x200 [xe]
<4> [139.435882] ? xe_lrc_start_seqno+0x33/0x70 [xe]
<4> [139.435954] xe_lrc_timestamp+0x25/0x30 [xe]
<4> [139.436024] guc_exec_queue_timedout_job+0xf76/0x2400 [xe]
<4> [139.436095] ? lock_acquire+0xb0/0x300
<4> [139.436100] ? lock_release+0xd0/0x2b0
<4> [139.436104] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [139.436110] process_one_work+0x239/0x740
<4> [139.436117] worker_thread+0x200/0x3f0
<4> [139.436119] ? __pfx_worker_thread+0x10/0x10
<4> [139.436122] kthread+0x10d/0x150
<4> [139.436124] ? __pfx_kthread+0x10/0x10
<4> [139.436126] ret_from_fork+0x3bd/0x470
<4> [139.436129] ? __pfx_kthread+0x10/0x10
<4> [139.436132] ret_from_fork_asm+0x1a/0x30
<4> [139.436139] </TASK>
<4> [139.436140] irq event stamp: 5271
<4> [139.436141] hardirqs last enabled at (5277): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [139.436143] hardirqs last disabled at (5282): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [139.436145] softirqs last enabled at (5052): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [139.436148] softirqs last disabled at (5047): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [139.436150] ---[ end trace 0000000000000000 ]---
<7> [139.436155] xe 0000:03:00.0: [drm:guc_exec_queue_timedout_job [xe]] Tile0: GT0: Check job timeout: seqno=5524, lrc_seqno=5524, guc_id=0, running_time_ms=234423, timeout_ms=5000, diff=0xff8058de
<6> [140.077633] xe 0000:03:00.0: [drm] Tile0: GT0: Engine reset: engine_class=bcs, logical_mask: 0x2, guc_id=0, state=0x209
<5> [140.078244] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5524, lrc_seqno=5524, guc_id=0, flags=0x73 in no process [-1]
<6> [140.261886] xe 0000:03:00.0: [drm] Xe device coredump has been created
<6> [140.261906] xe 0000:03:00.0: [drm] Check your /sys/class/drm/card0/device/devcoredump/data
<4> [140.261909] ------------[ cut here ]------------
<4> [140.261910] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [140.261912] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#0: kworker/u64:37/2785
<4> [140.262005] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [140.262069] CPU: 0 UID: 0 PID: 2785 Comm: kworker/u64:37 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [140.262072] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [140.262073] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [140.262075] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [140.262080] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [140.262149] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [140.262151] RSP: 0018:ffffc90005127ca0 EFLAGS: 00010246
<4> [140.262153] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [140.262154] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [140.262156] RBP: ffffc90005127db0 R08: 0000000000000000 R09: 0000000000000000
<4> [140.262157] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [140.262157] R13: ffff88810379d010 R14: ffff88811ab48000 R15: 00000000ffffffc2
<4> [140.262159] FS: 0000000000000000(0000) GS:ffff8888dac83000(0000) knlGS:0000000000000000
<4> [140.262160] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [140.262161] CR2: 00005f4fd0729458 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [140.262162] PKRU: 55555554
<4> [140.262163] Call Trace:
<4> [140.262164] <TASK>
<4> [140.262168] ? lock_acquire+0xb0/0x300
<4> [140.262174] ? __pfx_autoremove_wake_function+0x10/0x10
<4> [140.262179] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [140.262184] process_one_work+0x239/0x740
<4> [140.262191] worker_thread+0x200/0x3f0
<4> [140.262193] ? __pfx_worker_thread+0x10/0x10
<4> [140.262196] kthread+0x10d/0x150
<4> [140.262198] ? __pfx_kthread+0x10/0x10
<4> [140.262201] ret_from_fork+0x3bd/0x470
<4> [140.262203] ? __pfx_kthread+0x10/0x10
<4> [140.262205] ret_from_fork_asm+0x1a/0x30
<4> [140.262212] </TASK>
<4> [140.262213] irq event stamp: 6921
<4> [140.262214] hardirqs last enabled at (6927): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [140.262217] hardirqs last disabled at (6932): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [140.262218] softirqs last enabled at (5970): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [140.262221] softirqs last disabled at (5965): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [140.262223] ---[ end trace 0000000000000000 ]---
<6> [140.262225] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [140.262303] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [140.262580] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [140.262783] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [140.263473] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [140.263580] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [140.263696] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [140.263778] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [140.263862] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [140.263949] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [140.264034] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [140.264118] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [140.264201] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [140.264306] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [140.264397] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [140.265563] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [140.276349] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [140.276630] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [140.278146] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [140.278233] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [140.278324] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [140.278409] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [140.278488] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [140.278566] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [140.278641] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [140.278719] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [140.278793] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [140.278869] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [140.278944] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [140.279017] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [140.279090] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [140.279163] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [140.279235] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [140.279322] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [140.279398] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [140.279474] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [140.279550] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [140.279636] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [140.279719] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [140.279799] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [140.279875] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [140.279953] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [140.280030] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [140.280105] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [140.280179] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [140.280251] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [140.280333] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [140.280409] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [140.280486] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [140.280562] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [140.280638] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [140.280714] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [140.280794] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [140.280877] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [140.280960] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [140.281040] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [140.281118] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [140.281193] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [140.281269] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [140.281363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [140.281439] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [140.281513] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [140.281589] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [140.281666] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [140.281742] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [140.281816] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [140.281889] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [140.281962] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [140.282040] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [140.282116] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [140.282190] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [140.282265] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [140.282356] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [140.282469] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [140.282554] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [140.282627] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [140.282701] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [140.282775] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [140.282848] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [140.282923] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [140.282996] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [140.283069] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [140.283143] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [140.283216] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [140.283300] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [140.283378] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [140.283458] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [140.285129] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [140.285135] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967170, lrc_seqno=4294967170, guc_id=10, flags=0x0 in xe_exec_system_ [2807]
<7> [140.285138] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<7> [140.285313] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [140.285455] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [143.303239] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2a2b
<7> [143.304013] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2d2d
<5> [145.572897] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5524, lrc_seqno=5524, guc_id=0, flags=0x73 in no process [-1]
<7> [145.572925] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [145.573368] ------------[ cut here ]------------
<4> [145.573375] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [145.573382] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#8: kworker/u64:37/2785
<4> [145.573947] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [145.574285] CPU: 8 UID: 0 PID: 2785 Comm: kworker/u64:37 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [145.574300] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [145.574307] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [145.574314] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [145.574345] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [145.574809] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [145.574819] RSP: 0018:ffffc90005127ca0 EFLAGS: 00010246
<4> [145.574831] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [145.574838] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [145.574845] RBP: ffffc90005127db0 R08: 0000000000000000 R09: 0000000000000000
<4> [145.574852] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [145.574858] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [145.574865] FS: 0000000000000000(0000) GS:ffff8888db083000(0000) knlGS:0000000000000000
<4> [145.574873] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [145.574881] CR2: 000071474c60b048 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [145.574888] PKRU: 55555554
<4> [145.574894] Call Trace:
<4> [145.574901] <TASK>
<4> [145.574923] ? lock_acquire+0xb0/0x300
<4> [145.574952] ? lock_release+0xd0/0x2b0
<4> [145.574976] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [145.575006] process_one_work+0x239/0x740
<4> [145.575041] worker_thread+0x200/0x3f0
<4> [145.575057] ? __pfx_worker_thread+0x10/0x10
<4> [145.575071] kthread+0x10d/0x150
<4> [145.575083] ? __pfx_kthread+0x10/0x10
<4> [145.575098] ret_from_fork+0x3bd/0x470
<4> [145.575111] ? __pfx_kthread+0x10/0x10
<4> [145.575126] ret_from_fork_asm+0x1a/0x30
<4> [145.575166] </TASK>
<4> [145.575171] irq event stamp: 10651
<4> [145.575177] hardirqs last enabled at (10657): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [145.575191] hardirqs last disabled at (10662): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [145.575199] softirqs last enabled at (9906): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [145.575215] softirqs last disabled at (9889): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [145.575229] ---[ end trace 0000000000000000 ]---
<6> [145.575340] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [145.575798] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [145.576697] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [145.577208] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [145.577959] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [145.578341] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [145.578431] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [145.578518] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [145.578614] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [145.578703] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [145.578789] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [145.578870] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [145.578946] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [145.579040] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [145.579127] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [145.580156] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [145.591009] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [145.591533] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [145.593394] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [145.593692] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [145.593940] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [145.594182] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [145.594421] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [145.594702] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [145.594953] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [145.595202] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [145.595452] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [145.595705] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [145.595931] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [145.596163] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [145.596390] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [145.596614] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [145.596826] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [145.597041] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [145.597244] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [145.597447] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [145.597661] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [145.597883] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [145.598093] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [145.598300] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [145.598500] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [145.598711] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [145.598908] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [145.599098] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [145.599274] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [145.599450] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [145.599629] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [145.599795] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [145.599962] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [145.600128] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [145.600294] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [145.600450] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [145.600615] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [145.600769] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [145.600926] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [145.601080] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [145.601229] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [145.601369] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [145.601507] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [145.601654] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [145.601786] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [145.601924] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [145.602061] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [145.602189] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [145.602316] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [145.602441] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [145.602831] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [145.602959] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [145.603081] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [145.603198] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [145.603315] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [145.603426] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [145.603533] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [145.603653] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [145.603760] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [145.603861] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [145.603964] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [145.604072] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [145.604174] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [145.604273] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [145.604369] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [145.604466] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [145.604572] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [145.604677] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [145.604772] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [145.604865] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [145.604959] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [145.605178] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<7> [145.606132] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [145.606257] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [146.602787] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x29292729
<7> [146.603222] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x29292829
<5> [150.691889] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5524, lrc_seqno=5524, guc_id=0, flags=0x73 in no process [-1]
<7> [150.691920] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [150.692414] ------------[ cut here ]------------
<4> [150.692420] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [150.692428] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#12: kworker/u64:20/2767
<4> [150.693022] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [150.693410] CPU: 12 UID: 0 PID: 2767 Comm: kworker/u64:20 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [150.693427] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [150.693434] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [150.693442] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [150.693477] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [150.693991] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [150.694001] RSP: 0018:ffffc90004747ca0 EFLAGS: 00010246
<4> [150.694014] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [150.694023] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [150.694031] RBP: ffffc90004747db0 R08: 0000000000000000 R09: 0000000000000000
<4> [150.694038] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [150.694045] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [150.694053] FS: 0000000000000000(0000) GS:ffff8888db283000(0000) knlGS:0000000000000000
<4> [150.694062] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [150.694070] CR2: 000076a6ec5bb8d0 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [150.694079] PKRU: 55555554
<4> [150.694086] Call Trace:
<4> [150.694093] <TASK>
<4> [150.694121] ? lock_acquire+0xb0/0x300
<4> [150.694155] ? lock_release+0xd0/0x2b0
<4> [150.694187] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [150.694227] process_one_work+0x239/0x740
<4> [150.694272] worker_thread+0x200/0x3f0
<4> [150.694291] ? __pfx_worker_thread+0x10/0x10
<4> [150.694307] kthread+0x10d/0x150
<4> [150.694320] ? __pfx_kthread+0x10/0x10
<4> [150.694338] ret_from_fork+0x3bd/0x470
<4> [150.694352] ? __pfx_kthread+0x10/0x10
<4> [150.694367] ret_from_fork_asm+0x1a/0x30
<4> [150.694419] </TASK>
<4> [150.694426] irq event stamp: 35623
<4> [150.694433] hardirqs last enabled at (35629): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [150.694447] hardirqs last disabled at (35634): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [150.694457] softirqs last enabled at (34878): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [150.694474] softirqs last disabled at (34871): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [150.694488] ---[ end trace 0000000000000000 ]---
<5> [150.698449] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5525, lrc_seqno=5525, guc_id=0, flags=0x73 in no process [-1]
<7> [150.698456] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [150.698857] ------------[ cut here ]------------
<4> [150.698859] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [150.698861] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#12: kworker/u64:37/2785
<4> [150.698992] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [150.699089] CPU: 12 UID: 0 PID: 2785 Comm: kworker/u64:37 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [150.699093] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [150.699095] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [150.699097] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [150.699105] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [150.699227] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [150.699229] RSP: 0018:ffffc90005127ca0 EFLAGS: 00010246
<4> [150.699232] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [150.699234] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [150.699236] RBP: ffffc90005127db0 R08: 0000000000000000 R09: 0000000000000000
<4> [150.699237] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [150.699239] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [150.699241] FS: 0000000000000000(0000) GS:ffff8888db283000(0000) knlGS:0000000000000000
<4> [150.699243] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [150.699245] CR2: 000076a6ec5bb8d0 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [150.699247] PKRU: 55555554
<4> [150.699248] Call Trace:
<4> [150.699250] <TASK>
<4> [150.699257] ? lock_acquire+0xb0/0x300
<4> [150.699266] ? lock_release+0xd0/0x2b0
<4> [150.699273] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [150.699283] process_one_work+0x239/0x740
<4> [150.699295] worker_thread+0x200/0x3f0
<4> [150.699299] ? __pfx_worker_thread+0x10/0x10
<4> [150.699303] kthread+0x10d/0x150
<4> [150.699306] ? __pfx_kthread+0x10/0x10
<4> [150.699310] ret_from_fork+0x3bd/0x470
<4> [150.699314] ? __pfx_kthread+0x10/0x10
<4> [150.699317] ret_from_fork_asm+0x1a/0x30
<4> [150.699330] </TASK>
<4> [150.699332] irq event stamp: 21157
<4> [150.699333] hardirqs last enabled at (21163): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [150.699336] hardirqs last disabled at (21168): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [150.699339] softirqs last enabled at (20334): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [150.699343] softirqs last disabled at (20329): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [150.699346] ---[ end trace 0000000000000000 ]---
<6> [150.699349] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [150.699471] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [150.699481] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [150.699542] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [150.699841] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [150.699985] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [150.700130] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [150.700270] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [150.700410] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [150.700553] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [150.700695] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [150.700836] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [150.700973] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [150.701123] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [150.701269] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [150.702708] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [150.712599] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [150.713026] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [150.714492] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [150.714711] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [150.714911] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [150.715113] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [150.715320] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [150.715540] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [150.715758] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [150.715977] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [150.716192] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [150.716398] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [150.716603] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [150.716804] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [150.717002] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [150.717201] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [150.717393] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [150.717584] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [150.717768] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [150.717942] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [150.718116] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [150.718301] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [150.718483] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [150.718668] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [150.718837] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [150.719006] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [150.719173] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [150.719332] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [150.719487] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [150.719644] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [150.719794] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [150.719947] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [150.720098] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [150.720244] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [150.720387] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [150.720532] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [150.720678] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [150.720822] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [150.720968] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [150.721112] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [150.721255] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [150.721386] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [150.721522] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [150.721655] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [150.721785] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [150.721917] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [150.722051] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [150.722184] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [150.722316] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [150.722448] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [150.722582] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [150.722716] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [150.722851] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [150.722983] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [150.723115] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [150.723246] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [150.723375] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [150.723758] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [150.723868] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [150.723956] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [150.724037] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [150.724115] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [150.724190] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [150.724270] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [150.724344] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [150.724416] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [150.724490] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [150.724583] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [150.724663] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [150.724738] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [150.724816] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [150.725241] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [150.725246] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5525, lrc_seqno=5525, guc_id=0, flags=0x73 in no process [-1]
<7> [150.725249] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [150.725323] ------------[ cut here ]------------
<4> [150.725325] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [150.725326] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#2: kworker/u64:37/2785
<4> [150.725409] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [150.725475] CPU: 2 UID: 0 PID: 2785 Comm: kworker/u64:37 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [150.725478] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [150.725479] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [150.725480] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [150.725487] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [150.725580] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [150.725582] RSP: 0018:ffffc90005127ca0 EFLAGS: 00010246
<4> [150.725585] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [150.725587] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [150.725588] RBP: ffffc90005127db0 R08: 0000000000000000 R09: 0000000000000000
<4> [150.725589] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [150.725591] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [150.725592] FS: 0000000000000000(0000) GS:ffff8888dad83000(0000) knlGS:0000000000000000
<4> [150.725594] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [150.725595] CR2: 000000c0001df1a8 CR3: 000000000344a005 CR4: 0000000000f72ef0
<4> [150.725597] PKRU: 55555554
<4> [150.725598] Call Trace:
<4> [150.725599] <TASK>
<4> [150.725604] ? lock_acquire+0xb0/0x300
<4> [150.725610] ? lock_release+0xd0/0x2b0
<4> [150.725615] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [150.725621] process_one_work+0x239/0x740
<4> [150.725628] worker_thread+0x200/0x3f0
<4> [150.725631] ? __pfx_worker_thread+0x10/0x10
<4> [150.725634] kthread+0x10d/0x150
<4> [150.725636] ? __pfx_kthread+0x10/0x10
<4> [150.725639] ret_from_fork+0x3bd/0x470
<4> [150.725642] ? __pfx_kthread+0x10/0x10
<4> [150.725644] ret_from_fork_asm+0x1a/0x30
<4> [150.725653] </TASK>
<4> [150.725654] irq event stamp: 24783
<4> [150.725655] hardirqs last enabled at (24789): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [150.725657] hardirqs last disabled at (24794): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [150.725659] softirqs last enabled at (24622): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [150.725662] softirqs last disabled at (24617): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [150.725665] ---[ end trace 0000000000000000 ]---
<6> [150.725667] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [150.725747] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [150.725754] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<3> [151.779620] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 17344, action 5503, done no
<5> [151.781430] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [151.781459] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [151.781470] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [151.781479] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [151.781487] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [151.781495] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [151.781502] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [151.781509] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [151.781516] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [151.781523] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [151.781529] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [151.781536] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [151.781543] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [151.781550] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [151.781557] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [151.781565] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [151.781607] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [151.781919] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [151.782649] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [151.783767] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [151.784614] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [151.785148] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [151.785698] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [151.786160] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [151.786640] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [151.787069] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [151.787543] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [151.787994] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [151.788440] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [151.788963] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [151.789398] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [151.790426] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [151.801271] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [151.802134] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [151.804426] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [151.804853] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [151.805257] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [151.805692] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [151.806066] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [151.806480] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [151.806852] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [151.807210] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [151.807570] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [151.807887] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [151.808200] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [151.808532] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [151.808823] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [151.809104] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [151.809400] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [151.809666] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [151.809921] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [151.810169] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [151.810437] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [151.810716] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [151.810969] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [151.811216] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [151.811483] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [151.811708] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [151.811930] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [151.812153] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [151.812368] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [151.812574] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [151.812767] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [151.812966] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [151.813162] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [151.813361] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [151.813548] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [151.813721] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [151.813902] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [151.814085] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [151.814264] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [151.814454] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [151.814614] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [151.814768] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [151.814917] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [151.815064] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [151.815205] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [151.815353] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [151.815500] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [151.815641] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [151.815777] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [151.815907] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [151.816033] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [151.816162] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [151.816290] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [151.816428] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [151.816548] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [151.816666] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [151.816777] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [151.816889] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [151.817004] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [151.817114] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [151.817217] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [151.817330] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [151.817435] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [151.817545] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [151.817653] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [151.817755] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [151.817857] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [151.817958] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [151.818057] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [151.818150] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [151.818244] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [151.818432] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [151.818441] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5525, lrc_seqno=5525, guc_id=0, flags=0x73 in no process [-1]
<7> [151.818444] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [151.818520] ------------[ cut here ]------------
<4> [151.818521] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [151.818523] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#4: kworker/u64:7/206
<4> [151.818609] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [151.818690] CPU: 4 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [151.818693] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [151.818694] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [151.818696] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [151.818703] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [151.818782] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [151.818784] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [151.818786] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [151.818788] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [151.818789] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [151.818790] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [151.818791] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [151.818792] FS: 0000000000000000(0000) GS:ffff8888dae83000(0000) knlGS:0000000000000000
<4> [151.818794] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [151.818796] CR2: 000076a6ec5a8d78 CR3: 000000000344a004 CR4: 0000000000f72ef0
<4> [151.818797] PKRU: 55555554
<4> [151.818798] Call Trace:
<4> [151.818799] <TASK>
<4> [151.818804] ? lock_acquire+0xb0/0x300
<4> [151.818811] ? lock_release+0xd0/0x2b0
<4> [151.818816] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [151.818822] process_one_work+0x239/0x740
<4> [151.818830] worker_thread+0x200/0x3f0
<4> [151.818834] ? __pfx_worker_thread+0x10/0x10
<4> [151.818836] kthread+0x10d/0x150
<4> [151.818839] ? __pfx_kthread+0x10/0x10
<4> [151.818842] ret_from_fork+0x3bd/0x470
<4> [151.818845] ? __pfx_kthread+0x10/0x10
<4> [151.818848] ret_from_fork_asm+0x1a/0x30
<4> [151.818856] </TASK>
<4> [151.818858] irq event stamp: 208707
<4> [151.818859] hardirqs last enabled at (208713): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [151.818862] hardirqs last disabled at (208718): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [151.818863] softirqs last enabled at (207568): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [151.818867] softirqs last disabled at (207561): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [151.818869] ---[ end trace 0000000000000000 ]---
<5> [151.820415] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5526, lrc_seqno=5526, guc_id=0, flags=0x73 in no process [-1]
<7> [151.820421] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [151.820521] ------------[ cut here ]------------
<4> [151.820522] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [151.820524] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#8: kworker/u64:7/206
<4> [151.820603] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [151.820663] CPU: 8 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [151.820666] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [151.820667] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [151.820668] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [151.820673] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [151.820747] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [151.820748] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [151.820751] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [151.820752] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [151.820753] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [151.820754] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [151.820755] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [151.820756] FS: 0000000000000000(0000) GS:ffff8888db083000(0000) knlGS:0000000000000000
<4> [151.820757] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [151.820758] CR2: 00007b7a59e33b08 CR3: 0000000118ee3001 CR4: 0000000000f72ef0
<4> [151.820760] PKRU: 55555554
<4> [151.820761] Call Trace:
<4> [151.820762] <TASK>
<4> [151.820765] ? lock_acquire+0xb0/0x300
<4> [151.820770] ? lock_release+0xd0/0x2b0
<4> [151.820774] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [151.820780] process_one_work+0x239/0x740
<4> [151.820786] worker_thread+0x200/0x3f0
<4> [151.820789] ? __pfx_worker_thread+0x10/0x10
<4> [151.820791] kthread+0x10d/0x150
<4> [151.820793] ? __pfx_kthread+0x10/0x10
<4> [151.820796] ret_from_fork+0x3bd/0x470
<4> [151.820798] ? __pfx_kthread+0x10/0x10
<4> [151.820801] ret_from_fork_asm+0x1a/0x30
<4> [151.820808] </TASK>
<4> [151.820809] irq event stamp: 209593
<4> [151.820810] hardirqs last enabled at (209599): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [151.820812] hardirqs last disabled at (209604): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [151.820813] softirqs last enabled at (207568): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [151.820816] softirqs last disabled at (207561): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [151.820819] ---[ end trace 0000000000000000 ]---
<6> [151.820820] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [151.820892] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [151.820898] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<3> [152.867385] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 17365, action 5503, done no
<5> [152.867470] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [152.867484] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [152.867494] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [152.867503] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [152.867510] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [152.867518] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [152.867525] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [152.867532] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [152.867538] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [152.867545] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [152.867552] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [152.867559] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [152.867566] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [152.867573] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [152.867580] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [152.867588] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [152.867634] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [152.867716] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [152.868433] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [152.869094] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [152.869990] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [152.870557] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [152.871059] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [152.871583] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [152.872024] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [152.872494] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [152.872928] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [152.873445] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [152.873827] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [152.873914] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [152.874001] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [152.875029] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [152.885847] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [152.886680] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [152.889009] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [152.889473] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [152.889864] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [152.890285] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [152.890658] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [152.891005] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [152.891381] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [152.891722] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [152.892049] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [152.892403] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [152.892699] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [152.892998] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [152.893309] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [152.893596] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [152.893870] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [152.894141] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [152.894465] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [152.894715] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [152.894961] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [152.895326] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [152.895624] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [152.895868] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [152.896101] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [152.896371] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [152.896583] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [152.896788] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [152.896993] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [152.897208] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [152.897398] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [152.897587] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [152.897768] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [152.897948] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [152.898127] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [152.898330] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [152.898503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [152.898666] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [152.898833] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [152.898997] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [152.899171] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [152.899327] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [152.899471] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [152.899611] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [152.899741] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [152.899880] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [152.900023] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [152.900168] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [152.900304] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [152.900432] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [152.900557] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [152.900682] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [152.900805] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [152.900922] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [152.901040] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [152.901172] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [152.901292] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [152.901403] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [152.901515] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [152.901618] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [152.901719] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [152.901824] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [152.901925] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [152.902026] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [152.902126] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [152.902243] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [152.902339] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [152.902436] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [152.902531] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [152.902625] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [152.902721] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [152.902953] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [152.902960] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5526, lrc_seqno=5526, guc_id=0, flags=0x73 in no process [-1]
<7> [152.902963] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [152.903045] ------------[ cut here ]------------
<4> [152.903046] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [152.903048] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#8: kworker/u64:7/206
<4> [152.903557] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [152.903633] CPU: 8 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [152.903636] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [152.903637] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [152.903639] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [152.903646] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [152.903730] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [152.903732] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [152.903734] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [152.903736] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [152.903737] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [152.903738] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [152.903739] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [152.903741] FS: 0000000000000000(0000) GS:ffff8888db083000(0000) knlGS:0000000000000000
<4> [152.903742] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [152.903743] CR2: 00007b7a59e33b08 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [152.903745] PKRU: 55555554
<4> [152.903746] Call Trace:
<4> [152.903747] <TASK>
<4> [152.903752] ? lock_acquire+0xb0/0x300
<4> [152.903758] ? lock_release+0xd0/0x2b0
<4> [152.903763] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [152.903769] process_one_work+0x239/0x740
<4> [152.903777] worker_thread+0x200/0x3f0
<4> [152.903780] ? __pfx_worker_thread+0x10/0x10
<4> [152.903783] kthread+0x10d/0x150
<4> [152.903785] ? __pfx_kthread+0x10/0x10
<4> [152.903788] ret_from_fork+0x3bd/0x470
<4> [152.903791] ? __pfx_kthread+0x10/0x10
<4> [152.903794] ret_from_fork_asm+0x1a/0x30
<4> [152.903802] </TASK>
<4> [152.903803] irq event stamp: 213239
<4> [152.903804] hardirqs last enabled at (213245): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [152.903807] hardirqs last disabled at (213250): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [152.903809] softirqs last enabled at (211918): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [152.903812] softirqs last disabled at (211911): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [152.903814] ---[ end trace 0000000000000000 ]---
<6> [152.903817] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [152.903897] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [152.903904] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<3> [153.955262] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 17386, action 5503, done no
<5> [153.955347] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [153.955361] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [153.955371] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [153.955380] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [153.955388] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [153.955395] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [153.955402] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [153.955409] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [153.955416] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [153.955423] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [153.955429] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [153.955436] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [153.955443] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [153.955450] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [153.955456] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [153.955464] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [153.955513] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [153.955595] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [153.956337] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [153.956974] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [153.959449] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [153.960111] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [153.960762] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [153.961424] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [153.961551] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [153.961637] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [153.961721] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [153.961806] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [153.961889] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [153.962001] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [153.962102] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [153.963161] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [153.973027] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [153.973793] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [153.976038] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [153.976551] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [153.977048] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [153.977439] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [153.977796] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [153.978201] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [153.978534] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [153.978839] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [153.979177] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [153.979473] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [153.979769] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [153.980060] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [153.980326] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [153.980596] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [153.980851] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [153.981130] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [153.981371] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [153.981616] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [153.981839] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [153.982136] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [153.982396] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [153.982631] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [153.982859] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [153.983131] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [153.983365] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [153.983574] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [153.983770] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [153.983961] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [153.984189] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [153.984379] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [153.984568] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [153.984757] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [153.984937] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [153.985136] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [153.985318] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [153.985493] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [153.985654] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [153.985811] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [153.985967] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [153.986154] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [153.986306] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [153.986452] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [153.986587] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [153.986727] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [153.986869] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [153.987017] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [153.987157] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [153.987286] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [153.987412] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [153.987534] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [153.987659] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [153.987779] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [153.987896] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [153.988024] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [153.988152] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [153.988279] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [153.988393] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [153.988503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [153.988606] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [153.988716] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [153.988817] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [153.988919] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [153.989034] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [153.989135] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [153.989238] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [153.989333] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [153.989427] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [153.989519] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [153.989613] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [153.989802] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [153.989808] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5526, lrc_seqno=5526, guc_id=0, flags=0x73 in no process [-1]
<7> [153.989812] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [153.989891] ------------[ cut here ]------------
<4> [153.989893] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [153.989894] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#8: kworker/u64:20/2767
<4> [153.989991] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [153.990074] CPU: 8 UID: 0 PID: 2767 Comm: kworker/u64:20 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [153.990078] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [153.990079] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [153.990081] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [153.990088] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [153.990177] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [153.990179] RSP: 0018:ffffc90004747ca0 EFLAGS: 00010246
<4> [153.990181] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [153.990183] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [153.990184] RBP: ffffc90004747db0 R08: 0000000000000000 R09: 0000000000000000
<4> [153.990186] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [153.990187] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [153.990188] FS: 0000000000000000(0000) GS:ffff8888db083000(0000) knlGS:0000000000000000
<4> [153.990190] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [153.990192] CR2: 00007b7a59e33b08 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [153.990193] PKRU: 55555554
<4> [153.990195] Call Trace:
<4> [153.990196] <TASK>
<4> [153.990201] ? lock_acquire+0xb0/0x300
<4> [153.990208] ? lock_release+0xd0/0x2b0
<4> [153.990215] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [153.990221] process_one_work+0x239/0x740
<4> [153.990229] worker_thread+0x200/0x3f0
<4> [153.990232] ? __pfx_worker_thread+0x10/0x10
<4> [153.990235] kthread+0x10d/0x150
<4> [153.990238] ? __pfx_kthread+0x10/0x10
<4> [153.990241] ret_from_fork+0x3bd/0x470
<4> [153.990244] ? __pfx_kthread+0x10/0x10
<4> [153.990247] ret_from_fork_asm+0x1a/0x30
<4> [153.990256] </TASK>
<4> [153.990257] irq event stamp: 40397
<4> [153.990258] hardirqs last enabled at (40403): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [153.990261] hardirqs last disabled at (40408): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [153.990263] softirqs last enabled at (39388): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [153.990266] softirqs last disabled at (39383): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [153.990269] ---[ end trace 0000000000000000 ]---
<7> [153.991254] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=115, gpusvm=ffff88814dda1190, errno=-EOPNOTSUPP
<5> [153.991833] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5527, lrc_seqno=5527, guc_id=0, flags=0x73 in no process [-1]
<7> [153.991837] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [153.991914] ------------[ cut here ]------------
<4> [153.991916] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [153.991917] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#4: kworker/u64:7/206
<4> [153.992024] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [153.992096] CPU: 4 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [153.992099] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [153.992100] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [153.992102] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [153.992107] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [153.992186] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [153.992187] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [153.992190] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [153.992191] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [153.992192] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [153.992193] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [153.992194] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [153.992195] FS: 0000000000000000(0000) GS:ffff8888dae83000(0000) knlGS:0000000000000000
<4> [153.992197] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [153.992198] CR2: 000076a6ec5a8d78 CR3: 000000012ba17003 CR4: 0000000000f72ef0
<4> [153.992200] PKRU: 55555554
<4> [153.992201] Call Trace:
<4> [153.992202] <TASK>
<4> [153.992205] ? lock_acquire+0xb0/0x300
<4> [153.992211] ? lock_release+0xd0/0x2b0
<4> [153.992215] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [153.992221] process_one_work+0x239/0x740
<4> [153.992228] worker_thread+0x200/0x3f0
<4> [153.992231] ? __pfx_worker_thread+0x10/0x10
<4> [153.992234] kthread+0x10d/0x150
<4> [153.992236] ? __pfx_kthread+0x10/0x10
<4> [153.992239] ret_from_fork+0x3bd/0x470
<4> [153.992241] ? __pfx_kthread+0x10/0x10
<4> [153.992243] ret_from_fork_asm+0x1a/0x30
<4> [153.992251] </TASK>
<4> [153.992252] irq event stamp: 214855
<4> [153.992253] hardirqs last enabled at (214861): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [153.992256] hardirqs last disabled at (214866): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [153.992257] softirqs last enabled at (214112): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [153.992260] softirqs last disabled at (214107): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [153.992262] ---[ end trace 0000000000000000 ]---
<6> [153.992333] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [153.992468] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [153.992481] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<6> [154.488791] Console: switching to colour frame buffer device 240x67
<3> [155.043043] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 17407, action 5503, done no
<5> [155.043789] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [155.043804] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [155.043814] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [155.043822] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [155.043913] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [155.043926] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [155.043939] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [155.043952] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [155.043965] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [155.043979] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [155.043993] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [155.044004] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [155.044011] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [155.044018] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [155.044026] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [155.044035] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [155.044075] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [155.044331] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [155.044954] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [155.045630] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [155.046588] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [155.047206] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [155.047797] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [155.048504] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [155.049229] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [155.049674] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [155.049759] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [155.049847] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [155.049934] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [155.050029] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [155.050116] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [155.051158] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [155.061954] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [155.062735] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [155.065430] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [155.065893] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [155.066280] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [155.066672] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [155.067119] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [155.067505] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [155.067938] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [155.068310] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [155.068648] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [155.069028] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [155.069375] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [155.069702] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [155.070031] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [155.070326] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [155.070621] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [155.070934] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [155.071204] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [155.071466] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [155.071729] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [155.072044] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [155.072334] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [155.072599] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [155.072859] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [155.073116] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [155.073365] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [155.073586] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [155.073804] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [155.074067] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [155.074290] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [155.074503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [155.074704] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [155.074917] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [155.075112] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [155.075289] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [155.075473] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [155.075653] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [155.075832] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [155.076007] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [155.076178] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [155.076339] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [155.076497] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [155.076642] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [155.076795] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [155.076970] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [155.077123] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [155.077270] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [155.077413] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [155.077549] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [155.077684] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [155.077815] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [155.077987] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [155.078119] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [155.078247] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [155.078374] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [155.078492] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [155.078610] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [155.078725] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [155.078843] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [155.078955] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [155.079072] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [155.079180] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [155.079284] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [155.079390] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [155.079488] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [155.079589] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [155.079693] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [155.079792] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [155.079901] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [155.080011] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [155.080227] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [155.080234] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5527, lrc_seqno=5527, guc_id=0, flags=0x73 in no process [-1]
<7> [155.080238] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [155.080328] ------------[ cut here ]------------
<4> [155.080329] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [155.080331] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#8: kworker/u64:7/206
<4> [155.080427] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [155.080517] CPU: 8 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [155.080521] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [155.080522] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [155.080524] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [155.080531] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [155.080619] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [155.080621] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [155.080623] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [155.080625] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [155.080627] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [155.080628] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [155.080629] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [155.080631] FS: 0000000000000000(0000) GS:ffff8888db083000(0000) knlGS:0000000000000000
<4> [155.080632] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [155.080634] CR2: 00007b7a59e33b08 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [155.080636] PKRU: 55555554
<4> [155.080637] Call Trace:
<4> [155.080638] <TASK>
<4> [155.080643] ? lock_acquire+0xb0/0x300
<4> [155.080651] ? lock_release+0xd0/0x2b0
<4> [155.080657] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [155.080664] process_one_work+0x239/0x740
<4> [155.080672] worker_thread+0x200/0x3f0
<4> [155.080676] ? __pfx_worker_thread+0x10/0x10
<4> [155.080679] kthread+0x10d/0x150
<4> [155.080682] ? __pfx_kthread+0x10/0x10
<4> [155.080685] ret_from_fork+0x3bd/0x470
<4> [155.080688] ? __pfx_kthread+0x10/0x10
<4> [155.080691] ret_from_fork_asm+0x1a/0x30
<4> [155.080700] </TASK>
<4> [155.080702] irq event stamp: 218571
<4> [155.080703] hardirqs last enabled at (218577): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [155.080706] hardirqs last disabled at (218582): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [155.080708] softirqs last enabled at (217528): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [155.080711] softirqs last disabled at (217521): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [155.080715] ---[ end trace 0000000000000000 ]---
<6> [155.080717] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [155.081322] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [155.081334] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<3> [156.130964] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 17428, action 5503, done no
<5> [156.131085] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [156.131099] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [156.131110] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [156.131118] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [156.131126] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [156.131133] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [156.131140] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [156.131147] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [156.131154] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [156.131161] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [156.131168] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [156.131174] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [156.131181] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [156.131188] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [156.131195] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [156.131203] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [156.131251] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [156.131353] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [156.132119] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [156.132755] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [156.133543] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [156.134127] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [156.134631] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [156.135160] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [156.135604] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [156.136089] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [156.136530] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [156.137017] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [156.137458] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [156.137786] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [156.137885] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [156.138925] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [156.148822] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [156.149584] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [156.152133] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [156.152619] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [156.153131] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [156.153556] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [156.154017] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [156.154414] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [156.154797] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [156.155159] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [156.155521] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [156.155878] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [156.156198] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [156.156519] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [156.156870] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [156.157189] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [156.157487] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [156.157795] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [156.158088] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [156.158361] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [156.158627] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [156.158968] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [156.159253] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [156.159527] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [156.159787] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [156.160040] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [156.160278] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [156.160499] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [156.160722] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [156.160936] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [156.161143] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [156.161351] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [156.161546] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [156.161753] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [156.161947] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [156.162141] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [156.162340] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [156.162525] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [156.162706] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [156.162884] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [156.163066] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [156.163233] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [156.163388] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [156.163543] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [156.163695] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [156.163847] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [156.164001] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [156.164152] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [156.164297] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [156.164437] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [156.164569] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [156.164707] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [156.164845] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [156.164983] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [156.165118] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [156.165242] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [156.165362] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [156.165483] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [156.165598] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [156.165718] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [156.165829] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [156.165941] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [156.166053] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [156.166166] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [156.166272] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [156.166376] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [156.166481] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [156.166586] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [156.166684] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [156.166782] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [156.166882] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [156.167085] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [156.167094] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5527, lrc_seqno=5527, guc_id=0, flags=0x73 in no process [-1]
<7> [156.167112] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [156.167353] ------------[ cut here ]------------
<4> [156.167355] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [156.167359] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#12: kworker/u64:7/206
<4> [156.167514] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [156.167645] CPU: 12 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [156.167651] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [156.167653] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [156.167655] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [156.167667] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [156.167820] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [156.167824] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [156.167828] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [156.167831] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [156.167834] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [156.167836] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [156.167838] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [156.167840] FS: 0000000000000000(0000) GS:ffff8888db283000(0000) knlGS:0000000000000000
<4> [156.167844] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [156.167846] CR2: 000076a6ec5bb8d0 CR3: 000000000344a006 CR4: 0000000000f72ef0
<4> [156.167849] PKRU: 55555554
<4> [156.167851] Call Trace:
<4> [156.167853] <TASK>
<4> [156.167861] ? lock_acquire+0xb0/0x300
<4> [156.167872] ? lock_release+0xd0/0x2b0
<4> [156.167881] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [156.167893] process_one_work+0x239/0x740
<4> [156.167907] worker_thread+0x200/0x3f0
<4> [156.167912] ? __pfx_worker_thread+0x10/0x10
<4> [156.167917] kthread+0x10d/0x150
<4> [156.167921] ? __pfx_kthread+0x10/0x10
<4> [156.167926] ret_from_fork+0x3bd/0x470
<4> [156.167930] ? __pfx_kthread+0x10/0x10
<4> [156.167935] ret_from_fork_asm+0x1a/0x30
<4> [156.167950] </TASK>
<4> [156.167952] irq event stamp: 222251
<4> [156.167954] hardirqs last enabled at (222257): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [156.167958] hardirqs last disabled at (222262): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [156.167960] softirqs last enabled at (221006): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [156.167965] softirqs last disabled at (220999): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [156.167968] ---[ end trace 0000000000000000 ]---
<7> [156.168937] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [156.169049] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<3> [158.436019] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=7171 recv=7170
<5> [158.441664] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5528, lrc_seqno=5528, guc_id=0, flags=0x73 in no process [-1]
<7> [158.441670] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [158.441801] ------------[ cut here ]------------
<4> [158.441802] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [158.441804] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#9: kworker/u64:40/2788
<4> [158.441879] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core
<7> [158.441853] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<4> [158.441915] usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [158.441944] CPU: 9 UID: 0 PID: 2788 Comm: kworker/u64:40 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [158.441946] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [158.441947] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [158.441949] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [158.441954] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [158.442024] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [158.442025] RSP: 0018:ffffc9000513fca0 EFLAGS: 00010246
<4> [158.442027] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [158.442029] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [158.442030] RBP: ffffc9000513fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [158.442031] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [158.442032] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [158.442033] FS: 0000000000000000(0000) GS:ffff8888db103000(0000) knlGS:0000000000000000
<4> [158.442034] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [158.442035] CR2: 0000615e7344c398 CR3: 000000000344a004 CR4: 0000000000f72ef0
<4> [158.442037] PKRU: 55555554
<4> [158.442038] Call Trace:
<4> [158.442039] <TASK>
<4> [158.442043] ? lock_acquire+0xb0/0x300
<4> [158.442048] ? lock_release+0xd0/0x2b0
<4> [158.442053] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [158.442058] process_one_work+0x239/0x740
<4> [158.442064] worker_thread+0x200/0x3f0
<4> [158.442067] ? __pfx_worker_thread+0x10/0x10
<4> [158.442069] kthread+0x10d/0x150
<4> [158.442071] ? __pfx_kthread+0x10/0x10
<4> [158.442074] ret_from_fork+0x3bd/0x470
<4> [158.442077] ? __pfx_kthread+0x10/0x10
<4> [158.442079] ret_from_fork_asm+0x1a/0x30
<4> [158.442086] </TASK>
<4> [158.442087] irq event stamp: 24635
<4> [158.442088] hardirqs last enabled at (24641): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [158.442091] hardirqs last disabled at (24646): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [158.442092] softirqs last enabled at (23440): [<ffffffff8133ad83>] kernel_fpu_end+0x53/0x70
<4> [158.442094] softirqs last disabled at (23438): [<ffffffff8133b494>] kernel_fpu_begin_mask+0xc4/0x120
<4> [158.442097] ---[ end trace 0000000000000000 ]---
<6> [158.442099] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [158.442169] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [158.442176] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [158.442243] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [158.443046] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [158.443580] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [158.444072] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [158.444566] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [158.445021] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [158.445495] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [158.445948] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [158.446437] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [158.446670] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [158.446764] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [158.446852] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [158.447883] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [158.458405] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [158.458858] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [158.460594] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [158.460822] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [158.461027] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [158.461228] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [158.461451] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [158.461657] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [158.461863] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [158.462063] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [158.462261] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [158.462468] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [158.462660] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [158.462840] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [158.463016] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [158.463192] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [158.463367] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [158.463560] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [158.463724] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [158.463893] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [158.464062] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [158.464245] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [158.464433] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [158.464612] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [158.464782] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [158.464944] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [158.465105] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [158.465259] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [158.465412] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [158.465556] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [158.465700] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [158.465848] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [158.465995] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [158.466139] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [158.466277] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [158.466417] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [158.466556] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [158.466689] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [158.466818] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [158.466947] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [158.467075] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [158.467198] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [158.467322] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [158.467453] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [158.467573] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [158.467692] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [158.467812] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [158.467927] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [158.468038] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [158.468147] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [158.468256] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [158.468363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [158.468493] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [158.468603] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [158.468710] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [158.468813] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [158.468912] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [158.469015] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [158.469121] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [158.469215] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [158.469309] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [158.469413] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [158.469507] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [158.469599] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [158.469691] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [158.469778] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [158.469867] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [158.469958] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [158.470044] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [158.470130] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [158.470218] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [158.470414] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [158.470424] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5528, lrc_seqno=5528, guc_id=0, flags=0x73 in no process [-1]
<7> [158.470427] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [158.470503] ------------[ cut here ]------------
<4> [158.470504] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [158.470505] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#6: kworker/u64:7/206
<4> [158.470590] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [158.470670] CPU: 6 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [158.470673] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [158.470674] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [158.470675] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [158.470681] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [158.470761] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [158.470762] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [158.470764] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [158.470766] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [158.470767] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [158.470768] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [158.470769] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [158.470771] FS: 0000000000000000(0000) GS:ffff8888daf83000(0000) knlGS:0000000000000000
<4> [158.470772] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [158.470773] CR2: 000079959bb0a000 CR3: 0000000155c6a005 CR4: 0000000000f72ef0
<4> [158.470775] PKRU: 55555554
<4> [158.470776] Call Trace:
<4> [158.470777] <TASK>
<4> [158.470781] ? lock_acquire+0xb0/0x300
<4> [158.470786] ? lock_release+0xd0/0x2b0
<4> [158.470791] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [158.470797] process_one_work+0x239/0x740
<4> [158.470804] worker_thread+0x200/0x3f0
<4> [158.470807] ? __pfx_worker_thread+0x10/0x10
<4> [158.470809] kthread+0x10d/0x150
<4> [158.470812] ? __pfx_kthread+0x10/0x10
<4> [158.470815] ret_from_fork+0x3bd/0x470
<4> [158.470817] ? __pfx_kthread+0x10/0x10
<4> [158.470820] ret_from_fork_asm+0x1a/0x30
<4> [158.470827] </TASK>
<4> [158.470828] irq event stamp: 226161
<4> [158.470829] hardirqs last enabled at (226167): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [158.470832] hardirqs last disabled at (226172): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [158.470833] softirqs last enabled at (225356): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [158.470836] softirqs last disabled at (225345): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [158.470839] ---[ end trace 0000000000000000 ]---
<6> [158.470841] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [158.470982] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [158.471005] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [158.471066] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [158.471865] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [158.472162] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [158.472315] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x01800000
<7> [158.472410] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [158.472543] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [158.472678] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [158.472794] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [158.472884] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [158.472971] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [158.473075] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [158.473173] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [158.474298] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [158.485030] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [158.485280] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [158.486307] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [158.486400] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [158.486483] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [158.486563] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [158.486641] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [158.486720] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [158.486798] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [158.486876] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [158.486953] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [158.487030] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [158.487107] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [158.487184] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [158.487261] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [158.487339] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [158.487425] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [158.487502] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [158.487578] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [158.487656] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [158.487731] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [158.487818] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [158.487905] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2098] = 0xffffffff
<7> [158.487988] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [158.488071] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [158.488153] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x229c] = 0x00080008
<7> [158.488236] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [158.488317] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [158.488402] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [158.488484] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [158.488566] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [158.488648] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [158.488733] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [158.488814] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [158.488894] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [158.488976] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [158.489060] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [158.489148] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [158.489232] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009100
<7> [158.489314] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00030003
<7> [158.489400] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [158.489481] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [158.489560] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22098] = 0xffffffff
<7> [158.489642] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [158.489724] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [158.489805] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2229c] = 0x00080008
<7> [158.489886] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [158.489966] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [158.490044] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee098] = 0xffffffff
<7> [158.490123] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [158.490201] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [158.490282] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee29c] = 0x00080008
<7> [158.490365] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [158.490455] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [158.490535] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a098] = 0xffffffff
<7> [158.490616] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [158.490694] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [158.490773] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a29c] = 0x00080008
<7> [158.490853] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [158.490929] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [158.491006] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [158.491088] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [158.491167] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [158.491247] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c098] = 0xffffffff
<7> [158.491325] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [158.491409] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [158.491488] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c29c] = 0x00080008
<7> [158.491568] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [158.491645] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [158.491719] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [158.491800] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [158.491922] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [158.491926] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=5528, lrc_seqno=5528, guc_id=0, flags=0x73 in no process [-1]
<7> [158.491928] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [158.491996] ------------[ cut here ]------------
<4> [158.491997] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [158.491999] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1618 at guc_exec_queue_timedout_job+0x141a/0x2400 [xe], CPU#6: kworker/u64:7/206
<4> [158.492078] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy gpu_sched drm_ttm_helper ttm drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor hid_generic mtd eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg irqbypass snd_hda_codec aesni_intel gf128mul snd_hda_core usbhid snd_hwdep r8169 rapl intel_cstate hid i2c_i801 snd_pcm realtek spi_intel_pci i2c_mux phy_package i2c_smbus video spi_intel snd_timer binfmt_misc snd soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class pinctrl_alderlake acpi_pad intel_pmc_ssram_telemetry acpi_tad mei_me intel_vsec mei wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4
<4> [158.492140] CPU: 6 UID: 0 PID: 206 Comm: kworker/u64:7 Tainted: G S U W 7.1.0-rc4-lgci-xe-xe-5118-60d51bdeabf700864-debug+ #1 PREEMPT(lazy)
<4> [158.492143] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [158.492143] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [158.492145] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [158.492150] RIP: 0010:guc_exec_queue_timedout_job+0x1423/0x2400 [xe]
<4> [158.492233] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 b0 d8 3c e1 48 89 c6 48 8d 3d 06 8c d9 ff 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 d0 ee ff ff 8b 70 08 49
<4> [158.492234] RSP: 0018:ffffc900014dbca0 EFLAGS: 00010246
<4> [158.492236] RAX: ffffffffa12bc20d RBX: 0000000000000000 RCX: 0000000000000000
<4> [158.492238] RDX: ffff88810379d010 RSI: ffffffffa12bc20d RDI: ffffffffa0c03f80
<4> [158.492239] RBP: ffffc900014dbdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [158.492240] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [158.492241] R13: ffff88810379d010 R14: ffff888131518818 R15: 00000000ffffffc2
<4> [158.492242] FS: 0000000000000000(0000) GS:ffff8888daf83000(0000) knlGS:0000000000000000
<4> [158.492244] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [158.492245] CR2: 000079959bb0a000 CR3: 000000000344a001 CR4: 0000000000f72ef0
<4> [158.492246] PKRU: 55555554
<4> [158.492247] Call Trace:
<4> [158.492248] <TASK>
<4> [158.492252] ? lock_acquire+0xb0/0x300
<4> [158.492257] ? lock_release+0xd0/0x2b0
<4> [158.492262] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [158.492268] process_one_work+0x239/0x740
<4> [158.492274] worker_thread+0x200/0x3f0
<4> [158.492277] ? __pfx_worker_thread+0x10/0x10
<4> [158.492280] kthread+0x10d/0x150
<4> [158.492282] ? __pfx_kthread+0x10/0x10
<4> [158.492285] ret_from_fork+0x3bd/0x470
<4> [158.492287] ? __pfx_kthread+0x10/0x10
<4> [158.492289] ret_from_fork_asm+0x1a/0x30
<4> [158.492297] </TASK>
<4> [158.492298] irq event stamp: 229443
<4> [158.492299] hardirqs last enabled at (229449): [<ffffffff814ab629>] __up_console_sem+0x79/0xa0
<4> [158.492301] hardirqs last disabled at (229454): [<ffffffff814ab60e>] __up_console_sem+0x5e/0xa0
<4> [158.492302] softirqs last enabled at (228302): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [158.492305] softirqs last disabled at (228295): [<ffffffff813d233b>] __irq_exit_rcu+0xdb/0x1c0
<4> [158.492307] ---[ end trace 0000000000000000 ]---
<3> [159.522566] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: Timed out wait for G2H, fence 17493, action 5503, done no
<5> [159.522686] xe 0000:03:00.0: [drm] PF: Tile0: GT0: Failed to push PF 15 config KLVs (-ETIME)
<6> [159.522700] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0b : 32b value 0 } # begin_ctx_id
<6> [159.522710] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0004 : 32b value 65535 } # num_contexts
<6> [159.522719] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0a : 32b value 0 } # begin_db_id
<6> [159.522727] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0006 : 32b value 256 } # num_doorbells
<6> [159.522734] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a01 : 32b value 0 } # exec_quantum
<6> [159.522742] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a02 : 32b value 0 } # preempt_timeout
<6> [159.522749] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a03 : 32b value 0 } # cat_error_count
<6> [159.522756] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a04 : 32b value 0 } # engine_reset_count
<6> [159.522763] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a05 : 32b value 0 } # page_fault_count
<6> [159.522770] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a06 : 32b value 0 } # guc_time_us
<6> [159.522776] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a07 : 32b value 0 } # irq_time_us
<6> [159.522783] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a08 : 32b value 0 } # doorbell_time_us
<6> [159.522790] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x8a0d : 32b value 0 } # multi_lrc_count
<6> [159.522797] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0001 : 64b value 0x400000 } # ggtt_start
<6> [159.522805] xe 0000:03:00.0: [drm] Tile0: GT0: { key 0x0002 : 64b value 0xfea00000 } # ggtt_size
<3> [159.522852] xe 0000:03:00.0: [drm] *ERROR* PF: Tile0: GT0: Failed to push self configuration (-ETIME)
<7> [159.522953] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [159.523709] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<3> [160.803659] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=7204 recv=7173
<7> [161.592196] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2c2c282a
<7> [161.592524] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2c2b2c
<3> [163.106137] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=7205 recv=7173
<7> [163.109809] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<3> [165.410406] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=7206 recv=7173
<3> [167.714648] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=7207 recv=7173
<7> [167.721282] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
|