Oops#2 Part16 <7>[ 378.033084] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[10] = 0x00000000 <7>[ 378.033137] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[11] = 0x00000000 <7>[ 378.033189] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[12] = 0x00000000 <7>[ 378.033243] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[13] = 0x00000000 <7>[ 378.033321] xe 0000:03:00.0: [drm:xe_guc_id_mgr_init [xe]] Tile0: GT0: using 65535 GuC IDs <7>[ 378.033398] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 378.034481] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0xe8c000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 378.035369] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 378.036349] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 378.058293] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152) <7>[ 378.069364] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 378.069635] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 378.070351] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4146 dwords <7>[ 378.070427] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 Oops#2 Part15 <7>[ 378.070488] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x00400040 <7>[ 378.070546] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 378.070606] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 378.070664] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 378.070720] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420 <7>[ 378.070777] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 378.070836] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 378.070898] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR) <7>[ 378.072716] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 378.072782] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 378.072840] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 378.074507] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 378.074581] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 378.076104] FAULT_INJECTION: forcing a failure. <5>[ 378.076104] name fail_function, interval 0, probability 100, space 1, times 100 Oops#2 Part14 <3>[ 378.076120] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 378.076222] ------------[ cut here ]------------ <4>[ 378.076224] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 378.076224] platform: BATTLEMAGE subplatform: 7 <4>[ 378.076224] graphics: Xe2_HPG 20.01 step A0 <4>[ 378.076224] media: Xe2_HPM 13.01 step A1 <4>[ 378.076224] tile: 0 VRAM 12.0 GiB <4>[ 378.076224] GT: 0 type 1 <4>[ 378.076228] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:541 at guc_ct_change_state+0x264/0x330 [xe], CPU#11: xe_fault_inject/12964 <4>[ 378.076323] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor mtd eeepc_wmi hid_generic asus_wmi sparse_keymap platform_profile mei_hdcp mei_pxp kvm_intel wmi_bmof kvm snd_intel_dspcfg irqbypass snd_hda_codec ghash_clmulni_intel aesni_intel r8169 snd_hda_core usbhid rapl binfmt_misc intel_cstate snd_hwdep i2c_i801 hid spi_intel_pci snd_pcm i2c_mux spi_intel realtek i2c_smbus snd_timer snd soundcore idma64 intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class mei_me intel_pmc_ssram_telemetry mei intel_vsec wmi pinctrl_alderlake acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part13 <4>[ 378.076400] autofs4 [last unloaded: xe] <4>[ 378.076404] CPU: 11 UID: 0 PID: 12964 Comm: xe_fault_inject Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4688-7282a0941df77adea-debug+ #1 PREEMPT(lazy) <4>[ 378.076408] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 378.076409] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 378.076411] RIP: 0010:guc_ct_change_state+0x2d8/0x330 [xe] <4>[ 378.076483] Code: 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 20 7f 18 a1 52 4c 8b 55 88 41 52 44 8b 4d 9c 4c 8b 45 90 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 50 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 378.076485] RSP: 0018:ffffc9000607b548 EFLAGS: 00010002 <4>[ 378.076487] RAX: ffffffffa11fd651 RBX: ffff8883fccb0738 RCX: ffffffffa1187f20 <4>[ 378.076489] RDX: ffff888104b83b90 RSI: ffffffffa11fd651 RDI: ffffffffa1002f00 Oops#2 Part12 <4>[ 378.076490] RBP: ffffc9000607b630 R08: ffffffffa11fd6a1 R09: 0000000000000007 <4>[ 378.076491] R10: ffffffffa11fd752 R11: 0000000000000514 R12: ffff8883fccb07c8 <4>[ 378.076492] R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000001 <4>[ 378.076494] FS: 00007addb9db6980(0000) GS:ffff8888db21b000(0000) knlGS:0000000000000000 <4>[ 378.076495] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 378.076496] CR2: 00005c8278fce350 CR3: 00000003fc5bf005 CR4: 0000000000f72ef0 <4>[ 378.076498] PKRU: 55555554 <4>[ 378.076499] Call Trace: <4>[ 378.076501] <4>[ 378.076510] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 378.076587] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 378.076656] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 378.076722] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 378.076813] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 378.076892] xe_gt_init+0x35d/0xab0 [xe] <4>[ 378.076956] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 378.076961] ? __devm_add_action+0x70/0xa0 <4>[ 378.076968] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 378.077105] xe_device_probe+0x32c/0xbe0 [xe] <4>[ 378.077232] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 378.077239] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 378.077248] xe_pci_probe+0x39b/0x620 [xe] <4>[ 378.077378] ? trace_hardirqs_on+0x22/0x100 <4>[ 378.077390] local_pci_probe+0x47/0xb0 <4>[ 378.077396] pci_call_probe+0x6c/0x360 <4>[ 378.077405] ? _raw_spin_unlock+0x22/0x50 <4>[ 378.077412] pci_device_probe+0xae/0x110 <4>[ 378.077418] really_probe+0xf1/0x410 <4>[ 378.077424] __driver_probe_device+0x8c/0x190 <4>[ 378.077428] device_driver_attach+0x57/0xd0 <4>[ 378.077433] bind_store+0x77/0xd0 Oops#2 Part11 <4>[ 378.077439] drv_attr_store+0x24/0x50 <4>[ 378.077442] sysfs_kf_write+0x4d/0x80 <4>[ 378.077449] kernfs_fop_write_iter+0x188/0x240 <4>[ 378.077456] vfs_write+0x283/0x540 <4>[ 378.077469] ksys_write+0x6f/0xf0 <4>[ 378.077474] __x64_sys_write+0x19/0x30 <4>[ 378.077477] x64_sys_call+0x259/0x26e0 <4>[ 378.077481] do_syscall_64+0xdd/0x1470 <4>[ 378.077489] ? __slab_free+0x129/0x2b0 <4>[ 378.077496] ? __pcs_replace_full_main+0x2ad/0x710 <4>[ 378.077502] ? putname+0x41/0x90 <4>[ 378.077505] ? kmem_cache_free+0x165/0x510 <4>[ 378.077513] ? putname+0x41/0x90 <4>[ 378.077516] ? do_sys_openat2+0x85/0xd0 <4>[ 378.077524] ? __x64_sys_openat+0x54/0xa0 <4>[ 378.077528] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 378.077533] ? do_syscall_64+0x22e/0x1470 <4>[ 378.077539] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 378.077544] ? do_syscall_64+0x22e/0x1470 <4>[ 378.077548] ? fput_close_sync+0x3d/0xa0 <4>[ 378.077552] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 378.077557] ? do_syscall_64+0x22e/0x1470 <4>[ 378.077560] ? do_syscall_64+0x22e/0x1470 <4>[ 378.077564] ? do_syscall_64+0x22e/0x1470 <4>[ 378.077567] ? exc_page_fault+0xbd/0x2c0 <4>[ 378.077573] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 378.077575] RIP: 0033:0x7addbbf1c5a4 <4>[ 378.077579] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 378.077581] RSP: 002b:00007ffc9f4aca88 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 378.077584] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007addbbf1c5a4 Oops#2 Part10 <4>[ 378.077586] RDX: 000000000000000c RSI: 00007ffc9f4acf50 RDI: 0000000000000007 <4>[ 378.077588] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 378.077590] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffc9f4acf50 <4>[ 378.077591] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffc9f4acc00 <4>[ 378.077605] <4>[ 378.077606] irq event stamp: 962426 <4>[ 378.077608] hardirqs last enabled at (962425): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 378.077612] hardirqs last disabled at (962426): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 378.077615] softirqs last enabled at (962062): [] __irq_exit_rcu+0x13f/0x160 <4>[ 378.077619] softirqs last disabled at (962057): [] __irq_exit_rcu+0x13f/0x160 <4>[ 378.077621] ---[ end trace 0000000000000000 ]--- <7>[ 378.077624] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <3>[ 378.077774] xe 0000:03:00.0: probe with driver xe failed with error -12 <3>[ 378.078426] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <7>[ 378.078788] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 378.080107] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 378.169245] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 378.170567] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. Oops#2 Part9 <3>[ 380.346538] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=50 recv=0 <1>[ 380.348373] BUG: unable to handle page fault for address: ffffc9002138a188 <1>[ 380.348393] #PF: supervisor write access in kernel mode <1>[ 380.348406] #PF: error_code(0x0002) - not-present page <6>[ 380.348419] PGD 100000067 P4D 100000067 PUD 100ad1067 PMD 0 <4>[ 380.348442] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 380.348459] CPU: 10 UID: 0 PID: 643 Comm: kworker/10:3 Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4688-7282a0941df77adea-debug+ #1 PREEMPT(lazy) <4>[ 380.348488] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 380.348498] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 380.348513] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 380.348964] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 380.349444] Code: 24 66 90 65 8b 05 1c 4c 2e e3 48 0f a3 05 c0 b2 d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 380.349480] RSP: 0018:ffffc900019037e0 EFLAGS: 00010086 <4>[ 380.349498] RAX: 0000000000000002 RBX: ffffc9002138a188 RCX: 0000000000000000 <4>[ 380.349516] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff888417fd0060 <4>[ 380.349533] RBP: ffffc90001903858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 380.349551] R10: ffff8883dfe50000 R11: 0000000000000001 R12: ffff888417fd0060 <4>[ 380.349567] R13: 000000000000a188 R14: ffff8883dfe50000 R15: 0000000000010001 <4>[ 380.349584] FS: 0000000000000000(0000) GS:ffff8888db19b000(0000) knlGS:0000000000000000 Oops#2 Part8 <4>[ 380.349604] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 380.349620] CR2: ffffc9002138a188 CR3: 000000000344c002 CR4: 0000000000f72ef0 <4>[ 380.349637] PKRU: 55555554 <4>[ 380.349647] Call Trace: <4>[ 380.349657] <4>[ 380.349676] xe_force_wake_get+0x2a5/0x940 [xe] <4>[ 380.350084] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 380.350113] ? mark_held_locks+0x46/0x90 <4>[ 380.350137] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 380.350565] ? trace_hardirqs_on+0x22/0x100 <4>[ 380.350576] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 380.350580] ? xe_tlb_inval_fence_prep+0xce/0x1e0 [xe] <4>[ 380.350668] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 380.350752] ? xelpg_ggtt_pte_flags+0x27/0x1a0 [xe] <4>[ 380.350821] ? find_held_lock+0x31/0x90 <4>[ 380.350824] ? ggtt_node_remove+0xcb/0x140 [xe] <4>[ 380.350895] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 380.350964] ggtt_node_remove+0x12c/0x140 [xe] <4>[ 380.351033] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 380.351103] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 380.351171] ? _raw_write_unlock+0x22/0x50 <4>[ 380.351175] ? drm_vma_offset_remove+0x65/0x80 <4>[ 380.351180] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 380.351247] ? lock_is_held_type+0xa3/0x130 <4>[ 380.351252] ttm_bo_release+0x70/0x310 [ttm] <4>[ 380.351260] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 380.351327] ? lock_release+0xd0/0x2b0 <4>[ 380.351331] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 380.351337] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 380.351404] drm_gem_object_free+0x1d/0x40 <4>[ 380.351408] xe_bo_put+0x12a/0x190 [xe] Oops#2 Part7 <4>[ 380.351475] xe_lrc_destroy+0x49/0x90 [xe] <4>[ 380.351553] __xe_exec_queue_fini+0x6b/0xa0 [xe] <4>[ 380.351620] xe_exec_queue_fini+0x2b/0x60 [xe] <4>[ 380.351688] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 380.351761] process_one_work+0x22e/0x740 <4>[ 380.351765] worker_thread+0x1e8/0x3d0 <4>[ 380.351768] ? __pfx_worker_thread+0x10/0x10 <4>[ 380.351770] kthread+0x10d/0x150 <4>[ 380.351774] ? __pfx_kthread+0x10/0x10 <4>[ 380.351777] ret_from_fork+0x3d4/0x480 <4>[ 380.351780] ? __pfx_kthread+0x10/0x10 <4>[ 380.351783] ret_from_fork_asm+0x1a/0x30 <4>[ 380.351788] <4>[ 380.351790] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor mtd eeepc_wmi hid_generic asus_wmi sparse_keymap platform_profile mei_hdcp mei_pxp kvm_intel wmi_bmof kvm snd_intel_dspcfg irqbypass snd_hda_codec ghash_clmulni_intel aesni_intel r8169 snd_hda_core usbhid rapl binfmt_misc intel_cstate snd_hwdep i2c_i801 hid spi_intel_pci snd_pcm i2c_mux spi_intel realtek i2c_smbus snd_timer snd soundcore idma64 intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class mei_me intel_pmc_ssram_telemetry mei intel_vsec wmi pinctrl_alderlake acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part6 <4>[ 380.351825] autofs4 [last unloaded: xe] <4>[ 380.351851] CR2: ffffc9002138a188 <4>[ 380.351854] ---[ end trace 0000000000000000 ]--- <4>[ 380.495014] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 380.495105] Code: 24 66 90 65 8b 05 1c 4c 2e e3 48 0f a3 05 c0 b2 d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 380.495112] RSP: 0018:ffffc900019037e0 EFLAGS: 00010086 <4>[ 380.495115] RAX: 0000000000000002 RBX: ffffc9002138a188 RCX: 0000000000000000 <4>[ 380.495118] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff888417fd0060 <4>[ 380.495121] RBP: ffffc90001903858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 380.495124] R10: ffff8883dfe50000 R11: 0000000000000001 R12: ffff888417fd0060 <4>[ 380.495126] R13: 000000000000a188 R14: ffff8883dfe50000 R15: 0000000000010001 <4>[ 380.495129] FS: 0000000000000000(0000) GS:ffff8888db19b000(0000) knlGS:0000000000000000 <4>[ 380.495132] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 380.495135] CR2: ffffc9002138a188 CR3: 000000000344c002 CR4: 0000000000f72ef0 Oops#2 Part5 <4>[ 380.495138] PKRU: 55555554 <6>[ 380.495139] note: kworker/10:3[643] exited with irqs disabled <6>[ 380.495149] note: kworker/10:3[643] exited with preempt_count 1 <3>[ 382.649151] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=26229568 recv=0 <4>[ 382.649191] non-slab/vmalloc memory <4>[ 382.649205] ------------[ cut here ]------------ <4>[ 382.649215] list_del corruption. prev->next should be ffffc90001903a90, but was 103d48d44db60f44. (prev=ffffffff81391a0e) <4>[ 382.649234] WARNING: lib/list_debug.c:62 at __list_del_entry_valid_or_report+0xd9/0x120, CPU#8: kworker/u64:44/5962 <4>[ 382.649264] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor mtd eeepc_wmi hid_generic asus_wmi sparse_keymap platform_profile mei_hdcp mei_pxp kvm_intel wmi_bmof kvm snd_intel_dspcfg irqbypass snd_hda_codec ghash_clmulni_intel aesni_intel r8169 snd_hda_core usbhid rapl binfmt_misc intel_cstate snd_hwdep i2c_i801 hid spi_intel_pci snd_pcm i2c_mux spi_intel realtek i2c_smbus snd_timer snd soundcore idma64 intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class mei_me intel_pmc_ssram_telemetry mei intel_vsec wmi pinctrl_alderlake acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part4 <4>[ 382.649425] autofs4 [last unloaded: xe] <4>[ 382.649534] CPU: 8 UID: 0 PID: 5962 Comm: kworker/u64:44 Tainted: G S UD W 7.0.0-rc3-lgci-xe-xe-4688-7282a0941df77adea-debug+ #1 PREEMPT(lazy) <4>[ 382.649564] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 382.649576] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 382.649591] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 382.650063] RIP: 0010:__list_del_entry_valid_or_report+0xe3/0x120 <4>[ 382.650085] Code: b5 01 4c 89 ea 48 89 de 67 48 0f b9 3a 31 c0 eb 8b 4c 89 ef e8 be de 8e ff 48 8d 3d a7 29 b5 01 49 8b 55 00 4c 89 e9 48 89 de <67> 48 0f b9 3a 31 c0 e9 66 ff ff ff 4c 89 e7 e8 99 de 8e ff 48 8d <4>[ 382.650119] RSP: 0018:ffffc9000949bd58 EFLAGS: 00010046 <4>[ 382.650135] RAX: 0000000000000000 RBX: ffffc90001903a90 RCX: ffffffff81391a0e <4>[ 382.650152] RDX: 103d48d44db60f44 RSI: ffffc90001903a90 RDI: ffffffff839e1480 Oops#2 Part3 <4>[ 382.650167] RBP: ffffc9000949bd70 R08: 0000000000000000 R09: 0000000000000000 <4>[ 382.650183] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888417fd0500 <4>[ 382.650198] R13: ffffffff81391a0e R14: 0000000190335d75 R15: ffff888417fd0480 <4>[ 382.650214] FS: 0000000000000000(0000) GS:ffff8888db09b000(0000) knlGS:0000000000000000 <4>[ 382.650234] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 382.650248] CR2: 00007c1b7ce8e000 CR3: 000000000344c006 CR4: 0000000000f72ef0 <4>[ 382.650264] PKRU: 55555554 <4>[ 382.650273] Call Trace: <4>[ 382.650283] <4>[ 382.650297] xe_tlb_inval_fence_signal+0x40/0x200 [xe] <4>[ 382.650770] xe_tlb_inval_fence_timeout+0xb9/0x220 [xe] <4>[ 382.651209] process_one_work+0x22e/0x740 <4>[ 382.651234] worker_thread+0x1e8/0x3d0 <4>[ 382.651249] ? __pfx_worker_thread+0x10/0x10 <4>[ 382.651264] kthread+0x10d/0x150 <4>[ 382.651281] ? __pfx_kthread+0x10/0x10 <4>[ 382.651300] ret_from_fork+0x3d4/0x480 <4>[ 382.651314] ? __pfx_kthread+0x10/0x10 <4>[ 382.651332] ret_from_fork_asm+0x1a/0x30 <4>[ 382.651358] <4>[ 382.651367] irq event stamp: 88556 <4>[ 382.651377] hardirqs last enabled at (88555): [] _raw_spin_unlock_irq+0x27/0x70 <4>[ 382.651406] hardirqs last disabled at (88556): [] __schedule+0x11e7/0x1dd0 <4>[ 382.651431] softirqs last enabled at (86974): [] __irq_exit_rcu+0x13f/0x160 <4>[ 382.651455] softirqs last disabled at (86967): [] __irq_exit_rcu+0x13f/0x160 <4>[ 382.651477] ---[ end trace 0000000000000000 ]--- <1>[ 382.651500] BUG: unable to handle page fault for address: ffffffff0afd0510 Oops#2 Part2 <1>[ 382.651515] #PF: supervisor read access in kernel mode <1>[ 382.651529] #PF: error_code(0x0000) - not-present page <6>[ 382.651542] PGD 344f067 P4D 344f067 PUD 0 <4>[ 382.651560] Oops: Oops: 0000 [#2] SMP NOPTI <4>[ 382.651575] CPU: 8 UID: 0 PID: 5962 Comm: kworker/u64:44 Tainted: G S UD W 7.0.0-rc3-lgci-xe-xe-4688-7282a0941df77adea-debug+ #1 PREEMPT(lazy) <4>[ 382.651606] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 382.651620] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 382.651638] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 382.652072] RIP: 0010:xe_tlb_inval_fence_signal+0x75/0x200 [xe] <4>[ 382.652498] Code: 48 8b 83 88 00 00 00 48 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 48 89 83 80 00 00 00 48 83 c0 22 48 89 83 88 00 00 00 <49> 8b 95 b8 00 00 00 49 8d 85 b8 00 00 00 48 39 c2 0f 84 53 01 00 <4>[ 382.652532] RSP: 0018:ffffc9000949bd80 EFLAGS: 00010086 <4>[ 382.652547] RAX: dead000000000122 RBX: ffffc90001903a10 RCX: 0000000000000000 <4>[ 382.652563] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 <4>[ 382.652578] RBP: ffffc9000949bda0 R08: 0000000000000000 R09: 0000000000000000 <4>[ 382.652593] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 <4>[ 382.652608] R13: ffffffff0afd0458 R14: 0000000190335d75 R15: ffff888417fd0480 <4>[ 382.652624] FS: 0000000000000000(0000) GS:ffff8888db09b000(0000) knlGS:0000000000000000 <4>[ 382.652643] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 382.652657] CR2: ffffffff0afd0510 CR3: 000000000344c006 CR4: 0000000000f72ef0 <4>[ 382.652673] PKRU: 55555554 Oops#2 Part1 <4>[ 382.652682] Call Trace: <4>[ 382.652690] <4>[ 382.652701] xe_tlb_inval_fence_timeout+0xb9/0x220 [xe] <4>[ 382.653121] process_one_work+0x22e/0x740 <4>[ 382.653143] worker_thread+0x1e8/0x3d0 <4>[ 382.653157] ? __pfx_worker_thread+0x10/0x10 <4>[ 382.653171] kthread+0x10d/0x150 <4>[ 382.653187] ? __pfx_kthread+0x10/0x10 <4>[ 382.653205] ret_from_fork+0x3d4/0x480 <4>[ 382.653217] ? __pfx_kthread+0x10/0x10 <4>[ 382.653235] ret_from_fork_asm+0x1a/0x30 <4>[ 382.653260] <4>[ 382.653268] Modules linked in: xe snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mei_gsc mtd_intel_dg drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp coretemp spi_nor mtd eeepc_wmi hid_generic asus_wmi sparse_keymap platform_profile mei_hdcp mei_pxp kvm_intel wmi_bmof kvm snd_intel_dspcfg irqbypass snd_hda_codec ghash_clmulni_intel aesni_intel r8169 snd_hda_core usbhid rapl binfmt_misc intel_cstate snd_hwdep i2c_i801 hid spi_intel_pci snd_pcm i2c_mux spi_intel realtek i2c_smbus snd_timer snd soundcore idma64 intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class mei_me intel_pmc_ssram_telemetry mei intel_vsec wmi pinctrl_alderlake acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 382.653443] autofs4 [last unloaded: xe] <4>[ 382.653591] CR2: ffffffff0afd0510 <4>[ 382.653604] ---[ end trace 0000000000000000 ]---