Oops#2 Part16 <7>[ 117.497648] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[10] = 0x00000000 <7>[ 117.497699] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[11] = 0x00000000 <7>[ 117.497750] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[12] = 0x00000000 <7>[ 117.497801] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[13] = 0x00000000 <7>[ 117.497875] xe 0000:03:00.0: [drm:xe_guc_id_mgr_init [xe]] Tile0: GT0: using 65535 GuC IDs <7>[ 117.497950] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 117.499040] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0xe8c000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 117.500112] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 117.501112] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 117.522916] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152) <7>[ 117.533705] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 117.533981] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 117.534737] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4146 dwords <7>[ 117.534815] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 Oops#2 Part15 <7>[ 117.534880] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x00400040 <7>[ 117.534940] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 117.535004] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 117.535065] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 117.535153] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420 <7>[ 117.535213] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 117.535276] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 117.535342] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR) <7>[ 117.537284] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 117.537356] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 117.537421] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 117.539831] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 117.539940] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 117.541358] FAULT_INJECTION: forcing a failure. <5>[ 117.541358] name fail_function, interval 0, probability 100, space 1, times 100 Oops#2 Part14 <3>[ 117.541374] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 117.541499] ------------[ cut here ]------------ <4>[ 117.541501] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 117.541501] platform: BATTLEMAGE subplatform: 7 <4>[ 117.541501] graphics: Xe2_HPG 20.01 step A0 <4>[ 117.541501] media: Xe2_HPM 13.01 step A1 <4>[ 117.541501] tile: 0 VRAM 12.0 GiB <4>[ 117.541501] GT: 0 type 1 <4>[ 117.541504] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:541 at guc_ct_change_state+0x264/0x330 [xe], CPU#7: xe_fault_inject/2960 <4>[ 117.541572] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mtd mei_pxp sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg rapl snd_hda_codec intel_cstate r8169 snd_hda_core video snd_hwdep realtek snd_pcm binfmt_misc snd_timer i2c_i801 idma64 spi_intel_pci i2c_mux mei_me snd spi_intel i2c_smbus soundcore mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class nls_iso8859_1 intel_pmc_ssram_telemetry intel_vsec pinctrl_alderlake acpi_tad acpi_pad wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part13 <4>[ 117.541627] autofs4 [last unloaded: snd_hda_intel] <4>[ 117.541630] CPU: 7 UID: 0 PID: 2960 Comm: xe_fault_inject Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4712-4082c266f2930288f-debug+ #1 PREEMPT(lazy) <4>[ 117.541633] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 117.541634] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 117.541635] RIP: 0010:guc_ct_change_state+0x2d8/0x330 [xe] <4>[ 117.541701] Code: 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 d8 7f 18 a1 52 4c 8b 55 88 41 52 44 8b 4d 9c 4c 8b 45 90 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 50 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 117.541703] RSP: 0018:ffffc90004c33628 EFLAGS: 00010002 <4>[ 117.541705] RAX: ffffffffa11fd88f RBX: ffff8881741c0738 RCX: ffffffffa1187fd8 <4>[ 117.541706] RDX: ffff888104d13010 RSI: ffffffffa11fd88f RDI: ffffffffa1002f20 Oops#2 Part12 <4>[ 117.541707] RBP: ffffc90004c33710 R08: ffffffffa11fd8df R09: 0000000000000007 <4>[ 117.541708] R10: ffffffffa11fd990 R11: 0000000000000514 R12: ffff8881741c07c8 <4>[ 117.541709] R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000001 <4>[ 117.541710] FS: 000075a5b948b980(0000) GS:ffff8888db01b000(0000) knlGS:0000000000000000 <4>[ 117.541712] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 117.541713] CR2: 0000600ac2df7bb8 CR3: 000000015534b003 CR4: 0000000000f72ef0 <4>[ 117.541714] PKRU: 55555554 <4>[ 117.541715] Call Trace: <4>[ 117.541716] <4>[ 117.541723] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 117.541793] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 117.541866] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 117.541928] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 117.542017] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 117.542094] xe_gt_init+0x3ae/0xdd0 [xe] <4>[ 117.542170] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 117.542175] ? __devm_add_action+0x70/0xa0 <4>[ 117.542179] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 117.542263] xe_device_probe+0x32c/0xbe0 [xe] <4>[ 117.542338] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 117.542342] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 117.542348] xe_pci_probe+0x39b/0x620 [xe] <4>[ 117.542433] ? trace_hardirqs_on+0x22/0x100 <4>[ 117.542440] local_pci_probe+0x47/0xb0 <4>[ 117.542445] pci_call_probe+0x6c/0x360 <4>[ 117.542451] ? _raw_spin_unlock+0x22/0x50 <4>[ 117.542455] pci_device_probe+0xae/0x110 <4>[ 117.542459] really_probe+0xf1/0x410 <4>[ 117.542463] __driver_probe_device+0x8c/0x190 <4>[ 117.542466] device_driver_attach+0x57/0xd0 <4>[ 117.542469] bind_store+0x77/0xd0 Oops#2 Part11 <4>[ 117.542473] drv_attr_store+0x24/0x50 <4>[ 117.542475] sysfs_kf_write+0x4d/0x80 <4>[ 117.542480] kernfs_fop_write_iter+0x188/0x240 <4>[ 117.542485] vfs_write+0x283/0x540 <4>[ 117.542487] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 117.542495] ksys_write+0x6f/0xf0 <4>[ 117.542498] __x64_sys_write+0x19/0x30 <4>[ 117.542500] x64_sys_call+0x259/0x26e0 <4>[ 117.542503] do_syscall_64+0xdd/0x1470 <4>[ 117.542506] ? fput_close_sync+0x3d/0xa0 <4>[ 117.542509] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 117.542512] ? do_syscall_64+0x22e/0x1470 <4>[ 117.542515] ? do_syscall_64+0x22e/0x1470 <4>[ 117.542517] ? do_sys_openat2+0x85/0xd0 <4>[ 117.542523] ? __x64_sys_openat+0x54/0xa0 <4>[ 117.542525] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 117.542529] ? do_syscall_64+0x22e/0x1470 <4>[ 117.542532] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 117.542535] ? do_syscall_64+0x22e/0x1470 <4>[ 117.542538] ? __x64_sys_openat+0x54/0xa0 <4>[ 117.542541] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 117.542544] ? do_syscall_64+0x22e/0x1470 <4>[ 117.542547] ? do_syscall_64+0x22e/0x1470 <4>[ 117.542549] ? exc_page_fault+0xbd/0x2c0 <4>[ 117.542553] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 117.542555] RIP: 0033:0x75a5bb71c5a4 <4>[ 117.542557] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 117.542559] RSP: 002b:00007ffe89aa2d58 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 117.542561] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 000075a5bb71c5a4 Oops#2 Part10 <4>[ 117.542562] RDX: 000000000000000c RSI: 00007ffe89aa3220 RDI: 0000000000000007 <4>[ 117.542563] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 117.542564] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffe89aa3220 <4>[ 117.542565] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffe89aa2ed0 <4>[ 117.542573] <4>[ 117.542574] irq event stamp: 963438 <4>[ 117.542575] hardirqs last enabled at (963437): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 117.542578] hardirqs last disabled at (963438): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 117.542581] softirqs last enabled at (961472): [] __irq_exit_rcu+0x13f/0x160 <4>[ 117.542584] softirqs last disabled at (961463): [] __irq_exit_rcu+0x13f/0x160 <4>[ 117.542586] ---[ end trace 0000000000000000 ]--- <7>[ 117.542588] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <3>[ 117.542869] xe 0000:03:00.0: probe with driver xe failed with error -12 <3>[ 117.543444] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <7>[ 117.543805] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 117.545104] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 117.625329] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 117.628181] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. Oops#2 Part9 <3>[ 119.828876] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=51 recv=0 <1>[ 119.830736] BUG: unable to handle page fault for address: ffffc9000838a188 <1>[ 119.830770] #PF: supervisor write access in kernel mode <1>[ 119.830784] #PF: error_code(0x0002) - not-present page <6>[ 119.830796] PGD 100000067 P4D 100000067 PUD 100ac8067 PMD 0 <4>[ 119.830815] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 119.830832] CPU: 10 UID: 0 PID: 2863 Comm: kworker/10:6 Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4712-4082c266f2930288f-debug+ #1 PREEMPT(lazy) <4>[ 119.830860] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 119.830870] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 119.830885] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 119.831336] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 119.831799] Code: 24 66 90 65 8b 05 6c 46 2e e3 48 0f a3 05 10 ad d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 119.831835] RSP: 0018:ffffc90004a8b7e0 EFLAGS: 00010086 <4>[ 119.831851] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 119.831868] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8881727c8060 <4>[ 119.831884] RBP: ffffc90004a8b858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 119.831899] R10: ffff888110a60000 R11: 0000000000000001 R12: ffff8881727c8060 <4>[ 119.831914] R13: 000000000000a188 R14: ffff888110a60000 R15: 0000000000010001 <4>[ 119.831931] FS: 0000000000000000(0000) GS:ffff8888db19b000(0000) knlGS:0000000000000000 Oops#2 Part8 <4>[ 119.831950] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 119.831964] CR2: ffffc9000838a188 CR3: 000000000344c005 CR4: 0000000000f72ef0 <4>[ 119.831980] PKRU: 55555554 <4>[ 119.831990] Call Trace: <4>[ 119.831999] <4>[ 119.832016] xe_force_wake_get+0x2a5/0x940 [xe] <4>[ 119.832416] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 119.832445] ? mark_held_locks+0x46/0x90 <4>[ 119.832467] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 119.832876] ? trace_hardirqs_on+0x22/0x100 <4>[ 119.832897] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 119.832915] ? xe_tlb_inval_fence_prep+0xce/0x1e0 [xe] <4>[ 119.833385] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 119.833844] ? xelpg_ggtt_pte_flags+0x27/0x1a0 [xe] <4>[ 119.834234] ? find_held_lock+0x31/0x90 <4>[ 119.834249] ? ggtt_node_remove+0xcb/0x140 [xe] <4>[ 119.834647] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 119.835037] ggtt_node_remove+0x12c/0x140 [xe] <4>[ 119.835424] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 119.835827] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 119.836008] ? _raw_write_unlock+0x22/0x50 <4>[ 119.836011] ? drm_vma_offset_remove+0x65/0x80 <4>[ 119.836016] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 119.836083] ? lock_is_held_type+0xa3/0x130 <4>[ 119.836088] ttm_bo_release+0x70/0x310 [ttm] <4>[ 119.836094] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 119.836161] ? lock_release+0xd0/0x2b0 <4>[ 119.836165] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 119.836170] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 119.836236] drm_gem_object_free+0x1d/0x40 <4>[ 119.836240] xe_bo_put+0x12a/0x190 [xe] Oops#2 Part7 <4>[ 119.836307] xe_lrc_destroy+0x74/0x90 [xe] <4>[ 119.836380] __xe_exec_queue_fini+0x6b/0xa0 [xe] <4>[ 119.836449] xe_exec_queue_fini+0x2b/0x60 [xe] <4>[ 119.836517] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 119.836590] process_one_work+0x22e/0x740 <4>[ 119.836594] worker_thread+0x1e8/0x3d0 <4>[ 119.836596] ? __pfx_worker_thread+0x10/0x10 <4>[ 119.836599] kthread+0x10d/0x150 <4>[ 119.836602] ? __pfx_kthread+0x10/0x10 <4>[ 119.836605] ret_from_fork+0x3d4/0x480 <4>[ 119.836608] ? __pfx_kthread+0x10/0x10 <4>[ 119.836611] ret_from_fork_asm+0x1a/0x30 <4>[ 119.836616] <4>[ 119.836618] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mtd mei_pxp sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg rapl snd_hda_codec intel_cstate r8169 snd_hda_core video snd_hwdep realtek snd_pcm binfmt_misc snd_timer i2c_i801 idma64 spi_intel_pci i2c_mux mei_me snd spi_intel i2c_smbus soundcore mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class nls_iso8859_1 intel_pmc_ssram_telemetry intel_vsec pinctrl_alderlake acpi_tad acpi_pad wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part6 <4>[ 119.836653] autofs4 [last unloaded: snd_hda_intel] <4>[ 119.836680] CR2: ffffc9000838a188 <4>[ 119.836682] ---[ end trace 0000000000000000 ]--- <4>[ 122.859742] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 122.859843] Code: 24 66 90 65 8b 05 6c 46 2e e3 48 0f a3 05 10 ad d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 122.859850] RSP: 0018:ffffc90004a8b7e0 EFLAGS: 00010086 <4>[ 122.859853] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 122.859857] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8881727c8060 <4>[ 122.859860] RBP: ffffc90004a8b858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 122.859862] R10: ffff888110a60000 R11: 0000000000000001 R12: ffff8881727c8060 <4>[ 122.859865] R13: 000000000000a188 R14: ffff888110a60000 R15: 0000000000010001 <4>[ 122.859868] FS: 0000000000000000(0000) GS:ffff8888db19b000(0000) knlGS:0000000000000000 <4>[ 122.859871] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 122.859874] CR2: ffffc9000838a188 CR3: 000000000344c005 CR4: 0000000000f72ef0 Oops#2 Part5 <4>[ 122.859877] PKRU: 55555554 <6>[ 122.859879] note: kworker/10:6[2863] exited with irqs disabled <6>[ 122.859902] note: kworker/10:6[2863] exited with preempt_count 1 <3>[ 122.860008] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=78166848 recv=0 <4>[ 122.860019] non-slab/vmalloc memory <4>[ 122.860023] ------------[ cut here ]------------ <4>[ 122.860027] list_del corruption. prev->next should be ffffc90004a8ba90, but was 103d48d44db60f44. (prev=ffffffff81391a0e) <4>[ 122.860033] WARNING: lib/list_debug.c:62 at __list_del_entry_valid_or_report+0xd9/0x120, CPU#13: kworker/u64:41/2336 <4>[ 122.860043] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mtd mei_pxp sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg rapl snd_hda_codec intel_cstate r8169 snd_hda_core video snd_hwdep realtek snd_pcm binfmt_misc snd_timer i2c_i801 idma64 spi_intel_pci i2c_mux mei_me snd spi_intel i2c_smbus soundcore mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class nls_iso8859_1 intel_pmc_ssram_telemetry intel_vsec pinctrl_alderlake acpi_tad acpi_pad wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part4 <4>[ 122.860091] autofs4 [last unloaded: snd_hda_intel] <4>[ 122.860128] CPU: 13 UID: 0 PID: 2336 Comm: kworker/u64:41 Tainted: G S UD W 7.0.0-rc3-lgci-xe-xe-4712-4082c266f2930288f-debug+ #1 PREEMPT(lazy) <4>[ 122.860137] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 122.860141] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 122.860146] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 122.860297] RIP: 0010:__list_del_entry_valid_or_report+0xe3/0x120 <4>[ 122.860303] Code: b5 01 4c 89 ea 48 89 de 67 48 0f b9 3a 31 c0 eb 8b 4c 89 ef e8 7e dd 8e ff 48 8d 3d 77 28 b5 01 49 8b 55 00 4c 89 e9 48 89 de <67> 48 0f b9 3a 31 c0 e9 66 ff ff ff 4c 89 e7 e8 59 dd 8e ff 48 8d <4>[ 122.860311] RSP: 0018:ffffc90004823d58 EFLAGS: 00010046 <4>[ 122.860315] RAX: 0000000000000000 RBX: ffffc90004a8ba90 RCX: ffffffff81391a0e <4>[ 122.860319] RDX: 103d48d44db60f44 RSI: ffffc90004a8ba90 RDI: ffffffff839e1490 <4>[ 122.860322] RBP: ffffc90004823d70 R08: 0000000000000000 R09: 0000000000000000 Oops#2 Part3 <4>[ 122.860326] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8881727c8500 <4>[ 122.860330] R13: ffffffff81391a0e R14: 0000000193f19435 R15: ffff8881727c8480 <4>[ 122.860334] FS: 0000000000000000(0000) GS:ffff8888db31b000(0000) knlGS:0000000000000000 <4>[ 122.860338] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 122.860342] CR2: 000079b760eb4000 CR3: 0000000130969006 CR4: 0000000000f72ef0 <4>[ 122.860346] PKRU: 55555554 <4>[ 122.860348] Call Trace: <4>[ 122.860350] <4>[ 122.860354] xe_tlb_inval_fence_signal+0x40/0x200 [xe] <4>[ 122.860482] xe_tlb_inval_fence_timeout+0xb9/0x220 [xe] <4>[ 122.860607] process_one_work+0x22e/0x740 <4>[ 122.860614] worker_thread+0x1e8/0x3d0 <4>[ 122.860617] ? __pfx_worker_thread+0x10/0x10 <4>[ 122.860621] kthread+0x10d/0x150 <4>[ 122.860625] ? __pfx_kthread+0x10/0x10 <4>[ 122.860630] ret_from_fork+0x3d4/0x480 <4>[ 122.860634] ? __pfx_kthread+0x10/0x10 <4>[ 122.860638] ret_from_fork_asm+0x1a/0x30 <4>[ 122.860644] <4>[ 122.860646] irq event stamp: 20422 <4>[ 122.860649] hardirqs last enabled at (20421): [] irqentry_exit+0x6a/0x780 <4>[ 122.860656] hardirqs last disabled at (20422): [] __schedule+0x11e7/0x1dd0 <4>[ 122.860663] softirqs last enabled at (20368): [] neigh_periodic_work+0x27e/0x360 <4>[ 122.860669] softirqs last disabled at (20364): [] neigh_periodic_work+0x38/0x360 <4>[ 122.860674] ---[ end trace 0000000000000000 ]--- <1>[ 122.860681] BUG: unable to handle page fault for address: ffffffff0a7c8510 <1>[ 122.860685] #PF: supervisor read access in kernel mode Oops#2 Part2 <1>[ 122.860688] #PF: error_code(0x0000) - not-present page <6>[ 122.860691] PGD 344f067 P4D 344f067 PUD 0 <4>[ 122.860695] Oops: Oops: 0000 [#2] SMP NOPTI <4>[ 122.860699] CPU: 13 UID: 0 PID: 2336 Comm: kworker/u64:41 Tainted: G S UD W 7.0.0-rc3-lgci-xe-xe-4712-4082c266f2930288f-debug+ #1 PREEMPT(lazy) <4>[ 122.860706] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 122.860710] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 122.860714] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 122.860842] RIP: 0010:xe_tlb_inval_fence_signal+0x75/0x200 [xe] <4>[ 122.860970] Code: 48 8b 83 88 00 00 00 48 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 48 89 83 80 00 00 00 48 83 c0 22 48 89 83 88 00 00 00 <49> 8b 95 b8 00 00 00 49 8d 85 b8 00 00 00 48 39 c2 0f 84 53 01 00 <4>[ 122.860979] RSP: 0018:ffffc90004823d80 EFLAGS: 00010086 <4>[ 122.860982] RAX: dead000000000122 RBX: ffffc90004a8ba10 RCX: 0000000000000000 <4>[ 122.860986] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 <4>[ 122.860990] RBP: ffffc90004823da0 R08: 0000000000000000 R09: 0000000000000000 <4>[ 122.860993] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 <4>[ 122.860997] R13: ffffffff0a7c8458 R14: 0000000193f19435 R15: ffff8881727c8480 <4>[ 122.861001] FS: 0000000000000000(0000) GS:ffff8888db31b000(0000) knlGS:0000000000000000 <4>[ 122.861005] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 122.861008] CR2: ffffffff0a7c8510 CR3: 0000000130969006 CR4: 0000000000f72ef0 <4>[ 122.861012] PKRU: 55555554 <4>[ 122.861014] Call Trace: Oops#2 Part1 <4>[ 122.861016] <4>[ 122.861019] xe_tlb_inval_fence_timeout+0xb9/0x220 [xe] <4>[ 122.861146] process_one_work+0x22e/0x740 <4>[ 122.861151] worker_thread+0x1e8/0x3d0 <4>[ 122.861155] ? __pfx_worker_thread+0x10/0x10 <4>[ 122.861158] kthread+0x10d/0x150 <4>[ 122.861162] ? __pfx_kthread+0x10/0x10 <4>[ 122.861166] ret_from_fork+0x3d4/0x480 <4>[ 122.861170] ? __pfx_kthread+0x10/0x10 <4>[ 122.861174] ret_from_fork_asm+0x1a/0x30 <4>[ 122.861180] <4>[ 122.861181] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mtd mei_pxp sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg rapl snd_hda_codec intel_cstate r8169 snd_hda_core video snd_hwdep realtek snd_pcm binfmt_misc snd_timer i2c_i801 idma64 spi_intel_pci i2c_mux mei_me snd spi_intel i2c_smbus soundcore mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class nls_iso8859_1 intel_pmc_ssram_telemetry intel_vsec pinctrl_alderlake acpi_tad acpi_pad wmi dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 122.861220] autofs4 [last unloaded: snd_hda_intel] <4>[ 122.861255] CR2: ffffffff0a7c8510 <4>[ 122.861259] ---[ end trace 0000000000000000 ]---