Oops#2 Part9 <3>[ 331.822280] xe 0000:03:00.0: probe with driver xe failed with error -12 <4>[ 331.822282] drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp spi_nor asus_nb_wmi mei_pxp mei_hdcp asus_wmi mtd sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel rapl r8169 intel_cstate binfmt_misc snd_intel_dspcfg snd_hda_codec video snd_hda_core realtek snd_hwdep snd_pcm i2c_i801 snd_timer idma64 mei_me snd i2c_mux spi_intel_pci soundcore i2c_smbus spi_intel mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry intel_vsec acpi_tad acpi_pad wmi pinctrl_alderlake dm_multipath msr nvme_fabrics efi_pstore fuse nfnetlink autofs4 [last unloaded: snd_hda_intel] <4>[ 331.822413] CPU: 10 UID: 0 PID: 2404 Comm: kworker/10:4 Tainted: G S U W N 7.0.0-rc3-lgci-xe-xe-4698-9a6bac4a4a289d3ac-debug+ #1 PREEMPT(lazy) <4>[ 331.822416] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [N]=TEST <4>[ 331.822417] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 331.822419] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 331.822496] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe] <4>[ 331.822559] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 74 99 62 e1 48 89 c6 48 8d 3d 2a c7 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31 Oops#2 Part8 <4>[ 331.822561] RSP: 0018:ffffc900025e7af0 EFLAGS: 00010246 <4>[ 331.822563] RAX: ffffffffa11fd819 RBX: 0000000000000000 RCX: 0000000000000000 <4>[ 331.822564] RDX: ffff8881017dcf90 RSI: ffffffffa11fd819 RDI: ffffffffa1001fe0 <4>[ 331.822566] RBP: ffffc900025e7b10 R08: 0000000000000000 R09: ffffffffffffffed <4>[ 331.822567] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed <4>[ 331.822568] R13: 0000000000000000 R14: ffff8881017dcf90 R15: 0000000000000000 <4>[ 331.822569] FS: 0000000000000000(0000) GS:ffff8888db19b000(0000) knlGS:0000000000000000 <4>[ 331.822571] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 331.822572] CR2: 00005ead9f3ef570 CR3: 0000000125601000 CR4: 0000000000f52ef0 <4>[ 331.822573] PKRU: 55555554 <4>[ 331.822574] Call Trace: <4>[ 331.822575] <4>[ 331.822578] ggtt_node_remove+0x11a/0x140 [xe] <4>[ 331.822643] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 331.822705] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 331.822769] ? _raw_write_unlock+0x22/0x50 <4>[ 331.822773] ? drm_vma_offset_remove+0x65/0x80 <4>[ 331.822779] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 331.822835] ? lock_is_held_type+0xa3/0x130 <4>[ 331.822841] ttm_bo_release+0x70/0x310 [ttm] <4>[ 331.822848] ? xe_ggtt_might_lock+0x29/0x60 [xe] <3>[ 331.822909] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <4>[ 331.822909] ? lock_release+0xd0/0x2b0 <4>[ 331.822923] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 331.822929] xe_gem_object_free+0x1a/0x30 [xe] Oops#2 Part7 <4>[ 331.822983] drm_gem_object_free+0x1d/0x40 <4>[ 331.822986] xe_bo_put+0x12a/0x190 [xe] <4>[ 331.823044] xe_lrc_destroy+0x49/0x90 [xe] <4>[ 331.823121] __xe_exec_queue_fini+0x6b/0xa0 [xe] <4>[ 331.823180] xe_exec_queue_fini+0x2b/0x60 [xe] <4>[ 331.823240] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 331.823324] process_one_work+0x22e/0x740 <7>[ 331.823279] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 331.823331] worker_thread+0x1e8/0x3d0 <4>[ 331.823333] ? __pfx_worker_thread+0x10/0x10 <4>[ 331.823336] kthread+0x10d/0x150 <4>[ 331.823339] ? __pfx_kthread+0x10/0x10 <4>[ 331.823342] ret_from_fork+0x3d4/0x480 <4>[ 331.823344] ? __pfx_kthread+0x10/0x10 <4>[ 331.823347] ret_from_fork_asm+0x1a/0x30 <4>[ 331.823354] <4>[ 331.823355] irq event stamp: 29071 <4>[ 331.823356] hardirqs last enabled at (29077): [] __up_console_sem+0x79/0xa0 <4>[ 331.823359] hardirqs last disabled at (29082): [] __up_console_sem+0x5e/0xa0 <4>[ 331.823361] softirqs last enabled at (28246): [] __irq_exit_rcu+0x13f/0x160 <4>[ 331.823363] softirqs last disabled at (28241): [] __irq_exit_rcu+0x13f/0x160 <4>[ 331.823365] ---[ end trace 0000000000000000 ]--- <7>[ 331.824657] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 331.918460] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 331.924579] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. Oops#2 Part6 <3>[ 334.124868] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=48 recv=0 <1>[ 334.125970] BUG: unable to handle page fault for address: ffffc9000638a188 <1>[ 334.126002] #PF: supervisor write access in kernel mode <1>[ 334.126018] #PF: error_code(0x0002) - not-present page <6>[ 334.126032] PGD 100000067 P4D 100000067 PUD 100ac1067 PMD 0 <4>[ 334.126056] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 334.126075] CPU: 13 UID: 0 PID: 2780 Comm: kworker/13:9 Tainted: G S U W N 7.0.0-rc3-lgci-xe-xe-4698-9a6bac4a4a289d3ac-debug+ #1 PREEMPT(lazy) <4>[ 334.126113] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [N]=TEST <4>[ 334.126129] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 334.126150] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 334.126653] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 334.127148] Code: 24 66 90 65 8b 05 2c 48 2e e3 48 0f a3 05 d0 ae d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 334.127183] RSP: 0018:ffffc90002cff7e0 EFLAGS: 00010086 <4>[ 334.127201] RAX: 0000000000000002 RBX: ffffc9000638a188 RCX: 0000000000000000 <4>[ 334.127219] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88810ebe8060 <4>[ 334.127237] RBP: ffffc90002cff858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 334.127254] R10: ffff88814b8d0000 R11: 0000000000000001 R12: ffff88810ebe8060 <4>[ 334.127269] R13: 000000000000a188 R14: ffff88814b8d0000 R15: 0000000000010001 <4>[ 334.127287] FS: 0000000000000000(0000) GS:ffff8888db31b000(0000) knlGS:0000000000000000 Oops#2 Part5 <4>[ 334.127309] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 334.127325] CR2: ffffc9000638a188 CR3: 000000000344c000 CR4: 0000000000f52ef0 <4>[ 334.127343] PKRU: 55555554 <4>[ 334.127354] Call Trace: <4>[ 334.127365] <4>[ 334.127384] xe_force_wake_get+0x2a5/0x940 [xe] <4>[ 334.127805] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 334.127834] ? mark_held_locks+0x46/0x90 <4>[ 334.127858] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 334.128319] ? trace_hardirqs_on+0x22/0x100 <4>[ 334.128340] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 334.128359] ? xe_tlb_inval_fence_prep+0xce/0x1e0 [xe] <4>[ 334.128884] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 334.129400] ? xelpg_ggtt_pte_flags+0x27/0x1a0 [xe] <4>[ 334.129815] ? find_held_lock+0x31/0x90 <4>[ 334.129831] ? ggtt_node_remove+0xcb/0x140 [xe] <4>[ 334.130255] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 334.130672] ggtt_node_remove+0x12c/0x140 [xe] <4>[ 334.131088] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 334.131505] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 334.131830] ? _raw_write_unlock+0x22/0x50 <4>[ 334.131835] ? drm_vma_offset_remove+0x65/0x80 <4>[ 334.131841] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 334.131938] ? lock_is_held_type+0xa3/0x130 <4>[ 334.131944] ttm_bo_release+0x70/0x310 [ttm] <4>[ 334.131954] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 334.132054] ? lock_release+0xd0/0x2b0 <4>[ 334.132059] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 334.132068] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 334.132163] drm_gem_object_free+0x1d/0x40 <4>[ 334.132168] xe_bo_put+0x12a/0x190 [xe] Oops#2 Part4 <4>[ 334.132264] xe_lrc_destroy+0x49/0x90 [xe] <4>[ 334.132378] __xe_exec_queue_fini+0x6b/0xa0 [xe] <4>[ 334.132477] xe_exec_queue_fini+0x2b/0x60 [xe] <4>[ 334.132576] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 334.132685] process_one_work+0x22e/0x740 <4>[ 334.132691] worker_thread+0x1e8/0x3d0 <4>[ 334.132695] ? __pfx_worker_thread+0x10/0x10 <4>[ 334.132698] kthread+0x10d/0x150 <4>[ 334.132702] ? __pfx_kthread+0x10/0x10 <4>[ 334.132707] ret_from_fork+0x3d4/0x480 <4>[ 334.132711] ? __pfx_kthread+0x10/0x10 <4>[ 334.132715] ret_from_fork_asm+0x1a/0x30 <4>[ 334.132722] <4>[ 334.132724] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp spi_nor asus_nb_wmi mei_pxp mei_hdcp asus_wmi mtd sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel rapl r8169 intel_cstate binfmt_misc snd_intel_dspcfg snd_hda_codec video snd_hda_core realtek snd_hwdep snd_pcm i2c_i801 snd_timer idma64 mei_me snd i2c_mux spi_intel_pci soundcore i2c_smbus spi_intel mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry intel_vsec acpi_tad acpi_pad wmi pinctrl_alderlake dm_multipath msr nvme_fabrics efi_pstore fuse nfnetlink Oops#2 Part3 <4>[ 334.132767] autofs4 [last unloaded: snd_hda_intel] <4>[ 334.132802] CR2: ffffc9000638a188 <4>[ 334.132805] ---[ end trace 0000000000000000 ]--- <4>[ 334.309294] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 334.309435] Code: 24 66 90 65 8b 05 2c 48 2e e3 48 0f a3 05 d0 ae d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 334.309444] RSP: 0018:ffffc90002cff7e0 EFLAGS: 00010086 <4>[ 334.309448] RAX: 0000000000000002 RBX: ffffc9000638a188 RCX: 0000000000000000 <4>[ 334.309452] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88810ebe8060 <4>[ 334.309456] RBP: ffffc90002cff858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 334.309459] R10: ffff88814b8d0000 R11: 0000000000000001 R12: ffff88810ebe8060 <4>[ 334.309463] R13: 000000000000a188 R14: ffff88814b8d0000 R15: 0000000000010001 <4>[ 334.309467] FS: 0000000000000000(0000) GS:ffff8888db31b000(0000) knlGS:0000000000000000 <4>[ 334.309472] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 334.309475] CR2: ffffc9000638a188 CR3: 000000000344c000 CR4: 0000000000f52ef0 <4>[ 334.309479] PKRU: 55555554 <6>[ 334.309482] note: kworker/13:9[2780] exited with irqs disabled <6>[ 334.309497] note: kworker/13:9[2780] exited with preempt_count 1 <1>[ 336.423465] BUG: unable to handle page fault for address: ffffc90002cffa90 Oops#2 Part2 <1>[ 336.423496] #PF: supervisor read access in kernel mode <1>[ 336.423508] #PF: error_code(0x0000) - not-present page <6>[ 336.423520] PGD 100000067 P4D 100000067 PUD 100ac1067 PMD 11bb0a067 PTE 0 <4>[ 336.423543] Oops: Oops: 0000 [#2] SMP NOPTI <4>[ 336.423559] CPU: 6 UID: 0 PID: 121 Comm: kworker/u64:2 Tainted: G S UD W N 7.0.0-rc3-lgci-xe-xe-4698-9a6bac4a4a289d3ac-debug+ #1 PREEMPT(lazy) <4>[ 336.423587] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN, [N]=TEST <4>[ 336.423600] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 336.423615] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 336.424133] RIP: 0010:xe_tlb_inval_fence_timeout+0x65/0x220 [xe] <4>[ 336.424598] Code: 89 df 48 89 45 d0 49 8b 85 08 ff ff ff 48 8b 40 20 2e 2e 2e ff d0 49 8d 45 c0 48 89 c7 48 89 45 b8 e8 bf bd c7 e1 49 8b 45 b0 <48> 8b 30 4c 8d 78 80 48 8d 5e 80 49 8d 75 b0 48 89 75 c8 48 39 c6 <4>[ 336.424633] RSP: 0018:ffffc9000056fdb0 EFLAGS: 00010046 <4>[ 336.424649] RAX: ffffc90002cffa90 RBX: ffff88810ebe8458 RCX: 0000000000000000 <4>[ 336.424666] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 <4>[ 336.424681] RBP: ffffc9000056fdf8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 336.424697] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88814b8d0000 <4>[ 336.424712] R13: ffff88810ebe8560 R14: ffff88810ebe8560 R15: ffff88810150dfc0 <4>[ 336.424727] FS: 0000000000000000(0000) GS:ffff8888daf9b000(0000) knlGS:0000000000000000 <4>[ 336.424747] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 336.424761] CR2: ffffc90002cffa90 CR3: 000000000344c000 CR4: 0000000000f52ef0 Oops#2 Part1 <4>[ 336.424778] PKRU: 55555554 <4>[ 336.424787] Call Trace: <4>[ 336.424796] <4>[ 336.424812] process_one_work+0x22e/0x740 <4>[ 336.424841] worker_thread+0x1e8/0x3d0 <4>[ 336.424856] ? __pfx_worker_thread+0x10/0x10 <4>[ 336.424871] kthread+0x10d/0x150 <4>[ 336.424889] ? __pfx_kthread+0x10/0x10 <4>[ 336.424907] ret_from_fork+0x3d4/0x480 <4>[ 336.424922] ? __pfx_kthread+0x10/0x10 <4>[ 336.424939] ret_from_fork_asm+0x1a/0x30 <4>[ 336.424966] <4>[ 336.424974] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp spi_nor asus_nb_wmi mei_pxp mei_hdcp asus_wmi mtd sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm hid irqbypass ghash_clmulni_intel aesni_intel rapl r8169 intel_cstate binfmt_misc snd_intel_dspcfg snd_hda_codec video snd_hda_core realtek snd_hwdep snd_pcm i2c_i801 snd_timer idma64 mei_me snd i2c_mux spi_intel_pci soundcore i2c_smbus spi_intel mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry intel_vsec acpi_tad acpi_pad wmi pinctrl_alderlake dm_multipath msr nvme_fabrics efi_pstore fuse nfnetlink <4>[ 336.425159] autofs4 [last unloaded: snd_hda_intel] <4>[ 336.425309] CR2: ffffc90002cffa90 <4>[ 336.425322] ---[ end trace 0000000000000000 ]---