Oops#2 Part9 <3>[ 558.006669] xe 0000:03:00.0: probe with driver xe failed with error -12 <4>[ 558.006681] WARNING: drivers/gpu/drm/xe/xe_ggtt.c:521 at ggtt_invalidate_gt_tlb.part.0+0x76/0xb0 [xe], CPU#13: kworker/13:15/9664 <4>[ 558.006789] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mei_pxp sparse_keymap mtd platform_profile wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg usbhid irqbypass ghash_clmulni_intel snd_hda_codec hid aesni_intel snd_hda_core binfmt_misc video snd_hwdep rapl r8169 intel_cstate snd_pcm realtek snd_timer i2c_i801 snd i2c_mux spi_intel_pci mei_me idma64 soundcore i2c_smbus spi_intel intel_pmc_core mei pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad wmi intel_vsec acpi_tad dm_multipath msr nvme_fabrics fuse <4>[ 558.006884] efi_pstore nfnetlink autofs4 <4>[ 558.006890] CPU: 13 UID: 0 PID: 9664 Comm: kworker/13:15 Tainted: G S U W 6.19.0-rc8-lgci-xe-xe-4493-ff449f153b0966cfd+ #1 PREEMPT(voluntary) <4>[ 558.006894] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 558.006896] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 Oops#2 Part8 <4>[ 558.006898] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 558.007011] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe] <4>[ 558.007121] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 74 57 5e e1 48 89 c6 48 8d 3d 6a d3 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31 <4>[ 558.007123] RSP: 0018:ffffc9000fe6fb08 EFLAGS: 00010246 <4>[ 558.007127] RAX: ffffffffa11f8635 RBX: 0000000000000000 RCX: 0000000000000000 <4>[ 558.007129] RDX: ffff888103ce5810 RSI: ffffffffa11f8635 RDI: ffffffffa1001fc0 <4>[ 558.007131] RBP: ffffc9000fe6fb28 R08: 0000000000000000 R09: ffffffffffffffed <4>[ 558.007133] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed <4>[ 558.007135] R13: 0000000000000000 R14: ffff888103ce5810 R15: 0000000000000000 <4>[ 558.007137] FS: 0000000000000000(0000) GS:ffff8888db35b000(0000) knlGS:0000000000000000 <4>[ 558.007139] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 558.007141] CR2: 000057ec60c44cb0 CR3: 0000000003448006 CR4: 0000000000f72ef0 <4>[ 558.007143] PKRU: 55555554 <4>[ 558.007145] Call Trace: <4>[ 558.007146] <4>[ 558.007151] ggtt_node_remove+0x110/0x140 [xe] <4>[ 558.007253] xe_ggtt_node_remove+0x40/0xa0 [xe] <3>[ 558.007314] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC enable mode=0 failed: -ENODEV <4>[ 558.007463] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 558.007565] ? _raw_write_unlock+0x22/0x50 <4>[ 558.007569] ? drm_vma_offset_remove+0x65/0x80 <4>[ 558.007576] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] Oops#2 Part7 <4>[ 558.007671] ? lock_is_held_type+0xa3/0x130 <4>[ 558.007678] ttm_bo_release+0x70/0x330 [ttm] <7>[ 558.007671] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 558.007687] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 558.007787] ? lock_release+0xce/0x280 <4>[ 558.007794] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 558.007802] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 558.007896] drm_gem_object_free+0x1d/0x40 <4>[ 558.007899] xe_bo_put+0x12a/0x190 [xe] <4>[ 558.007996] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 558.008118] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 558.008217] __guc_exec_queue_destroy_async+0x6c/0x170 [xe] <4>[ 558.008327] process_one_work+0x22e/0x6b0 <4>[ 558.008336] worker_thread+0x1e8/0x3d0 <4>[ 558.008340] ? __pfx_worker_thread+0x10/0x10 <4>[ 558.008343] kthread+0x11f/0x250 <4>[ 558.008349] ? __pfx_kthread+0x10/0x10 <4>[ 558.008353] ret_from_fork+0x344/0x3a0 <4>[ 558.008357] ? __pfx_kthread+0x10/0x10 <4>[ 558.008361] ret_from_fork_asm+0x1a/0x30 <4>[ 558.008373] <4>[ 558.008375] irq event stamp: 23365 <4>[ 558.008377] hardirqs last enabled at (23371): [] __up_console_sem+0x79/0xa0 <4>[ 558.008381] hardirqs last disabled at (23376): [] __up_console_sem+0x5e/0xa0 <4>[ 558.008383] softirqs last enabled at (23116): [] __irq_exit_rcu+0x13f/0x160 <4>[ 558.008387] softirqs last disabled at (23109): [] __irq_exit_rcu+0x13f/0x160 <4>[ 558.008390] ---[ end trace 0000000000000000 ]--- <7>[ 558.008767] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled Oops#2 Part6 <7>[ 558.083357] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 558.086394] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. <3>[ 560.316175] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=38 recv=0 <1>[ 560.317360] BUG: unable to handle page fault for address: ffffc9000838a188 <1>[ 560.317390] #PF: supervisor write access in kernel mode <1>[ 560.317406] #PF: error_code(0x0002) - not-present page <6>[ 560.317420] PGD 100000067 P4D 100000067 PUD 100aad067 PMD 0 <4>[ 560.317443] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 560.317463] CPU: 13 UID: 0 PID: 9662 Comm: kworker/13:13 Tainted: G S U W 6.19.0-rc8-lgci-xe-xe-4493-ff449f153b0966cfd+ #1 PREEMPT(voluntary) <4>[ 560.317498] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 560.317513] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 560.317534] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 560.318031] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 560.318519] Code: 24 66 90 65 8b 05 6c 7a 2a e3 48 0f a3 05 10 a3 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 560.318555] RSP: 0018:ffffc9000fe5f830 EFLAGS: 00010086 <4>[ 560.318572] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 560.318590] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8883f29281c8 <4>[ 560.318607] RBP: ffffc9000fe5f8a8 R08: 0000000000000000 R09: 0000000000000000 Oops#2 Part5 <4>[ 560.318624] R10: ffff8882eb1f8000 R11: 0000000000000001 R12: ffff8883f29281c8 <4>[ 560.318640] R13: 000000000000a188 R14: ffff8882eb1f8000 R15: 0000000000010001 <4>[ 560.318656] FS: 0000000000000000(0000) GS:ffff8888db35b000(0000) knlGS:0000000000000000 <4>[ 560.318677] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 560.318691] CR2: ffffc9000838a188 CR3: 0000000003448006 CR4: 0000000000f72ef0 <4>[ 560.318707] PKRU: 55555554 <4>[ 560.318717] Call Trace: <4>[ 560.318727] <4>[ 560.318745] xe_force_wake_get+0x417/0x950 [xe] <4>[ 560.319161] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 560.319190] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 560.319645] ? trace_hardirqs_on+0x63/0xd0 <4>[ 560.319665] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 560.319681] ? xe_tlb_inval_fence_prep+0xbf/0x1a0 [xe] <4>[ 560.320197] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 560.320701] ? find_held_lock+0x31/0x90 <4>[ 560.320718] ? ggtt_node_remove+0xc4/0x140 [xe] <4>[ 560.321136] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 560.321546] ggtt_node_remove+0x122/0x140 [xe] <4>[ 560.321957] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 560.322366] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 560.322587] ? _raw_write_unlock+0x22/0x50 <4>[ 560.322591] ? drm_vma_offset_remove+0x65/0x80 <4>[ 560.322597] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 560.322692] ? lock_is_held_type+0xa3/0x130 <4>[ 560.322698] ttm_bo_release+0x70/0x330 [ttm] <4>[ 560.322708] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 560.322807] ? lock_release+0xce/0x280 <4>[ 560.322812] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 560.322820] xe_gem_object_free+0x1a/0x30 [xe] Oops#2 Part4 <4>[ 560.322915] drm_gem_object_free+0x1d/0x40 <4>[ 560.322919] xe_bo_put+0x12a/0x190 [xe] <4>[ 560.323014] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 560.323126] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 560.323225] __guc_exec_queue_destroy_async+0x6c/0x170 [xe] <4>[ 560.323333] process_one_work+0x22e/0x6b0 <4>[ 560.323339] worker_thread+0x1e8/0x3d0 <4>[ 560.323343] ? __pfx_worker_thread+0x10/0x10 <4>[ 560.323347] kthread+0x11f/0x250 <4>[ 560.323351] ? __pfx_kthread+0x10/0x10 <4>[ 560.323356] ret_from_fork+0x344/0x3a0 <4>[ 560.323360] ? __pfx_kthread+0x10/0x10 <4>[ 560.323364] ret_from_fork_asm+0x1a/0x30 <4>[ 560.323371] <4>[ 560.323373] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mei_pxp sparse_keymap mtd platform_profile wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg usbhid irqbypass ghash_clmulni_intel snd_hda_codec hid aesni_intel snd_hda_core binfmt_misc video snd_hwdep rapl r8169 intel_cstate snd_pcm realtek snd_timer i2c_i801 snd i2c_mux spi_intel_pci mei_me idma64 soundcore i2c_smbus spi_intel intel_pmc_core mei pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad wmi intel_vsec acpi_tad dm_multipath msr nvme_fabrics fuse Oops#2 Part3 <4>[ 560.323417] efi_pstore nfnetlink autofs4 <4>[ 560.323451] CR2: ffffc9000838a188 <4>[ 560.323455] ---[ end trace 0000000000000000 ]--- <4>[ 560.480061] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 560.480189] Code: 24 66 90 65 8b 05 6c 7a 2a e3 48 0f a3 05 10 a3 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 560.480197] RSP: 0018:ffffc9000fe5f830 EFLAGS: 00010086 <4>[ 560.480201] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 560.480205] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8883f29281c8 <4>[ 560.480209] RBP: ffffc9000fe5f8a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 560.480212] R10: ffff8882eb1f8000 R11: 0000000000000001 R12: ffff8883f29281c8 <4>[ 560.480216] R13: 000000000000a188 R14: ffff8882eb1f8000 R15: 0000000000010001 <4>[ 560.480220] FS: 0000000000000000(0000) GS:ffff8888db35b000(0000) knlGS:0000000000000000 <4>[ 560.480224] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 560.480228] CR2: ffffc9000838a188 CR3: 0000000003448006 CR4: 0000000000f72ef0 <4>[ 560.480231] PKRU: 55555554 <6>[ 560.480234] note: kworker/13:13[9662] exited with irqs disabled <6>[ 560.480258] note: kworker/13:13[9662] exited with preempt_count 1 <1>[ 562.618379] BUG: unable to handle page fault for address: ffffc9000fe5faa8 Oops#2 Part2 <1>[ 562.618409] #PF: supervisor read access in kernel mode <1>[ 562.618423] #PF: error_code(0x0000) - not-present page <6>[ 562.618434] PGD 100000067 P4D 100000067 PUD 100aad067 PMD 43a408067 PTE 0 <4>[ 562.618457] Oops: Oops: 0000 [#2] SMP NOPTI <4>[ 562.618474] CPU: 1 UID: 0 PID: 203 Comm: kworker/u64:5 Tainted: G S UD W 6.19.0-rc8-lgci-xe-xe-4493-ff449f153b0966cfd+ #1 PREEMPT(voluntary) <4>[ 562.618500] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 562.618511] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 562.618525] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 562.619062] RIP: 0010:xe_tlb_inval_fence_timeout+0x65/0x1d0 [xe] <4>[ 562.619564] Code: 89 df 48 89 45 d0 49 8b 85 08 ff ff ff 48 8b 40 20 2e 2e 2e ff d0 49 8d 45 c0 48 89 c7 48 89 45 c0 e8 1f c7 c2 e1 49 8b 45 b0 <48> 8b 30 4c 8d 78 b8 48 8d 5e b8 49 8d 75 b0 48 89 75 c8 48 39 c6 <4>[ 562.619600] RSP: 0018:ffffc900014c3db0 EFLAGS: 00010046 <4>[ 562.619616] RAX: ffffc9000fe5faa8 RBX: ffff8883f29285c0 RCX: 0000000000000000 <4>[ 562.619633] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 <4>[ 562.619648] RBP: ffffc900014c3df0 R08: 0000000000000000 R09: 0000000000000000 <4>[ 562.619663] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8882eb1f8000 <4>[ 562.619678] R13: ffff8883f29286c8 R14: ffff8883f29286c8 R15: ffff888105105cc0 <4>[ 562.619694] FS: 0000000000000000(0000) GS:ffff8888dad5b000(0000) knlGS:0000000000000000 <4>[ 562.619713] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 562.619727] CR2: ffffc9000fe5faa8 CR3: 0000000003448002 CR4: 0000000000f72ef0 Oops#2 Part1 <4>[ 562.619743] PKRU: 55555554 <4>[ 562.619753] Call Trace: <4>[ 562.619762] <4>[ 562.619777] process_one_work+0x22e/0x6b0 <4>[ 562.619805] worker_thread+0x1e8/0x3d0 <4>[ 562.619821] ? __pfx_worker_thread+0x10/0x10 <4>[ 562.619837] kthread+0x11f/0x250 <4>[ 562.619855] ? __pfx_kthread+0x10/0x10 <4>[ 562.619873] ret_from_fork+0x344/0x3a0 <4>[ 562.619889] ? __pfx_kthread+0x10/0x10 <4>[ 562.619905] ret_from_fork_asm+0x1a/0x30 <4>[ 562.619935] <4>[ 562.619943] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp asus_nb_wmi spi_nor asus_wmi mei_hdcp mei_pxp sparse_keymap mtd platform_profile wmi_bmof kvm_intel snd_hda_intel kvm snd_intel_dspcfg usbhid irqbypass ghash_clmulni_intel snd_hda_codec hid aesni_intel snd_hda_core binfmt_misc video snd_hwdep rapl r8169 intel_cstate snd_pcm realtek snd_timer i2c_i801 snd i2c_mux spi_intel_pci mei_me idma64 soundcore i2c_smbus spi_intel intel_pmc_core mei pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad wmi intel_vsec acpi_tad dm_multipath msr nvme_fabrics fuse <4>[ 562.620125] efi_pstore nfnetlink autofs4 <4>[ 562.620274] CR2: ffffc9000fe5faa8 <4>[ 562.620287] ---[ end trace 0000000000000000 ]---