Oops#2 Part15 <7>[ 672.188232] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 672.188300] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 672.188375] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR) <7>[ 672.190008] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 672.190079] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 672.190141] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 672.191787] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 672.191862] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 672.193584] FAULT_INJECTION: forcing a failure. <5>[ 672.193584] name fail_function, interval 0, probability 100, space 1, times 100 <3>[ 672.193604] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 672.193741] ------------[ cut here ]------------ <4>[ 672.193742] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 672.193742] platform: BATTLEMAGE subplatform: 7 <4>[ 672.193742] graphics: Xe2_HPG 20.01 step A0 <4>[ 672.193742] media: Xe2_HPM 13.01 step A1 <4>[ 672.193742] tile: 0 VRAM 12.0 GiB <4>[ 672.193742] GT: 0 type 1 Oops#2 Part14 <4>[ 672.193745] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:527 at guc_ct_change_state+0x279/0x350 [xe], CPU#10: xe_fault_inject/13257 <4>[ 672.193823] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic coretemp mtd asus_nb_wmi asus_wmi sparse_keymap mei_pxp mei_hdcp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel r8169 usbhid rapl hid intel_cstate spi_intel_pci binfmt_misc realtek spi_intel idma64 video snd_intel_dspcfg snd_hda_codec snd_hda_core snd_hwdep snd_pcm intel_pmc_core snd_timer i2c_i801 pmt_telemetry nls_iso8859_1 i2c_mux snd pmt_discovery mei_me pmt_class i2c_smbus soundcore mei intel_pmc_ssram_telemetry wmi acpi_pad acpi_tad intel_vsec pinctrl_alderlake dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 672.193878] autofs4 [last unloaded: snd_hda_intel] <4>[ 672.193882] CPU: 10 UID: 0 PID: 13257 Comm: xe_fault_inject Tainted: G S U W L 7.0.0-rc1-lgci-xe-xe-4617-3b1923ab37ecd72e1-debug+ #1 PREEMPT(lazy) <4>[ 672.193885] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [L]=SOFTLOCKUP Oops#2 Part13 <4>[ 672.193886] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 672.193887] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe] <4>[ 672.193958] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 c0 55 18 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 672.193960] RSP: 0018:ffffc90004eef3f8 EFLAGS: 00010002 <4>[ 672.193962] RAX: ffffffffa11fa36f RBX: ffff88828d908738 RCX: ffffffffa11855c0 <4>[ 672.193963] RDX: ffff88811052a810 RSI: ffffffffa11fa36f RDI: ffffffffa1002ee0 <4>[ 672.193964] RBP: ffffc90004eef4e0 R08: ffffffffa11fa3bf R09: 0000000000000007 <4>[ 672.193965] R10: 0000000000000001 R11: 0000000000000514 R12: ffff88828d908740 <4>[ 672.193966] R13: ffff88828d9087d0 R14: 0000000000000515 R15: 0000000000000001 <4>[ 672.193967] FS: 00007a764a69d980(0000) GS:ffff8888db19b000(0000) knlGS:0000000000000000 <4>[ 672.193969] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 672.193970] CR2: 0000585570191200 CR3: 0000000166d1d000 CR4: 0000000000f52ef0 <4>[ 672.193971] PKRU: 55555554 <4>[ 672.193972] Call Trace: <4>[ 672.193973] <4>[ 672.193980] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 672.194058] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 672.194128] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 672.194197] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 672.194296] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 672.194376] xe_gt_init+0x35d/0xab0 [xe] <4>[ 672.194440] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 672.194445] ? __devm_add_action+0x70/0xa0 Oops#2 Part12 <4>[ 672.194449] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 672.194528] xe_device_probe+0x3c5/0xc10 [xe] <4>[ 672.194587] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 672.194591] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 672.194596] xe_pci_probe+0x396/0x610 [xe] <4>[ 672.194676] ? trace_hardirqs_on+0x22/0x100 <4>[ 672.194683] local_pci_probe+0x47/0xb0 <4>[ 672.194687] pci_call_probe+0x6c/0x360 <4>[ 672.194692] ? _raw_spin_unlock+0x22/0x50 <4>[ 672.194696] pci_device_probe+0xae/0x110 <4>[ 672.194699] really_probe+0xf1/0x410 <4>[ 672.194703] __driver_probe_device+0x8c/0x190 <4>[ 672.194706] device_driver_attach+0x57/0xd0 <4>[ 672.194708] bind_store+0x142/0x150 <4>[ 672.194712] drv_attr_store+0x24/0x50 <4>[ 672.194714] sysfs_kf_write+0x4d/0x80 <4>[ 672.194719] kernfs_fop_write_iter+0x188/0x240 <4>[ 672.194722] vfs_write+0x283/0x540 <4>[ 672.194730] ksys_write+0x6f/0xf0 <4>[ 672.194735] __x64_sys_write+0x19/0x30 <4>[ 672.194737] x64_sys_call+0x259/0x26e0 <4>[ 672.194741] do_syscall_64+0xdd/0x1470 <4>[ 672.194745] ? __pcs_replace_full_main+0x29a/0x660 <4>[ 672.194750] ? putname+0x41/0x90 <4>[ 672.194753] ? kmem_cache_free+0x165/0x510 <4>[ 672.194758] ? putname+0x41/0x90 <4>[ 672.194761] ? do_sys_openat2+0x85/0xd0 <4>[ 672.194765] ? __x64_sys_openat+0x54/0xa0 <4>[ 672.194768] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 672.194771] ? do_syscall_64+0x22e/0x1470 <4>[ 672.194776] ? putname+0x41/0x90 <4>[ 672.194779] ? do_sys_openat2+0x85/0xd0 <4>[ 672.194783] ? __x64_sys_openat+0x54/0xa0 <4>[ 672.194785] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 672.194789] ? do_syscall_64+0x22e/0x1470 Oops#2 Part11 <4>[ 672.194792] ? __pcs_replace_full_main+0x29a/0x660 <4>[ 672.194796] ? putname+0x41/0x90 <4>[ 672.194799] ? kmem_cache_free+0x165/0x510 <4>[ 672.194804] ? putname+0x41/0x90 <4>[ 672.194807] ? do_sys_openat2+0x85/0xd0 <4>[ 672.194811] ? __x64_sys_openat+0x54/0xa0 <4>[ 672.194813] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 672.194816] ? do_syscall_64+0x22e/0x1470 <4>[ 672.194819] ? do_syscall_64+0x22e/0x1470 <4>[ 672.194821] ? exc_page_fault+0xbd/0x2c0 <4>[ 672.194825] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 672.194827] RIP: 0033:0x7a764c91c5a4 <4>[ 672.194830] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 672.194831] RSP: 002b:00007ffd065f57f8 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 672.194833] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007a764c91c5a4 <4>[ 672.194835] RDX: 000000000000000c RSI: 00007ffd065f5cc0 RDI: 0000000000000007 <4>[ 672.194836] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 672.194837] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffd065f5cc0 <4>[ 672.194838] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffd065f5970 <4>[ 672.194846] <4>[ 672.194847] irq event stamp: 997374 <4>[ 672.194848] hardirqs last enabled at (997373): [] irqentry_exit+0x6a/0x7c0 <4>[ 672.194851] hardirqs last disabled at (997374): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 672.194853] softirqs last enabled at (997372): [] __irq_exit_rcu+0x13f/0x160 Oops#2 Part10 <4>[ 672.194856] softirqs last disabled at (997367): [] __irq_exit_rcu+0x13f/0x160 <4>[ 672.194858] ---[ end trace 0000000000000000 ]--- <7>[ 672.194860] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 672.194950] ------------[ cut here ]------------ <4>[ 672.194957] xe 0000:03:00.0: [drm] Tile0: GT0: Failed to invalidate GGTT (-ENODEV) <3>[ 672.194962] xe 0000:03:00.0: probe with driver xe failed with error -12 <4>[ 672.194959] WARNING: drivers/gpu/drm/xe/xe_ggtt.c:576 at ggtt_invalidate_gt_tlb.part.0+0x76/0xb0 [xe], CPU#9: kworker/9:4/10787 <4>[ 672.195028] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic coretemp mtd asus_nb_wmi asus_wmi sparse_keymap mei_pxp mei_hdcp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel r8169 usbhid rapl hid intel_cstate spi_intel_pci binfmt_misc realtek spi_intel idma64 video snd_intel_dspcfg snd_hda_codec snd_hda_core snd_hwdep snd_pcm intel_pmc_core snd_timer i2c_i801 pmt_telemetry nls_iso8859_1 i2c_mux snd pmt_discovery mei_me pmt_class i2c_smbus soundcore mei intel_pmc_ssram_telemetry wmi acpi_pad acpi_tad intel_vsec pinctrl_alderlake dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part9 <4>[ 672.195098] autofs4 [last unloaded: snd_hda_intel] <4>[ 672.195102] CPU: 9 UID: 0 PID: 10787 Comm: kworker/9:4 Tainted: G S U W L 7.0.0-rc1-lgci-xe-xe-4617-3b1923ab37ecd72e1-debug+ #1 PREEMPT(lazy) <4>[ 672.195105] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [L]=SOFTLOCKUP <4>[ 672.195107] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 672.195108] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 672.195182] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe] <4>[ 672.195245] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 24 76 62 e1 48 89 c6 48 8d 3d 7a d0 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31 Oops#2 Part8 <4>[ 672.195246] RSP: 0018:ffffc9000499fb10 EFLAGS: 00010246 <4>[ 672.195249] RAX: ffffffffa11fa36f RBX: 0000000000000000 RCX: 0000000000000000 <4>[ 672.195250] RDX: ffff88811052a810 RSI: ffffffffa11fa36f RDI: ffffffffa1001fc0 <4>[ 672.195251] RBP: ffffc9000499fb30 R08: 0000000000000000 R09: ffffffffffffffed <4>[ 672.195253] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed <4>[ 672.195254] R13: 0000000000000000 R14: ffff88811052a810 R15: 0000000000000000 <4>[ 672.195255] FS: 0000000000000000(0000) GS:ffff8888db11b000(0000) knlGS:0000000000000000 <4>[ 672.195256] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 672.195258] CR2: 000076719c032000 CR3: 000000000344a000 CR4: 0000000000f52ef0 <4>[ 672.195259] PKRU: 55555554 <4>[ 672.195260] Call Trace: <4>[ 672.195261] <4>[ 672.195264] ggtt_node_remove+0x11a/0x140 [xe] <4>[ 672.195326] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 672.195387] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 672.195447] ? _raw_write_unlock+0x22/0x50 <4>[ 672.195451] ? drm_vma_offset_remove+0x65/0x80 <4>[ 672.195456] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <3>[ 672.195506] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <4>[ 672.195578] ? lock_is_held_type+0xa3/0x130 <4>[ 672.195584] ttm_bo_release+0x70/0x330 [ttm] <4>[ 672.195590] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 672.195650] ? lock_release+0xd0/0x2b0 <4>[ 672.195655] ttm_bo_fini+0x3c/0x70 [ttm] Oops#2 Part7 <4>[ 672.195661] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 672.195714] drm_gem_object_free+0x1d/0x40 <4>[ 672.195718] xe_bo_put+0x12a/0x190 [xe] <7>[ 672.195776] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 672.195796] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 672.195870] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 672.195928] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 672.195996] process_one_work+0x22e/0x740 <4>[ 672.196004] worker_thread+0x1e8/0x3d0 <4>[ 672.196007] ? __pfx_worker_thread+0x10/0x10 <4>[ 672.196010] kthread+0x10d/0x150 <4>[ 672.196013] ? __pfx_kthread+0x10/0x10 <4>[ 672.196016] ret_from_fork+0x3d4/0x480 <4>[ 672.196020] ? __pfx_kthread+0x10/0x10 <4>[ 672.196023] ret_from_fork_asm+0x1a/0x30 <4>[ 672.196030] <4>[ 672.196031] irq event stamp: 6611 <4>[ 672.196032] hardirqs last enabled at (6617): [] __up_console_sem+0x79/0xa0 <4>[ 672.196035] hardirqs last disabled at (6622): [] __up_console_sem+0x5e/0xa0 <4>[ 672.196037] softirqs last enabled at (4782): [] __irq_exit_rcu+0x13f/0x160 <4>[ 672.196039] softirqs last disabled at (4481): [] __irq_exit_rcu+0x13f/0x160 <4>[ 672.196041] ---[ end trace 0000000000000000 ]--- <7>[ 672.197113] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 672.279749] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 672.279858] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. Oops#2 Part6 <3>[ 674.455492] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=38 recv=0 <1>[ 674.458035] BUG: unable to handle page fault for address: ffffc9000c38a188 <1>[ 674.458055] #PF: supervisor write access in kernel mode <1>[ 674.458067] #PF: error_code(0x0002) - not-present page <6>[ 674.458080] PGD 100000067 P4D 100000067 PUD 100ac7067 PMD 0 <4>[ 674.458101] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 674.458118] CPU: 9 UID: 0 PID: 104 Comm: kworker/9:0 Tainted: G S U W L 7.0.0-rc1-lgci-xe-xe-4617-3b1923ab37ecd72e1-debug+ #1 PREEMPT(lazy) <4>[ 674.458147] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [L]=SOFTLOCKUP <4>[ 674.458160] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 674.458175] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 674.458583] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 674.458991] Code: 24 66 90 65 8b 05 dc 64 2e e3 48 0f a3 05 80 c4 d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 674.459026] RSP: 0018:ffffc900004df800 EFLAGS: 00010086 <4>[ 674.459033] RAX: 0000000000000002 RBX: ffffc9000c38a188 RCX: 0000000000000000 <4>[ 674.459036] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff888215b10060 <4>[ 674.459041] RBP: ffffc900004df878 R08: 0000000000000000 R09: 0000000000000000 <4>[ 674.459043] R10: ffff888244e60000 R11: 0000000000000001 R12: ffff888215b10060 <4>[ 674.459046] R13: 000000000000a188 R14: ffff888244e60000 R15: 0000000000010001 Oops#2 Part5 <4>[ 674.459049] FS: 0000000000000000(0000) GS:ffff8888db11b000(0000) knlGS:0000000000000000 <4>[ 674.459052] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 674.459055] CR2: ffffc9000c38a188 CR3: 000000000344a000 CR4: 0000000000f52ef0 <4>[ 674.459063] PKRU: 55555554 <4>[ 674.459065] Call Trace: <4>[ 674.459066] <4>[ 674.459070] xe_force_wake_get+0x2a5/0x940 [xe] <4>[ 674.459141] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 674.459146] ? mark_held_locks+0x46/0x90 <4>[ 674.459151] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 674.459225] ? trace_hardirqs_on+0x22/0x100 <4>[ 674.459229] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 674.459232] ? xe_tlb_inval_fence_prep+0xce/0x1e0 [xe] <4>[ 674.459317] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 674.459397] ? xelpg_ggtt_pte_flags+0x27/0x1a0 [xe] <4>[ 674.459464] ? find_held_lock+0x31/0x90 <4>[ 674.459467] ? ggtt_node_remove+0xcb/0x140 [xe] <4>[ 674.459536] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 674.459603] ggtt_node_remove+0x12c/0x140 [xe] <4>[ 674.459670] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 674.459737] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 674.459805] ? _raw_write_unlock+0x22/0x50 <4>[ 674.459808] ? drm_vma_offset_remove+0x65/0x80 <4>[ 674.459812] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 674.459878] ? lock_is_held_type+0xa3/0x130 <4>[ 674.459883] ttm_bo_release+0x70/0x330 [ttm] <4>[ 674.459890] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 674.459957] ? lock_release+0xd0/0x2b0 <4>[ 674.459960] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 674.459966] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 674.460034] drm_gem_object_free+0x1d/0x40 Oops#2 Part4 <4>[ 674.460037] xe_bo_put+0x12a/0x190 [xe] <4>[ 674.460103] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 674.460176] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 674.460243] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 674.460313] process_one_work+0x22e/0x740 <4>[ 674.460318] worker_thread+0x1e8/0x3d0 <4>[ 674.460321] ? __pfx_worker_thread+0x10/0x10 <4>[ 674.460325] kthread+0x10d/0x150 <4>[ 674.460328] ? __pfx_kthread+0x10/0x10 <4>[ 674.460331] ret_from_fork+0x3d4/0x480 <4>[ 674.460335] ? __pfx_kthread+0x10/0x10 <4>[ 674.460338] ret_from_fork_asm+0x1a/0x30 <4>[ 674.460343] <4>[ 674.460344] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic coretemp mtd asus_nb_wmi asus_wmi sparse_keymap mei_pxp mei_hdcp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel r8169 usbhid rapl hid intel_cstate spi_intel_pci binfmt_misc realtek spi_intel idma64 video snd_intel_dspcfg snd_hda_codec snd_hda_core snd_hwdep snd_pcm intel_pmc_core snd_timer i2c_i801 pmt_telemetry nls_iso8859_1 i2c_mux snd pmt_discovery mei_me pmt_class i2c_smbus soundcore mei intel_pmc_ssram_telemetry wmi acpi_pad acpi_tad intel_vsec pinctrl_alderlake dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part3 <4>[ 674.460376] autofs4 [last unloaded: snd_hda_intel] <4>[ 674.460403] CR2: ffffc9000c38a188 <4>[ 674.460405] ---[ end trace 0000000000000000 ]--- <4>[ 674.603292] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 674.603385] Code: 24 66 90 65 8b 05 dc 64 2e e3 48 0f a3 05 80 c4 d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 674.603392] RSP: 0018:ffffc900004df800 EFLAGS: 00010086 <4>[ 674.603395] RAX: 0000000000000002 RBX: ffffc9000c38a188 RCX: 0000000000000000 <4>[ 674.603398] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff888215b10060 <4>[ 674.603400] RBP: ffffc900004df878 R08: 0000000000000000 R09: 0000000000000000 <4>[ 674.603403] R10: ffff888244e60000 R11: 0000000000000001 R12: ffff888215b10060 <4>[ 674.603406] R13: 000000000000a188 R14: ffff888244e60000 R15: 0000000000010001 <4>[ 674.603408] FS: 0000000000000000(0000) GS:ffff8888db11b000(0000) knlGS:0000000000000000 <4>[ 674.603412] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 674.603414] CR2: ffffc9000c38a188 CR3: 000000000344a000 CR4: 0000000000f52ef0 <4>[ 674.603417] PKRU: 55555554 <6>[ 674.603419] note: kworker/9:0[104] exited with irqs disabled <6>[ 674.603427] note: kworker/9:0[104] exited with preempt_count 1 <3>[ 676.758642] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=-2125596358 recv=0 <1>[ 676.758684] BUG: unable to handle page fault for address: 00000000004dfbd8 <1>[ 676.758698] #PF: supervisor read access in kernel mode <1>[ 676.758710] #PF: error_code(0x0000) - not-present page <6>[ 676.758721] PGD 0 P4D 0 Oops#2 Part2 <4>[ 676.758734] Oops: Oops: 0000 [#2] SMP NOPTI <4>[ 676.758751] CPU: 4 UID: 0 PID: 979 Comm: kworker/u64:13 Tainted: G S UD W L 7.0.0-rc1-lgci-xe-xe-4617-3b1923ab37ecd72e1-debug+ #1 PREEMPT(lazy) <4>[ 676.758778] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN, [L]=SOFTLOCKUP <4>[ 676.758792] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 0812 02/24/2023 <4>[ 676.758807] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 676.759363] RIP: 0010:__list_del_entry_valid_or_report+0x3b/0x120 <4>[ 676.759389] Code: 6f 08 4d 85 e4 74 50 4d 85 ed 74 5e 48 b8 00 01 00 00 00 00 ad de 49 39 c4 74 62 48 b8 22 01 00 00 00 00 ad de 49 39 c5 74 71 <49> 39 7d 00 0f 85 85 00 00 00 49 39 7c 24 08 0f 85 9f 00 00 00 b8 <4>[ 676.759424] RSP: 0018:ffffc90001abbd58 EFLAGS: 00010003 <4>[ 676.759441] RAX: dead000000000122 RBX: ffffc900004dfab0 RCX: 0000000000000000 <4>[ 676.759457] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffc900004dfab0 <4>[ 676.759473] RBP: ffffc90001abbd70 R08: 0000000000000000 R09: 0000000000000000 <4>[ 676.759488] R10: 0000000000000000 R11: 0000000000000000 R12: ffffc900004dfb40 <4>[ 676.759503] R13: 00000000004dfbd8 R14: ffffffff814e0094 R15: ffffc900004dfac0 <4>[ 676.759519] FS: 0000000000000000(0000) GS:ffff8888dae9b000(0000) knlGS:0000000000000000 <4>[ 676.759539] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 676.759553] CR2: 00000000004dfbd8 CR3: 000000000344a000 CR4: 0000000000f52ef0 <4>[ 676.759569] PKRU: 55555554 <4>[ 676.759578] Call Trace: <4>[ 676.759588] <4>[ 676.759600] ? lock_acquire+0x2b3/0x2f0 <4>[ 676.759622] xe_tlb_inval_fence_signal+0x40/0x200 [xe] Oops#2 Part1 <4>[ 676.760121] ? call_rcu+0x34/0x50 <4>[ 676.760141] xe_tlb_inval_fence_timeout+0xb9/0x220 [xe] <4>[ 676.760613] process_one_work+0x22e/0x740 <4>[ 676.760643] worker_thread+0x1e8/0x3d0 <4>[ 676.760662] ? __pfx_worker_thread+0x10/0x10 <4>[ 676.760681] kthread+0x10d/0x150 <4>[ 676.760698] ? __pfx_kthread+0x10/0x10 <4>[ 676.760716] ret_from_fork+0x3d4/0x480 <4>[ 676.760736] ? __pfx_kthread+0x10/0x10 <4>[ 676.760753] ret_from_fork_asm+0x1a/0x30 <4>[ 676.760779] <4>[ 676.760787] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic coretemp mtd asus_nb_wmi asus_wmi sparse_keymap mei_pxp mei_hdcp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel r8169 usbhid rapl hid intel_cstate spi_intel_pci binfmt_misc realtek spi_intel idma64 video snd_intel_dspcfg snd_hda_codec snd_hda_core snd_hwdep snd_pcm intel_pmc_core snd_timer i2c_i801 pmt_telemetry nls_iso8859_1 i2c_mux snd pmt_discovery mei_me pmt_class i2c_smbus soundcore mei intel_pmc_ssram_telemetry wmi acpi_pad acpi_tad intel_vsec pinctrl_alderlake dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 676.760972] autofs4 [last unloaded: snd_hda_intel] <4>[ 676.761121] CR2: 00000000004dfbd8 <4>[ 676.761134] ---[ end trace 0000000000000000 ]---