Oops#2 Part15 <7>[ 157.531110] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 157.532135] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0xe8c000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 157.533055] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 157.534037] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 157.555706] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152) <7>[ 157.566789] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 157.567071] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 157.567780] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4146 dwords <7>[ 157.567857] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 <7>[ 157.567920] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x00400040 <7>[ 157.567980] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 157.568041] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 157.568102] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 157.568162] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420 Oops#2 Part14 <7>[ 157.568221] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 157.568284] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 157.568350] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR) <7>[ 157.570256] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 157.570334] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 157.570408] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 157.571853] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 157.571923] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 157.573566] FAULT_INJECTION: forcing a failure. <5>[ 157.573566] name fail_function, interval 0, probability 100, space 1, times 100 <3>[ 157.573574] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 157.574198] ------------[ cut here ]------------ <4>[ 157.574200] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 157.574200] platform: BATTLEMAGE subplatform: 7 <4>[ 157.574200] graphics: Xe2_HPG 20.01 step A0 <4>[ 157.574200] media: Xe2_HPM 13.01 step A1 <4>[ 157.574200] tile: 0 VRAM 12.0 GiB <4>[ 157.574200] GT: 0 type 1 Oops#2 Part13 <4>[ 157.574203] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:541 at guc_ct_change_state+0x264/0x330 [xe], CPU#3: xe_fault_inject/5398 <4>[ 157.574276] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic mtd coretemp eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 rapl binfmt_misc snd_hda_codec usbhid intel_cstate snd_hda_core hid spi_intel_pci snd_hwdep realtek spi_intel snd_pcm snd_timer i2c_i801 i2c_mux snd soundcore idma64 i2c_smbus intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 157.574328] autofs4 [last unloaded: snd_hda_intel] <4>[ 157.574332] CPU: 3 UID: 0 PID: 5398 Comm: xe_fault_inject Tainted: G S U W 7.0.0-rc1-lgci-xe-xe-4628-1abdcb654ffbb08bb-debug+ #1 PREEMPT(lazy) <4>[ 157.574335] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 157.574336] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 Oops#2 Part12 <4>[ 157.574338] RIP: 0010:guc_ct_change_state+0x2d8/0x330 [xe] <4>[ 157.574410] Code: 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 e0 55 18 a1 52 4c 8b 55 88 41 52 44 8b 4d 9c 4c 8b 45 90 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 50 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 157.574412] RSP: 0018:ffffc9000c357588 EFLAGS: 00010002 <4>[ 157.574414] RAX: ffffffffa11fa3c7 RBX: ffff888166d50738 RCX: ffffffffa11855e0 <4>[ 157.574415] RDX: ffff888103f3b090 RSI: ffffffffa11fa3c7 RDI: ffffffffa1002ee0 <4>[ 157.574416] RBP: ffffc9000c357670 R08: ffffffffa11fa417 R09: 0000000000000007 <4>[ 157.574418] R10: ffffffffa11fa4c8 R11: 0000000000000514 R12: ffff888166d507c8 <4>[ 157.574419] R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000001 <4>[ 157.574421] FS: 000072d8bc586980(0000) GS:ffff8888dae1b000(0000) knlGS:0000000000000000 <4>[ 157.574423] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 157.574424] CR2: 000065486a3690e0 CR3: 0000000125002005 CR4: 0000000000f72ef0 <4>[ 157.574443] PKRU: 55555554 <4>[ 157.574444] Call Trace: <4>[ 157.574445] <4>[ 157.574453] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 157.574538] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 157.574616] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 157.574694] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 157.574794] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 157.574882] xe_gt_init+0x35d/0xab0 [xe] <4>[ 157.574958] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 157.574963] ? __devm_add_action+0x70/0xa0 <4>[ 157.574967] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 157.575053] xe_device_probe+0x3c5/0xc10 [xe] Oops#2 Part11 <4>[ 157.575127] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 157.575131] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 157.575137] xe_pci_probe+0x396/0x610 [xe] <4>[ 157.575223] ? trace_hardirqs_on+0x22/0x100 <4>[ 157.575231] local_pci_probe+0x47/0xb0 <4>[ 157.575235] pci_call_probe+0x6c/0x360 <4>[ 157.575241] ? _raw_spin_unlock+0x22/0x50 <4>[ 157.575245] pci_device_probe+0xae/0x110 <4>[ 157.575249] really_probe+0xf1/0x410 <4>[ 157.575253] __driver_probe_device+0x8c/0x190 <4>[ 157.575256] device_driver_attach+0x57/0xd0 <4>[ 157.575259] bind_store+0x142/0x150 <4>[ 157.575263] drv_attr_store+0x24/0x50 <4>[ 157.575266] sysfs_kf_write+0x4d/0x80 <4>[ 157.575270] kernfs_fop_write_iter+0x188/0x240 <4>[ 157.575275] vfs_write+0x283/0x540 <4>[ 157.575283] ksys_write+0x6f/0xf0 <4>[ 157.575288] __x64_sys_write+0x19/0x30 <4>[ 157.575291] x64_sys_call+0x259/0x26e0 <4>[ 157.575294] do_syscall_64+0xdd/0x1470 <4>[ 157.575298] ? __pcs_replace_full_main+0x29a/0x660 <4>[ 157.575303] ? putname+0x41/0x90 <4>[ 157.575307] ? kmem_cache_free+0x165/0x510 <4>[ 157.575311] ? putname+0x41/0x90 <4>[ 157.575314] ? do_sys_openat2+0x85/0xd0 <4>[ 157.575319] ? __x64_sys_openat+0x54/0xa0 <4>[ 157.575321] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 157.575325] ? do_syscall_64+0x22e/0x1470 <4>[ 157.575330] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 157.575333] ? do_syscall_64+0x22e/0x1470 <4>[ 157.575335] ? fput_close_sync+0x3d/0xa0 <4>[ 157.575338] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 157.575341] ? do_syscall_64+0x22e/0x1470 <4>[ 157.575343] ? exc_page_fault+0xbd/0x2c0 <4>[ 157.575347] entry_SYSCALL_64_after_hwframe+0x76/0x7e Oops#2 Part10 <4>[ 157.575349] RIP: 0033:0x72d8be71c5a4 <4>[ 157.575352] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 157.575353] RSP: 002b:00007ffec7655ae8 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 157.575355] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 000072d8be71c5a4 <4>[ 157.575357] RDX: 000000000000000c RSI: 00007ffec7655fb0 RDI: 0000000000000007 <4>[ 157.575358] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 157.575359] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffec7655fb0 <4>[ 157.575360] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffec7655c60 <4>[ 157.575368] <4>[ 157.575369] irq event stamp: 1240786 <4>[ 157.575370] hardirqs last enabled at (1240785): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 157.575373] hardirqs last disabled at (1240786): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 157.575375] softirqs last enabled at (1240494): [] __irq_exit_rcu+0x13f/0x160 <4>[ 157.575378] softirqs last disabled at (1240489): [] __irq_exit_rcu+0x13f/0x160 <4>[ 157.575380] ---[ end trace 0000000000000000 ]--- <7>[ 157.575382] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 157.575482] ------------[ cut here ]------------ <4>[ 157.575489] xe 0000:03:00.0: [drm] Tile0: GT0: Failed to invalidate GGTT (-ENODEV) <3>[ 157.575493] xe 0000:03:00.0: probe with driver xe failed with error -12 Oops#2 Part9 <4>[ 157.575491] WARNING: drivers/gpu/drm/xe/xe_ggtt.c:576 at ggtt_invalidate_gt_tlb.part.0+0x76/0xb0 [xe], CPU#4: kworker/4:7/2349 <4>[ 157.575560] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic mtd coretemp eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 rapl binfmt_misc snd_hda_codec usbhid intel_cstate snd_hda_core hid spi_intel_pci snd_hwdep realtek spi_intel snd_pcm snd_timer i2c_i801 i2c_mux snd soundcore idma64 i2c_smbus intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 157.575627] autofs4 [last unloaded: snd_hda_intel] <4>[ 157.575631] CPU: 4 UID: 0 PID: 2349 Comm: kworker/4:7 Tainted: G S U W 7.0.0-rc1-lgci-xe-xe-4628-1abdcb654ffbb08bb-debug+ #1 PREEMPT(lazy) <4>[ 157.575634] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 157.575635] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 157.575636] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] Oops#2 Part8 <4>[ 157.575711] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe] <4>[ 157.575772] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 24 76 62 e1 48 89 c6 48 8d 3d 7a d0 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31 <4>[ 157.575774] RSP: 0018:ffffc90003e9bb08 EFLAGS: 00010246 <4>[ 157.575776] RAX: ffffffffa11fa3c7 RBX: 0000000000000000 RCX: 0000000000000000 <4>[ 157.575778] RDX: ffff888103f3b090 RSI: ffffffffa11fa3c7 RDI: ffffffffa1001fc0 <4>[ 157.575779] RBP: ffffc90003e9bb28 R08: 0000000000000000 R09: ffffffffffffffed <4>[ 157.575780] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed <4>[ 157.575781] R13: 0000000000000000 R14: ffff888103f3b090 R15: 0000000000000000 <4>[ 157.575782] FS: 0000000000000000(0000) GS:ffff8888dae9b000(0000) knlGS:0000000000000000 <4>[ 157.575784] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 157.575785] CR2: 000065486a49fde0 CR3: 000000000344a004 CR4: 0000000000f72ef0 <4>[ 157.575787] PKRU: 55555554 <4>[ 157.575788] Call Trace: <4>[ 157.575789] <4>[ 157.575792] ggtt_node_remove+0x11a/0x140 [xe] <4>[ 157.575854] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 157.575914] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 157.575975] ? _raw_write_unlock+0x22/0x50 <4>[ 157.575979] ? drm_vma_offset_remove+0x65/0x80 <4>[ 157.575984] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 157.576039] ? lock_is_held_type+0xa3/0x130 <4>[ 157.576044] ttm_bo_release+0x70/0x330 [ttm] <3>[ 157.576072] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <4>[ 157.576051] ? xe_ggtt_might_lock+0x29/0x60 [xe] Oops#2 Part7 <4>[ 157.576111] ? lock_release+0xd0/0x2b0 <4>[ 157.576116] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 157.576121] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 157.576174] drm_gem_object_free+0x1d/0x40 <4>[ 157.576177] xe_bo_put+0x12a/0x190 [xe] <4>[ 157.576233] xe_lrc_destroy+0x49/0x90 [xe] <4>[ 157.576308] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 157.576367] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 157.576449] process_one_work+0x22e/0x740 <4>[ 157.576457] worker_thread+0x1e8/0x3d0 <4>[ 157.576461] ? __pfx_worker_thread+0x10/0x10 <4>[ 157.576464] kthread+0x10d/0x150 <4>[ 157.576467] ? __pfx_kthread+0x10/0x10 <7>[ 157.576436] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 157.576471] ret_from_fork+0x3d4/0x480 <4>[ 157.576474] ? __pfx_kthread+0x10/0x10 <4>[ 157.576477] ret_from_fork_asm+0x1a/0x30 <4>[ 157.576484] <4>[ 157.576485] irq event stamp: 6641 <4>[ 157.576486] hardirqs last enabled at (6647): [] __up_console_sem+0x79/0xa0 <4>[ 157.576489] hardirqs last disabled at (6652): [] __up_console_sem+0x5e/0xa0 <4>[ 157.576491] softirqs last enabled at (6582): [] __irq_exit_rcu+0x13f/0x160 <4>[ 157.576494] softirqs last disabled at (6577): [] __irq_exit_rcu+0x13f/0x160 <4>[ 157.576495] ---[ end trace 0000000000000000 ]--- <7>[ 157.577738] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 157.653725] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. Oops#2 Part6 <7>[ 157.656263] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. <3>[ 159.876988] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=51 recv=0 <1>[ 159.877979] BUG: unable to handle page fault for address: ffffc9000838a188 <1>[ 159.878017] #PF: supervisor write access in kernel mode <1>[ 159.878039] #PF: error_code(0x0002) - not-present page <6>[ 159.878058] PGD 100000067 P4D 100000067 PUD 100ac0067 PMD 0 <4>[ 159.878089] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 159.878114] CPU: 4 UID: 0 PID: 32 Comm: kworker/4:0 Tainted: G S U W 7.0.0-rc1-lgci-xe-xe-4628-1abdcb654ffbb08bb-debug+ #1 PREEMPT(lazy) <4>[ 159.878158] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 159.878176] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 159.878200] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 159.878864] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 159.879605] Code: 24 66 90 65 8b 05 2c 63 2e e3 48 0f a3 05 d0 c2 d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 159.879668] RSP: 0018:ffffc900002637f8 EFLAGS: 00010086 <4>[ 159.879696] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 159.879725] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88817ede0060 <4>[ 159.879753] RBP: ffffc90000263870 R08: 0000000000000000 R09: 0000000000000000 <4>[ 159.879780] R10: ffff888154d60000 R11: 0000000000000001 R12: ffff88817ede0060 Oops#2 Part5 <4>[ 159.879808] R13: 000000000000a188 R14: ffff888154d60000 R15: 0000000000010001 <4>[ 159.879836] FS: 0000000000000000(0000) GS:ffff8888dae9b000(0000) knlGS:0000000000000000 <4>[ 159.879870] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 159.879895] CR2: ffffc9000838a188 CR3: 000000000344a004 CR4: 0000000000f72ef0 <4>[ 159.879924] PKRU: 55555554 <4>[ 159.879940] Call Trace: <4>[ 159.879955] <4>[ 159.879986] xe_force_wake_get+0x2a5/0x940 [xe] <4>[ 159.880654] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 159.880704] ? mark_held_locks+0x46/0x90 <4>[ 159.880749] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 159.881432] ? trace_hardirqs_on+0x22/0x100 <4>[ 159.881460] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 159.881486] ? xe_tlb_inval_fence_prep+0xce/0x1e0 [xe] <4>[ 159.882027] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 159.882497] ? xelpg_ggtt_pte_flags+0x27/0x1a0 [xe] <4>[ 159.882894] ? find_held_lock+0x31/0x90 <4>[ 159.882911] ? ggtt_node_remove+0xcb/0x140 [xe] <4>[ 159.883308] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 159.883468] ggtt_node_remove+0x12c/0x140 [xe] <4>[ 159.883541] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 159.883610] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 159.883677] ? _raw_write_unlock+0x22/0x50 <4>[ 159.883680] ? drm_vma_offset_remove+0x65/0x80 <4>[ 159.883685] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 159.883751] ? lock_is_held_type+0xa3/0x130 <4>[ 159.883756] ttm_bo_release+0x70/0x330 [ttm] <4>[ 159.883763] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 159.883829] ? lock_release+0xd0/0x2b0 <4>[ 159.883833] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 159.883838] xe_gem_object_free+0x1a/0x30 [xe] Oops#2 Part4 <4>[ 159.883905] drm_gem_object_free+0x1d/0x40 <4>[ 159.883908] xe_bo_put+0x12a/0x190 [xe] <4>[ 159.883974] xe_lrc_destroy+0x74/0x90 [xe] <4>[ 159.884049] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 159.884115] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 159.884185] process_one_work+0x22e/0x740 <4>[ 159.884191] worker_thread+0x1e8/0x3d0 <4>[ 159.884194] ? __pfx_worker_thread+0x10/0x10 <4>[ 159.884197] kthread+0x10d/0x150 <4>[ 159.884200] ? __pfx_kthread+0x10/0x10 <4>[ 159.884204] ret_from_fork+0x3d4/0x480 <4>[ 159.884207] ? __pfx_kthread+0x10/0x10 <4>[ 159.884210] ret_from_fork_asm+0x1a/0x30 <4>[ 159.884215] <4>[ 159.884217] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic mtd coretemp eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 rapl binfmt_misc snd_hda_codec usbhid intel_cstate snd_hda_core hid spi_intel_pci snd_hwdep realtek spi_intel snd_pcm snd_timer i2c_i801 i2c_mux snd soundcore idma64 i2c_smbus intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part3 <4>[ 159.884250] autofs4 [last unloaded: snd_hda_intel] <4>[ 159.884276] CR2: ffffc9000838a188 <4>[ 159.884279] ---[ end trace 0000000000000000 ]--- <4>[ 162.974383] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 162.974496] Code: 24 66 90 65 8b 05 2c 63 2e e3 48 0f a3 05 d0 c2 d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 162.974503] RSP: 0018:ffffc900002637f8 EFLAGS: 00010086 <4>[ 162.974507] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 162.974511] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88817ede0060 <4>[ 162.974514] RBP: ffffc90000263870 R08: 0000000000000000 R09: 0000000000000000 <4>[ 162.974517] R10: ffff888154d60000 R11: 0000000000000001 R12: ffff88817ede0060 <4>[ 162.974520] R13: 000000000000a188 R14: ffff888154d60000 R15: 0000000000010001 <4>[ 162.974523] FS: 0000000000000000(0000) GS:ffff8888dae9b000(0000) knlGS:0000000000000000 <4>[ 162.974527] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 162.974530] CR2: ffffc9000838a188 CR3: 000000000344a004 CR4: 0000000000f72ef0 <4>[ 162.974533] PKRU: 55555554 <6>[ 162.974535] note: kworker/4:0[32] exited with irqs disabled <6>[ 162.974560] note: kworker/4:0[32] exited with preempt_count 1 Oops#2 Part2 <3>[ 162.974575] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=1699706880 recv=0 <4>[ 162.974593] Oops: general protection fault, probably for non-canonical address 0xffff3937343536e8: 0000 [#2] SMP NOPTI <4>[ 162.974601] CPU: 13 UID: 0 PID: 205 Comm: kworker/u64:5 Tainted: G S UD W 7.0.0-rc1-lgci-xe-xe-4628-1abdcb654ffbb08bb-debug+ #1 PREEMPT(lazy) <4>[ 162.974610] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 162.974614] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 162.974619] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 162.974763] RIP: 0010:xe_tlb_inval_fence_signal+0x75/0x200 [xe] <4>[ 162.974889] Code: 48 8b 83 88 00 00 00 48 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 48 89 83 80 00 00 00 48 83 c0 22 48 89 83 88 00 00 00 <49> 8b 95 b8 00 00 00 49 8d 85 b8 00 00 00 48 39 c2 0f 84 53 01 00 <4>[ 162.974897] RSP: 0018:ffffc90001627d80 EFLAGS: 00010086 <4>[ 162.974902] RAX: dead000000000122 RBX: ffffc90000263a28 RCX: 0000000000000000 <4>[ 162.974906] RDX: ffff88817ede0510 RSI: 0000000000000000 RDI: 0000000000000000 <4>[ 162.974910] RBP: ffffc90001627da0 R08: 0000000000000000 R09: 0000000000000000 <4>[ 162.974913] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 <4>[ 162.974917] R13: ffff393734353630 R14: 0000000000000801 R15: ffff88817ede0490 <4>[ 162.974921] FS: 0000000000000000(0000) GS:ffff8888db31b000(0000) knlGS:0000000000000000 <4>[ 162.974926] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 162.974929] CR2: 000065486a3691f0 CR3: 0000000124926002 CR4: 0000000000f72ef0 <4>[ 162.974934] PKRU: 55555554 Oops#2 Part1 <4>[ 162.974936] Call Trace: <4>[ 162.974939] <4>[ 162.974943] xe_tlb_inval_fence_timeout+0xb9/0x220 [xe] <4>[ 162.975067] process_one_work+0x22e/0x740 <4>[ 162.975075] worker_thread+0x1e8/0x3d0 <4>[ 162.975080] ? __pfx_worker_thread+0x10/0x10 <4>[ 162.975085] kthread+0x10d/0x150 <4>[ 162.975089] ? __pfx_kthread+0x10/0x10 <4>[ 162.975093] ret_from_fork+0x3d4/0x480 <4>[ 162.975098] ? __pfx_kthread+0x10/0x10 <4>[ 162.975102] ret_from_fork_asm+0x1a/0x30 <4>[ 162.975109] <4>[ 162.975111] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal intel_powerclamp spi_nor hid_generic mtd coretemp eeepc_wmi asus_wmi sparse_keymap mei_hdcp mei_pxp platform_profile wmi_bmof kvm_intel kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 rapl binfmt_misc snd_hda_codec usbhid intel_cstate snd_hda_core hid spi_intel_pci snd_hwdep realtek spi_intel snd_pcm snd_timer i2c_i801 i2c_mux snd soundcore idma64 i2c_smbus intel_pmc_core video pmt_telemetry nls_iso8859_1 pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink <4>[ 162.975159] autofs4 [last unloaded: snd_hda_intel] <4>[ 162.975195] ---[ end trace 0000000000000000 ]---