Oops#2 Part15 <7>[ 327.126508] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[10] = 0x00000000 <7>[ 327.126567] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[11] = 0x00000000 <7>[ 327.126625] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[12] = 0x00000000 <7>[ 327.126686] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[13] = 0x00000000 <7>[ 327.126770] xe 0000:03:00.0: [drm:xe_guc_id_mgr_init [xe]] Tile0: GT0: using 65535 GuC IDs <7>[ 327.126860] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 327.128062] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0x627000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 327.129016] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 327.130022] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 327.153129] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152) <7>[ 327.163948] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 327.164257] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 327.164855] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4140 dwords <7>[ 327.164939] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 Oops#2 Part14 <7>[ 327.165007] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x01400140 <7>[ 327.165071] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 327.165137] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 327.165224] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 327.165291] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420 <7>[ 327.165356] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 327.165423] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 327.165495] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 <7>[ 327.167013] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 327.167090] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 327.167163] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 327.168581] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 327.168650] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 327.170753] FAULT_INJECTION: forcing a failure. <5>[ 327.170753] name fail_function, interval 0, probability 100, space 1, times 100 Oops#2 Part13 <3>[ 327.170762] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 327.170972] ------------[ cut here ]------------ <4>[ 327.170974] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 327.170974] platform: BATTLEMAGE subplatform: 7 <4>[ 327.170974] graphics: Xe2_HPG 20.01 step A0 <4>[ 327.170974] media: Xe2_HPM 13.01 step A1 <4>[ 327.170974] tile: 0 VRAM 12.0 GiB <4>[ 327.170974] GT: 0 type 1 <4>[ 327.170977] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:527 at guc_ct_change_state+0x279/0x350 [xe], CPU#3: xe_fault_inject/7009 <4>[ 327.171045] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal spi_nor intel_powerclamp eeepc_wmi mtd asus_wmi hid_generic coretemp sparse_keymap platform_profile mei_pxp mei_hdcp wmi_bmof snd_hda_intel snd_intel_dspcfg snd_hda_codec kvm_intel snd_hda_core snd_hwdep kvm snd_pcm irqbypass ghash_clmulni_intel aesni_intel video usbhid r8169 snd_timer spi_intel_pci rapl hid intel_cstate binfmt_misc i2c_i801 spi_intel snd i2c_mux realtek i2c_smbus soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry mei_me mei wmi intel_vsec acpi_pad pinctrl_alderlake acpi_tad dm_multipath msr nvme_fabrics fuse Oops#2 Part12 <4>[ 327.171096] efi_pstore nfnetlink autofs4 <4>[ 327.171100] CPU: 3 UID: 0 PID: 7009 Comm: xe_fault_inject Tainted: G S U W 6.19.0-lgci-xe-xe-4561-eac0d1b8d00bd336f-debug+ #1 PREEMPT(voluntary) <4>[ 327.171103] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 327.171105] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 327.171106] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe] <4>[ 327.171181] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 f0 63 18 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 327.171182] RSP: 0018:ffffc9000e00b850 EFLAGS: 00010002 <4>[ 327.171184] RAX: ffffffffa11fa9f4 RBX: ffff88816e4f88a8 RCX: ffffffffa11863f0 <4>[ 327.171186] RDX: ffff888104859c10 RSI: ffffffffa11fa9f4 RDI: ffffffffa1002f80 <4>[ 327.171187] RBP: ffffc9000e00b938 R08: ffffffffa11faa44 R09: 0000000000000007 <4>[ 327.171188] R10: 0000000000000001 R11: 0000000000000514 R12: ffff88816e4f88b0 <4>[ 327.171189] R13: ffff88816e4f8940 R14: 0000000000000515 R15: 0000000000000001 <4>[ 327.171190] FS: 00007080ba82a980(0000) GS:ffff8888dae5a000(0000) knlGS:0000000000000000 <4>[ 327.171192] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Oops#2 Part11 <4>[ 327.171193] CR2: 000070118ee0b048 CR3: 000000011b1df005 CR4: 0000000000f72ef0 <4>[ 327.171195] PKRU: 55555554 <4>[ 327.171196] Call Trace: <4>[ 327.171197] <4>[ 327.171205] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 327.171288] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 327.171366] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 327.171443] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 327.171541] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 327.171629] xe_gt_init+0x363/0xab0 [xe] <4>[ 327.171705] ? trace_hardirqs_on+0x63/0xd0 <4>[ 327.171709] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 327.171713] ? __devm_add_action+0x70/0xa0 <4>[ 327.171717] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 327.171804] xe_device_probe+0x3cc/0xc20 [xe] <4>[ 327.171879] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 327.171883] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 327.171889] xe_pci_probe+0x396/0x610 [xe] <4>[ 327.171981] local_pci_probe+0x47/0xb0 <4>[ 327.171985] pci_device_probe+0xf3/0x260 <4>[ 327.171990] really_probe+0xf1/0x410 <4>[ 327.171993] __driver_probe_device+0x8c/0x190 <4>[ 327.171996] device_driver_attach+0x57/0xd0 <4>[ 327.172000] bind_store+0x77/0xd0 <4>[ 327.172003] drv_attr_store+0x24/0x50 <4>[ 327.172006] sysfs_kf_write+0x4d/0x80 <4>[ 327.172010] kernfs_fop_write_iter+0x188/0x240 <4>[ 327.172014] vfs_write+0x283/0x540 <4>[ 327.172022] ksys_write+0x6f/0xf0 <4>[ 327.172026] __x64_sys_write+0x19/0x30 <4>[ 327.172029] x64_sys_call+0x79/0x26b0 <4>[ 327.172031] do_syscall_64+0x93/0x1470 <4>[ 327.172036] ? do_syscall_64+0x1e4/0x1470 <4>[ 327.172037] ? exc_page_fault+0xbb/0x260 <4>[ 327.172041] entry_SYSCALL_64_after_hwframe+0x76/0x7e Oops#2 Part10 <4>[ 327.172043] RIP: 0033:0x7080bcb1c5a4 <4>[ 327.172045] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 327.172047] RSP: 002b:00007ffc8dab7cc8 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 327.172049] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007080bcb1c5a4 <4>[ 327.172050] RDX: 000000000000000c RSI: 00007ffc8dab9180 RDI: 000000000000000b <4>[ 327.172051] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 327.172052] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffc8dab9180 <4>[ 327.172053] R13: 000000000000000b R14: 000061256c0bd35b R15: 00007ffc8dab8e30 <4>[ 327.172061] <4>[ 327.172062] irq event stamp: 795732 <4>[ 327.172063] hardirqs last enabled at (795731): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 327.172066] hardirqs last disabled at (795732): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 327.172068] softirqs last enabled at (795636): [] __irq_exit_rcu+0x13f/0x160 <4>[ 327.172071] softirqs last disabled at (795627): [] __irq_exit_rcu+0x13f/0x160 <4>[ 327.172073] ---[ end trace 0000000000000000 ]--- <7>[ 327.172075] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 327.172171] ------------[ cut here ]------------ <4>[ 327.172177] xe 0000:03:00.0: [drm] Tile0: GT0: Failed to invalidate GGTT (-ENODEV) <3>[ 327.172182] xe 0000:03:00.0: probe with driver xe failed with error -12 Oops#2 Part9 <4>[ 327.172179] WARNING: drivers/gpu/drm/xe/xe_ggtt.c:521 at ggtt_invalidate_gt_tlb.part.0+0x76/0xb0 [xe], CPU#5: kworker/5:4/2574 <4>[ 327.172246] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal spi_nor intel_powerclamp eeepc_wmi mtd asus_wmi hid_generic coretemp sparse_keymap platform_profile mei_pxp mei_hdcp wmi_bmof snd_hda_intel snd_intel_dspcfg snd_hda_codec kvm_intel snd_hda_core snd_hwdep kvm snd_pcm irqbypass ghash_clmulni_intel aesni_intel video usbhid r8169 snd_timer spi_intel_pci rapl hid intel_cstate binfmt_misc i2c_i801 spi_intel snd i2c_mux realtek i2c_smbus soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry mei_me mei wmi intel_vsec acpi_pad pinctrl_alderlake acpi_tad dm_multipath msr nvme_fabrics fuse <4>[ 327.172314] efi_pstore nfnetlink autofs4 <4>[ 327.172319] CPU: 5 UID: 0 PID: 2574 Comm: kworker/5:4 Tainted: G S U W 6.19.0-lgci-xe-xe-4561-eac0d1b8d00bd336f-debug+ #1 PREEMPT(voluntary) <4>[ 327.172322] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 327.172323] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 327.172325] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] Oops#2 Part8 <4>[ 327.172397] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe] <4>[ 327.172457] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 e4 7d 5e e1 48 89 c6 48 8d 3d fa d2 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31 <4>[ 327.172459] RSP: 0018:ffffc9000428bb08 EFLAGS: 00010246 <4>[ 327.172462] RAX: ffffffffa11fa9f4 RBX: 0000000000000000 RCX: 0000000000000000 <4>[ 327.172464] RDX: ffff888104859c10 RSI: ffffffffa11fa9f4 RDI: ffffffffa1001fc0 <4>[ 327.172465] RBP: ffffc9000428bb28 R08: 0000000000000000 R09: ffffffffffffffed <4>[ 327.172467] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed <4>[ 327.172468] R13: 0000000000000000 R14: ffff888104859c10 R15: 0000000000000000 <4>[ 327.172469] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000 <4>[ 327.172471] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 327.172473] CR2: 00005b9d801f4a60 CR3: 000000010e8af003 CR4: 0000000000f72ef0 <4>[ 327.172474] PKRU: 55555554 <4>[ 327.172476] Call Trace: <4>[ 327.172477] <4>[ 327.172480] ggtt_node_remove+0x110/0x140 [xe] <4>[ 327.172542] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 327.172603] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 327.172663] ? _raw_write_unlock+0x22/0x50 <4>[ 327.172667] ? drm_vma_offset_remove+0x65/0x80 <4>[ 327.172672] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 327.172727] ? lock_is_held_type+0xa3/0x130 <4>[ 327.172733] ttm_bo_release+0x70/0x330 [ttm] <3>[ 327.172745] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <4>[ 327.172739] ? xe_ggtt_might_lock+0x29/0x60 [xe] Oops#2 Part7 <4>[ 327.172799] ? lock_release+0xce/0x280 <4>[ 327.172805] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 327.172810] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 327.172864] drm_gem_object_free+0x1d/0x40 <4>[ 327.172866] xe_bo_put+0x12a/0x190 [xe] <4>[ 327.172922] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 327.172997] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 327.173054] __guc_exec_queue_destroy_async+0x6c/0x170 [xe] <4>[ 327.173122] process_one_work+0x22e/0x6b0 <4>[ 327.173129] worker_thread+0x1e8/0x3d0 <4>[ 327.173131] ? __pfx_worker_thread+0x10/0x10 <4>[ 327.173134] kthread+0x11f/0x250 <4>[ 327.173137] ? __pfx_kthread+0x10/0x10 <7>[ 327.173106] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 327.173149] ret_from_fork+0x344/0x3a0 <4>[ 327.173152] ? __pfx_kthread+0x10/0x10 <4>[ 327.173155] ret_from_fork_asm+0x1a/0x30 <4>[ 327.173162] <4>[ 327.173163] irq event stamp: 12009 <4>[ 327.173164] hardirqs last enabled at (12015): [] __up_console_sem+0x79/0xa0 <4>[ 327.173167] hardirqs last disabled at (12020): [] __up_console_sem+0x5e/0xa0 <4>[ 327.173169] softirqs last enabled at (11984): [] __irq_exit_rcu+0x13f/0x160 <4>[ 327.173172] softirqs last disabled at (11979): [] __irq_exit_rcu+0x13f/0x160 <4>[ 327.173174] ---[ end trace 0000000000000000 ]--- <7>[ 327.174199] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 327.270962] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. Oops#2 Part6 <7>[ 327.272972] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. <3>[ 329.469629] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=38 recv=0 <1>[ 329.470690] BUG: unable to handle page fault for address: ffffc9002438a188 <1>[ 329.470719] #PF: supervisor write access in kernel mode <1>[ 329.470732] #PF: error_code(0x0002) - not-present page <6>[ 329.470743] PGD 100000067 P4D 100000067 PUD 100ab3067 PMD 0 <4>[ 329.470763] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 329.470779] CPU: 5 UID: 0 PID: 6586 Comm: kworker/5:10 Tainted: G S U W 6.19.0-lgci-xe-xe-4561-eac0d1b8d00bd336f-debug+ #1 PREEMPT(voluntary) <4>[ 329.470806] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 329.470818] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 329.470834] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 329.471330] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 329.471782] Code: 24 66 90 65 8b 05 6c 7a 2a e3 48 0f a3 05 10 95 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 329.471818] RSP: 0018:ffffc90003163830 EFLAGS: 00010086 <4>[ 329.471835] RAX: 0000000000000002 RBX: ffffc9002438a188 RCX: 0000000000000000 <4>[ 329.471853] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88816def81d0 <4>[ 329.471870] RBP: ffffc900031638a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 329.471886] R10: ffff888175a90000 R11: 0000000000000001 R12: ffff88816def81d0 Oops#2 Part5 <4>[ 329.471902] R13: 000000000000a188 R14: ffff888175a90000 R15: 0000000000010001 <4>[ 329.471919] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000 <4>[ 329.471940] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 329.471955] CR2: ffffc9002438a188 CR3: 0000000003448001 CR4: 0000000000f72ef0 <4>[ 329.471972] PKRU: 55555554 <4>[ 329.471982] Call Trace: <4>[ 329.471993] <4>[ 329.472011] xe_force_wake_get+0x417/0x950 [xe] <4>[ 329.472118] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 329.472124] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 329.472198] ? trace_hardirqs_on+0x63/0xd0 <4>[ 329.472203] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 329.472206] ? xe_tlb_inval_fence_prep+0xbf/0x1a0 [xe] <4>[ 329.472290] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 329.472371] ? find_held_lock+0x31/0x90 <4>[ 329.472375] ? ggtt_node_remove+0xc4/0x140 [xe] <4>[ 329.472444] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 329.472512] ggtt_node_remove+0x122/0x140 [xe] <4>[ 329.472580] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 329.472648] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 329.472716] ? _raw_write_unlock+0x22/0x50 <4>[ 329.472718] ? drm_vma_offset_remove+0x65/0x80 <4>[ 329.472723] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 329.472789] ? lock_is_held_type+0xa3/0x130 <4>[ 329.472794] ttm_bo_release+0x70/0x330 [ttm] <4>[ 329.472801] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 329.472868] ? lock_release+0xce/0x280 <4>[ 329.472872] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 329.472878] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 329.472944] drm_gem_object_free+0x1d/0x40 Oops#2 Part4 <4>[ 329.472947] xe_bo_put+0x12a/0x190 [xe] <4>[ 329.473014] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 329.473195] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 329.473583] __guc_exec_queue_destroy_async+0x6c/0x170 [xe] <4>[ 329.473990] process_one_work+0x22e/0x6b0 <4>[ 329.474014] worker_thread+0x1e8/0x3d0 <4>[ 329.474030] ? __pfx_worker_thread+0x10/0x10 <4>[ 329.474046] kthread+0x11f/0x250 <4>[ 329.474064] ? __pfx_kthread+0x10/0x10 <4>[ 329.474082] ret_from_fork+0x344/0x3a0 <4>[ 329.474098] ? __pfx_kthread+0x10/0x10 <4>[ 329.474115] ret_from_fork_asm+0x1a/0x30 <4>[ 329.474138] <4>[ 329.474146] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal spi_nor intel_powerclamp eeepc_wmi mtd asus_wmi hid_generic coretemp sparse_keymap platform_profile mei_pxp mei_hdcp wmi_bmof snd_hda_intel snd_intel_dspcfg snd_hda_codec kvm_intel snd_hda_core snd_hwdep kvm snd_pcm irqbypass ghash_clmulni_intel aesni_intel video usbhid r8169 snd_timer spi_intel_pci rapl hid intel_cstate binfmt_misc i2c_i801 spi_intel snd i2c_mux realtek i2c_smbus soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry mei_me mei wmi intel_vsec acpi_pad pinctrl_alderlake acpi_tad dm_multipath msr nvme_fabrics fuse Oops#2 Part3 <4>[ 329.474326] efi_pstore nfnetlink autofs4 <4>[ 329.474475] CR2: ffffc9002438a188 <4>[ 329.474489] ---[ end trace 0000000000000000 ]--- <4>[ 332.592483] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 332.592588] Code: 24 66 90 65 8b 05 6c 7a 2a e3 48 0f a3 05 10 95 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 332.592594] RSP: 0018:ffffc90003163830 EFLAGS: 00010086 <4>[ 332.592597] RAX: 0000000000000002 RBX: ffffc9002438a188 RCX: 0000000000000000 <4>[ 332.592600] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88816def81d0 <4>[ 332.592603] RBP: ffffc900031638a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 332.592606] R10: ffff888175a90000 R11: 0000000000000001 R12: ffff88816def81d0 <4>[ 332.592608] R13: 000000000000a188 R14: ffff888175a90000 R15: 0000000000010001 <4>[ 332.592611] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000 <4>[ 332.592615] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 332.592617] CR2: ffffc9002438a188 CR3: 0000000003448001 CR4: 0000000000f72ef0 <4>[ 332.592620] PKRU: 55555554 <6>[ 332.592622] note: kworker/5:10[6586] exited with irqs disabled <6>[ 332.592640] note: kworker/5:10[6586] exited with preempt_count 1 <3>[ 332.592700] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=51788576 recv=0 <1>[ 332.592711] BUG: unable to handle page fault for address: 00000001000a2a40 <1>[ 332.592715] #PF: supervisor read access in kernel mode <1>[ 332.592719] #PF: error_code(0x0000) - not-present page <6>[ 332.592722] PGD 0 P4D 0 Oops#2 Part2 <4>[ 332.592726] Oops: Oops: 0000 [#2] SMP NOPTI <4>[ 332.592731] CPU: 15 UID: 0 PID: 6088 Comm: kworker/u64:55 Tainted: G S UD W 6.19.0-lgci-xe-xe-4561-eac0d1b8d00bd336f-debug+ #1 PREEMPT(voluntary) <4>[ 332.592740] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 332.592744] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 332.592748] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 332.592891] RIP: 0010:__list_del_entry_valid_or_report+0x3b/0x120 <4>[ 332.592899] Code: 6f 08 4d 85 e4 74 50 4d 85 ed 74 5e 48 b8 00 01 00 00 00 00 ad de 49 39 c4 74 62 48 b8 22 01 00 00 00 00 ad de 49 39 c5 74 71 <49> 39 7d 00 0f 85 85 00 00 00 49 39 7c 24 08 0f 85 9f 00 00 00 b8 <4>[ 332.592908] RSP: 0018:ffffc90007b97d60 EFLAGS: 00010017 <4>[ 332.592912] RAX: dead000000000122 RBX: ffffc90003163aa8 RCX: 0000000000000000 <4>[ 332.592916] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffc90003163aa8 <4>[ 332.592920] RBP: ffffc90007b97d78 R08: 0000000000000000 R09: 0000000000000000 <4>[ 332.592924] R10: 0000000000000000 R11: 0000000000000000 R12: ffffc90003163f20 <4>[ 332.592928] R13: 00000001000a2a40 R14: ffffc90003163f38 R15: ffffc90003163ed8 <4>[ 332.592932] FS: 0000000000000000(0000) GS:ffff8888db45a000(0000) knlGS:0000000000000000 <4>[ 332.592937] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 332.592940] CR2: 00000001000a2a40 CR3: 0000000110b8f001 CR4: 0000000000f72ef0 <4>[ 332.592944] PKRU: 55555554 <4>[ 332.592947] Call Trace: <4>[ 332.592950] <4>[ 332.592953] ? update_stack_state+0x10e/0x1a0 <4>[ 332.592958] xe_tlb_inval_fence_signal+0x3b/0x1b0 [xe] Oops#2 Part1 <4>[ 332.593085] xe_tlb_inval_fence_timeout+0xb6/0x1d0 [xe] <4>[ 332.593211] process_one_work+0x22e/0x6b0 <4>[ 332.593218] worker_thread+0x1e8/0x3d0 <4>[ 332.593222] ? __pfx_worker_thread+0x10/0x10 <4>[ 332.593226] kthread+0x11f/0x250 <4>[ 332.593230] ? __pfx_kthread+0x10/0x10 <4>[ 332.593235] ret_from_fork+0x344/0x3a0 <4>[ 332.593240] ? __pfx_kthread+0x10/0x10 <4>[ 332.593244] ret_from_fork_asm+0x1a/0x30 <4>[ 332.593250] <4>[ 332.593252] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling cmdlinepart x86_pkg_temp_thermal spi_nor intel_powerclamp eeepc_wmi mtd asus_wmi hid_generic coretemp sparse_keymap platform_profile mei_pxp mei_hdcp wmi_bmof snd_hda_intel snd_intel_dspcfg snd_hda_codec kvm_intel snd_hda_core snd_hwdep kvm snd_pcm irqbypass ghash_clmulni_intel aesni_intel video usbhid r8169 snd_timer spi_intel_pci rapl hid intel_cstate binfmt_misc i2c_i801 spi_intel snd i2c_mux realtek i2c_smbus soundcore idma64 intel_pmc_core pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry mei_me mei wmi intel_vsec acpi_pad pinctrl_alderlake acpi_tad dm_multipath msr nvme_fabrics fuse <4>[ 332.593297] efi_pstore nfnetlink autofs4 <4>[ 332.593332] CR2: 00000001000a2a40 <4>[ 332.593336] ---[ end trace 0000000000000000 ]---