Oops#2 Part16 <7>[ 288.219058] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[ 9] = 0x00000000 <7>[ 288.219112] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[10] = 0x00000000 <7>[ 288.219166] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[11] = 0x00000000 <7>[ 288.219220] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[12] = 0x00000000 <7>[ 288.219274] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[13] = 0x00000000 <7>[ 288.219511] xe 0000:03:00.0: [drm:xe_guc_id_mgr_init [xe]] Tile0: GT0: using 65535 GuC IDs <7>[ 288.219606] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 288.220743] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0x627000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 288.221506] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 288.222496] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 288.244247] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 45056) <7>[ 288.254966] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 288.255258] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 288.255908] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4138 dwords Oops#2 Part15 <7>[ 288.255989] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 <7>[ 288.256058] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x01400140 <7>[ 288.256121] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 288.256188] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 288.256251] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 288.256314] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 288.256380] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 288.256451] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 <7>[ 288.258210] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 288.258306] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 288.258390] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 288.260220] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 288.260289] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 288.261981] FAULT_INJECTION: forcing a failure. <5>[ 288.261981] name fail_function, interval 0, probability 100, space 1, times 100 Oops#2 Part14 <3>[ 288.262006] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 288.262141] ------------[ cut here ]------------ <4>[ 288.262143] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 288.262143] platform: BATTLEMAGE subplatform: 7 <4>[ 288.262143] graphics: Xe2_HPG 20.01 step A0 <4>[ 288.262143] media: Xe2_HPM 13.01 step A1 <4>[ 288.262143] tile: 0 VRAM 12.0 GiB <4>[ 288.262143] GT: 0 type 1 <4>[ 288.262146] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:526 at guc_ct_change_state+0x279/0x350 [xe], CPU#5: xe_fault_inject/4854 <4>[ 288.262212] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp hid_generic cmdlinepart asus_nb_wmi asus_wmi spi_nor mei_hdcp sparse_keymap mei_pxp platform_profile mtd kvm_intel wmi_bmof kvm snd_hda_intel irqbypass usbhid ghash_clmulni_intel snd_intel_dspcfg hid aesni_intel snd_hda_codec r8169 video rapl snd_hda_core snd_hwdep intel_cstate binfmt_misc snd_pcm realtek snd_timer i2c_i801 snd idma64 spi_intel_pci mei_me i2c_mux spi_intel intel_pmc_core soundcore i2c_smbus mei pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad acpi_tad intel_vsec wmi dm_multipath msr nvme_fabrics fuse Oops#2 Part13 <4>[ 288.262265] efi_pstore nfnetlink autofs4 <4>[ 288.262269] CPU: 5 UID: 0 PID: 4854 Comm: xe_fault_inject Tainted: G S U W 6.19.0-rc5-lgci-xe-xe-4392-6226d2d655d2d5a08+ #1 PREEMPT(voluntary) <4>[ 288.262272] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 288.262273] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 288.262274] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe] <4>[ 288.262366] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 98 2c 18 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb Oops#2 Part12 <4>[ 288.262368] RSP: 0018:ffffc90004523570 EFLAGS: 00010002 <4>[ 288.262371] RAX: ffffffffa11f66c7 RBX: ffff888171e888a0 RCX: ffffffffa1182c98 <4>[ 288.262372] RDX: ffff8881040e0990 RSI: ffffffffa11f66c7 RDI: ffffffffa1002f50 <4>[ 288.262373] RBP: ffffc90004523658 R08: ffffffffa11f670c R09: 0000000000000007 <4>[ 288.262375] R10: 0000000000000001 R11: 0000000000000514 R12: ffff888171e888a8 <4>[ 288.262376] R13: ffff888171e88938 R14: 0000000000000515 R15: 0000000000000001 <4>[ 288.262377] FS: 000078a954000940(0000) GS:ffff8888daf5d000(0000) knlGS:0000000000000000 <4>[ 288.262379] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 288.262380] CR2: 00005d7f7385fb10 CR3: 0000000167839003 CR4: 0000000000f72ef0 <4>[ 288.262381] PKRU: 55555554 <4>[ 288.262382] Call Trace: <4>[ 288.262383] <4>[ 288.262392] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 288.262476] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 288.262544] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 288.262622] xe_uc_load_hw+0x187/0x2a0 [xe] <4>[ 288.262722] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 288.262809] xe_gt_init+0x363/0xab0 [xe] <4>[ 288.262884] ? trace_hardirqs_on+0x63/0xd0 <4>[ 288.262888] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 288.262891] ? __devm_add_action+0x70/0xa0 <4>[ 288.262896] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 288.262981] xe_device_probe+0x3cc/0xc10 [xe] <4>[ 288.263055] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 288.263059] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 288.263065] xe_pci_probe+0x396/0x610 [xe] <4>[ 288.263157] local_pci_probe+0x47/0xb0 Oops#2 Part11 <4>[ 288.263161] pci_device_probe+0xf3/0x260 <4>[ 288.263167] really_probe+0xf1/0x410 <4>[ 288.263171] __driver_probe_device+0x8c/0x190 <4>[ 288.263174] device_driver_attach+0x57/0xd0 <4>[ 288.263178] bind_store+0x77/0xd0 <4>[ 288.263182] drv_attr_store+0x24/0x50 <4>[ 288.263184] sysfs_kf_write+0x4d/0x80 <4>[ 288.263189] kernfs_fop_write_iter+0x188/0x240 <4>[ 288.263193] vfs_write+0x283/0x540 <4>[ 288.263201] ksys_write+0x6f/0xf0 <4>[ 288.263205] __x64_sys_write+0x19/0x30 <4>[ 288.263207] x64_sys_call+0x79/0x26b0 <4>[ 288.263210] do_syscall_64+0x93/0x1470 <4>[ 288.263212] ? __x64_sys_openat+0x54/0xa0 <4>[ 288.263217] ? do_syscall_64+0x1e4/0x1470 <4>[ 288.263221] ? __x64_sys_openat+0x54/0xa0 <4>[ 288.263225] ? do_syscall_64+0x1e4/0x1470 <4>[ 288.263227] ? call_rcu+0x34/0x50 <4>[ 288.263231] ? __delete_object+0x60/0xa0 <4>[ 288.263236] ? kmem_cache_free+0x49f/0x5c0 <4>[ 288.263240] ? __fput+0x1bf/0x2f0 <4>[ 288.263244] ? __fput+0x1bf/0x2f0 <4>[ 288.263246] ? __fput+0x1bf/0x2f0 <4>[ 288.263250] ? fput_close_sync+0x3d/0xa0 <4>[ 288.263253] ? __x64_sys_close+0x3e/0x90 <4>[ 288.263256] ? do_syscall_64+0x1e4/0x1470 <4>[ 288.263259] ? do_syscall_64+0x1e4/0x1470 <4>[ 288.263263] ? do_syscall_64+0x1e4/0x1470 <4>[ 288.263265] ? do_syscall_64+0x1e4/0x1470 <4>[ 288.263266] ? exc_page_fault+0xbb/0x250 <4>[ 288.263269] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 288.263271] RIP: 0033:0x78a95611c5a4 <4>[ 288.263274] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 Oops#2 Part10 <4>[ 288.263276] RSP: 002b:00007fff22850178 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 288.263278] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 000078a95611c5a4 <4>[ 288.263279] RDX: 000000000000000c RSI: 00007fff22851630 RDI: 000000000000000b <4>[ 288.263280] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 288.263281] R10: 0000000000000000 R11: 0000000000000202 R12: 00007fff22851630 <4>[ 288.263282] R13: 000000000000000b R14: 0000581fb8d3f35b R15: 00007fff228512e0 <4>[ 288.263290] <4>[ 288.263291] irq event stamp: 762234 <4>[ 288.263292] hardirqs last enabled at (762233): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 288.263294] hardirqs last disabled at (762234): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 288.263296] softirqs last enabled at (761410): [] __irq_exit_rcu+0x13f/0x160 <4>[ 288.263299] softirqs last disabled at (761405): [] __irq_exit_rcu+0x13f/0x160 <4>[ 288.263301] ---[ end trace 0000000000000000 ]--- <7>[ 288.263303] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <3>[ 288.263401] xe 0000:03:00.0: probe with driver xe failed with error -12 <3>[ 288.263975] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC enable mode=0 failed: -ENODEV <7>[ 288.264339] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 288.265438] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 288.386606] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. Oops#2 Part9 <7>[ 288.390309] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. <3>[ 292.675086] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=39 recv=0 <1>[ 292.676990] BUG: unable to handle page fault for address: ffffc9000a38a188 <1>[ 292.677029] #PF: supervisor write access in kernel mode <1>[ 292.677053] #PF: error_code(0x0002) - not-present page <6>[ 292.677076] PGD 100000067 P4D 100000067 PUD 100abc067 PMD 0 <4>[ 292.677115] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 292.677143] CPU: 6 UID: 0 PID: 37 Comm: kworker/6:0 Tainted: G S U W 6.19.0-rc5-lgci-xe-xe-4392-6226d2d655d2d5a08+ #1 PREEMPT(voluntary) <4>[ 292.677191] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 292.677210] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 292.677238] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 292.677877] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 292.678611] Code: 24 66 90 65 8b 05 bc 6e 2a e3 48 0f a3 05 60 a3 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 292.678672] RSP: 0018:ffffc9000028f830 EFLAGS: 00010086 <4>[ 292.678700] RAX: 0000000000000002 RBX: ffffc9000a38a188 RCX: 0000000000000000 <4>[ 292.678729] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8881662101c8 <4>[ 292.678756] RBP: ffffc9000028f8a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 292.678784] R10: ffff88816df20000 R11: 0000000000000001 R12: ffff8881662101c8 Oops#2 Part8 <4>[ 292.678810] R13: 000000000000a188 R14: ffff88816df20000 R15: 0000000000010001 <4>[ 292.678838] FS: 0000000000000000(0000) GS:ffff8888dafdd000(0000) knlGS:0000000000000000 <4>[ 292.678870] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 292.678894] CR2: ffffc9000a38a188 CR3: 0000000003448006 CR4: 0000000000f72ef0 <4>[ 292.678922] PKRU: 55555554 <4>[ 292.678938] Call Trace: <4>[ 292.678953] <4>[ 292.678983] xe_force_wake_get+0x417/0x950 [xe] <4>[ 292.679674] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 292.679720] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 292.680428] ? trace_hardirqs_on+0x63/0xd0 <4>[ 292.680457] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 292.680483] ? xe_tlb_inval_fence_prep+0xbf/0x1a0 [xe] <4>[ 292.681245] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 292.681989] ? find_held_lock+0x31/0x90 <4>[ 292.682015] ? ggtt_node_remove+0xc4/0x140 [xe] <4>[ 292.682224] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 292.682344] ggtt_node_remove+0x122/0x140 [xe] <4>[ 292.682464] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 292.682584] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 292.682716] ? _raw_write_unlock+0x22/0x50 <4>[ 292.682721] ? drm_vma_offset_remove+0x65/0x80 <4>[ 292.682727] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 292.682840] ? lock_is_held_type+0xa3/0x130 <4>[ 292.682844] ttm_bo_release+0x70/0x330 [ttm] <4>[ 292.682851] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 292.682918] ? lock_release+0xce/0x280 <4>[ 292.682922] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 292.682927] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 292.682994] drm_gem_object_free+0x1d/0x40 Oops#2 Part7 <4>[ 292.682996] xe_bo_put+0x12a/0x190 [xe] <4>[ 292.683063] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 292.683138] xe_exec_queue_fini+0x85/0xd0 [xe] <4>[ 292.683206] __guc_exec_queue_destroy_async+0x6c/0x170 [xe] <4>[ 292.683277] process_one_work+0x22e/0x6b0 <4>[ 292.683281] worker_thread+0x1e8/0x3d0 <4>[ 292.683284] ? __pfx_worker_thread+0x10/0x10 <4>[ 292.683286] kthread+0x11f/0x250 <4>[ 292.683289] ? __pfx_kthread+0x10/0x10 <4>[ 292.683293] ret_from_fork+0x344/0x3a0 <4>[ 292.683295] ? __pfx_kthread+0x10/0x10 <4>[ 292.683298] ret_from_fork_asm+0x1a/0x30 <4>[ 292.683304] <4>[ 292.683305] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp hid_generic cmdlinepart asus_nb_wmi asus_wmi spi_nor mei_hdcp sparse_keymap mei_pxp platform_profile mtd kvm_intel wmi_bmof kvm snd_hda_intel irqbypass usbhid ghash_clmulni_intel snd_intel_dspcfg hid aesni_intel snd_hda_codec r8169 video rapl snd_hda_core snd_hwdep intel_cstate binfmt_misc snd_pcm realtek snd_timer i2c_i801 snd idma64 spi_intel_pci mei_me i2c_mux spi_intel intel_pmc_core soundcore i2c_smbus mei pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad acpi_tad intel_vsec wmi dm_multipath msr nvme_fabrics fuse Oops#2 Part6 <4>[ 292.683343] efi_pstore nfnetlink autofs4 <4>[ 292.683369] CR2: ffffc9000a38a188 <4>[ 292.683372] ---[ end trace 0000000000000000 ]--- <4>[ 292.824480] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 292.824578] Code: 24 66 90 65 8b 05 bc 6e 2a e3 48 0f a3 05 60 a3 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 292.824585] RSP: 0018:ffffc9000028f830 EFLAGS: 00010086 <4>[ 292.824589] RAX: 0000000000000002 RBX: ffffc9000a38a188 RCX: 0000000000000000 <4>[ 292.824592] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8881662101c8 <4>[ 292.824595] RBP: ffffc9000028f8a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 292.824598] R10: ffff88816df20000 R11: 0000000000000001 R12: ffff8881662101c8 <4>[ 292.824601] R13: 000000000000a188 R14: ffff88816df20000 R15: 0000000000010001 <4>[ 292.824605] FS: 0000000000000000(0000) GS:ffff8888dafdd000(0000) knlGS:0000000000000000 <4>[ 292.824608] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 292.824611] CR2: ffffc9000a38a188 CR3: 0000000003448006 CR4: 0000000000f72ef0 Oops#2 Part5 <4>[ 292.824614] PKRU: 55555554 <6>[ 292.824616] note: kworker/6:0[37] exited with irqs disabled <6>[ 292.824630] note: kworker/6:0[37] exited with preempt_count 1 <3>[ 294.978799] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=0 recv=0 <4>[ 294.978839] non-slab/vmalloc memory <4>[ 294.978852] ------------[ cut here ]------------ <4>[ 294.978862] list_del corruption. prev->next should be ffffc9000028faa8, but was 850fc084a0558b44. (prev=ffffffff81484dbd) <4>[ 294.978880] WARNING: lib/list_debug.c:62 at __list_del_entry_valid_or_report+0xd9/0x120, CPU#10: kworker/u64:61/3671 <4>[ 294.978913] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp hid_generic cmdlinepart asus_nb_wmi asus_wmi spi_nor mei_hdcp sparse_keymap mei_pxp platform_profile mtd kvm_intel wmi_bmof kvm snd_hda_intel irqbypass usbhid ghash_clmulni_intel snd_intel_dspcfg hid aesni_intel snd_hda_codec r8169 video rapl snd_hda_core snd_hwdep intel_cstate binfmt_misc snd_pcm realtek snd_timer i2c_i801 snd idma64 spi_intel_pci mei_me i2c_mux spi_intel intel_pmc_core soundcore i2c_smbus mei pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad acpi_tad intel_vsec wmi dm_multipath msr nvme_fabrics fuse Oops#2 Part4 <4>[ 294.979072] efi_pstore nfnetlink autofs4 <4>[ 294.979184] CPU: 10 UID: 0 PID: 3671 Comm: kworker/u64:61 Tainted: G S UD W 6.19.0-rc5-lgci-xe-xe-4392-6226d2d655d2d5a08+ #1 PREEMPT(voluntary) <4>[ 294.979213] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 294.979224] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 294.979239] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 294.979784] RIP: 0010:__list_del_entry_valid_or_report+0xe3/0x120 <4>[ 294.979809] Code: b5 01 4c 89 ea 48 89 de 67 48 0f b9 3a 31 c0 eb 8b 4c 89 ef e8 4e 1b 91 ff 48 8d 3d 17 7a b5 01 49 8b 55 00 4c 89 e9 48 89 de <67> 48 0f b9 3a 31 c0 e9 66 ff ff ff 4c 89 e7 e8 29 1b 91 ff 48 8d <4>[ 294.979844] RSP: 0018:ffffc9000400bd68 EFLAGS: 00010046 <4>[ 294.979860] RAX: 0000000000000000 RBX: ffffc9000028faa8 RCX: ffffffff81484dbd Oops#2 Part3 <4>[ 294.979877] RDX: 850fc084a0558b44 RSI: ffffc9000028faa8 RDI: ffffffff839ab720 <4>[ 294.979892] RBP: ffffc9000400bd80 R08: 0000000000000000 R09: 0000000000000000 <4>[ 294.979908] R10: 0000000000000000 R11: 0000000000000000 R12: ffffc9000028ff20 <4>[ 294.979923] R13: ffffffff81484dbd R14: ffffc9000028ff38 R15: ffffc9000028fed8 <4>[ 294.979939] FS: 0000000000000000(0000) GS:ffff8888db1dd000(0000) knlGS:0000000000000000 <4>[ 294.979958] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 294.979972] CR2: 00005d7f739a6be0 CR3: 0000000003448006 CR4: 0000000000f72ef0 <4>[ 294.979988] PKRU: 55555554 <4>[ 294.979998] Call Trace: <4>[ 294.980007] <4>[ 294.980021] xe_tlb_inval_fence_signal+0x35/0x190 [xe] <4>[ 294.980534] xe_tlb_inval_fence_timeout+0xb6/0x1d0 [xe] <4>[ 294.981028] process_one_work+0x22e/0x6b0 <4>[ 294.981054] worker_thread+0x1e8/0x3d0 <4>[ 294.981068] ? __pfx_worker_thread+0x10/0x10 <4>[ 294.981084] kthread+0x11f/0x250 <4>[ 294.981103] ? __pfx_kthread+0x10/0x10 <4>[ 294.981121] ret_from_fork+0x344/0x3a0 <4>[ 294.981136] ? __pfx_kthread+0x10/0x10 <4>[ 294.981153] ret_from_fork_asm+0x1a/0x30 <4>[ 294.981182] <4>[ 294.981190] irq event stamp: 96914 <4>[ 294.981200] hardirqs last enabled at (96913): [] _raw_spin_unlock_irq+0x27/0x70 <4>[ 294.981225] hardirqs last disabled at (96914): [] __schedule+0x121e/0x1d40 <4>[ 294.981248] softirqs last enabled at (96910): [] __irq_exit_rcu+0x13f/0x160 <4>[ 294.981274] softirqs last disabled at (96905): [] __irq_exit_rcu+0x13f/0x160 Oops#2 Part2 <4>[ 294.981297] ---[ end trace 0000000000000000 ]--- <4>[ 294.981326] Oops: general protection fault, probably for non-canonical address 0x468949677481674d: 0000 [#2] SMP NOPTI <4>[ 294.981352] CPU: 10 UID: 0 PID: 3671 Comm: kworker/u64:61 Tainted: G S UD W 6.19.0-rc5-lgci-xe-xe-4392-6226d2d655d2d5a08+ #1 PREEMPT(voluntary) <4>[ 294.981383] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [D]=DIE, [W]=WARN <4>[ 294.981397] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 294.981414] Workqueue: gt-ordered-wq xe_tlb_inval_fence_timeout [xe] <4>[ 294.981892] RIP: 0010:xe_pm_runtime_put+0x30/0x110 [xe] <4>[ 294.982364] Code: 48 89 e5 41 54 53 48 89 fb 48 8b 55 08 66 90 65 8b 05 88 25 29 e3 48 0f a3 05 2c 5a cc e2 0f 82 94 00 00 00 f0 83 44 24 fc 00 <48> 8b 83 a0 2f 00 00 65 48 39 05 49 25 29 e3 0f 84 97 00 00 00 4c <4>[ 294.982398] RSP: 0018:ffffc9000400bd70 EFLAGS: 00010086 <4>[ 294.982413] RAX: 0000000000000001 RBX: 46894967748137ad RCX: 0000000000000000 <4>[ 294.982428] RDX: ffffffffa0cc2ad3 RSI: 46894967748137ad RDI: 46894967748137ad <4>[ 294.982443] RBP: ffffc9000400bd80 R08: 0000000000000000 R09: 0000000000000000 <4>[ 294.982458] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 <4>[ 294.982473] R13: ffff8881662106c8 R14: ffffc9000028ff38 R15: ffffc9000028fed8 <4>[ 294.982488] FS: 0000000000000000(0000) GS:ffff8888db1dd000(0000) knlGS:0000000000000000 <4>[ 294.982507] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 294.982521] CR2: 00005d7f739a6be0 CR3: 0000000003448006 CR4: 0000000000f72ef0 <4>[ 294.982537] PKRU: 55555554 <4>[ 294.982546] Call Trace: Oops#2 Part1 <4>[ 294.982554] <4>[ 294.982564] xe_tlb_inval_fence_signal+0x93/0x190 [xe] <4>[ 294.983036] xe_tlb_inval_fence_timeout+0xb6/0x1d0 [xe] <4>[ 294.983460] process_one_work+0x22e/0x6b0 <4>[ 294.983464] worker_thread+0x1e8/0x3d0 <4>[ 294.983466] ? __pfx_worker_thread+0x10/0x10 <4>[ 294.983469] kthread+0x11f/0x250 <4>[ 294.983472] ? __pfx_kthread+0x10/0x10 <4>[ 294.983475] ret_from_fork+0x344/0x3a0 <4>[ 294.983477] ? __pfx_kthread+0x10/0x10 <4>[ 294.983480] ret_from_fork_asm+0x1a/0x30 <4>[ 294.983485] <4>[ 294.983486] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp hid_generic cmdlinepart asus_nb_wmi asus_wmi spi_nor mei_hdcp sparse_keymap mei_pxp platform_profile mtd kvm_intel wmi_bmof kvm snd_hda_intel irqbypass usbhid ghash_clmulni_intel snd_intel_dspcfg hid aesni_intel snd_hda_codec r8169 video rapl snd_hda_core snd_hwdep intel_cstate binfmt_misc snd_pcm realtek snd_timer i2c_i801 snd idma64 spi_intel_pci mei_me i2c_mux spi_intel intel_pmc_core soundcore i2c_smbus mei pmt_telemetry pmt_discovery nls_iso8859_1 pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake acpi_pad acpi_tad intel_vsec wmi dm_multipath msr nvme_fabrics fuse <4>[ 294.983517] efi_pstore nfnetlink autofs4 <4>[ 294.983543] ---[ end trace 0000000000000000 ]---