Oops#2 Part15 <7>[ 201.756720] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 201.756776] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 201.756836] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR) <7>[ 201.758680] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 201.758746] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 201.758805] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 201.760705] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 201.760771] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 201.762368] FAULT_INJECTION: forcing a failure. <5>[ 201.762368] name fail_function, interval 0, probability 100, space 1, times 100 <3>[ 201.762421] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 201.762704] ------------[ cut here ]------------ <4>[ 201.762706] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 201.762706] platform: BATTLEMAGE subplatform: 7 <4>[ 201.762706] graphics: Xe2_HPG 20.01 step A0 <4>[ 201.762706] media: Xe2_HPM 13.01 step A1 <4>[ 201.762706] tile: 0 VRAM 12.0 GiB <4>[ 201.762706] GT: 0 type 1 Oops#2 Part14 <4>[ 201.762709] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:541 at guc_ct_change_state+0x264/0x330 [xe], CPU#1: xe_fault_inject/5505 <4>[ 201.762778] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp eeepc_wmi spi_nor mei_pxp mei_hdcp asus_wmi mtd sparse_keymap platform_profile wmi_bmof kvm_intel binfmt_misc usbhid hid kvm irqbypass snd_intel_dspcfg ghash_clmulni_intel aesni_intel rapl snd_hda_codec snd_hda_core r8169 snd_hwdep intel_cstate snd_pcm video realtek snd_timer i2c_i801 idma64 mei_me snd i2c_mux spi_intel_pci nls_iso8859_1 soundcore spi_intel i2c_smbus mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore <4>[ 201.762832] nfnetlink autofs4 [last unloaded: snd_hda_intel] <4>[ 201.762836] CPU: 1 UID: 0 PID: 5505 Comm: xe_fault_inject Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4716-d1b3b2cab4528a3ba-debug+ #1 PREEMPT(lazy) <4>[ 201.762839] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN Oops#2 Part13 <4>[ 201.762840] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1656 04/18/2024 <4>[ 201.762841] RIP: 0010:guc_ct_change_state+0x2d8/0x330 [xe] <4>[ 201.762905] Code: 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 68 80 18 a1 52 4c 8b 55 88 41 52 44 8b 4d 9c 4c 8b 45 90 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 50 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 201.762907] RSP: 0018:ffffc9000e89f408 EFLAGS: 00010002 <4>[ 201.762909] RAX: ffffffffa11fd91f RBX: ffff88811cb28738 RCX: ffffffffa1188068 <4>[ 201.762910] RDX: ffff8881045f2690 RSI: ffffffffa11fd91f RDI: ffffffffa1002f20 <4>[ 201.762911] RBP: ffffc9000e89f4f0 R08: ffffffffa11fd96f R09: 0000000000000007 <4>[ 201.762912] R10: ffffffffa11fda20 R11: 0000000000000514 R12: ffff88811cb287c8 <4>[ 201.762913] R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000001 <4>[ 201.762914] FS: 00006fff0be48980(0000) GS:ffff8888dad1b000(0000) knlGS:0000000000000000 <4>[ 201.762916] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 201.762917] CR2: 00005eddfbd06cd8 CR3: 00000001583d3001 CR4: 0000000000f72ef0 <4>[ 201.762918] PKRU: 55555554 <4>[ 201.762919] Call Trace: <4>[ 201.762920] <4>[ 201.762927] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 201.762997] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 201.763060] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 201.763120] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 201.763211] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 201.763295] xe_gt_init+0x3ae/0xdd0 [xe] <4>[ 201.763364] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 201.763369] ? __devm_add_action+0x70/0xa0 <4>[ 201.763373] ? xe_irq_install+0x11a/0x490 [xe] Oops#2 Part12 <4>[ 201.763450] xe_device_probe+0x32c/0xbe0 [xe] <4>[ 201.763518] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 201.763522] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 201.763528] xe_pci_probe+0x39b/0x620 [xe] <4>[ 201.763603] ? trace_hardirqs_on+0x22/0x100 <4>[ 201.763610] local_pci_probe+0x47/0xb0 <4>[ 201.763615] pci_call_probe+0x6c/0x360 <4>[ 201.763620] ? _raw_spin_unlock+0x22/0x50 <4>[ 201.763624] pci_device_probe+0xae/0x110 <4>[ 201.763628] really_probe+0xf1/0x410 <4>[ 201.763631] __driver_probe_device+0x8c/0x190 <4>[ 201.763634] device_driver_attach+0x57/0xd0 <4>[ 201.763637] bind_store+0x77/0xd0 <4>[ 201.763641] drv_attr_store+0x24/0x50 <4>[ 201.763643] sysfs_kf_write+0x4d/0x80 <4>[ 201.763647] kernfs_fop_write_iter+0x188/0x240 <4>[ 201.763651] vfs_write+0x283/0x540 <4>[ 201.763659] ksys_write+0x6f/0xf0 <4>[ 201.763662] __x64_sys_write+0x19/0x30 <4>[ 201.763664] x64_sys_call+0x259/0x26e0 <4>[ 201.763667] do_syscall_64+0xdd/0x1470 <4>[ 201.763670] ? do_sys_openat2+0x85/0xd0 <4>[ 201.763674] ? __x64_sys_openat+0x54/0xa0 <4>[ 201.763677] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 201.763680] ? do_syscall_64+0x22e/0x1470 <4>[ 201.763682] ? putname+0x41/0x90 <4>[ 201.763687] ? __slab_free+0x129/0x2b0 <4>[ 201.763691] ? __pcs_replace_full_main+0x2ad/0x710 <4>[ 201.763694] ? putname+0x41/0x90 <4>[ 201.763696] ? kmem_cache_free+0x165/0x510 <4>[ 201.763700] ? putname+0x41/0x90 <4>[ 201.763702] ? do_sys_openat2+0x85/0xd0 <4>[ 201.763707] ? __x64_sys_openat+0x54/0xa0 <4>[ 201.763709] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 201.763712] ? do_syscall_64+0x22e/0x1470 <4>[ 201.763715] ? __fput+0x1bf/0x2f0 Oops#2 Part11 <4>[ 201.763718] ? fput_close_sync+0x3d/0xa0 <4>[ 201.763720] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 201.763723] ? do_syscall_64+0x22e/0x1470 <4>[ 201.763725] ? putname+0x41/0x90 <4>[ 201.763727] ? do_sys_openat2+0x85/0xd0 <4>[ 201.763731] ? __x64_sys_openat+0x54/0xa0 <4>[ 201.763734] ? trace_hardirqs_on_prepare+0xe1/0x100 <4>[ 201.763737] ? do_syscall_64+0x22e/0x1470 <4>[ 201.763739] ? do_syscall_64+0x22e/0x1470 <4>[ 201.763743] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 201.763745] RIP: 0033:0x6fff0e11c5a4 <4>[ 201.763747] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 201.763748] RSP: 002b:00007ffe86cde688 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 201.763750] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00006fff0e11c5a4 <4>[ 201.763752] RDX: 000000000000000c RSI: 00007ffe86cdeb50 RDI: 0000000000000007 <4>[ 201.763753] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 201.763754] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffe86cdeb50 <4>[ 201.763755] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffe86cde800 <4>[ 201.763762] <4>[ 201.763763] irq event stamp: 1307090 <4>[ 201.763764] hardirqs last enabled at (1307089): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 201.763767] hardirqs last disabled at (1307090): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 201.763769] softirqs last enabled at (1305990): [] __irq_exit_rcu+0x13f/0x160 Oops#2 Part10 <4>[ 201.763772] softirqs last disabled at (1305983): [] __irq_exit_rcu+0x13f/0x160 <4>[ 201.763773] ---[ end trace 0000000000000000 ]--- <7>[ 201.763775] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 201.763858] ------------[ cut here ]------------ <4>[ 201.763867] xe 0000:03:00.0: [drm] Tile0: GT0: Failed to invalidate GGTT (-ENODEV) <3>[ 201.763868] xe 0000:03:00.0: probe with driver xe failed with error -12 <4>[ 201.763879] WARNING: drivers/gpu/drm/xe/xe_ggtt.c:576 at ggtt_invalidate_gt_tlb.part.0+0x76/0xb0 [xe], CPU#12: kworker/12:4/2403 <4>[ 201.763991] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart coretemp eeepc_wmi spi_nor mei_pxp mei_hdcp asus_wmi mtd sparse_keymap platform_profile wmi_bmof kvm_intel binfmt_misc usbhid hid kvm irqbypass snd_intel_dspcfg ghash_clmulni_intel aesni_intel rapl snd_hda_codec snd_hda_core r8169 snd_hwdep intel_cstate snd_pcm video realtek snd_timer i2c_i801 idma64 mei_me snd i2c_mux spi_intel_pci nls_iso8859_1 soundcore spi_intel i2c_smbus mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore Oops#2 Part9 <4>[ 201.764089] nfnetlink autofs4 [last unloaded: snd_hda_intel] <4>[ 201.764096] CPU: 12 UID: 0 PID: 2403 Comm: kworker/12:4 Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4716-d1b3b2cab4528a3ba-debug+ #1 PREEMPT(lazy) <4>[ 201.764100] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 201.764102] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1656 04/18/2024 <4>[ 201.764104] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 201.764227] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe] <4>[ 201.764329] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 54 98 62 e1 48 89 c6 48 8d 3d ea c5 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31 Oops#2 Part8 <4>[ 201.764332] RSP: 0018:ffffc90003ebfaf0 EFLAGS: 00010246 <4>[ 201.764335] RAX: ffffffffa11fd91f RBX: 0000000000000000 RCX: 0000000000000000 <4>[ 201.764337] RDX: ffff8881045f2690 RSI: ffffffffa11fd91f RDI: ffffffffa1001fe0 <4>[ 201.764339] RBP: ffffc90003ebfb10 R08: 0000000000000000 R09: ffffffffffffffed <4>[ 201.764341] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed <4>[ 201.764342] R13: 0000000000000000 R14: ffff8881045f2690 R15: 0000000000000000 <4>[ 201.764344] FS: 0000000000000000(0000) GS:ffff8888db29b000(0000) knlGS:0000000000000000 <4>[ 201.764346] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 201.764348] CR2: 00005eddfbda7d88 CR3: 000000000344c004 CR4: 0000000000f72ef0 <4>[ 201.764350] PKRU: 55555554 <4>[ 201.764352] Call Trace: <4>[ 201.764353] <4>[ 201.764358] ggtt_node_remove+0x11a/0x140 [xe] <3>[ 201.764487] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <4>[ 201.764461] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 201.764563] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 201.764666] ? _raw_write_unlock+0x22/0x50 <4>[ 201.764672] ? drm_vma_offset_remove+0x65/0x80 <4>[ 201.764679] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 201.764775] ? lock_is_held_type+0xa3/0x130 <4>[ 201.764784] ttm_bo_release+0x70/0x310 [ttm] <4>[ 201.764793] ? xe_ggtt_might_lock+0x29/0x60 [xe] <7>[ 201.764846] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <4>[ 201.764894] ? lock_release+0xd0/0x2b0 Oops#2 Part7 <4>[ 201.764901] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 201.764909] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 201.765005] drm_gem_object_free+0x1d/0x40 <4>[ 201.765009] xe_bo_put+0x12a/0x190 [xe] <4>[ 201.765107] xe_lrc_destroy+0x74/0x90 [xe] <4>[ 201.765231] __xe_exec_queue_fini+0x6b/0xa0 [xe] <4>[ 201.765332] xe_exec_queue_fini+0x2b/0x60 [xe] <4>[ 201.765431] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe] <4>[ 201.765542] process_one_work+0x22e/0x740 <4>[ 201.765552] worker_thread+0x1e8/0x3d0 <4>[ 201.765556] ? __pfx_worker_thread+0x10/0x10 <4>[ 201.765559] kthread+0x10d/0x150 <4>[ 201.765563] ? __pfx_kthread+0x10/0x10 <4>[ 201.765568] ret_from_fork+0x3d4/0x480 <4>[ 201.765572] ? __pfx_kthread+0x10/0x10 <4>[ 201.765576] ret_from_fork_asm+0x1a/0x30 <4>[ 201.765588] <4>[ 201.765590] irq event stamp: 14491 <4>[ 201.765591] hardirqs last enabled at (14497): [] __up_console_sem+0x79/0xa0 <4>[ 201.765595] hardirqs last disabled at (14502): [] __up_console_sem+0x5e/0xa0 <4>[ 201.765598] softirqs last enabled at (14414): [] __irq_exit_rcu+0x13f/0x160 <4>[ 201.765602] softirqs last disabled at (14403): [] __irq_exit_rcu+0x13f/0x160 <4>[ 201.765604] ---[ end trace 0000000000000000 ]--- <7>[ 201.766107] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 201.846803] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 201.849303] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. Oops#2 Part6 <3>[ 204.036870] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=51 recv=50 <1>[ 204.037896] BUG: unable to handle page fault for address: ffffc9002338a188 <1>[ 204.037930] #PF: supervisor write access in kernel mode <1>[ 204.037946] #PF: error_code(0x0002) - not-present page <6>[ 204.037960] PGD 100000067 P4D 100000067 PUD 100ab9067 PMD 0 <4>[ 204.037985] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 204.038006] CPU: 12 UID: 0 PID: 2385 Comm: kworker/12:3 Tainted: G S U W 7.0.0-rc3-lgci-xe-xe-4716-d1b3b2cab4528a3ba-debug+ #1 PREEMPT(lazy) <4>[ 204.038042] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 204.038057] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1656 04/18/2024 <4>[ 204.038077] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 204.038576] RIP: 0010:xe_mmio_write32+0x58/0x2b0 [xe] <4>[ 204.039067] Code: 24 66 90 65 8b 05 dc 45 2e e3 48 0f a3 05 80 ac d0 e2 0f 82 1d 01 00 00 41 f7 c5 00 00 00 01 0f 84 b7 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 204.039102] RSP: 0018:ffffc900037ef7e0 EFLAGS: 00010086 <4>[ 204.039120] RAX: 0000000000000002 RBX: ffffc9002338a188 RCX: 0000000000000000 <4>[ 204.039137] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff888155208060 <4>[ 204.039155] RBP: ffffc900037ef858 R08: 0000000000000000 R09: 0000000000000000 <4>[ 204.039172] R10: ffff8881673c0000 R11: 0000000000000001 R12: ffff888155208060 <4>[ 204.039188] R13: 000000000000a188 R14: ffff8881673c0000 R15: 0000000000010001 <4>[ 204.039204] FS: 0000000000000000(0000) GS:ffff8888db29b000(0000) knlGS:0000000000000000