Oops#1 Part9 <7>[ 653.689135] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 <7>[ 653.689196] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x01400140 <7>[ 653.689252] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 653.689310] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 653.689368] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 653.689425] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 653.689484] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 653.689547] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 <7>[ 653.691475] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 653.691548] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 653.691613] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 653.693289] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 653.693360] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 653.695863] FAULT_INJECTION: forcing a failure. <5>[ 653.695863] name fail_function, interval 0, probability 100, space 1, times 100 Oops#1 Part8 <3>[ 653.695867] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 653.695941] ------------[ cut here ]------------ <4>[ 653.695942] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 653.695942] platform: BATTLEMAGE subplatform: 7 <4>[ 653.695942] graphics: Xe2_HPG 20.01 step A0 <4>[ 653.695942] media: Xe2_HPM 13.01 step A1 <4>[ 653.695942] tile: 0 VRAM 12.0 GiB <4>[ 653.695942] GT: 0 type 1 <4>[ 653.695945] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:527 at guc_ct_change_state+0x279/0x350 [xe], CPU#1: xe_fault_inject/17609 <4>[ 653.696018] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic asus_nb_wmi coretemp spi_nor asus_wmi sparse_keymap mtd platform_profile mei_pxp mei_hdcp wmi_bmof kvm_intel binfmt_misc kvm irqbypass snd_hda_intel ghash_clmulni_intel aesni_intel r8169 snd_intel_dspcfg rapl video snd_hda_codec intel_cstate usbhid snd_hda_core snd_hwdep realtek hid snd_pcm nls_iso8859_1 snd_timer i2c_i801 snd spi_intel_pci i2c_mux soundcore spi_intel i2c_smbus idma64 intel_pmc_core pmt_telemetry mei_me pmt_discovery pmt_class intel_pmc_ssram_telemetry mei wmi pinctrl_alderlake acpi_pad acpi_tad intel_vsec dm_multipath msr nvme_fabrics fuse Oops#1 Part7 <4>[ 653.696072] efi_pstore nfnetlink autofs4 [last unloaded: xe_live_test] <4>[ 653.696076] CPU: 1 UID: 0 PID: 17609 Comm: xe_fault_inject Tainted: G S U W N 6.19.0-rc8-lgci-xe-xe-4506-05230cfcdb1abc225+ #1 PREEMPT(voluntary) <4>[ 653.696079] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [N]=TEST <4>[ 653.696080] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 653.696082] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe] <4>[ 653.696149] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 a8 46 18 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 653.696150] RSP: 0018:ffffc900028f7590 EFLAGS: 00010002 Oops#1 Part6 <4>[ 653.696152] RAX: ffffffffa11f8635 RBX: ffff8881ddb188a0 RCX: ffffffffa11846a8 <4>[ 653.696154] RDX: ffff888103ca6910 RSI: ffffffffa11f8635 RDI: ffffffffa1002f60 <4>[ 653.696155] RBP: ffffc900028f7678 R08: ffffffffa11f867a R09: 0000000000000007 <4>[ 653.696156] R10: 0000000000000001 R11: 0000000000000514 R12: ffff8881ddb188a8 <4>[ 653.696157] R13: ffff8881ddb18938 R14: 0000000000000515 R15: 0000000000000001 <4>[ 653.696158] FS: 000070fc0fe0c980(0000) GS:ffff8888dad5b000(0000) knlGS:0000000000000000 <4>[ 653.696159] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 653.696160] CR2: 00005adf6c5c6218 CR3: 00000001e40c7001 CR4: 0000000000f72ef0 <4>[ 653.696162] PKRU: 55555554 <4>[ 653.696163] Call Trace: <4>[ 653.696164] <4>[ 653.696171] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 653.696244] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 653.696309] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 653.696373] xe_uc_load_hw+0x187/0x2a0 [xe] <4>[ 653.696463] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 653.696540] xe_gt_init+0x363/0xab0 [xe] <4>[ 653.696602] ? trace_hardirqs_on+0x63/0xd0 <4>[ 653.696605] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 653.696609] ? __devm_add_action+0x70/0xa0 <4>[ 653.696613] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 653.696688] xe_device_probe+0x3cc/0xc20 [xe] <4>[ 653.696745] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 653.696749] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 653.696755] xe_pci_probe+0x396/0x610 [xe] <4>[ 653.696839] local_pci_probe+0x47/0xb0 <4>[ 653.696844] pci_device_probe+0xf3/0x260 <4>[ 653.696849] really_probe+0xf1/0x410 <4>[ 653.696852] __driver_probe_device+0x8c/0x190 Oops#1 Part5 <4>[ 653.696855] device_driver_attach+0x57/0xd0 <4>[ 653.696858] bind_store+0x77/0xd0 <4>[ 653.696861] drv_attr_store+0x24/0x50 <4>[ 653.696863] sysfs_kf_write+0x4d/0x80 <4>[ 653.696868] kernfs_fop_write_iter+0x188/0x240 <4>[ 653.696872] vfs_write+0x283/0x540 <4>[ 653.696875] ? trace_hardirqs_on+0x63/0xd0 <4>[ 653.696882] ksys_write+0x6f/0xf0 <4>[ 653.696886] __x64_sys_write+0x19/0x30 <4>[ 653.696888] x64_sys_call+0x79/0x26b0 <4>[ 653.696891] do_syscall_64+0x93/0x1470 <4>[ 653.696895] ? kmem_cache_free+0x49f/0x5c0 <4>[ 653.696897] ? putname+0x3e/0x80 <4>[ 653.696903] ? putname+0x3e/0x80 <4>[ 653.696904] ? putname+0x3e/0x80 <4>[ 653.696907] ? do_sys_openat2+0x95/0xe0 <4>[ 653.696912] ? __x64_sys_openat+0x54/0xa0 <4>[ 653.696916] ? do_syscall_64+0x1e4/0x1470 <4>[ 653.696918] ? call_rcu+0x34/0x50 <4>[ 653.696922] ? __delete_object+0x60/0xa0 <4>[ 653.696927] ? kmem_cache_free+0x49f/0x5c0 <4>[ 653.696929] ? putname+0x3e/0x80 <4>[ 653.696933] ? putname+0x3e/0x80 <4>[ 653.696935] ? putname+0x3e/0x80 <4>[ 653.696937] ? do_sys_openat2+0x95/0xe0 <4>[ 653.696942] ? __x64_sys_openat+0x54/0xa0 <4>[ 653.696946] ? do_syscall_64+0x1e4/0x1470 <4>[ 653.696950] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 653.696951] RIP: 0033:0x70fc11f1c5a4 <4>[ 653.696954] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 653.696956] RSP: 002b:00007fffb3832f38 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 <4>[ 653.696958] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 000070fc11f1c5a4 Oops#1 Part4 <4>[ 653.696959] RDX: 000000000000000c RSI: 00007fffb38343f0 RDI: 000000000000000b <4>[ 653.696960] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 653.696961] R10: 0000000000000000 R11: 0000000000000202 R12: 00007fffb38343f0 <4>[ 653.696962] R13: 000000000000000b R14: 000058372f10535b R15: 00007fffb38340a0 <4>[ 653.696970] <4>[ 653.696971] irq event stamp: 941374 <4>[ 653.696972] hardirqs last enabled at (941373): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 653.696975] hardirqs last disabled at (941374): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 653.696976] softirqs last enabled at (941162): [] __irq_exit_rcu+0x13f/0x160 <4>[ 653.696979] softirqs last disabled at (941157): [] __irq_exit_rcu+0x13f/0x160 <4>[ 653.696982] ---[ end trace 0000000000000000 ]--- <7>[ 653.696983] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <3>[ 653.697082] xe 0000:03:00.0: probe with driver xe failed with error -12 <3>[ 653.697667] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC enable mode=0 failed: -ENODEV <7>[ 653.697849] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 653.698938] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 653.798807] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. <7>[ 653.800672] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. Oops#1 Part3 <3>[ 655.971404] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=39 recv=38 <1>[ 655.972542] BUG: unable to handle page fault for address: ffffc9000838a188 <1>[ 655.972577] #PF: supervisor write access in kernel mode <1>[ 655.972592] #PF: error_code(0x0002) - not-present page <6>[ 655.972604] PGD 100000067 P4D 100000067 PUD 100aa5067 PMD 0 <4>[ 655.972627] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 655.972645] CPU: 9 UID: 0 PID: 6982 Comm: kworker/9:66 Tainted: G S U W N 6.19.0-rc8-lgci-xe-xe-4506-05230cfcdb1abc225+ #1 PREEMPT(voluntary) <4>[ 655.972674] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [N]=TEST <4>[ 655.972685] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 655.972700] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 655.973196] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 655.973653] Code: 24 66 90 65 8b 05 3c 7a 2a e3 48 0f a3 05 e0 a2 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 655.973689] RSP: 0018:ffffc9000fbc7830 EFLAGS: 00010086 <4>[ 655.973706] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 655.973723] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8881818d81c8 <4>[ 655.973741] RBP: ffffc9000fbc78a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 655.973757] R10: ffff88812b4f0000 R11: 0000000000000001 R12: ffff8881818d81c8 <4>[ 655.973774] R13: 000000000000a188 R14: ffff88812b4f0000 R15: 0000000000010001 <4>[ 655.973790] FS: 0000000000000000(0000) GS:ffff8888db15b000(0000) knlGS:0000000000000000 Oops#1 Part2 <4>[ 655.973810] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 655.973826] CR2: ffffc9000838a188 CR3: 0000000003448002 CR4: 0000000000f72ef0 <4>[ 655.973843] PKRU: 55555554 <4>[ 655.973854] Call Trace: <4>[ 655.973865] <4>[ 655.973884] xe_force_wake_get+0x417/0x950 [xe] <4>[ 655.974281] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 655.974310] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 655.974736] ? trace_hardirqs_on+0x63/0xd0 <4>[ 655.974756] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 655.974771] ? xe_tlb_inval_fence_prep+0xbf/0x1a0 [xe] <4>[ 655.975258] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 655.975723] ? find_held_lock+0x31/0x90 <4>[ 655.975740] ? ggtt_node_remove+0xc4/0x140 [xe] <4>[ 655.976039] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 655.976107] ggtt_node_remove+0x122/0x140 [xe] <4>[ 655.976175] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 655.976242] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 655.976309] ? _raw_write_unlock+0x22/0x50 <4>[ 655.976311] ? drm_vma_offset_remove+0x65/0x80 <4>[ 655.976316] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 655.976382] ? lock_is_held_type+0xa3/0x130 <4>[ 655.976387] ttm_bo_release+0x70/0x330 [ttm] <4>[ 655.976393] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 655.976460] ? lock_release+0xce/0x280 <4>[ 655.976464] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 655.976469] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 655.976535] drm_gem_object_free+0x1d/0x40 <4>[ 655.976538] xe_bo_put+0x12a/0x190 [xe] <4>[ 655.976604] xe_lrc_destroy+0x47/0x60 [xe] <4>[ 655.976680] xe_exec_queue_fini+0x85/0xd0 [xe] Oops#1 Part1 <4>[ 655.976747] __guc_exec_queue_destroy_async+0x6c/0x170 [xe] <4>[ 655.976818] process_one_work+0x22e/0x6b0 <4>[ 655.976835] worker_thread+0x1e8/0x3d0 <4>[ 655.976838] ? __pfx_worker_thread+0x10/0x10 <4>[ 655.976840] kthread+0x11f/0x250 <4>[ 655.976844] ? __pfx_kthread+0x10/0x10 <4>[ 655.976847] ret_from_fork+0x344/0x3a0 <4>[ 655.976850] ? __pfx_kthread+0x10/0x10 <4>[ 655.976853] ret_from_fork_asm+0x1a/0x30 <4>[ 655.976859] <4>[ 655.976860] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic asus_nb_wmi coretemp spi_nor asus_wmi sparse_keymap mtd platform_profile mei_pxp mei_hdcp wmi_bmof kvm_intel binfmt_misc kvm irqbypass snd_hda_intel ghash_clmulni_intel aesni_intel r8169 snd_intel_dspcfg rapl video snd_hda_codec intel_cstate usbhid snd_hda_core snd_hwdep realtek hid snd_pcm nls_iso8859_1 snd_timer i2c_i801 snd spi_intel_pci i2c_mux soundcore spi_intel i2c_smbus idma64 intel_pmc_core pmt_telemetry mei_me pmt_discovery pmt_class intel_pmc_ssram_telemetry mei wmi pinctrl_alderlake acpi_pad acpi_tad intel_vsec dm_multipath msr nvme_fabrics fuse <4>[ 655.976895] efi_pstore nfnetlink autofs4 [last unloaded: xe_live_test] <4>[ 655.976923] CR2: ffffc9000838a188 <4>[ 655.976929] ---[ end trace 0000000000000000 ]---