Oops#2 Part15
<7>[ 277.551011] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x00400040
<7>[ 277.551075] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020
<7>[ 277.551142] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240
<7>[ 277.551209] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000
<7>[ 277.551275] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420
<7>[ 277.551341] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000
<7>[ 277.551407] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000
<7>[ 277.551478] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR)
<7>[ 277.553489] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords
<7>[ 277.553568] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606
<7>[ 277.553638] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01
<7>[ 277.555596] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords
<7>[ 277.555673] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01
<5>[ 277.557724] FAULT_INJECTION: forcing a failure.
<5>[ 277.557724] name fail_function, interval 0, probability 100, space 1, times 100
Oops#2 Part14
<3>[ 277.557728] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM
<4>[ 277.557784] ------------[ cut here ]------------
<4>[ 277.557785] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed!
<4>[ 277.557785] platform: BATTLEMAGE subplatform: 7
<4>[ 277.557785] graphics: Xe2_HPG 20.01 step A0
<4>[ 277.557785] media: Xe2_HPM 13.01 step A1
<4>[ 277.557785] tile: 0 VRAM 12.0 GiB
<4>[ 277.557785] GT: 0 type 1
<4>[ 277.557788] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:541 at guc_ct_change_state+0x264/0x330 [xe], CPU#7: xe_fault_inject/6063
<4>[ 277.557866] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling hid_generic x86_pkg_temp_thermal intel_powerclamp cmdlinepart spi_nor mei_hdcp mei_pxp asus_nb_wmi asus_wmi mtd coretemp sparse_keymap platform_profile wmi_bmof kvm_intel usbhid binfmt_misc hid kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 rapl snd_hda_codec video snd_hda_core intel_cstate realtek snd_hwdep snd_pcm snd_timer i2c_i801 mei_me i2c_mux spi_intel_pci snd soundcore i2c_smbus nls_iso8859_1 idma64 spi_intel mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake wmi intel_vsec acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore
Oops#2 Part13
<4>[ 277.557918] nfnetlink autofs4 [last unloaded: snd_hda_intel]
<4>[ 277.557922] CPU: 7 UID: 0 PID: 6063 Comm: xe_fault_inject Tainted: G S U W 7.0.0-rc2-lgci-xe-xe-4667-9565ad1c312903452-debug+ #1 PREEMPT(lazy)
<4>[ 277.557925] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4>[ 277.557926] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4>[ 277.557927] RIP: 0010:guc_ct_change_state+0x2d8/0x330 [xe]
<4>[ 277.557999] Code: 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 c0 67 18 a1 52 4c 8b 55 88 41 52 44 8b 4d 9c 4c 8b 45 90 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 50 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb
<4>[ 277.558001] RSP: 0018:ffffc9000f9ef4e8 EFLAGS: 00010002
<4>[ 277.558003] RAX: ffffffffa11fb95a RBX: ffff8881f41b0738 RCX: ffffffffa11867c0
Oops#2 Part12
<4>[ 277.558004] RDX: ffff888104c6ae90 RSI: ffffffffa11fb95a RDI: ffffffffa1002f00
<4>[ 277.558005] RBP: ffffc9000f9ef5d0 R08: ffffffffa11fb9aa R09: 0000000000000007
<4>[ 277.558006] R10: ffffffffa11fba5b R11: 0000000000000514 R12: ffff8881f41b07c8
<4>[ 277.558007] R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000001
<4>[ 277.558008] FS: 0000701099ac5980(0000) GS:ffff8888db019000(0000) knlGS:0000000000000000
<4>[ 277.558010] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 277.558011] CR2: 00006184b72afa90 CR3: 00000001b9a0b006 CR4: 0000000000f72ef0
<4>[ 277.558012] PKRU: 55555554
<4>[ 277.558013] Call Trace:
<4>[ 277.558014]
<4>[ 277.558021] ? xe_guc_submit_enable+0xa8/0xf0 [xe]
<4>[ 277.558100] xe_guc_ct_disable+0x17/0x80 [xe]
<4>[ 277.558170] xe_guc_sanitize+0x2a/0x50 [xe]
<4>[ 277.558239] xe_uc_load_hw+0x19a/0x2b0 [xe]
<4>[ 277.558338] ? xe_migrate_init+0x277/0x2d0 [xe]
<4>[ 277.558420] xe_gt_init+0x35d/0xab0 [xe]
<4>[ 277.558485] ? _raw_spin_unlock_irqrestore+0x51/0x80
<4>[ 277.558490] ? __devm_add_action+0x70/0xa0
<4>[ 277.558494] ? xe_irq_install+0x11a/0x490 [xe]
<4>[ 277.558574] xe_device_probe+0x32c/0xbe0 [xe]
<4>[ 277.558634] ? __drm_dev_dbg+0x7d/0xb0
<4>[ 277.558638] ? __drmm_add_action_or_reset+0x1e/0x50
<4>[ 277.558644] xe_pci_probe+0x39b/0x620 [xe]
<4>[ 277.558724] ? trace_hardirqs_on+0x22/0x100
<4>[ 277.558731] local_pci_probe+0x47/0xb0
<4>[ 277.558735] pci_call_probe+0x6c/0x360
<4>[ 277.558741] ? _raw_spin_unlock+0x22/0x50
<4>[ 277.558746] pci_device_probe+0xae/0x110
<4>[ 277.558750] really_probe+0xf1/0x410
<4>[ 277.558754] __driver_probe_device+0x8c/0x190
Oops#2 Part11
<4>[ 277.558756] device_driver_attach+0x57/0xd0
<4>[ 277.558760] bind_store+0x142/0x150
<4>[ 277.558764] drv_attr_store+0x24/0x50
<4>[ 277.558767] sysfs_kf_write+0x4d/0x80
<4>[ 277.558772] kernfs_fop_write_iter+0x188/0x240
<4>[ 277.558776] vfs_write+0x283/0x540
<4>[ 277.558784] ksys_write+0x6f/0xf0
<4>[ 277.558787] __x64_sys_write+0x19/0x30
<4>[ 277.558789] x64_sys_call+0x259/0x26e0
<4>[ 277.558793] do_syscall_64+0xdd/0x1470
<4>[ 277.558797] ? free_to_partial_list+0x46d/0x640
<4>[ 277.558800] ? putname+0x41/0x90
<4>[ 277.558804] ? __slab_free+0x129/0x2b0
<4>[ 277.558808] ? __pcs_replace_full_main+0x29a/0x660
<4>[ 277.558812] ? putname+0x41/0x90
<4>[ 277.558813] ? kmem_cache_free+0x165/0x510
<4>[ 277.558818] ? putname+0x41/0x90
<4>[ 277.558820] ? do_sys_openat2+0x85/0xd0
<4>[ 277.558826] ? __x64_sys_openat+0x54/0xa0
<4>[ 277.558828] ? trace_hardirqs_on_prepare+0xe1/0x100
<4>[ 277.558832] ? do_syscall_64+0x22e/0x1470
<4>[ 277.558838] ? trace_hardirqs_on_prepare+0xe1/0x100
<4>[ 277.558841] ? do_syscall_64+0x22e/0x1470
<4>[ 277.558843] ? do_syscall_64+0x22e/0x1470
<4>[ 277.558846] ? trace_hardirqs_on_prepare+0xe1/0x100
<4>[ 277.558849] ? do_syscall_64+0x22e/0x1470
<4>[ 277.558852] ? do_syscall_64+0x22e/0x1470
<4>[ 277.558854] ? exc_page_fault+0xbd/0x2c0
<4>[ 277.558858] entry_SYSCALL_64_after_hwframe+0x76/0x7e
<4>[ 277.558860] RIP: 0033:0x70109bd1c5a4
<4>[ 277.558863] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
Oops#2 Part10
<4>[ 277.558864] RSP: 002b:00007ffc70daf408 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
<4>[ 277.558866] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 000070109bd1c5a4
<4>[ 277.558867] RDX: 000000000000000c RSI: 00007ffc70daf8d0 RDI: 0000000000000007
<4>[ 277.558869] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000
<4>[ 277.558870] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffc70daf8d0
<4>[ 277.558871] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffc70daf580
<4>[ 277.558879]
<4>[ 277.558880] irq event stamp: 1577240
<4>[ 277.558881] hardirqs last enabled at (1577239): [] _raw_spin_unlock_irqrestore+0x51/0x80
<4>[ 277.558884] hardirqs last disabled at (1577240): [] _raw_spin_lock_irq+0x6f/0x80
<4>[ 277.558886] softirqs last enabled at (1577234): [] __irq_exit_rcu+0x13f/0x160
<4>[ 277.558889] softirqs last disabled at (1577203): [] __irq_exit_rcu+0x13f/0x160
<4>[ 277.558892] ---[ end trace 0000000000000000 ]---
<7>[ 277.558894] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<4>[ 277.558983] ------------[ cut here ]------------
<4>[ 277.558999] xe 0000:03:00.0: [drm] Tile0: GT0: Failed to invalidate GGTT (-ENODEV)
<4>[ 277.559001] WARNING: drivers/gpu/drm/xe/xe_ggtt.c:576 at ggtt_invalidate_gt_tlb.part.0+0x76/0xb0 [xe], CPU#4: kworker/4:12/5683
<4>[ 277.559069] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper
Oops#2 Part9
<3>[ 277.559112] xe 0000:03:00.0: probe with driver xe failed with error -12
<4>[ 277.559185] drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling hid_generic x86_pkg_temp_thermal intel_powerclamp cmdlinepart spi_nor mei_hdcp mei_pxp asus_nb_wmi asus_wmi mtd coretemp sparse_keymap platform_profile wmi_bmof kvm_intel usbhid binfmt_misc hid kvm irqbypass ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 rapl snd_hda_codec video snd_hda_core intel_cstate realtek snd_hwdep snd_pcm snd_timer i2c_i801 mei_me i2c_mux spi_intel_pci snd soundcore i2c_smbus nls_iso8859_1 idma64 spi_intel mei intel_pmc_core pmt_telemetry pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake wmi intel_vsec acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink autofs4 [last unloaded: snd_hda_intel]
<4>[ 277.559241] CPU: 4 UID: 0 PID: 5683 Comm: kworker/4:12 Tainted: G S U W 7.0.0-rc2-lgci-xe-xe-4667-9565ad1c312903452-debug+ #1 PREEMPT(lazy)
<4>[ 277.559244] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4>[ 277.559245] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4>[ 277.559247] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe]
<4>[ 277.559321] RIP: 0010:ggtt_invalidate_gt_tlb.part.0+0x81/0xb0 [xe]
<4>[ 277.559381] Code: 48 8b 7f 08 4c 8b 77 50 4d 85 f6 75 03 4c 8b 37 e8 94 93 62 e1 48 89 c6 48 8d 3d 6a cc 3d 00 4d 89 e1 45 89 e8 89 d9 4c 89 f2 <67> 48 0f b9 3a 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 c9 31 f6 31
Oops#2 Part8
<4>[ 277.559383] RSP: 0018:ffffc9000f027af0 EFLAGS: 00010246
<4>[ 277.559385] RAX: ffffffffa11fb95a RBX: 0000000000000000 RCX: 0000000000000000
<4>[ 277.559386] RDX: ffff888104c6ae90 RSI: ffffffffa11fb95a RDI: ffffffffa1001fe0
<4>[ 277.559388] RBP: ffffc9000f027b10 R08: 0000000000000000 R09: ffffffffffffffed
<4>[ 277.559389] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffffffffed
<4>[ 277.559390] R13: 0000000000000000 R14: ffff888104c6ae90 R15: 0000000000000000
<4>[ 277.559391] FS: 0000000000000000(0000) GS:ffff8888dae99000(0000) knlGS:0000000000000000
<4>[ 277.559393] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 277.559394] CR2: 00006184b7187028 CR3: 000000000344c005 CR4: 0000000000f72ef0
<4>[ 277.559395] PKRU: 55555554
<4>[ 277.559396] Call Trace:
<4>[ 277.559397]
<4>[ 277.559401] ggtt_node_remove+0x11a/0x140 [xe]
<4>[ 277.559462] xe_ggtt_node_remove+0x40/0xa0 [xe]
<4>[ 277.559522] xe_ggtt_remove_bo+0x87/0x250 [xe]
<4>[ 277.559582] ? _raw_write_unlock+0x22/0x50
<4>[ 277.559586] ? drm_vma_offset_remove+0x65/0x80
<4>[ 277.559591] xe_ttm_bo_destroy+0xa2/0x2d0 [xe]
<4>[ 277.559645] ? lock_is_held_type+0xa3/0x130
<4>[ 277.559651] ttm_bo_release+0x70/0x310 [ttm]
<3>[ 277.559663] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV)
<4>[ 277.559657] ? xe_ggtt_might_lock+0x29/0x60 [xe]
<4>[ 277.559716] ? lock_release+0xd0/0x2b0
<4>[ 277.559721] ttm_bo_fini+0x3c/0x70 [ttm]
<4>[ 277.559727] xe_gem_object_free+0x1a/0x30 [xe]
Oops#2 Part7
<4>[ 277.559807] drm_gem_object_free+0x1d/0x40
<4>[ 277.559810] xe_bo_put+0x12a/0x190 [xe]
<4>[ 277.559865] xe_lrc_destroy+0x74/0x90 [xe]
<4>[ 277.559939] __xe_exec_queue_fini+0x6b/0xa0 [xe]
<4>[ 277.559997] xe_exec_queue_fini+0x2b/0x60 [xe]
<7>[ 277.560025] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<4>[ 277.560054] __guc_exec_queue_destroy_async+0x6c/0x1a0 [xe]
<4>[ 277.560123] process_one_work+0x22e/0x740
<4>[ 277.560130] worker_thread+0x1e8/0x3d0
<4>[ 277.560132] ? __pfx_worker_thread+0x10/0x10
<4>[ 277.560135] kthread+0x10d/0x150
<4>[ 277.560138] ? __pfx_kthread+0x10/0x10
<4>[ 277.560141] ret_from_fork+0x3d4/0x480
<4>[ 277.560144] ? __pfx_kthread+0x10/0x10
<4>[ 277.560147] ret_from_fork_asm+0x1a/0x30
<4>[ 277.560154]
<4>[ 277.560155] irq event stamp: 12295
<4>[ 277.560157] hardirqs last enabled at (12301): [] __up_console_sem+0x79/0xa0
<4>[ 277.560160] hardirqs last disabled at (12306): [] __up_console_sem+0x5e/0xa0
<4>[ 277.560162] softirqs last enabled at (12200): [] __irq_exit_rcu+0x13f/0x160
<4>[ 277.560164] softirqs last disabled at (12195): [] __irq_exit_rcu+0x13f/0x160
<4>[ 277.560166] ---[ end trace 0000000000000000 ]---
<7>[ 277.561328] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7>[ 277.648207] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache.
<7>[ 277.650672] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker.