Oops#2 Part15 <7>[ 350.434146] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 350.435323] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0x627000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 350.436210] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 350.437234] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 350.458702] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152) <7>[ 350.469639] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 350.469914] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 350.470683] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4140 dwords <7>[ 350.470772] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 <7>[ 350.470838] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x01400140 <7>[ 350.470897] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 350.470958] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 350.471018] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 350.471077] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420 Oops#2 Part14 <7>[ 350.471135] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 350.471196] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 350.471261] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 <7>[ 350.473003] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 350.473086] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 350.473150] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 350.474668] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 350.474750] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 350.477062] FAULT_INJECTION: forcing a failure. <5>[ 350.477062] name fail_function, interval 0, probability 100, space 1, times 100 <3>[ 350.477069] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 350.477256] ------------[ cut here ]------------ <4>[ 350.477257] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 350.477257] platform: BATTLEMAGE subplatform: 7 <4>[ 350.477257] graphics: Xe2_HPG 20.01 step A0 <4>[ 350.477257] media: Xe2_HPM 13.01 step A1 <4>[ 350.477257] tile: 0 VRAM 12.0 GiB <4>[ 350.477257] GT: 0 type 1 Oops#2 Part13 <4>[ 350.477260] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:527 at guc_ct_change_state+0x279/0x350 [xe], CPU#4: xe_fault_inject/6547 <4>[ 350.477344] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit overlay intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart hid_generic coretemp spi_nor asus_nb_wmi mei_pxp mei_hdcp asus_wmi mtd sparse_keymap platform_profile wmi_bmof kvm_intel usbhid kvm irqbypass hid ghash_clmulni_intel aesni_intel snd_intel_dspcfg r8169 snd_hda_codec video rapl snd_hda_core intel_cstate snd_hwdep binfmt_misc realtek snd_pcm snd_timer i2c_i801 idma64 i2c_mux snd mei_me spi_intel_pci i2c_smbus spi_intel soundcore mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec acpi_tad pinctrl_alderlake acpi_pad dm_multipath msr nvme_fabrics fuse efi_pstore <4>[ 350.477403] nfnetlink autofs4 [last unloaded: snd_hda_intel] <4>[ 350.477408] CPU: 4 UID: 0 PID: 6547 Comm: xe_fault_inject Tainted: G S U W 6.19.0-lgci-xe-xe-4574-e1032fc6a7b99e9b2-debug+ #1 PREEMPT(voluntary) <4>[ 350.477411] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN <4>[ 350.477412] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1656 04/18/2024 Oops#2 Part12 <4>[ 350.477413] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe] <4>[ 350.477495] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 d0 63 18 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb <4>[ 350.477497] RSP: 0018:ffffc9000b34f6c0 EFLAGS: 00010002 <4>[ 350.477499] RAX: ffffffffa11fa8f2 RBX: ffff888151108738 RCX: ffffffffa11863d0 <4>[ 350.477500] RDX: ffff888103fba510 RSI: ffffffffa11fa8f2 RDI: ffffffffa1002ef0 <4>[ 350.477501] RBP: ffffc9000b34f7a8 R08: ffffffffa11fa942 R09: 0000000000000007 <4>[ 350.477503] R10: 0000000000000001 R11: 0000000000000514 R12: ffff888151108740 <4>[ 350.477504] R13: ffff8881511087d0 R14: 0000000000000515 R15: 0000000000000001 <4>[ 350.477505] FS: 0000701fb3d11980(0000) GS:ffff8888daeda000(0000) knlGS:0000000000000000 <4>[ 350.477506] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 350.477508] CR2: 00005c59730be7d0 CR3: 0000000111902004 CR4: 0000000000f72ef0 <4>[ 350.477509] PKRU: 55555554 <4>[ 350.477510] Call Trace: <4>[ 350.477511] <4>[ 350.477520] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 350.477609] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 350.477690] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 350.477770] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 350.477874] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 350.477965] xe_gt_init+0x35d/0xab0 [xe] <4>[ 350.478042] ? trace_hardirqs_on+0x63/0xd0 <4>[ 350.478046] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 350.478050] ? __devm_add_action+0x70/0xa0 <4>[ 350.478055] ? xe_irq_install+0x11a/0x490 [xe] Oops#2 Part11 <4>[ 350.478145] xe_device_probe+0x3c5/0xc10 [xe] <4>[ 350.478222] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 350.478226] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 350.478233] xe_pci_probe+0x396/0x610 [xe] <4>[ 350.478328] local_pci_probe+0x47/0xb0 <4>[ 350.478333] pci_device_probe+0xf3/0x260 <4>[ 350.478338] really_probe+0xf1/0x410 <4>[ 350.478341] __driver_probe_device+0x8c/0x190 <4>[ 350.478344] device_driver_attach+0x57/0xd0 <4>[ 350.478347] bind_store+0x77/0xd0 <4>[ 350.478351] drv_attr_store+0x24/0x50 <4>[ 350.478354] sysfs_kf_write+0x4d/0x80 <4>[ 350.478358] kernfs_fop_write_iter+0x188/0x240 <4>[ 350.478362] vfs_write+0x283/0x540 <4>[ 350.478371] ksys_write+0x6f/0xf0 <4>[ 350.478375] __x64_sys_write+0x19/0x30 <4>[ 350.478377] x64_sys_call+0x79/0x26b0 <4>[ 350.478380] do_syscall_64+0x93/0x1470 <4>[ 350.478383] ? __slab_free+0x15e/0x2c0 <4>[ 350.478387] ? call_rcu+0x34/0x50 <4>[ 350.478390] ? __delete_object+0x60/0xa0 <4>[ 350.478397] ? kmem_cache_free+0x49f/0x5c0 <4>[ 350.478398] ? putname+0x3e/0x80 <4>[ 350.478404] ? putname+0x3e/0x80 <4>[ 350.478406] ? putname+0x3e/0x80 <4>[ 350.478408] ? do_sys_openat2+0x95/0xe0 <4>[ 350.478413] ? __x64_sys_openat+0x54/0xa0 <4>[ 350.478418] ? do_syscall_64+0x1e4/0x1470 <4>[ 350.478419] ? exc_page_fault+0xbb/0x260 <4>[ 350.478423] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 350.478425] RIP: 0033:0x701fb5f1c5a4 <4>[ 350.478428] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89