Oops#2 Part16 <7>[ 265.675998] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[10] = 0x00000000 <7>[ 265.676057] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[11] = 0x00000000 <7>[ 265.676115] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[12] = 0x00000000 <7>[ 265.676172] xe 0000:03:00.0: [drm:guc_print_params [xe]] Tile0: GT0: GuC param[13] = 0x00000000 <7>[ 265.676253] xe 0000:03:00.0: [drm:xe_guc_id_mgr_init [xe]] Tile0: GT0: using 65535 GuC IDs <7>[ 265.676333] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells <7>[ 265.677478] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0x627000 for xe_guc_buf_cache_init_with_size [xe] <7>[ 265.678317] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000 <7>[ 265.679316] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs. <7>[ 265.700292] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152) <7>[ 265.716230] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 15ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034 <7>[ 265.716502] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled <7>[ 265.717471] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4146 dwords <7>[ 265.717607] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400 Oops#2 Part15 <7>[ 265.717729] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x01400140 <7>[ 265.717846] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020 <7>[ 265.717956] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240 <7>[ 265.718038] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000 <7>[ 265.718115] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7044] = 0x04200420 <7>[ 265.718192] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000 <7>[ 265.718274] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000 <7>[ 265.718363] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000 (MCR) <7>[ 265.720157] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords <7>[ 265.720263] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606 <7>[ 265.720359] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <7>[ 265.722663] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords <7>[ 265.722798] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01 <5>[ 265.725092] FAULT_INJECTION: forcing a failure. <5>[ 265.725092] name fail_function, interval 0, probability 100, space 1, times 100 Oops#2 Part14 <3>[ 265.725103] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM <4>[ 265.725361] ------------[ cut here ]------------ <4>[ 265.725363] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed! <4>[ 265.725363] platform: BATTLEMAGE subplatform: 7 <4>[ 265.725363] graphics: Xe2_HPG 20.01 step A0 <4>[ 265.725363] media: Xe2_HPM 13.01 step A1 <4>[ 265.725363] tile: 0 VRAM 12.0 GiB <4>[ 265.725363] GT: 0 type 1 <4>[ 265.725367] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:527 at guc_ct_change_state+0x279/0x350 [xe], CPU#4: xe_fault_inject/4923 <4>[ 265.725484] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mei_gsc mtd_intel_dg xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp cmdlinepart spi_nor mtd hid_generic asus_nb_wmi asus_wmi mei_pxp kvm_intel sparse_keymap mei_hdcp platform_profile wmi_bmof kvm snd_intel_dspcfg binfmt_misc snd_hda_codec irqbypass snd_hda_core ghash_clmulni_intel snd_hwdep aesni_intel usbhid snd_pcm r8169 rapl hid intel_cstate snd_timer i2c_i801 spi_intel_pci i2c_mux snd i2c_smbus spi_intel realtek soundcore nls_iso8859_1 idma64 video intel_pmc_core pmt_telemetry mei_me pmt_discovery pmt_class mei intel_pmc_ssram_telemetry wmi intel_vsec acpi_pad pinctrl_alderlake acpi_tad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink Oops#2 Part13 <4>[ 265.725555] autofs4 [last unloaded: snd_hda_intel] <4>[ 265.725559] CPU: 4 UID: 0 PID: 4923 Comm: xe_fault_inject Tainted: G S U W N 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary) <4>[ 265.725563] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [N]=TEST <4>[ 265.725564] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 265.725565] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe] <4>[ 265.725667] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 f0 43 24 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb Oops#2 Part12 <4>[ 265.725669] RSP: 0018:ffffc90003cb3570 EFLAGS: 00010002 <4>[ 265.725672] RAX: ffffffffa12b8ba1 RBX: ffff888103310738 RCX: ffffffffa12443f0 <4>[ 265.725674] RDX: ffff888103c46790 RSI: ffffffffa12b8ba1 RDI: ffffffffa0c02ef0 <4>[ 265.725675] RBP: ffffc90003cb3658 R08: ffffffffa12b8bf1 R09: 0000000000000007 <4>[ 265.725676] R10: 0000000000000001 R11: 0000000000000514 R12: ffff888103310740 <4>[ 265.725678] R13: ffff8881033107d0 R14: 0000000000000515 R15: 0000000000000001 <4>[ 265.725679] FS: 00007fd37e326980(0000) GS:ffff8888daeda000(0000) knlGS:0000000000000000 <4>[ 265.725681] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 265.725682] CR2: 000057e031c2a008 CR3: 0000000219400001 CR4: 0000000000f72ef0 <4>[ 265.725684] PKRU: 55555554 <4>[ 265.725685] Call Trace: <4>[ 265.725686] <4>[ 265.725696] ? xe_guc_submit_enable+0xa8/0xf0 [xe] <4>[ 265.725805] xe_guc_ct_disable+0x17/0x80 [xe] <4>[ 265.725904] xe_guc_sanitize+0x2a/0x50 [xe] <4>[ 265.726015] xe_uc_load_hw+0x19a/0x2b0 [xe] <4>[ 265.726152] ? xe_migrate_init+0x277/0x2d0 [xe] <4>[ 265.726276] xe_gt_init+0x35d/0xab0 [xe] <4>[ 265.726376] ? trace_hardirqs_on+0x63/0xd0 <4>[ 265.726381] ? _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 265.726385] ? __devm_add_action+0x70/0xa0 <4>[ 265.726391] ? xe_irq_install+0x11a/0x490 [xe] <4>[ 265.726508] xe_device_probe+0x3c5/0xc10 [xe] <4>[ 265.726600] ? __drm_dev_dbg+0x7d/0xb0 <4>[ 265.726605] ? __drmm_add_action_or_reset+0x1e/0x50 <4>[ 265.726613] xe_pci_probe+0x396/0x610 [xe] <4>[ 265.726729] local_pci_probe+0x47/0xb0 Oops#2 Part11 <4>[ 265.726734] pci_device_probe+0xf3/0x260 <4>[ 265.726740] really_probe+0xf1/0x410 <4>[ 265.726744] __driver_probe_device+0x8c/0x190 <4>[ 265.726747] device_driver_attach+0x57/0xd0 <4>[ 265.726751] bind_store+0x77/0xd0 <4>[ 265.726756] drv_attr_store+0x24/0x50 <4>[ 265.726759] sysfs_kf_write+0x4d/0x80 <4>[ 265.726763] kernfs_fop_write_iter+0x188/0x240 <4>[ 265.726768] vfs_write+0x283/0x540 <4>[ 265.726778] ksys_write+0x6f/0xf0 <4>[ 265.726782] __x64_sys_write+0x19/0x30 <4>[ 265.726785] x64_sys_call+0x79/0x26b0 <4>[ 265.726788] do_syscall_64+0x93/0x1470 <4>[ 265.726791] ? do_syscall_64+0x1e4/0x1470 <4>[ 265.726795] ? fput_close_sync+0x3d/0xa0 <4>[ 265.726798] ? __x64_sys_close+0x3e/0x90 <4>[ 265.726802] ? do_syscall_64+0x1e4/0x1470 <4>[ 265.726807] ? __mutex_unlock_slowpath+0x40/0x340 <4>[ 265.726810] ? lock_release+0xce/0x280 <4>[ 265.726817] ? mutex_unlock+0x12/0x20 <4>[ 265.726819] ? __f_unlock_pos+0x15/0x20 <4>[ 265.726823] ? __x64_sys_getdents64+0x9a/0x130 <4>[ 265.726825] ? __pfx_filldir64+0x10/0x10 <4>[ 265.726832] ? do_syscall_64+0x1e4/0x1470 <4>[ 265.726835] ? __do_sys_newfstat+0x3c/0x70 <4>[ 265.726849] ? do_syscall_64+0x1e4/0x1470 <4>[ 265.726852] ? do_syscall_64+0x1e4/0x1470 <4>[ 265.726855] entry_SYSCALL_64_after_hwframe+0x76/0x7e <4>[ 265.726858] RIP: 0033:0x7fd38051c5a4 <4>[ 265.726861] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89 <4>[ 265.726862] RSP: 002b:00007ffc09d06b38 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 Oops#2 Part10 <4>[ 265.726865] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fd38051c5a4 <4>[ 265.726866] RDX: 000000000000000c RSI: 00007ffc09d07000 RDI: 0000000000000007 <4>[ 265.726867] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000 <4>[ 265.726869] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffc09d07000 <4>[ 265.726870] R13: 0000000000000007 R14: 0000000000000006 R15: 00007ffc09d06cb0 <4>[ 265.726879] <4>[ 265.726880] irq event stamp: 1256016 <4>[ 265.726882] hardirqs last enabled at (1256015): [] _raw_spin_unlock_irqrestore+0x51/0x80 <4>[ 265.726885] hardirqs last disabled at (1256016): [] _raw_spin_lock_irq+0x6f/0x80 <4>[ 265.726887] softirqs last enabled at (1255508): [] __irq_exit_rcu+0x13f/0x160 <4>[ 265.726891] softirqs last disabled at (1255501): [] __irq_exit_rcu+0x13f/0x160 <4>[ 265.726894] ---[ end trace 0000000000000000 ]--- <7>[ 265.726896] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <3>[ 265.727108] xe 0000:03:00.0: probe with driver xe failed with error -12 <3>[ 265.727733] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC setup HOST_CONTROL(0) failed (-ENODEV) <7>[ 265.727966] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 265.729064] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled <7>[ 265.829717] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache. Oops#2 Part9 <7>[ 265.831552] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker. <3>[ 267.990848] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=39 recv=0 <1>[ 267.992774] BUG: unable to handle page fault for address: ffffc9000838a188 <1>[ 267.992807] #PF: supervisor write access in kernel mode <1>[ 267.992835] #PF: error_code(0x0002) - not-present page <6>[ 267.992862] PGD 100000067 P4D 100000067 PUD 100ad2067 PMD 0 <4>[ 267.992910] Oops: Oops: 0002 [#1] SMP NOPTI <4>[ 267.992940] CPU: 10 UID: 0 PID: 3978 Comm: kworker/10:5 Tainted: G S U W N 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary) <4>[ 267.992990] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN, [N]=TEST <4>[ 267.993009] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024 <4>[ 267.993033] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe] <4>[ 267.993737] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe] <4>[ 267.994502] Code: 24 66 90 65 8b 05 1c 84 0a e3 48 0f a3 05 c0 9e ad e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 <4>[ 267.994565] RSP: 0018:ffffc90003593830 EFLAGS: 00010086 <4>[ 267.994594] RAX: 0000000000000002 RBX: ffffc9000838a188 RCX: 0000000000000000 <4>[ 267.994622] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff8881216d8060 <4>[ 267.994651] RBP: ffffc900035938a8 R08: 0000000000000000 R09: 0000000000000000 <4>[ 267.994679] R10: ffff88827e2f8000 R11: 0000000000000001 R12: ffff8881216d8060 Oops#2 Part8 <4>[ 267.994707] R13: 000000000000a188 R14: ffff88827e2f8000 R15: 0000000000010001 <4>[ 267.994735] FS: 0000000000000000(0000) GS:ffff8888db1da000(0000) knlGS:0000000000000000 <4>[ 267.994769] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 267.994793] CR2: ffffc9000838a188 CR3: 0000000003448002 CR4: 0000000000f72ef0 <4>[ 267.994822] PKRU: 55555554 <4>[ 267.994838] Call Trace: <4>[ 267.994853] <4>[ 267.994883] xe_force_wake_get+0x415/0x950 [xe] <4>[ 267.995325] ? _raw_spin_unlock_irqrestore+0x27/0x80 <4>[ 267.995335] send_tlb_inval_ggtt+0xfa/0x270 [xe] <4>[ 267.995471] ? trace_hardirqs_on+0x63/0xd0 <4>[ 267.995476] ? _raw_spin_unlock_irq+0x27/0x70 <4>[ 267.995480] ? xe_tlb_inval_fence_prep+0xbf/0x1a0 [xe] <4>[ 267.995589] xe_tlb_inval_ggtt+0x73/0x250 [xe] <4>[ 267.995675] ? find_held_lock+0x31/0x90 <4>[ 267.995679] ? ggtt_node_remove+0xcb/0x140 [xe] <4>[ 267.995753] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe] <4>[ 267.995825] ggtt_node_remove+0x12c/0x140 [xe] <4>[ 267.995895] xe_ggtt_node_remove+0x40/0xa0 [xe] <4>[ 267.995965] xe_ggtt_remove_bo+0x87/0x250 [xe] <4>[ 267.996034] ? _raw_write_unlock+0x22/0x50 <4>[ 267.996037] ? drm_vma_offset_remove+0x65/0x80 <4>[ 267.996042] xe_ttm_bo_destroy+0xa2/0x2d0 [xe] <4>[ 267.996111] ? lock_is_held_type+0xa3/0x130 <4>[ 267.996115] ttm_bo_release+0x70/0x330 [ttm] <4>[ 267.996122] ? xe_ggtt_might_lock+0x29/0x60 [xe] <4>[ 267.996191] ? lock_release+0xce/0x280 <4>[ 267.996194] ttm_bo_fini+0x3c/0x70 [ttm] <4>[ 267.996200] xe_gem_object_free+0x1a/0x30 [xe] <4>[ 267.996271] drm_gem_object_free+0x1d/0x40