Oops#1 Part9
<7>[ 279.258585] xe 0000:03:00.0: [drm:xe_guc_db_mgr_init [xe]] Tile0: GT0: using 256 doorbells
<7>[ 279.259730] xe 0000:03:00.0: [drm:guc_buf_cache_init [xe]] Tile0: GT0: reusable buffer with 2097152 dwords at 0x627000 for xe_guc_buf_cache_init_with_size [xe]
<7>[ 279.260658] xe 0000:03:00.0: [drm:xe_migrate_init [xe]] Migrate min chunk size is 0x00010000
<7>[ 279.261657] xe 0000:03:00.0: [drm:xe_guc_capture_steered_list_init [xe]] Tile0: GT0: capture found 120 ext-regs.
<7>[ 279.283691] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 45056)
<7>[ 279.294495] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7>[ 279.294792] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7>[ 279.295414] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC rcs0 WA job: 4138 dwords
<7>[ 279.295495] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x5588] = 0x04000400
<7>[ 279.295563] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6204] = 0x01400140
<7>[ 279.295628] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6208] = 0x00200020
<7>[ 279.295694] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x62a8] = 0x02400240
<7>[ 279.295759] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7010] = 0x40004000
<7>[ 279.295822] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x7300] = 0x10001000
Oops#1 Part8
<7>[ 279.295888] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x83a8] = 0x20002000
<7>[ 279.295958] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x6210] = ~0x3f18000|0x3f18000
<7>[ 279.297608] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC bcs0 WA job: 27 dwords
<7>[ 279.297678] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: REG[0x22204] = ~0x7e7e|0x606
<7>[ 279.297742] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01
<7>[ 279.299114] xe 0000:03:00.0: [drm:xe_gt_record_default_lrcs [xe]] Tile0: GT0: LRC ccs0 WA job: 0 dwords
<7>[ 279.299191] xe 0000:03:00.0: [drm:xe_lrc_emit_hwe_state_instructions [xe]] Tile0: GT0: No non-register state to emit on graphics ver 20.01
<5>[ 279.300821] FAULT_INJECTION: forcing a failure.
<5>[ 279.300821] name fail_function, interval 0, probability 100, space 1, times 100
<3>[ 279.300842] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC PC query task state failed: -ENOMEM
<4>[ 279.300936] ------------[ cut here ]------------
<4>[ 279.300937] xe 0000:03:00.0: [drm] Assertion `ct->g2h_outstanding == 0 || state == XE_GUC_CT_STATE_STOPPED` failed!
<4>[ 279.300937] platform: BATTLEMAGE subplatform: 7
<4>[ 279.300937] graphics: Xe2_HPG 20.01 step A0
<4>[ 279.300937] media: Xe2_HPM 13.01 step A1
<4>[ 279.300937] tile: 0 VRAM 12.0 GiB
<4>[ 279.300937] GT: 0 type 1
<4>[ 279.300940] WARNING: drivers/gpu/drm/xe/xe_guc_ct.c:526 at guc_ct_change_state+0x279/0x350 [xe], CPU#11: xe_fault_inject/4832
Oops#1 Part7
<4>[ 279.301011] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart coretemp spi_nor hid_generic eeepc_wmi asus_wmi mtd mei_pxp mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel snd_intel_dspcfg snd_hda_codec binfmt_misc kvm snd_hda_core irqbypass ghash_clmulni_intel snd_hwdep aesni_intel usbhid snd_pcm video r8169 rapl intel_cstate hid snd_timer i2c_i801 snd spi_intel_pci i2c_mux i2c_smbus spi_intel soundcore realtek nls_iso8859_1 idma64 intel_pmc_core pmt_telemetry pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry pinctrl_alderlake wmi intel_vsec acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse
<4>[ 279.301075] efi_pstore nfnetlink autofs4
<4>[ 279.301080] CPU: 11 UID: 0 PID: 4832 Comm: xe_fault_inject Tainted: G S U W 6.19.0-rc5-lgci-xe-xe-4407-f464bded8836c6cc9+ #1 PREEMPT(voluntary)
<4>[ 279.301083] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4>[ 279.301085] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4>[ 279.301086] RIP: 0010:guc_ct_change_state+0x2ed/0x350 [xe]
<4>[ 279.301154] Code: 1f 85 eb 51 48 c1 ea 25 44 6b ca 64 44 29 c9 51 48 c7 c1 18 2d 18 a1 52 ff 75 b0 44 8b 4d 94 4c 8b 45 88 48 8b 95 78 ff ff ff <67> 48 0f b9 3a 8b 8b 48 01 00 00 48 83 c4 60 85 c9 75 13 44 89 bb
Oops#1 Part6
<4>[ 279.301156] RSP: 0018:ffffc9000648b7f0 EFLAGS: 00010002
<4>[ 279.301159] RAX: ffffffffa11f689d RBX: ffff8881279388a0 RCX: ffffffffa1182d18
<4>[ 279.301160] RDX: ffff888104ba0a90 RSI: ffffffffa11f689d RDI: ffffffffa1002f50
<4>[ 279.301161] RBP: ffffc9000648b8d8 R08: ffffffffa11f68e2 R09: 0000000000000007
<4>[ 279.301162] R10: 0000000000000001 R11: 0000000000000514 R12: ffff8881279388a8
<4>[ 279.301164] R13: ffff888127938938 R14: 0000000000000515 R15: 0000000000000001
<4>[ 279.301165] FS: 00007e15e66d3940(0000) GS:ffff8888db25d000(0000) knlGS:0000000000000000
<4>[ 279.301167] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 279.301168] CR2: 000000c000573000 CR3: 0000000200e02002 CR4: 0000000000f72ef0
<4>[ 279.301170] PKRU: 55555554
<4>[ 279.301171] Call Trace:
<4>[ 279.301172]
<4>[ 279.301181] ? xe_guc_submit_enable+0xa8/0xf0 [xe]
<4>[ 279.301256] xe_guc_ct_disable+0x17/0x80 [xe]
<4>[ 279.301384] xe_guc_sanitize+0x2a/0x50 [xe]
<4>[ 279.301512] xe_uc_load_hw+0x187/0x2a0 [xe]
<4>[ 279.301647] ? xe_migrate_init+0x277/0x2d0 [xe]
<4>[ 279.301780] xe_gt_init+0x363/0xab0 [xe]
<4>[ 279.301906] ? trace_hardirqs_on+0x63/0xd0
<4>[ 279.301911] ? _raw_spin_unlock_irqrestore+0x51/0x80
<4>[ 279.301916] ? __devm_add_action+0x70/0xa0
<4>[ 279.301922] ? xe_irq_install+0x11a/0x490 [xe]
<4>[ 279.302058] xe_device_probe+0x3cc/0xc10 [xe]
<4>[ 279.302183] ? __drm_dev_dbg+0x7d/0xb0
<4>[ 279.302190] ? __drmm_add_action_or_reset+0x1e/0x50
Oops#1 Part5
<4>[ 279.302199] xe_pci_probe+0x396/0x610 [xe]
<4>[ 279.302336] local_pci_probe+0x47/0xb0
<4>[ 279.302343] pci_device_probe+0xf3/0x260
<4>[ 279.302351] really_probe+0xf1/0x410
<4>[ 279.302357] __driver_probe_device+0x8c/0x190
<4>[ 279.302362] device_driver_attach+0x57/0xd0
<4>[ 279.302367] bind_store+0x77/0xd0
<4>[ 279.302373] drv_attr_store+0x24/0x50
<4>[ 279.302376] sysfs_kf_write+0x4d/0x80
<4>[ 279.302382] kernfs_fop_write_iter+0x188/0x240
<4>[ 279.302388] vfs_write+0x283/0x540
<4>[ 279.302391] ? __delete_object+0x60/0xa0
<4>[ 279.302403] ksys_write+0x6f/0xf0
<4>[ 279.302410] __x64_sys_write+0x19/0x30
<4>[ 279.302413] x64_sys_call+0x79/0x26b0
<4>[ 279.302417] do_syscall_64+0x93/0x1470
<4>[ 279.302420] ? fput_close_sync+0x3d/0xa0
<4>[ 279.302423] ? __x64_sys_close+0x3e/0x90
<4>[ 279.302429] ? do_syscall_64+0x1e4/0x1470
<4>[ 279.302431] ? do_syscall_64+0x1e4/0x1470
<4>[ 279.302434] ? do_syscall_64+0x1e4/0x1470
<4>[ 279.302436] ? do_syscall_64+0x1e4/0x1470
<4>[ 279.302439] ? do_syscall_64+0x1e4/0x1470
<4>[ 279.302441] ? do_syscall_64+0x1e4/0x1470
<4>[ 279.302443] ? exc_page_fault+0xbb/0x250
<4>[ 279.302448] entry_SYSCALL_64_after_hwframe+0x76/0x7e
<4>[ 279.302451] RIP: 0033:0x7e15e891c5a4
<4>[ 279.302455] Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 ea 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
<4>[ 279.302457] RSP: 002b:00007ffec97ec218 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
<4>[ 279.302460] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007e15e891c5a4
Oops#1 Part4
<4>[ 279.302462] RDX: 000000000000000c RSI: 00007ffec97ed6d0 RDI: 000000000000000b
<4>[ 279.302463] RBP: 000000000000000c R08: 0000000000000073 R09: 0000000000000000
<4>[ 279.302465] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffec97ed6d0
<4>[ 279.302467] R13: 000000000000000b R14: 0000622734a6835b R15: 00007ffec97ed380
<4>[ 279.302480]
<4>[ 279.302482] irq event stamp: 819000
<4>[ 279.302483] hardirqs last enabled at (818999): [] _raw_spin_unlock_irqrestore+0x51/0x80
<4>[ 279.302486] hardirqs last disabled at (819000): [] _raw_spin_lock_irq+0x6f/0x80
<4>[ 279.302489] softirqs last enabled at (818896): [] __irq_exit_rcu+0x13f/0x160
<4>[ 279.302492] softirqs last disabled at (818891): [] __irq_exit_rcu+0x13f/0x160
<4>[ 279.302495] ---[ end trace 0000000000000000 ]---
<7>[ 279.302498] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<3>[ 279.302724] xe 0000:03:00.0: probe with driver xe failed with error -12
<3>[ 279.303405] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: GuC RC enable mode=0 failed: -ENODEV
<7>[ 279.303768] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7>[ 279.304841] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7>[ 279.399890] xe 0000:03:00.0: [drm:drm_pagemap_cache_fini [drm_gpusvm_helper]] Destroying dpagemap cache.
<7>[ 279.404092] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_fini [drm_gpusvm_helper]] Destroying dpagemap shrinker.
Oops#1 Part3
<3>[ 281.566038] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=38 recv=0
<1>[ 281.567130] BUG: unable to handle page fault for address: ffffc9000c38a188
<1>[ 281.567156] #PF: supervisor write access in kernel mode
<1>[ 281.567169] #PF: error_code(0x0002) - not-present page
<6>[ 281.567180] PGD 100000067 P4D 100000067 PUD 100aba067 PMD 0
<4>[ 281.567200] Oops: Oops: 0002 [#1] SMP NOPTI
<4>[ 281.567216] CPU: 2 UID: 0 PID: 347 Comm: kworker/2:2 Tainted: G S U W 6.19.0-rc5-lgci-xe-xe-4407-f464bded8836c6cc9+ #1 PREEMPT(voluntary)
<4>[ 281.567244] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4>[ 281.567255] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4>[ 281.567273] Workqueue: xe-destroy-wq __guc_exec_queue_destroy_async [xe]
<4>[ 281.567768] RIP: 0010:xe_mmio_write32+0x58/0x280 [xe]
<4>[ 281.568244] Code: 24 66 90 65 8b 05 6c 60 2a e3 48 0f a3 05 10 95 cd e2 0f 82 ee 00 00 00 41 f7 c5 00 00 00 01 0f 84 88 00 00 00 49 03 5c 24 08 <44> 89 3b 48 8d 65 d8 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31
<4>[ 281.568280] RSP: 0018:ffffc90001adf830 EFLAGS: 00010086
<4>[ 281.568298] RAX: 0000000000000002 RBX: ffffc9000c38a188 RCX: 0000000000000000
<4>[ 281.568315] RDX: 0000000000010001 RSI: 000000000000a188 RDI: ffff88817aa101c8
<4>[ 281.568333] RBP: ffffc90001adf8a8 R08: 0000000000000000 R09: 0000000000000000
<4>[ 281.568349] R10: ffff88820a428000 R11: 0000000000000001 R12: ffff88817aa101c8
<4>[ 281.568366] R13: 000000000000a188 R14: ffff88820a428000 R15: 0000000000010001
Oops#1 Part2
<4>[ 281.568383] FS: 0000000000000000(0000) GS:ffff8888daddd000(0000) knlGS:0000000000000000
<4>[ 281.568404] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 281.568419] CR2: ffffc9000c38a188 CR3: 0000000003448001 CR4: 0000000000f72ef0
<4>[ 281.568437] PKRU: 55555554
<4>[ 281.568447] Call Trace:
<4>[ 281.568458]
<4>[ 281.568477] xe_force_wake_get+0x417/0x950 [xe]
<4>[ 281.568877] ? _raw_spin_unlock_irqrestore+0x27/0x80
<4>[ 281.568905] send_tlb_inval_ggtt+0xfa/0x270 [xe]
<4>[ 281.569338] ? trace_hardirqs_on+0x63/0xd0
<4>[ 281.569358] ? _raw_spin_unlock_irq+0x27/0x70
<4>[ 281.569373] ? xe_tlb_inval_fence_prep+0xbf/0x1a0 [xe]
<4>[ 281.569883] xe_tlb_inval_ggtt+0x73/0x250 [xe]
<4>[ 281.570383] ? find_held_lock+0x31/0x90
<4>[ 281.570400] ? ggtt_node_remove+0xc4/0x140 [xe]
<4>[ 281.570803] ggtt_invalidate_gt_tlb.part.0+0x1f/0xb0 [xe]
<4>[ 281.571195] ggtt_node_remove+0x122/0x140 [xe]
<4>[ 281.571586] xe_ggtt_node_remove+0x40/0xa0 [xe]
<4>[ 281.571974] xe_ggtt_remove_bo+0x87/0x250 [xe]
<4>[ 281.572363] ? _raw_write_unlock+0x22/0x50
<4>[ 281.572379] ? drm_vma_offset_remove+0x65/0x80
<4>[ 281.572397] xe_ttm_bo_destroy+0xa2/0x2d0 [xe]
<4>[ 281.572469] ? lock_is_held_type+0xa3/0x130
<4>[ 281.572473] ttm_bo_release+0x70/0x330 [ttm]
<4>[ 281.572480] ? xe_ggtt_might_lock+0x29/0x60 [xe]
<4>[ 281.572547] ? lock_release+0xce/0x280
<4>[ 281.572551] ttm_bo_fini+0x3c/0x70 [ttm]
<4>[ 281.572557] xe_gem_object_free+0x1a/0x30 [xe]
<4>[ 281.572623] drm_gem_object_free+0x1d/0x40
<4>[ 281.572626] xe_bo_put+0x12a/0x190 [xe]
<4>[ 281.572693] xe_lrc_destroy+0x47/0x60 [xe]
Oops#1 Part1
<4>[ 281.572770] xe_exec_queue_fini+0x85/0xd0 [xe]
<4>[ 281.572838] __guc_exec_queue_destroy_async+0x6c/0x170 [xe]
<4>[ 281.572911] process_one_work+0x22e/0x6b0
<4>[ 281.572915] worker_thread+0x1e8/0x3d0
<4>[ 281.572918] ? __pfx_worker_thread+0x10/0x10
<4>[ 281.572920] kthread+0x11f/0x250
<4>[ 281.572924] ? __pfx_kthread+0x10/0x10
<4>[ 281.572927] ret_from_fork+0x344/0x3a0
<4>[ 281.572930] ? __pfx_kthread+0x10/0x10
<4>[ 281.572933] ret_from_fork_asm+0x1a/0x30
<4>[ 281.572939]
<4>[ 281.572940] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_gsc_proxy mei_lb mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp cmdlinepart coretemp spi_nor hid_generic eeepc_wmi asus_wmi mtd mei_pxp mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel snd_intel_dspcfg snd_hda_codec binfmt_misc kvm snd_hda_core irqbypass ghash_clmulni_intel snd_hwdep aesni_intel usbhid snd_pcm video r8169 rapl intel_cstate hid snd_timer i2c_i801 snd spi_intel_pci i2c_mux i2c_smbus spi_intel soundcore realtek nls_iso8859_1 idma64 intel_pmc_core pmt_telemetry pmt_discovery mei_me pmt_class mei intel_pmc_ssram_telemetry pinctrl_alderlake wmi intel_vsec acpi_tad acpi_pad dm_multipath msr nvme_fabrics fuse
<4>[ 281.572975] efi_pstore nfnetlink autofs4
<4>[ 281.573002] CR2: ffffc9000c38a188
<4>[ 281.573005] ---[ end trace 0000000000000000 ]---