Result: 24 Warning(s)
i915_display_info15 igt_runner15 results15.json results15-xe-load.json guc_logs15.tar i915_display_info_post_exec15 boot15 dmesg15
| Detail | Value |
|---|---|
| Duration | unknown |
| Hostname |
shard-bmg-2 |
| Igt-Version |
IGT-Version: 2.4-g93abaf017 (x86_64) (Linux: 7.0.0-lgci-xe-xe-4896-773e7de886203adcc-debug+ x86_64) |
| Out |
Using IGT_SRANDOM=1776117810 for randomisation Opened device: /dev/dri/card0 Starting subtest: render-square Starting dynamic subtest: render-linear-256x256 Stack trace: #0 ../lib/igt_core.c:2075 __igt_fail_assert() #1 ../lib/xe/xe_ioctl.c:556 xe_bo_mmap_offset() #2 ../lib/xe/xe_ioctl.c:564 xe_bo_map() #3 ../lib/intel_batchbuffer.c:2518 __xe_bb_exec() #4 ../lib/intel_batchbuffer.c:2772 intel_bb_exec() #5 ../lib/rendercopy_gen9.c:1340 _gen9_render_op() #6 ../lib/rendercopy_gen9.c:1414 xe2_render_copyfunc() #7 ../tests/intel/xe_render_copy.c:349 render.isra.0() #8 ../tests/intel/xe_render_copy.c:721 __igt_unique____real_main681() #9 ../tests/intel/xe_render_copy.c:681 main() #10 [__libc_init_first+0x8a] #11 [__libc_start_main+0x8b] #12 [_start+0x25] Dynamic subtest render-linear-256x256: FAIL (20.840s) Stack trace: #0 ../lib/igt_core.c:2075 __igt_fail_assert() #1 ../lib/intel_chipset.c:155 intel_get_drm_devid() #2 ../lib/intel_blt.c:511 render_supports_tiling() #3 ../tests/intel/xe_render_copy.c:715 __igt_unique____real_main681() #4 ../tests/intel/xe_render_copy.c:681 main() #5 [__libc_init_first+0x8a] #6 [__libc_start_main+0x8b] #7 [_start+0x25] Subtest render-square: FAIL (20.844s) This test caused an abort condition: Kernel badly tainted (0x4244, 0x200) (check dmesg for details): TAINT_WARN: WARN_ON has happened. |
| Err |
Starting subtest: render-square
Starting dynamic subtest: render-linear-256x256
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_bo_mmap_offset, file ../lib/xe/xe_ioctl.c:553:
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: Failed assertion: igt_ioctl(fd, (((2U|1U) << (((0+8)+8)+14)) | ((('d')) << (0+8)) | (((0x40 + 0x02)) << 0) | ((((sizeof(struct drm_xe_gem_mmap_offset)))) << ((0+8)+8))), &mmo) == 0
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: Last errno: 125, Operation canceled
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: error: -1 != 0
Dynamic subtest render-linear-256x256 failed.
**** DEBUG ****
(xe_render_copy:10805) intel_allocator_simple-DEBUG: Using simple allocator
(xe_render_copy:10805) intel_bufops-DEBUG: Test requirement passed: fn
(xe_render_copy:10805) intel_batchbuffer-DEBUG: Run on DRM_XE_ENGINE_CLASS_RENDER
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_bo_mmap_offset, file ../lib/xe/xe_ioctl.c:553:
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: Failed assertion: igt_ioctl(fd, (((2U|1U) << (((0+8)+8)+14)) | ((('d')) << (0+8)) | (((0x40 + 0x02)) << 0) | ((((sizeof(struct drm_xe_gem_mmap_offset)))) << ((0+8)+8))), &mmo) == 0
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: Last errno: 125, Operation canceled
(xe_render_copy:10805) xe/xe_ioctl-CRITICAL: error: -1 != 0
(xe_render_copy:10805) igt_core-INFO: Stack trace:
(xe_render_copy:10805) igt_core-INFO: #0 ../lib/igt_core.c:2075 __igt_fail_assert()
(xe_render_copy:10805) igt_core-INFO: #1 ../lib/xe/xe_ioctl.c:556 xe_bo_mmap_offset()
(xe_render_copy:10805) igt_core-INFO: #2 ../lib/xe/xe_ioctl.c:564 xe_bo_map()
(xe_render_copy:10805) igt_core-INFO: #3 ../lib/intel_batchbuffer.c:2518 __xe_bb_exec()
(xe_render_copy:10805) igt_core-INFO: #4 ../lib/intel_batchbuffer.c:2772 intel_bb_exec()
(xe_render_copy:10805) igt_core-INFO: #5 ../lib/rendercopy_gen9.c:1340 _gen9_render_op()
(xe_render_copy:10805) igt_core-INFO: #6 ../lib/rendercopy_gen9.c:1414 xe2_render_copyfunc()
(xe_render_copy:10805) igt_core-INFO: #7 ../tests/intel/xe_render_copy.c:349 render.isra.0()
(xe_render_copy:10805) igt_core-INFO: #8 ../tests/intel/xe_render_copy.c:721 __igt_unique____real_main681()
(xe_render_copy:10805) igt_core-INFO: #9 ../tests/intel/xe_render_copy.c:681 main()
(xe_render_copy:10805) igt_core-INFO: #10 [__libc_init_first+0x8a]
(xe_render_copy:10805) igt_core-INFO: #11 [__libc_start_main+0x8b]
(xe_render_copy:10805) igt_core-INFO: #12 [_start+0x25]
**** END ****
Dynamic subtest render-linear-256x256: FAIL (20.840s)
(xe_render_copy:10805) intel_chipset-CRITICAL: Test assertion failure function intel_get_drm_devid, file ../lib/intel_chipset.c:145:
(xe_render_copy:10805) intel_chipset-CRITICAL: Failed assertion: is_intel_device(fd)
(xe_render_copy:10805) intel_chipset-CRITICAL: Last errno: 125, Operation canceled
Subtest render-square: FAIL (20.844s)
|
| Dmesg |
<6> [610.262755] Console: switching to colour dummy device 80x25
<6> [610.263140] [IGT] xe_render_copy: executing
<6> [610.273477] [IGT] xe_render_copy: starting subtest render-square
<6> [610.273888] [IGT] xe_render_copy: starting dynamic subtest render-linear-256x256
<7> [612.112516] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [612.216634] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<3> [612.561948] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54292 recv=54291
<3> [614.865636] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54292 recv=54291
<6> [614.985232] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [614.985258] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [614.985268] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [614.985278] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [615.377999] xe 0000:03:00.0: [drm:xe_hw_engine_snapshot_capture [xe]] Tile0: GT0: Proceeding with manual engine snapshot
<4> [615.378450] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=61213, lrc_seqno=61213, guc_id=0, not started
<3> [617.169537] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54293 recv=54291
<3> [619.474048] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54293 recv=54291
<4> [620.500815] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=61213, lrc_seqno=61213, guc_id=0, not started
<7> [620.935842] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2c2c2929
<7> [620.936604] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2c2b2c
<3> [621.778562] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54294 recv=54291
<3> [624.082944] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54294 recv=54291
<7> [624.564006] xe 0000:03:00.0: [drm:drm_dp_dpcd_access [drm_display_helper]] AUX USBC1/DDI TC1/PHY F: Too many retries, giving up. First error: -6
<7> [624.570429] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [624.675214] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<7> [624.723949] xe 0000:03:00.0: [drm:drm_dp_dpcd_access [drm_display_helper]] AUX USBC4/DDI TC4/PHY I: Too many retries, giving up. First error: -6
<4> [625.619750] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=61213, lrc_seqno=61213, guc_id=0, not started
<6> [627.981540] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [627.981567] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [627.981577] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [627.981586] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [630.122587] xe 0000:03:00.0: [drm:drm_dp_dpcd_access [drm_display_helper]] AUX USBC1/DDI TC1/PHY F: Too many retries, giving up. First error: -6
<7> [630.123146] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [630.228855] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<7> [630.273546] xe 0000:03:00.0: [drm:drm_dp_dpcd_access [drm_display_helper]] AUX USBC4/DDI TC4/PHY I: Too many retries, giving up. First error: -6
<4> [630.741080] xe 0000:03:00.0: [drm] Tile0: GT0: Schedule disable failed to respond, guc_id=0
<6> [630.929878] xe 0000:03:00.0: [drm] Xe device coredump has been created
<6> [630.929908] xe 0000:03:00.0: [drm] Check your /sys/class/drm/card0/device/devcoredump/data
<6> [630.929911] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [630.930016] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<3> [630.930379] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=54295 recv=54291
<6> [630.939280] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [630.939435] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [630.939934] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [630.940037] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [630.940125] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [630.940210] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [630.940292] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [630.940370] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [630.940449] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [630.940527] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [630.940602] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [630.940692] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [630.940801] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [630.941985] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<3> [630.952772] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status = 0x400000A0, time = 10ms, freq = 2150MHz (req 2133MHz)
<3> [630.964625] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status: Reset = 0, BootROM = 0x50, UKernel = 0x00, MIA = 0x00, Auth = 0x01
<3> [630.977400] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: firmware signature verification failed
<3> [630.986193] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: reset failed (-EPROTO)
<3> [630.993338] xe 0000:03:00.0: [drm] *ERROR* CRITICAL: Xe has declared device 0000:03:00.0 as wedged.
IOCTLs and executions are blocked.
For recovery procedure, refer to https://docs.kernel.org/gpu/drm-uapi.html#device-wedging
Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/xe/kernel/issues/new
<7> [631.025186] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [631.025380] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT1: GuC CT communication channel stopped
<3> [631.090167] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT1: GuC mmio request 0x5507: no reply 0x5507
<6> [631.100046] xe 0000:03:00.0: [drm] device wedged, needs recovery
<4> [631.102642] ------------[ cut here ]------------
<4> [631.102656] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [631.102668] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1626 at guc_exec_queue_timedout_job+0x1424/0x2400 [xe], CPU#2: kworker/u64:53/7914
<4> [631.103233] Modules linked in: snd_hda_codec_intelhdmi snd_hda_codec_hdmi pmt_crashlog mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp hid_generic cmdlinepart eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mei_hdcp sparse_keymap mtd platform_profile binfmt_misc wmi_bmof usbhid kvm_intel hid kvm irqbypass ghash_clmulni_intel snd_intel_dspcfg aesni_intel rapl intel_cstate r8169 snd_hda_codec snd_hda_core video realtek snd_hwdep snd_pcm i2c_i801 mei_me idma64 snd_timer i2c_mux spi_intel_pci spi_intel snd i2c_smbus soundcore intel_pmc_core mei pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry wmi intel_vsec pinctrl_alderlake acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink
<4> [631.103765] autofs4 [last unloaded: snd_hda_intel]
<4> [631.103801] CPU: 2 UID: 0 PID: 7914 Comm: kworker/u64:53 Tainted: G S U L 7.0.0-lgci-xe-xe-4896-773e7de886203adcc-debug+ #1 PREEMPT(lazy)
<4> [631.103826] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [L]=SOFTLOCKUP
<4> [631.103837] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [631.103849] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [631.103916] RIP: 0010:guc_exec_queue_timedout_job+0x142d/0x2400 [xe]
<4> [631.104391] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 e6 d8 5e e1 48 89 c6 48 8d 3d 6c 95 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 c6 ee ff ff 8b 70 08 49
<4> [631.104403] RSP: 0018:ffffc9000914fca0 EFLAGS: 00010246
<4> [631.104420] RAX: ffffffffa1203043 RBX: 0000000000000000 RCX: 0000000000000000
<4> [631.104433] RDX: ffff888103ef0210 RSI: ffffffffa1203043 RDI: ffffffffa1003ee0
<4> [631.104446] RBP: ffffc9000914fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [631.104458] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [631.104469] R13: ffff888103ef0210 R14: ffff88812795b818 R15: 00000000ffffffc2
<4> [631.104483] FS: 0000000000000000(0000) GS:ffff8888dad97000(0000) knlGS:0000000000000000
<4> [631.104497] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [631.104510] CR2: 00007ea9f8000020 CR3: 000000000344c004 CR4: 0000000000f72ef0
<4> [631.104524] PKRU: 55555554
<4> [631.104536] Call Trace:
<4> [631.104547] <TASK>
<4> [631.104586] ? lock_sync+0x100/0x100
<4> [631.104631] ? lock_release+0xd0/0x2b0
<4> [631.104667] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [631.104702] process_one_work+0x239/0x760
<4> [631.104744] worker_thread+0x200/0x3f0
<4> [631.104762] ? __pfx_worker_thread+0x10/0x10
<4> [631.104776] kthread+0x10d/0x150
<4> [631.104792] ? __pfx_kthread+0x10/0x10
<4> [631.104813] ret_from_fork+0x3d4/0x480
<4> [631.104825] ? __pfx_kthread+0x10/0x10
<4> [631.104845] ret_from_fork_asm+0x1a/0x30
<4> [631.104912] </TASK>
<4> [631.104921] irq event stamp: 1918337
<4> [631.104929] hardirqs last enabled at (1918343): [<ffffffff814aadd9>] __up_console_sem+0x79/0xa0
<4> [631.104945] hardirqs last disabled at (1918348): [<ffffffff814aadbe>] __up_console_sem+0x5e/0xa0
<4> [631.104958] softirqs last enabled at (1917522): [<ffffffff813d1e9f>] __irq_exit_rcu+0x13f/0x160
<4> [631.104972] softirqs last disabled at (1917517): [<ffffffff813d1e9f>] __irq_exit_rcu+0x13f/0x160
<4> [631.104985] ---[ end trace 0000000000000000 ]---
<6> [631.114209] [IGT] xe_render_copy: finished subtest render-linear-256x256, FAIL
<6> [631.118084] [IGT] xe_render_copy: finished subtest render-square, FAIL
<7> [631.124881] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
<6> [631.125342] [IGT] xe_render_copy: exiting, ret=98
<6> [631.141676] Console: switching to colour frame buffer device 240x67
|