Results for igt@xe_exec_system_allocator@many-stride-mmap-remap-dontunmap

Result: Dmesg-Fail 29 Warning(s)

i915_display_info17 igt_runner17 results17.json results17-xe-load.json guc_logs17.tar i915_display_info_post_exec17 boot17 dmesg17

DetailValue
Duration 32.72 seconds
Hostname
shard-bmg-2
Igt-Version
IGT-Version: 2.4-g34f2fc606 (x86_64) (Linux: 7.0.0-rc7-lgci-xe-xe-4877-97d8833ffba6bd3d6-debug+ x86_64)
Out
Using IGT_SRANDOM=1775807648 for randomisation
Opened device: /dev/dri/card0
Starting subtest: many-stride-mmap-remap-dontunmap
Stack trace:
  #0 ../lib/igt_core.c:2075 __igt_fail_assert()
  #1 [xe_wait_ufence+0x57]
  #2 ../tests/intel/xe_exec_system_allocator.c:1757 test_exec()
  #3 ../tests/intel/xe_exec_system_allocator.c:2562 __igt_unique____real_main2349()
  #4 ../tests/intel/xe_exec_system_allocator.c:2349 main()
  #5 [__libc_init_first+0x8a]
  #6 [__libc_start_main+0x8b]
  #7 [_start+0x25]
Subtest many-stride-mmap-remap-dontunmap: FAIL (32.717s)
Err
Starting subtest: many-stride-mmap-remap-dontunmap
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:763:
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: Last errno: 62, Timer expired
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: error: -62 != 0
Subtest many-stride-mmap-remap-dontunmap failed.
**** DEBUG ****
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:763:
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: Last errno: 62, Timer expired
(xe_exec_system_allocator:4233) xe/xe_ioctl-CRITICAL: error: -62 != 0
(xe_exec_system_allocator:4233) igt_core-INFO: Stack trace:
(xe_exec_system_allocator:4233) igt_core-INFO:   #0 ../lib/igt_core.c:2075 __igt_fail_assert()
(xe_exec_system_allocator:4233) igt_core-INFO:   #1 [xe_wait_ufence+0x57]
(xe_exec_system_allocator:4233) igt_core-INFO:   #2 ../tests/intel/xe_exec_system_allocator.c:1757 test_exec()
(xe_exec_system_allocator:4233) igt_core-INFO:   #3 ../tests/intel/xe_exec_system_allocator.c:2562 __igt_unique____real_main2349()
(xe_exec_system_allocator:4233) igt_core-INFO:   #4 ../tests/intel/xe_exec_system_allocator.c:2349 main()
(xe_exec_system_allocator:4233) igt_core-INFO:   #5 [__libc_init_first+0x8a]
(xe_exec_system_allocator:4233) igt_core-INFO:   #6 [__libc_start_main+0x8b]
(xe_exec_system_allocator:4233) igt_core-INFO:   #7 [_start+0x25]
****  END  ****
Subtest many-stride-mmap-remap-dontunmap: FAIL (32.717s)
Dmesg

<6> [225.263318] Console: switching to colour dummy device 80x25
<6> [225.263592] [IGT] xe_exec_system_allocator: executing
<6> [225.375673] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [225.375698] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [225.375709] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [225.375718] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [227.106423] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [227.210441] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<3> [227.619937] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=382 recv=381
<7> [228.325150] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2b2b2829
<7> [228.325299] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2b2b2a2b
<3> [229.923315] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=382 recv=381
<3> [232.226747] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=383 recv=381
<6> [232.345348] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [232.345374] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [232.345384] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [232.345393] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [234.530872] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=383 recv=381
<6> [234.650059] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [234.650086] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [234.650096] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [234.650105] nvme 0000:05:00.0: [ 0] RxErr (First)
<6> [234.758259] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [234.758285] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [234.758295] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [234.758305] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [236.834927] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=384 recv=381
<3> [239.138953] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=384 recv=381
<7> [239.147829] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<6> [239.155046] [IGT] xe_exec_system_allocator: starting subtest many-stride-mmap-remap-dontunmap
<3> [241.443170] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=385 recv=381
<6> [241.563239] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [241.563265] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [241.563275] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [241.563284] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [243.297400] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2c2c292a
<7> [243.297543] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2c2b2c
<3> [243.747145] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=385 recv=381
<3> [246.051014] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=386 recv=381
<3> [248.354881] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=386 recv=381
<3> [250.658758] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=387 recv=381
<3> [252.962554] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=387 recv=381
<3> [255.266425] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=388 recv=381
<3> [257.570270] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=388 recv=381
<7> [258.362104] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d292a
<7> [258.362403] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2c2c2d
<6> [258.469554] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [258.469583] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [258.469594] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [258.469603] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [259.874229] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=389 recv=381
<3> [262.178107] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=389 recv=381
<3> [264.481978] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=390 recv=381
<3> [266.785915] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=390 recv=381
<6> [271.873011] [IGT] xe_exec_system_allocator: finished subtest many-stride-mmap-remap-dontunmap, FAIL
<6> [271.873708] [IGT] xe_exec_system_allocator: exiting, ret=98
<6> [271.874174] Console: switching to colour frame buffer device 240x67
<7> [273.331186] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2a2b
<7> [273.331339] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2c2d
<4> [276.961532] xe 0000:03:00.0: [drm] Tile0: GT0: Schedule disable failed to respond, guc_id=2
<7> [277.152448] xe 0000:03:00.0: [drm:xe_hw_engine_snapshot_capture [xe]] Tile0: GT0: Proceeding with manual engine snapshot
<6> [277.152704] xe 0000:03:00.0: [drm] Xe device coredump has been created
<6> [277.152725] xe 0000:03:00.0: [drm] Check your /sys/class/drm/card0/device/devcoredump/data
<6> [277.152726] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [277.152804] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<3> [277.152945] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=391 recv=381
<6> [277.161828] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [277.161932] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [277.162383] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [277.162474] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [277.162559] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [277.162639] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [277.162716] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [277.162791] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [277.162868] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [277.162946] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [277.163018] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [277.163102] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [277.163346] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [277.164351] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<3> [277.174532] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status = 0x400000A0, time = 9ms, freq = 2150MHz (req 2133MHz)
<3> [277.186589] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status: Reset = 0, BootROM = 0x50, UKernel = 0x00, MIA = 0x00, Auth = 0x01
<3> [277.199379] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: firmware signature verification failed
<3> [277.208211] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: reset failed (-EPROTO)
<3> [277.215355] xe 0000:03:00.0: [drm] *ERROR* CRITICAL: Xe has declared device 0000:03:00.0 as wedged.
IOCTLs and executions are blocked.
For recovery procedure, refer to https://docs.kernel.org/gpu/drm-uapi.html#device-wedging
Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/xe/kernel/issues/new
<7> [277.247255] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [277.247434] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT1: GuC CT communication channel stopped
<3> [277.313904] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT1: GuC mmio request 0x5507: no reply 0x5507
<6> [277.322935] xe 0000:03:00.0: [drm] device wedged, needs recovery
<7> [277.323731] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<7> [277.334809] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
Created at 2026-04-10 08:21:43