Results for igt@xe_vm@large-userptr-misaligned-binds-2097152

Result: Dmesg-Fail 42 Warning(s)

i915_display_info7 igt_runner7 results7.json results7-xe-load.json guc_logs7.tar i915_display_info_post_exec7 boot7 dmesg7

DetailValue
Duration 66.83 seconds
Hostname
shard-bmg-2
Igt-Version
IGT-Version: 2.4-g5b279a8b7 (x86_64) (Linux: 7.0.0-lgci-xe-xe-pw-164972v2-debug+ x86_64)
Out
Using IGT_SRANDOM=1776695187 for randomisation
Opened device: /dev/dri/card0
Starting subtest: large-userptr-misaligned-binds-2097152
Stack trace:
  #0 ../lib/igt_core.c:2075 __igt_fail_assert()
  #1 [syncobj_wait+0xab]
  #2 ../tests/intel/xe_vm.c:1465 test_large_binds.constprop.0()
  #3 ../tests/intel/xe_vm.c:2976 __igt_unique____real_main2699()
  #4 ../tests/intel/xe_vm.c:2699 main()
  #5 [__libc_init_first+0x8a]
  #6 [__libc_start_main+0x8b]
  #7 [_start+0x25]
Subtest large-userptr-misaligned-binds-2097152: FAIL (66.832s)
Err
Starting subtest: large-userptr-misaligned-binds-2097152
(xe_vm:9466) igt_syncobj-CRITICAL: Test assertion failure function syncobj_wait, file ../lib/igt_syncobj.c:240:
(xe_vm:9466) igt_syncobj-CRITICAL: Failed assertion: ret == 0
(xe_vm:9466) igt_syncobj-CRITICAL: error: -125 != 0
Subtest large-userptr-misaligned-binds-2097152 failed.
**** DEBUG ****
(xe_vm:9466) igt_syncobj-CRITICAL: Test assertion failure function syncobj_wait, file ../lib/igt_syncobj.c:240:
(xe_vm:9466) igt_syncobj-CRITICAL: Failed assertion: ret == 0
(xe_vm:9466) igt_syncobj-CRITICAL: error: -125 != 0
(xe_vm:9466) igt_core-INFO: Stack trace:
(xe_vm:9466) igt_core-INFO:   #0 ../lib/igt_core.c:2075 __igt_fail_assert()
(xe_vm:9466) igt_core-INFO:   #1 [syncobj_wait+0xab]
(xe_vm:9466) igt_core-INFO:   #2 ../tests/intel/xe_vm.c:1465 test_large_binds.constprop.0()
(xe_vm:9466) igt_core-INFO:   #3 ../tests/intel/xe_vm.c:2976 __igt_unique____real_main2699()
(xe_vm:9466) igt_core-INFO:   #4 ../tests/intel/xe_vm.c:2699 main()
(xe_vm:9466) igt_core-INFO:   #5 [__libc_init_first+0x8a]
(xe_vm:9466) igt_core-INFO:   #6 [__libc_start_main+0x8b]
(xe_vm:9466) igt_core-INFO:   #7 [_start+0x25]
****  END  ****
Subtest large-userptr-misaligned-binds-2097152: FAIL (66.832s)
Dmesg

<6> [412.588222] Console: switching to colour dummy device 80x25
<6> [412.588525] [IGT] xe_vm: executing
<6> [412.600673] [IGT] xe_vm: starting subtest large-userptr-misaligned-binds-2097152
<7> [414.413113] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [414.517175] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<3> [414.862346] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59947 recv=59946
<3> [417.165759] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59947 recv=59946
<6> [417.286874] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [417.286900] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [417.286983] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [417.287000] nvme 0000:05:00.0: [ 0] RxErr (First)
<6> [417.395052] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [417.395079] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [417.395088] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [417.395098] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [419.469187] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59948 recv=59946
<6> [419.584689] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [419.584722] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [419.584734] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [419.584743] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [421.773180] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59948 recv=59946
<3> [424.076930] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59949 recv=59946
<3> [426.381060] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59949 recv=59946
<6> [428.172395] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [428.172421] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [428.172431] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [428.172440] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [428.685215] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59950 recv=59946
<3> [430.989199] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59950 recv=59946
<3> [433.293203] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59951 recv=59946
<6> [433.411729] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [433.411755] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [433.411765] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [433.411775] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [435.597227] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59951 recv=59946
<3> [437.902543] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59952 recv=59946
<3> [440.205274] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59952 recv=59946
<3> [442.509218] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59953 recv=59946
<6> [444.501709] xe 0000:03:00.0: [drm] PL2 disabled for channel 0, val 0x00000000
<7> [444.504935] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2d2d2b2b
<7> [444.505095] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2d2e
<3> [444.813220] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59953 recv=59946
<3> [447.117256] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59954 recv=59946
<3> [449.421267] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59954 recv=59946
<6> [449.540018] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [449.540044] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [449.540054] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [449.540064] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [451.725224] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59955 recv=59946
<3> [454.029256] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59955 recv=59946
<6> [454.148319] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [454.148344] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [454.148354] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [454.148364] nvme 0000:05:00.0: [ 0] RxErr (First)
<6> [454.257205] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [454.257231] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [454.257241] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [454.257251] nvme 0000:05:00.0: [ 0] RxErr (First)
<3> [456.333295] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59956 recv=59946
<3> [458.637294] xe 0000:03:00.0: [drm] *ERROR* TLB invalidation fence timeout, seqno=59956 recv=59946
<6> [459.478940] xe 0000:03:00.0: [drm] PL2 disabled for channel 0, val 0x00000000
<7> [459.481925] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2e2e2b2b
<7> [459.482075] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2e2d2d2e
<4> [463.885347] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=5, not started
<4> [463.885420] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=4, not started
<4> [463.885473] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=3, not started
<4> [463.885522] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=2, not started
<6> [464.503196] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [464.503222] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [464.503232] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [464.503241] nvme 0000:05:00.0: [ 0] RxErr (First)
<6> [464.612188] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [464.612215] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [464.612224] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [464.612234] nvme 0000:05:00.0: [ 0] RxErr (First)
<4> [469.005277] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=4, not started
<4> [469.005380] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=2, not started
<4> [469.005435] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=3, not started
<4> [469.005487] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=5, not started
<4> [474.125364] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=5, not started
<6> [474.235809] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [474.235835] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [474.235845] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [474.235854] nvme 0000:05:00.0: [ 0] RxErr (First)
<6> [474.480257] xe 0000:03:00.0: [drm] PL2 disabled for channel 0, val 0x00000000
<7> [474.483331] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2e2e2b2c
<7> [474.483473] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2e2e2d2e
<4> [479.245375] xe 0000:03:00.0: [drm] Tile0: GT0: Schedule disable failed to respond, guc_id=5
<7> [479.245406] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [479.245887] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [479.246534] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<4> [479.246627] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=2, not started
<7> [479.246653] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [479.246978] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<4> [479.247464] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=3, not started
<7> [479.247475] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [479.247773] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<4> [479.248180] xe 0000:03:00.0: [drm] Tile0: GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=4, not started
<7> [479.248189] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [479.248486] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [479.248972] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [479.249674] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [479.250491] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [479.250575] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [479.250661] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [479.250743] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x11111440
<7> [479.250823] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [479.250902] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [479.250980] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [479.251060] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00000100
<7> [479.251153] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [479.251240] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [479.251348] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [479.252503] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<3> [479.263148] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status = 0x400000A0, time = 9ms, freq = 2150MHz (req 2133MHz)
<3> [479.274876] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: load failed: status: Reset = 0, BootROM = 0x50, UKernel = 0x00, MIA = 0x00, Auth = 0x01
<3> [479.287643] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: firmware signature verification failed
<3> [479.296287] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT0: reset failed (-EPROTO)
<3> [479.303425] xe 0000:03:00.0: [drm] *ERROR* CRITICAL: Xe has declared device 0000:03:00.0 as wedged.
IOCTLs and executions are blocked.
For recovery procedure, refer to https://docs.kernel.org/gpu/drm-uapi.html#device-wedging
Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/xe/kernel/issues/new
<7> [479.335225] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [479.335328] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT1: GuC CT communication channel stopped
<3> [479.397781] xe 0000:03:00.0: [drm] *ERROR* Tile0: GT1: GuC mmio request 0x5507: no reply 0x5507
<6> [479.406600] xe 0000:03:00.0: [drm] device wedged, needs recovery
<6> [479.433298] [IGT] xe_vm: finished subtest large-userptr-misaligned-binds-2097152, FAIL
<7> [479.450211] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
<6> [479.451255] [IGT] xe_vm: exiting, ret=98
<6> [479.470355] Console: switching to colour frame buffer device 240x67
Created at 2026-04-20 15:00:37