Machine description: shard-adlp-6
Result: 5 Warning(s)
i915_display_info5 igt_runner5 results5.json results5-xe-load.json i915_display_info_post_exec5 boot5 dmesg5
Detail | Value |
---|---|
Duration | unknown |
Hostname |
shard-adlp-6 |
Igt-Version |
IGT-Version: 1.30-g7187a77fa (x86_64) (Linux: 6.14.0-rc2-xe+ x86_64) |
Out |
Using IGT_SRANDOM=1739649952 for randomisation Opened device: /dev/dri/card0 Starting subtest: cm-cat-error Stack trace: #0 ../lib/igt_core.c:2051 __igt_fail_assert() #1 ../tests/intel/xe_exec_reset.c:570 test_compute_mode() #2 ../tests/intel/xe_exec_reset.c:806 __igt_unique____real_main758() #3 ../tests/intel/xe_exec_reset.c:758 main() #4 [__libc_init_first+0x8a] #5 [__libc_start_main+0x8b] #6 [_start+0x25] Subtest cm-cat-error: FAIL (1.014s) This test caused an abort condition: Lockdep not active /proc/lockdep_stats contents: lock-classes: 2246 [max: 8192] direct dependencies: 25756 [max: 524288] indirect dependencies: 168855 all direct dependencies: 519631 dependency chains: 38203 [max: 524288] dependency chain hlocks used: 163117 [max: 2621440] dependency chain hlocks lost: 0 in-hardirq chains: 366 in-softirq chains: 812 in-process chains: 37025 stack-trace entries: 264943 [max: 524288] number of stack traces: 12335 number of stack hash chains: 8626 combined max dependencies: 2457550054 hardirq-safe locks: 112 hardirq-unsafe locks: 1317 softirq-safe locks: 246 softirq-unsafe locks: 1226 irq-safe locks: 263 irq-unsafe locks: 1317 hardirq-read-safe locks: 4 hardirq-read-unsafe locks: 447 softirq-read-safe locks: 10 softirq-read-unsafe locks: 443 irq-read-safe locks: 10 irq-read-unsafe locks: 447 uncategorized locks: 351 unused locks: 1 max locking depth: 18 max bfs queue depth: 376 max lock class index: 2245 debug_locks: 0 zapped classes: 2 zapped lock chains: 165 large chain blocks: 1 |
Err |
Starting subtest: cm-cat-error (xe_exec_reset:3377) CRITICAL: Test assertion failure function test_compute_mode, file ../tests/intel/xe_exec_reset.c:570: (xe_exec_reset:3377) CRITICAL: Failed assertion: err == 0 (xe_exec_reset:3377) CRITICAL: Last errno: 5, Input/output error (xe_exec_reset:3377) CRITICAL: error: -5 != 0 Subtest cm-cat-error failed. **** DEBUG **** (xe_exec_reset:3377) CRITICAL: Test assertion failure function test_compute_mode, file ../tests/intel/xe_exec_reset.c:570: (xe_exec_reset:3377) CRITICAL: Failed assertion: err == 0 (xe_exec_reset:3377) CRITICAL: Last errno: 5, Input/output error (xe_exec_reset:3377) CRITICAL: error: -5 != 0 (xe_exec_reset:3377) igt_core-INFO: Stack trace: (xe_exec_reset:3377) igt_core-INFO: #0 ../lib/igt_core.c:2051 __igt_fail_assert() (xe_exec_reset:3377) igt_core-INFO: #1 ../tests/intel/xe_exec_reset.c:570 test_compute_mode() (xe_exec_reset:3377) igt_core-INFO: #2 ../tests/intel/xe_exec_reset.c:806 __igt_unique____real_main758() (xe_exec_reset:3377) igt_core-INFO: #3 ../tests/intel/xe_exec_reset.c:758 main() (xe_exec_reset:3377) igt_core-INFO: #4 [__libc_init_first+0x8a] (xe_exec_reset:3377) igt_core-INFO: #5 [__libc_start_main+0x8b] (xe_exec_reset:3377) igt_core-INFO: #6 [_start+0x25] **** END **** Subtest cm-cat-error: FAIL (1.014s) |
Dmesg
|
<6> [285.215856] Console: switching to colour dummy device 80x25
<6> [285.216292] [IGT] xe_exec_reset: executing
<6> [285.219554] [IGT] xe_exec_reset: starting subtest cm-cat-error
<7> [285.221766] xe 0000:00:02.0: [drm:xe_guc_exec_queue_memory_cat_error_handler [xe]] GT0: Engine memory cat error: engine_class=rcs, logical_mask: 0x1, guc_id=2
<3> [286.222821] xe 0000:00:02.0: [drm] *ERROR* GT0: GuC engine reset request failed on 0:0 because 0x00000000
<6> [286.223520] xe 0000:00:02.0: [drm] GT0: trying reset from xe_guc_exec_queue_reset_failure_handler [xe]
<6> [286.223853] xe 0000:00:02.0: [drm] GT0: reset queued
<7> [286.224266] xe 0000:00:02.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [286.224989] xe 0000:00:02.0: [drm] GT0: reset started
<4> [286.225661]
<4> [286.225675] ======================================================
<4> [286.225686] WARNING: possible circular locking dependency detected
<4> [286.225696] 6.14.0-rc2-xe+ #1 Not tainted
<4> [286.225706] ------------------------------------------------------
<4> [286.225715] kworker/u80:18/2469 is trying to acquire lock:
<4> [286.225726] ffffffff834c9500 (fs_reclaim){+.+.}-{0:0}, at: __kmalloc_cache_noprof+0x58/0x490
<4> [286.225760]
but task is already holding lock:
<4> [286.225770] ffff88814295a158 (&guc->submission_state.lock){+.+.}-{3:3}, at: xe_guc_submit_stop+0x6c/0x590 [xe]
<4> [286.226100]
which lock already depends on the new lock.
<4> [286.226115]
the existing dependency chain (in reverse order) is:
<4> [286.226128]
-> #1 (&guc->submission_state.lock){+.+.}-{3:3}:
<4> [286.226150] __mutex_lock+0xdc/0xe60
<4> [286.226168] mutex_lock_nested+0x1b/0x30
<4> [286.226183] xe_guc_submit_init+0xf0/0x130 [xe]
<4> [286.226497] xe_guc_init_post_hwconfig+0x352/0x11c0 [xe]
<4> [286.226792] xe_uc_init_post_hwconfig+0x3c/0x70 [xe]
<4> [286.227186] xe_gt_init+0x3df/0x910 [xe]
<4> [286.227471] xe_device_probe+0x5d1/0x820 [xe]
<4> [286.227748] xe_pci_probe+0x35b/0x5f0 [xe]
<4> [286.228084] local_pci_probe+0x44/0xb0
<4> [286.228101] pci_device_probe+0xf4/0x270
<4> [286.228117] really_probe+0xee/0x3c0
<4> [286.228131] __driver_probe_device+0x8c/0x180
<4> [286.228147] driver_probe_device+0x24/0xd0
<4> [286.228161] __driver_attach+0x10f/0x220
<4> [286.228175] bus_for_each_dev+0x8d/0xf0
<4> [286.228187] driver_attach+0x1e/0x30
<4> [286.228199] bus_add_driver+0x151/0x290
<4> [286.228212] driver_register+0x5e/0x130
<4> [286.228227] __pci_register_driver+0x7d/0x90
<4> [286.228242] xe_register_pci_driver+0x23/0x30 [xe]
<4> [286.228569] soundcore_open+0x83/0x210 [soundcore]
<4> [286.228585] do_one_initcall+0x76/0x400
<4> [286.228603] do_init_module+0x97/0x2a0
<4> [286.228622] load_module+0x2c23/0x2f60
<4> [286.228634] init_module_from_file+0x97/0xe0
<4> [286.228647] idempotent_init_module+0x134/0x350
<4> [286.228660] __x64_sys_finit_module+0x77/0x100
<4> [286.228674] x64_sys_call+0x1f37/0x2650
<4> [286.228688] do_syscall_64+0x91/0x180
<4> [286.228702] entry_SYSCALL_64_after_hwframe+0x76/0x7e
<4> [286.228720]
-> #0 (fs_reclaim){+.+.}-{0:0}:
<4> [286.228740] __lock_acquire+0x1637/0x2810
<4> [286.228757] lock_acquire+0xc9/0x300
<4> [286.228772] fs_reclaim_acquire+0xc5/0x100
<4> [286.228787] __kmalloc_cache_noprof+0x58/0x490
<4> [286.228804] xe_drm_client_add_blame+0x68/0x330 [xe]
<4> [286.229084] xe_guc_submit_stop+0x21e/0x590 [xe]
<4> [286.229392] xe_guc_stop+0x21/0x30 [xe]
<4> [286.229534] xe_uc_stop+0x2a/0x40 [xe]
<4> [286.229607] gt_reset_worker+0x13e/0x1e0 [xe]
<4> [286.229662] process_one_work+0x21c/0x740
<4> [286.229665] worker_thread+0x1db/0x3c0
<4> [286.229668] kthread+0x10d/0x270
<4> [286.229670] ret_from_fork+0x44/0x70
<4> [286.229674] ret_from_fork_asm+0x1a/0x30
<4> [286.229677]
other info that might help us debug this:
<4> [286.229679] Possible unsafe locking scenario:
<4> [286.229682] CPU0 CPU1
<4> [286.229684] ---- ----
<4> [286.229686] lock(&guc->submission_state.lock);
<4> [286.229688] lock(fs_reclaim);
<4> [286.229691] lock(&guc->submission_state.lock);
<4> [286.229695] lock(fs_reclaim);
<4> [286.229697]
*** DEADLOCK ***
<4> [286.229699] 3 locks held by kworker/u80:18/2469:
<4> [286.229701] #0: ffff88814294a948 ((wq_completion)gt-ordered-wq){+.+.}-{0:0}, at: process_one_work+0x444/0x740
<4> [286.229708] #1: ffffc90002c4fe20 ((work_completion)(>->reset.worker)){+.+.}-{0:0}, at: process_one_work+0x1da/0x740
<4> [286.229714] #2: ffff88814295a158 (&guc->submission_state.lock){+.+.}-{3:3}, at: xe_guc_submit_stop+0x6c/0x590 [xe]
<4> [286.229774]
stack backtrace:
<4> [286.229777] CPU: 2 UID: 0 PID: 2469 Comm: kworker/u80:18 Not tainted 6.14.0-rc2-xe+ #1
<4> [286.229778] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS RPLPFWI1.R00.4035.A00.2301200723 01/20/2023
<4> [286.229779] Workqueue: gt-ordered-wq gt_reset_worker [xe]
<4> [286.229832] Call Trace:
<4> [286.229832] <TASK>
<4> [286.229833] dump_stack_lvl+0x91/0xf0
<4> [286.229836] dump_stack+0x10/0x20
<4> [286.229837] print_circular_bug+0x285/0x360
<4> [286.229839] check_noncircular+0x150/0x170
<4> [286.229842] __lock_acquire+0x1637/0x2810
<4> [286.229844] lock_acquire+0xc9/0x300
<4> [286.229846] ? __kmalloc_cache_noprof+0x58/0x490
<4> [286.229847] ? __lock_acquire+0x1166/0x2810
<4> [286.229849] ? __flush_work+0x4a5/0x5f0
<4> [286.229851] ? xe_drm_client_add_blame+0x68/0x330 [xe]
<4> [286.229902] fs_reclaim_acquire+0xc5/0x100
<4> [286.229903] ? __kmalloc_cache_noprof+0x58/0x490
<4> [286.229905] __kmalloc_cache_noprof+0x58/0x490
<4> [286.229907] xe_drm_client_add_blame+0x68/0x330 [xe]
<4> [286.229964] ? xe_drm_client_add_blame+0x68/0x330 [xe]
<4> [286.230033] ? xe_lrc_read_ctx_reg+0x41/0x80 [xe]
<4> [286.230110] xe_guc_submit_stop+0x21e/0x590 [xe]
<4> [286.230167] ? trace_hardirqs_on+0x1e/0xe0
<4> [286.230169] ? enable_work+0x8c/0x110
<4> [286.230172] xe_guc_stop+0x21/0x30 [xe]
<4> [286.230226] xe_uc_stop+0x2a/0x40 [xe]
<4> [286.230296] gt_reset_worker+0x13e/0x1e0 [xe]
<4> [286.230349] process_one_work+0x21c/0x740
<4> [286.230351] worker_thread+0x1db/0x3c0
<4> [286.230352] ? __pfx_worker_thread+0x10/0x10
<4> [286.230354] kthread+0x10d/0x270
<4> [286.230355] ? __pfx_kthread+0x10/0x10
<4> [286.230356] ret_from_fork+0x44/0x70
<4> [286.230357] ? __pfx_kthread+0x10/0x10
<4> [286.230357] ret_from_fork_asm+0x1a/0x30
<4> [286.230360] </TASK>
<7> [286.230460] xe 0000:00:02.0: [drm:xe_reg_sr_apply_mmio [xe]] GT0: Applying GT save-restore MMIOs
<6> [286.230541] xe 0000:00:02.0: [drm] exec queue reset detected
<7> [286.230547] xe 0000:00:02.0: [drm:xe_reg_sr_apply_mmio [xe]] GT0: REG[0x9424] = 0xfffffffc
<7> [286.230623] xe 0000:00:02.0: [drm:xe_reg_sr_apply_mmio [xe]] GT0: REG[0x9550] = 0x000003ff
<7> [286.230690] xe 0000:00:02.0: [drm:xe_wopcm_init [xe]] WOPCM: 2048K
<7> [286.230770] xe 0000:00:02.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [592K, 1420K)
<7> [286.232145] xe 0000:00:02.0: [drm:xe_guc_ads_populate [xe]] GT0: ADS capture alloc size changed from 36864 to 32768
<7> [286.232745] xe 0000:00:02.0: [drm:__xe_guc_upload [xe]] GT0: load still in progress, timeouts = 0, freq = 1250MHz (req 1300MHz), status = 0x00000072 [0x39/00]
<7> [286.233079] xe 0000:00:02.0: [drm:__xe_guc_upload [xe]] GT0: load still in progress, timeouts = 0, freq = 1250MHz (req 1300MHz), status = 0x00000074 [0x3A/00]
<7> [286.233287] xe 0000:00:02.0: [drm:__xe_guc_upload [xe]] GT0: load still in progress, timeouts = 0, freq = 1250MHz (req 1300MHz), status = 0x800005EC [0x76/05]
<6> [286.234197] [IGT] xe_exec_reset: finished subtest cm-cat-error, FAIL
<6> [286.234500] [IGT] xe_exec_reset: exiting, ret=98
<6> [286.234805] Console: switching to colour frame buffer device 240x67
<7> [286.239992] xe 0000:00:02.0: [drm:intel_power_well_enable [xe]] enabling DC_off
<7> [286.240238] xe 0000:00:02.0: [drm:gen9_set_dc_state.part.0 [xe]] Setting DC state from 02 to 00
<7> [286.242296] xe 0000:00:02.0: [drm:drm_client_dev_restore] intel-fbdev: ret=0
<7> [286.252944] xe 0000:00:02.0: [drm:__xe_guc_upload [xe]] GT0: init took 20ms, freq = 1250MHz (req = 1300MHz), before = 1250MHz, status = 0x8002F0EC, timeouts = 0
<7> [286.253246] xe 0000:00:02.0: [drm:xe_guc_ct_enable [xe]] GT0: GuC CT communication channel enabled
|