Results for igt@xe_intel_bb@intel-bb-blit-y

Machine description: bat-adlp-vf

Result: Abort 6 Warning(s)

i915_display_info0 igt_runner0 results0.json results0-xe-load.json boot0 dmesg0

DetailValue
Duration unknown
Igt-Version
IGT-Version: 1.30-gf0b668833 (x86_64) (Linux: 6.14.0-rc4-xe+ x86_64)
Out
Using IGT_SRANDOM=1740691737 for randomisation
Opened device: /dev/dri/card1
Starting subtest: intel-bb-blit-y
Subtest intel-bb-blit-y: SUCCESS (0.171s)

This test caused an abort condition: Lockdep not active

/proc/lockdep_stats contents:
 lock-classes:                         2190 [max: 8192]
 direct dependencies:                 23908 [max: 524288]
 indirect dependencies:              155476
 all direct dependencies:            434708
 dependency chains:                   34089 [max: 524288]
 dependency chain hlocks used:       139876 [max: 2621440]
 dependency chain hlocks lost:            0
 in-hardirq chains:                     274
 in-softirq chains:                     705
 in-process chains:                   33110
 stack-trace entries:                243456 [max: 524288]
 number of stack traces:              11542
 number of stack hash chains:          8325
 combined max dependencies:      2133533354
 hardirq-safe locks:                     87
 hardirq-unsafe locks:                 1300
 softirq-safe locks:                    217
 softirq-unsafe locks:                 1199
 irq-safe locks:                        229
 irq-unsafe locks:                     1300
 hardirq-read-safe locks:                 3
 hardirq-read-unsafe locks:             427
 softirq-read-safe locks:                 8
 softirq-read-unsafe locks:             422
 irq-read-safe locks:                     8
 irq-read-unsafe locks:                 427
 uncategorized locks:                   364
 unused locks:                            1
 max locking depth:                      19
 max bfs queue depth:                   354
 max lock class index:                 2189
 debug_locks:                             0

 zapped classes:                          3
 zapped lock chains:                    169
 large chain blocks:                      1
Err
Starting subtest: intel-bb-blit-y
Subtest intel-bb-blit-y: SUCCESS (0.171s)
Dmesg

<6> [222.914116] [IGT] xe_intel_bb: executing
<6> [222.922784] [IGT] xe_intel_bb: starting subtest intel-bb-blit-y
<7> [223.029289] xe 0000:00:02.1: [drm:xe_guc_exec_queue_memory_cat_error_handler [xe]] GT0: Engine memory cat error: engine_class=bcs, logical_mask: 0x1, guc_id=1
<4> [223.029710]
<4> [223.029716] ======================================================
<4> [223.029720] WARNING: possible circular locking dependency detected
<4> [223.029723] 6.14.0-rc4-xe+ #1 Tainted: G U
<4> [223.029727] ------------------------------------------------------
<4> [223.029730] kworker/u64:2/119 is trying to acquire lock:
<4> [223.029734] ffffffff834c95c0 (fs_reclaim){+.+.}-{0:0}, at: __kmalloc_cache_noprof+0x58/0x490
<4> [223.029745]
but task is already holding lock:
<4> [223.029749] ffff88812dcf1430 (&ct->lock){+.+.}-{3:3}, at: receive_g2h+0x47/0x100 [xe]
<4> [223.029843]
which lock already depends on the new lock.
<4> [223.029847]
the existing dependency chain (in reverse order) is:
<4> [223.029850]
-> #1 (&ct->lock){+.+.}-{3:3}:
<4> [223.029856] xe_guc_ct_init+0x2b4/0x4c0 [xe]
<4> [223.029943] xe_guc_init+0xe9/0x360 [xe]
<4> [223.030027] xe_uc_init+0x1e/0x1f0 [xe]
<4> [223.030137] xe_gt_init_hwconfig+0x4c/0xb0 [xe]
<4> [223.030219] xe_device_probe+0x3b8/0x7f0 [xe]
<4> [223.030298] xe_pci_probe+0x372/0x5f0 [xe]
<4> [223.030394] local_pci_probe+0x44/0xb0
<4> [223.030399] pci_device_probe+0xf4/0x270
<4> [223.030404] really_probe+0xee/0x3c0
<4> [223.030408] __driver_probe_device+0x8c/0x180
<4> [223.030413] driver_probe_device+0x24/0xd0
<4> [223.030417] __driver_attach+0x10f/0x220
<4> [223.030421] bus_for_each_dev+0x8d/0xf0
<4> [223.030425] driver_attach+0x1e/0x30
<4> [223.030428] bus_add_driver+0x151/0x290
<4> [223.030432] driver_register+0x5e/0x130
<4> [223.030436] __pci_register_driver+0x7d/0x90
<4> [223.030441] xe_register_pci_driver+0x23/0x30 [xe]
<4> [223.030536] 0xffffffffa0a350f3
<4> [223.030548] do_one_initcall+0x76/0x400
<4> [223.030552] do_init_module+0x97/0x2a0
<4> [223.030556] load_module+0x2c23/0x2f60
<4> [223.030560] init_module_from_file+0x97/0xe0
<4> [223.030564] idempotent_init_module+0x134/0x350
<4> [223.030568] __x64_sys_finit_module+0x77/0x100
<4> [223.030571] x64_sys_call+0x1f37/0x2650
<4> [223.030576] do_syscall_64+0x91/0x180
<4> [223.030580] entry_SYSCALL_64_after_hwframe+0x76/0x7e
<4> [223.030586]
-> #0 (fs_reclaim){+.+.}-{0:0}:
<4> [223.030591] __lock_acquire+0x1637/0x2810
<4> [223.030597] lock_acquire+0xc9/0x300
<4> [223.030601] fs_reclaim_acquire+0xc5/0x100
<4> [223.030605] __kmalloc_cache_noprof+0x58/0x490
<4> [223.030610] xe_vm_add_ban_entry+0x64/0x2a0 [xe]
<4> [223.030718] xe_guc_exec_queue_memory_cat_error_handler+0x19d/0x230 [xe]
<4> [223.030808] dequeue_one_g2h+0x349/0x900 [xe]
<4> [223.030895] receive_g2h+0x4f/0x100 [xe]
<4> [223.030981] g2h_worker_func+0x15/0x20 [xe]
<4> [223.031067] process_one_work+0x21c/0x740
<4> [223.031071] worker_thread+0x1db/0x3c0
<4> [223.031075] kthread+0x10d/0x270
<4> [223.031079] ret_from_fork+0x44/0x70
<4> [223.031084] ret_from_fork_asm+0x1a/0x30
<4> [223.031088]
other info that might help us debug this:
<4> [223.031092] Possible unsafe locking scenario:
<4> [223.031095] CPU0 CPU1
<4> [223.031098] ---- ----
<4> [223.031101] lock(&ct->lock);
<4> [223.031104] lock(fs_reclaim);
<4> [223.031108] lock(&ct->lock);
<4> [223.031112] lock(fs_reclaim);
<4> [223.031115]
*** DEADLOCK ***
<4> [223.031119] 3 locks held by kworker/u64:2/119:
<4> [223.031122] #0: ffff88814b8ecd48 ((wq_completion)xe-g2h-wq#2){+.+.}-{0:0}, at: process_one_work+0x444/0x740
<4> [223.031132] #1: ffffc900005afe20 ((work_completion)(&ct->g2h_worker)){+.+.}-{0:0}, at: process_one_work+0x1da/0x740
<4> [223.031141] #2: ffff88812dcf1430 (&ct->lock){+.+.}-{3:3}, at: receive_g2h+0x47/0x100 [xe]
<4> [223.031230]
stack backtrace:
<4> [223.031234] CPU: 15 UID: 0 PID: 119 Comm: kworker/u64:2 Tainted: G U 6.14.0-rc4-xe+ #1
<4> [223.031236] Tainted: [U]=USER
<4> [223.031237] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS RPLPFWI1.R00.4035.A00.2301200723 01/20/2023
<4> [223.031238] Workqueue: xe-g2h-wq g2h_worker_func [xe]
<4> [223.031323] Call Trace:
<4> [223.031324] <TASK>
<4> [223.031325] dump_stack_lvl+0x91/0xf0
<4> [223.031328] dump_stack+0x10/0x20
<4> [223.031331] print_circular_bug+0x285/0x360
<4> [223.031333] check_noncircular+0x150/0x170
<4> [223.031337] __lock_acquire+0x1637/0x2810
<4> [223.031339] ? try_to_wake_up+0x447/0xbc0
<4> [223.031343] lock_acquire+0xc9/0x300
<4> [223.031345] ? __kmalloc_cache_noprof+0x58/0x490
<4> [223.031348] ? lock_release+0xd4/0x2b0
<4> [223.031350] ? xe_vm_add_ban_entry+0x64/0x2a0 [xe]
<4> [223.031455] fs_reclaim_acquire+0xc5/0x100
<4> [223.031457] ? __kmalloc_cache_noprof+0x58/0x490
<4> [223.031459] __kmalloc_cache_noprof+0x58/0x490
<4> [223.031461] ? find_held_lock+0x31/0x90
<4> [223.031463] xe_vm_add_ban_entry+0x64/0x2a0 [xe]
<4> [223.031567] ? xe_vm_add_ban_entry+0x64/0x2a0 [xe]
<4> [223.031671] ? _raw_spin_unlock+0x22/0x50
<4> [223.031673] ? drm_sched_tdr_queue_imm+0x36/0x50 [gpu_sched]
<4> [223.031678] xe_guc_exec_queue_memory_cat_error_handler+0x19d/0x230 [xe]
<4> [223.031765] dequeue_one_g2h+0x349/0x900 [xe]
<4> [223.031848] ? receive_g2h+0x47/0x100 [xe]
<4> [223.031934] receive_g2h+0x4f/0x100 [xe]
<4> [223.032018] g2h_worker_func+0x15/0x20 [xe]
<4> [223.032101] process_one_work+0x21c/0x740
<4> [223.032105] worker_thread+0x1db/0x3c0
<4> [223.032107] ? __pfx_worker_thread+0x10/0x10
<4> [223.032109] kthread+0x10d/0x270
<4> [223.032111] ? __pfx_kthread+0x10/0x10
<4> [223.032112] ret_from_fork+0x44/0x70
<4> [223.032114] ? __pfx_kthread+0x10/0x10
<4> [223.032115] ret_from_fork_asm+0x1a/0x30
<4> [223.032119] </TASK>
<7> [223.032226] xe 0000:00:02.1: [drm:xe_guc_exec_queue_memory_cat_error_handler [xe]] GT0: Engine memory cat error: engine_class=bcs, logical_mask: 0x1, guc_id=1
<6> [223.032725] xe 0000:00:02.1: [drm] GT0: Engine reset: engine_class=bcs, logical_mask: 0x1, guc_id=1
<5> [223.032741] xe 0000:00:02.1: [drm] GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=1, flags=0x4 in no process [-1]
<6> [223.032826] xe 0000:00:02.1: [drm] Xe device coredump has been created
<6> [223.032832] xe 0000:00:02.1: [drm] Check your /sys/class/drm/card1/device/devcoredump/data
<4> [223.032838] ------------[ cut here ]------------
<4> [223.032841] xe 0000:00:02.1: [drm] GT0: VM job timed out on non-killed execqueue
<4> [223.032870] WARNING: CPU: 0 PID: 484 at drivers/gpu/drm/xe/xe_guc_submit.c:1183 guc_exec_queue_timedout_job+0x44f/0xe40 [xe]
<4> [223.032988] Modules linked in: snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore xe drm_gpuvm drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_buddy drm_kunit_helpers drm_kms_helper i2c_algo_bit kunit hid_sensor_custom hid_sensor_hub hid_generic cdc_mbim cdc_wdm cdc_ncm intel_ishtp_hid cdc_ether usbnet hid overlay intel_uncore_frequency intel_uncore_frequency_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm polyval_clmulni cmdlinepart polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 spi_nor aesni_intel crypto_simd mtd cryptd processor_thermal_device_pci r8152 processor_thermal_device rapl mei_pxp mei_hdcp intel_rapl_msr wmi_bmof mii intel_cstate spi_pxa2xx_platform processor_thermal_wt_hint dw_dmac i2c_i801 processor_thermal_rfim spi_intel_pci dw_dmac_core spi_pxa2xx_core i2c_mux processor_thermal_rapl spi_intel i2c_smbus e1000e intel_ish_ipc binfmt_misc intel_rapl_common
<4> [223.033025] idma64 processor_thermal_wt_req intel_ishtp mei_me thunderbolt igen6_edac processor_thermal_power_floor mei processor_thermal_mbox int340x_thermal_zone video intel_skl_int3472_tps68470 tps68470_regulator nls_iso8859_1 clk_tps68470 intel_pmc_core wmi pmt_telemetry pmt_class intel_skl_int3472_discrete int3400_thermal intel_hid acpi_pad intel_skl_int3472_common acpi_tad acpi_thermal_rel sparse_keymap pinctrl_tigerlake intel_vsec dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink ip_tables x_tables autofs4
<4> [223.033066] CPU: 0 UID: 0 PID: 484 Comm: kworker/u64:4 Tainted: G U 6.14.0-rc4-xe+ #1
<4> [223.033070] Tainted: [U]=USER
<4> [223.033072] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS RPLPFWI1.R00.4035.A00.2301200723 01/20/2023
<4> [223.033076] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [223.033084] RIP: 0010:guc_exec_queue_timedout_job+0x44f/0xe40 [xe]
<4> [223.033135] Code: 0f b6 68 1c 48 89 95 78 ff ff ff e8 8b c0 57 e1 48 8b 95 78 ff ff ff 48 c7 c7 a0 8d ec a0 48 89 c6 41 0f b6 cd e8 81 be 81 e0 <0f> 0b 80 7d 88 00 0f 85 ef 02 00 00 49 8b 56 58 f6 c2 01 0f 85 cb
<4> [223.033140] RSP: 0018:ffffc9000159fcc0 EFLAGS: 00010246
<4> [223.033143] RAX: 0000000000000000 RBX: ffff888151223f80 RCX: 0000000000000000
<4> [223.033145] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
<4> [223.033147] RBP: ffffc9000159fdb0 R08: 0000000000000000 R09: 0000000000000000
<4> [223.033150] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888147afd000
<4> [223.033152] R13: 0000000000000000 R14: ffff88815009e400 R15: ffff88812dcf0028
<4> [223.033154] FS: 0000000000000000(0000) GS:ffff88849f000000(0000) knlGS:0000000000000000
<4> [223.033157] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [223.033159] CR2: 000070c8d681b430 CR3: 0000000003248006 CR4: 0000000000f72ef0
<4> [223.033162] PKRU: 55555554
<4> [223.033163] Call Trace:
<4> [223.033165] <TASK>
<4> [223.033167] ? show_regs+0x6c/0x80
<4> [223.033172] ? __warn+0x93/0x1c0
<4> [223.033176] ? guc_exec_queue_timedout_job+0x44f/0xe40 [xe]
<4> [223.033226] ? report_bug+0x182/0x1b0
<4> [223.033231] ? handle_bug+0x6e/0xb0
<4> [223.033234] ? exc_invalid_op+0x18/0x80
<4> [223.033237] ? asm_exc_invalid_op+0x1b/0x20
<4> [223.033242] ? guc_exec_queue_timedout_job+0x44f/0xe40 [xe]
<4> [223.033289] ? asm_sysvec_apic_timer_interrupt+0x1b/0x20
<4> [223.033293] ? __pfx_autoremove_wake_function+0x10/0x10
<4> [223.033298] drm_sched_job_timedout+0x91/0x130 [gpu_sched]
<4> [223.033302] process_one_work+0x21c/0x740
<4> [223.033306] worker_thread+0x1db/0x3c0
<4> [223.033309] ? __pfx_worker_thread+0x10/0x10
<4> [223.033312] kthread+0x10d/0x270
<4> [223.033315] ? __pfx_kthread+0x10/0x10
<4> [223.033317] ret_from_fork+0x44/0x70
<4> [223.033320] ? __pfx_kthread+0x10/0x10
<4> [223.033322] ret_from_fork_asm+0x1a/0x30
<4> [223.033327] </TASK>
<4> [223.033328] irq event stamp: 113184
<4> [223.033330] hardirqs last enabled at (113183): [<ffffffff827e49a7>] _raw_spin_unlock_irq+0x27/0x70
<4> [223.033335] hardirqs last disabled at (113184): [<ffffffff827d7c1d>] __schedule+0xfbd/0x1c60
<4> [223.033338] softirqs last enabled at (110298): [<ffffffff813d4fbf>] __irq_exit_rcu+0x13f/0x160
<4> [223.033342] softirqs last disabled at (110291): [<ffffffff813d4fbf>] __irq_exit_rcu+0x13f/0x160
<4> [223.033345] ---[ end trace 0000000000000000 ]---
<6> [223.033347] xe 0000:00:02.1: [drm] GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [223.033397] xe 0000:00:02.1: [drm] GT0: reset queued
<6> [223.033402] xe 0000:00:02.1: [drm] GT0: reset started
<7> [223.033960] xe 0000:00:02.1: [drm:xe_gt_sriov_vf_bootstrap [xe]] GT0: VF: using GuC interface version 0.1.17.0
<7> [223.034533] xe 0000:00:02.1: [drm:xe_gt_sriov_vf_bootstrap [xe]] GT0: VF: using GuC interface version 0.1.17.0
<7> [223.035075] xe 0000:00:02.1: [drm:xe_guc_ct_enable [xe]] GT0: GuC CT communication channel enabled
<7> [223.035469] xe 0000:00:02.0: [drm:xe_gt_sriov_pf_service_process_request [xe]] GT0: PF: VF1 negotiated ABI version 1.0
<7> [223.035782] xe 0000:00:02.1: [drm:xe_gt_sriov_vf_connect [xe]] GT0: VF: using VF/PF ABI 1.0
<6> [223.035929] xe 0000:00:02.1: [drm] GT0: reset done
<7> [223.036321] xe 0000:00:02.1: [drm:guc_exec_queue_timedout_job [xe]] GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=1, running_time_ms=1, timeout_ms=5000, diff=0x0000008c
<4> [223.039965] xe 0000:00:02.1: [drm] GT0: Check job timeout: seqno=4294967170, lrc_seqno=4294967170, guc_id=1, not started
<6> [223.094360] [IGT] xe_intel_bb: finished subtest intel-bb-blit-y, SUCCESS
<6> [223.094527] [IGT] xe_intel_bb: exiting, ret=0
Created at 2025-02-27 21:42:31