Results for igt@xe_exec_balancer@twice-parallel-rebind

Machine description: bat-adlp-vf

Result: Abort 5 Warning(s)

i915_display_info0 igt_runner0 results0.json results0-xe-load.json boot0 dmesg0

DetailValue
Duration unknown
Igt-Version
IGT-Version: 2.0-g8b20280be (x86_64) (Linux: 6.14.0-rc7-xe+ x86_64)
Out
Using IGT_SRANDOM=1742513332 for randomisation
Opened device: /dev/dri/card1
Starting subtest: twice-parallel-rebind
Subtest twice-parallel-rebind: SUCCESS (0.011s)

This test caused an abort condition: Kernel badly tainted (0x240, 0x200) (check dmesg for details):
	TAINT_WARN: WARN_ON has happened.
Err
Starting subtest: twice-parallel-rebind
Subtest twice-parallel-rebind: SUCCESS (0.011s)
Dmesg

<6> [160.031387] [IGT] xe_exec_balancer: executing
<6> [160.040149] [IGT] xe_exec_balancer: starting subtest twice-parallel-rebind
<7> [160.043175] xe 0000:00:02.1: [drm:xe_guc_exec_queue_memory_cat_error_handler [xe]] GT0: Engine memory cat error: engine_class=bcs, logical_mask: 0x1, guc_id=1
<6> [160.043998] xe 0000:00:02.1: [drm] GT0: Engine reset: engine_class=bcs, logical_mask: 0x1, guc_id=1
<5> [160.044007] xe 0000:00:02.1: [drm] GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=1, flags=0x4 in no process [-1]
<6> [160.044535] xe 0000:00:02.1: [drm] Xe device coredump has been created
<6> [160.044555] xe 0000:00:02.1: [drm] Check your /sys/class/drm/card1/device/devcoredump/data
<4> [160.044557] ------------[ cut here ]------------
<4> [160.044558] xe 0000:00:02.1: [drm] GT0: VM job timed out on non-killed execqueue
<4> [160.044574] WARNING: CPU: 5 PID: 1020 at drivers/gpu/drm/xe/xe_guc_submit.c:1191 guc_exec_queue_timedout_job+0x454/0xe00 [xe]
<4> [160.044660] Modules linked in: snd_hda_codec_hdmi xe drm_gpuvm drm_gpusvm drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_client_lib drm_exec drm_buddy drm_display_helper cec rc_core drm_kunit_helpers drm_kms_helper i2c_algo_bit kunit overlay hid_sensor_custom hid_sensor_hub hid_generic cdc_mbim cdc_wdm cdc_ncm cdc_ether intel_ishtp_hid usbnet hid intel_uncore_frequency intel_uncore_frequency_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm polyval_clmulni polyval_generic ghash_clmulni_intel snd_hda_intel sha256_ssse3 snd_intel_dspcfg sha1_ssse3 processor_thermal_device_pci aesni_intel cmdlinepart processor_thermal_device snd_hda_codec crypto_simd processor_thermal_wt_hint cryptd snd_hda_core processor_thermal_rfim spi_nor snd_hwdep rapl r8152 processor_thermal_rapl mei_pxp mei_hdcp intel_rapl_msr mtd wmi_bmof binfmt_misc mii snd_pcm intel_cstate spi_pxa2xx_platform dw_dmac dw_dmac_core intel_rapl_common i2c_i801 snd_timer spi_pxa2xx_core processor_thermal_wt_req snd i2c_mux mei_me e1000e
<4> [160.044725] spi_intel_pci processor_thermal_power_floor intel_ish_ipc i2c_smbus idma64 soundcore spi_intel processor_thermal_mbox thunderbolt mei intel_ishtp int340x_thermal_zone igen6_edac nls_iso8859_1 video intel_skl_int3472_tps68470 tps68470_regulator clk_tps68470 intel_pmc_core wmi pmt_telemetry pmt_class intel_skl_int3472_discrete intel_hid int3400_thermal sparse_keymap acpi_pad intel_vsec pinctrl_tigerlake acpi_tad intel_skl_int3472_common acpi_thermal_rel dm_multipath msr nvme_fabrics fuse efi_pstore nfnetlink ip_tables x_tables autofs4
<4> [160.044761] CPU: 5 UID: 0 PID: 1020 Comm: kworker/u64:9 Tainted: G U 6.14.0-rc7-xe+ #1
<4> [160.044763] Tainted: [U]=USER
<4> [160.044765] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS RPLPFWI1.R00.4035.A00.2301200723 01/20/2023
<4> [160.044766] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [160.044772] RIP: 0010:guc_exec_queue_timedout_job+0x454/0xe00 [xe]
<4> [160.044835] Code: 0f b6 68 1c 48 89 95 78 ff ff ff e8 76 1d 45 e1 48 8b 95 78 ff ff ff 48 c7 c7 08 60 fd a0 48 89 c6 41 0f b6 cd e8 6c 64 71 e0 <0f> 0b 80 7d 88 00 0f 85 b8 03 00 00 49 8b 56 58 f6 c2 01 0f 85 94
<4> [160.044837] RSP: 0018:ffffc900023b7cc0 EFLAGS: 00010246
<4> [160.044839] RAX: 0000000000000000 RBX: ffff888142b5ba00 RCX: 0000000000000000
<4> [160.044841] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
<4> [160.044842] RBP: ffffc900023b7db0 R08: 0000000000000000 R09: 0000000000000000
<4> [160.044843] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888154e33800
<4> [160.044844] R13: 0000000000000000 R14: ffff888151364c00 R15: ffff888147d80028
<4> [160.044845] FS: 0000000000000000(0000) GS:ffff88849f280000(0000) knlGS:0000000000000000
<4> [160.044847] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [160.044848] CR2: 00005c616366ffc8 CR3: 0000000003248001 CR4: 0000000000f72ef0
<4> [160.044850] PKRU: 55555554
<4> [160.044851] Call Trace:
<4> [160.044852] <TASK>
<4> [160.044854] ? show_regs+0x6c/0x80
<4> [160.044859] ? __warn+0x93/0x1c0
<4> [160.044863] ? guc_exec_queue_timedout_job+0x454/0xe00 [xe]
<4> [160.044924] ? report_bug+0x182/0x1b0
<4> [160.044929] ? handle_bug+0x6e/0xb0
<4> [160.044931] ? exc_invalid_op+0x18/0x80
<4> [160.044933] ? asm_exc_invalid_op+0x1b/0x20
<4> [160.044940] ? guc_exec_queue_timedout_job+0x454/0xe00 [xe]
<4> [160.044998] ? lock_acquire+0xc9/0x300
<4> [160.045002] ? find_held_lock+0x31/0x90
<4> [160.045005] ? __pfx_autoremove_wake_function+0x10/0x10
<4> [160.045010] drm_sched_job_timedout+0x91/0x130 [gpu_sched]
<4> [160.045015] process_one_work+0x21c/0x740
<4> [160.045021] worker_thread+0x1db/0x3c0
<4> [160.045024] ? __pfx_worker_thread+0x10/0x10
<4> [160.045026] kthread+0x10d/0x270
<4> [160.045029] ? __pfx_kthread+0x10/0x10
<4> [160.045032] ret_from_fork+0x44/0x70
<4> [160.045034] ? __pfx_kthread+0x10/0x10
<4> [160.045036] ret_from_fork_asm+0x1a/0x30
<4> [160.045043] </TASK>
<4> [160.045045] irq event stamp: 318093
<4> [160.045046] hardirqs last enabled at (318099): [<ffffffff814a2419>] __up_console_sem+0x79/0xa0
<4> [160.045049] hardirqs last disabled at (318104): [<ffffffff814a23fe>] __up_console_sem+0x5e/0xa0
<4> [160.045051] softirqs last enabled at (314932): [<ffffffff813d344f>] __irq_exit_rcu+0x13f/0x160
<4> [160.045053] softirqs last disabled at (314927): [<ffffffff813d344f>] __irq_exit_rcu+0x13f/0x160
<4> [160.045054] ---[ end trace 0000000000000000 ]---
<6> [160.045056] xe 0000:00:02.1: [drm] GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [160.045116] xe 0000:00:02.1: [drm] GT0: reset queued
<6> [160.045354] xe 0000:00:02.1: [drm] GT0: reset started
<7> [160.046547] xe 0000:00:02.1: [drm:xe_gt_sriov_vf_bootstrap [xe]] GT0: VF: using GuC interface version 0.1.17.0
<7> [160.047109] xe 0000:00:02.1: [drm:xe_gt_sriov_vf_bootstrap [xe]] GT0: VF: using GuC interface version 0.1.17.0
<7> [160.047599] xe 0000:00:02.1: [drm:xe_guc_ct_enable [xe]] GT0: GuC CT communication channel enabled
<7> [160.047965] xe 0000:00:02.0: [drm:xe_gt_sriov_pf_service_process_request [xe]] GT0: PF: VF1 negotiated ABI version 1.0
<7> [160.048261] xe 0000:00:02.1: [drm:xe_gt_sriov_vf_connect [xe]] GT0: VF: using VF/PF ABI 1.0
<6> [160.048634] xe 0000:00:02.1: [drm] GT0: reset done
<7> [160.049028] xe 0000:00:02.1: [drm:guc_exec_queue_timedout_job [xe]] GT0: Check job timeout: seqno=4294967169, lrc_seqno=4294967169, guc_id=1, running_time_ms=1, timeout_ms=5000, diff=0x00000075
<4> [160.049824] xe 0000:00:02.1: [drm] GT0: Check job timeout: seqno=4294967170, lrc_seqno=4294967170, guc_id=1, not started
<6> [160.051177] [IGT] xe_exec_balancer: finished subtest twice-parallel-rebind, SUCCESS
<6> [160.051352] [IGT] xe_exec_balancer: exiting, ret=0
Created at 2025-03-20 23:40:16