Result: 64 Warning(s)
i915_display_info24 igt_runner24 results24.json results24-xe-load.json guc_logs24.tar i915_display_info_post_exec24 boot24 dmesg24
| Detail | Value |
|---|---|
| Duration | unknown |
| Hostname |
shard-bmg-2 |
| Igt-Version |
IGT-Version: 2.3-g2020b0bf9 (x86_64) (Linux: 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ x86_64) |
| Out |
Using IGT_SRANDOM=1771529969 for randomisation Opened device: /dev/dri/card0 Starting subtest: compute-preempt-many-vram-evict Starting dynamic subtest: engine-DRM_XE_ENGINE_CLASS_COMPUTE Stack trace: Stack trace: Stack trace: Stack trace: Stack trace: Stack trace: Stack trace: #0 ../lib/igt_core.c:2075 __igt_fail_assert() #0 ../lib/igt_core.c:2075 __igt_fail_assert() #0 ../lib/igt_core.c:2075 __igt_fail_assert() #0 ../lib/igt_core.c:2075 __igt_fail_assert() #0 ../lib/igt_core.c:2075 __igt_fail_assert() #0 ../lib/igt_core.c:2075 __igt_fail_assert() #0 ../lib/igt_core.c:2075 __igt_fail_assert() #1 [xe_wait_ufence+0x57] #1 [xe_wait_ufence+0x57] #1 [xe_wait_ufence+0x57] #1 [xe_wait_ufence+0x57] #1 [xe_wait_ufence+0x57] #1 [xe_wait_ufence+0x57] #1 [xe_wait_ufence+0x57] #2 ../lib/intel_compute.c:418 bo_execenv_sync() #2 ../lib/intel_compute.c:418 bo_execenv_sync() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #7 ../tests/intel/xe_compute_preempt.c:65 main() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #2 ../lib/intel_compute.c:418 bo_execenv_sync() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #7 ../tests/intel/xe_compute_preempt.c:65 main() #2 ../lib/intel_compute.c:418 bo_execenv_sync() #2 ../lib/intel_compute.c:418 bo_execenv_sync() #2 ../lib/intel_compute.c:418 bo_execenv_sync() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #2 ../lib/intel_compute.c:418 bo_execenv_sync() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #7 ../tests/intel/xe_compute_preempt.c:65 main() #7 ../tests/intel/xe_compute_preempt.c:65 main() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #7 ../tests/intel/xe_compute_preempt.c:65 main() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #7 ../tests/intel/xe_compute_preempt.c:65 main() #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() #7 ../tests/intel/xe_compute_preempt.c:65 main() #8 [__libc_init_first+0x8a] #8 [__libc_init_first+0x8a] #8 [__libc_init_first+0x8a] #8 [__libc_init_first+0x8a] #8 [__libc_init_first+0x8a] #8 [__libc_init_first+0x8a] #8 [__libc_init_first+0x8a] #9 [__libc_start_main+0x8b] #9 [__libc_start_main+0x8b] #9 [__libc_start_main+0x8b] #9 [__libc_start_main+0x8b] #10 [_start+0x25] #10 [_start+0x25] #10 [_start+0x25] #10 [_start+0x25] #9 [__libc_start_main+0x8b] #9 [__libc_start_main+0x8b] #9 [__libc_start_main+0x8b] #10 [_start+0x25] #10 [_start+0x25] #10 [_start+0x25] Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE: FAIL (17.152s) Subtest compute-preempt-many-vram-evict: FAIL (17.154s) This test caused an abort condition: Kernel badly tainted (0x244, 0x200) (check dmesg for details): TAINT_WARN: WARN_ON has happened. |
| Err |
Starting subtest: compute-preempt-many-vram-evict Starting dynamic subtest: engine-DRM_XE_ENGINE_CLASS_COMPUTE (xe_compute_preempt:7532) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7532) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7532) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7532) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7527) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7527) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7527) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7527) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7557) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7557) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7557) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7557) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7547) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7547) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7547) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7547) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: error: -5 != 0 Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. **** DEBUG **** **** DEBUG **** (xe_compute_preempt:7526) DEBUG: VRAM: 12216, child count: 73 (xe_compute_preempt:7526) DEBUG: VRAM: 12216, child count: 73 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586939000, addr: 83e0000, size: 1000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586939000, addr: 83e0000, size: 1000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d584435000, addr: 300000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d584435000, addr: 300000, size: 100000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d584425000, addr: 200000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d584425000, addr: 200000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d584415000, addr: 63d0000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d584415000, addr: 63d0000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d584405000, addr: 40000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d584405000, addr: 40000000, size: 10000 Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d5843f5000, addr: 80000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d5843f5000, addr: 80000000, size: 10000 **** DEBUG **** (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d5842f5000, addr: 6000000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d5842f5000, addr: 6000000, size: 100000 (xe_compute_preempt:7526) DEBUG: VRAM: 12216, child count: 73 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d5842e5000, addr: 210000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d5842e5000, addr: 210000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586939000, addr: 83e0000, size: 1000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d5842d5000, addr: 100000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d5842d5000, addr: 100000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d584435000, addr: 300000, size: 100000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57ded5000, addr: 9000000, size: 6400000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57ded5000, addr: 9000000, size: 6400000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d584425000, addr: 200000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d57decd000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d57decd000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d584415000, addr: 63d0000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586812000, addr: 83e0000, size: 1000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586812000, addr: 83e0000, size: 1000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d57ddcd000, addr: 300000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d57ddcd000, addr: 300000, size: 100000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d584405000, addr: 40000000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d57ddbd000, addr: 200000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d57ddbd000, addr: 200000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d5843f5000, addr: 80000000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d57ddad000, addr: 63d0000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d57ddad000, addr: 63d0000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d57dd9d000, addr: 40000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d57dd9d000, addr: 40000000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d5842f5000, addr: 6000000, size: 100000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d57dd8d000, addr: 80000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d57dd8d000, addr: 80000000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d5842e5000, addr: 210000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d57dc8d000, addr: 6000000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d57dc8d000, addr: 6000000, size: 100000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d57dc7d000, addr: 210000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d57dc7d000, addr: 210000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d5842d5000, addr: 100000, size: 10000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d57dc6d000, addr: 100000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d57dc6d000, addr: 100000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57ded5000, addr: 9000000, size: 6400000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57786d000, addr: 9000000, size: 6400000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57786d000, addr: 9000000, size: 6400000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d57decd000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7567) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d577865000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586812000, addr: 83e0000, size: 1000 (xe_compute_preempt:7537) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d577865000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7567) intel_compute-DEBUG: general state base: 6000000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d57ddcd000, addr: 300000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7537) intel_compute-DEBUG: general state base: 6000000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d57ddbd000, addr: 200000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d57ddad000, addr: 63d0000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: instruct base addr: 8000000 (xe_compute_preempt:7537) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d57dd9d000, addr: 40000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: bindless base addr: 200000 (xe_compute_preempt:7537) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d57dd8d000, addr: 80000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: state context data base addr: 9000000 (xe_compute_preempt:7537) intel_compute-DEBUG: instruct base addr: 8000000 Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d57dc8d000, addr: 6000000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: offset indirect addr: 3d0000 (xe_compute_preempt:7537) intel_compute-DEBUG: bindless base addr: 200000 **** DEBUG **** (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d57dc7d000, addr: 210000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: kernel start pointer: 3e0000 (xe_compute_preempt:7537) intel_compute-DEBUG: state context data base addr: 9000000 (xe_compute_preempt:7526) DEBUG: VRAM: 12216, child count: 73 (xe_compute_preempt:7567) intel_compute-DEBUG: sip start pointer: ffff0000 (xe_compute_preempt:7537) intel_compute-DEBUG: offset indirect addr: 3d0000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d57dc6d000, addr: 100000, size: 10000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586939000, addr: 83e0000, size: 1000 (xe_compute_preempt:7567) intel_compute-DEBUG: general state base: 6000000 (xe_compute_preempt:7537) intel_compute-DEBUG: kernel start pointer: 3e0000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d584435000, addr: 300000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57786d000, addr: 9000000, size: 6400000 (xe_compute_preempt:7537) intel_compute-DEBUG: sip start pointer: ffff0000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d584425000, addr: 200000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7572) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d577865000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7537) intel_compute-DEBUG: general state base: 6000000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d584415000, addr: 63d0000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: instruct base addr: 8000000 (xe_compute_preempt:7572) intel_compute-DEBUG: general state base: 6000000 (xe_compute_preempt:7537) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d584405000, addr: 40000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: bindless base addr: 200000 (xe_compute_preempt:7572) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7537) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d5843f5000, addr: 80000000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: state context data base addr: 9000000 (xe_compute_preempt:7572) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7537) intel_compute-DEBUG: instruct base addr: 8000000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d5842f5000, addr: 6000000, size: 100000 (xe_compute_preempt:7567) intel_compute-DEBUG: offset indirect addr: 3d0000 (xe_compute_preempt:7572) intel_compute-DEBUG: instruct base addr: 8000000 (xe_compute_preempt:7537) intel_compute-DEBUG: bindless base addr: 200000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d5842e5000, addr: 210000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: kernel start pointer: 3e0000 (xe_compute_preempt:7572) intel_compute-DEBUG: bindless base addr: 200000 (xe_compute_preempt:7537) intel_compute-DEBUG: state context data base addr: 9000000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d5842d5000, addr: 100000, size: 10000 (xe_compute_preempt:7567) intel_compute-DEBUG: sip start pointer: ffff0000 (xe_compute_preempt:7537) intel_compute-DEBUG: offset indirect addr: 3d0000 (xe_compute_preempt:7572) intel_compute-DEBUG: state context data base addr: 9000000 (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57ded5000, addr: 9000000, size: 6400000 (xe_compute_preempt:7537) intel_compute-DEBUG: kernel start pointer: 3e0000 (xe_compute_preempt:7572) intel_compute-DEBUG: offset indirect addr: 3d0000 (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7537) intel_compute-DEBUG: sip start pointer: ffff0000 (xe_compute_preempt:7572) intel_compute-DEBUG: kernel start pointer: 3e0000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d57decd000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7572) intel_compute-DEBUG: sip start pointer: ffff0000 (xe_compute_preempt:7567) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 0 name: instr state base] data: 0x75d586812000, addr: 83e0000, size: 1000 (xe_compute_preempt:7572) intel_compute-DEBUG: general state base: 6000000 (xe_compute_preempt:7567) igt_core-INFO: Stack trace: (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 1 name: dynamic state base] data: 0x75d57ddcd000, addr: 300000, size: 100000 (xe_compute_preempt:7572) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7567) igt_core-INFO: #0 ../lib/igt_core.c:2075 __igt_fail_assert() (xe_compute_preempt:7537) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 2 name: surface state base] data: 0x75d57ddbd000, addr: 200000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7567) igt_core-INFO: #1 [xe_wait_ufence+0x57] (xe_compute_preempt:7537) igt_core-INFO: Stack trace: (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 3 name: indirect object base] data: 0x75d57ddad000, addr: 63d0000, size: 10000 (xe_compute_preempt:7572) intel_compute-DEBUG: instruct base addr: 8000000 (xe_compute_preempt:7567) igt_core-INFO: #2 ../lib/intel_compute.c:418 bo_execenv_sync() (xe_compute_preempt:7537) igt_core-INFO: #0 ../lib/igt_core.c:2075 __igt_fail_assert() (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 4 name: addr input] data: 0x75d57dd9d000, addr: 40000000, size: 10000 (xe_compute_preempt:7567) igt_core-INFO: #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() (xe_compute_preempt:7537) igt_core-INFO: #1 [xe_wait_ufence+0x57] (xe_compute_preempt:7572) intel_compute-DEBUG: bindless base addr: 200000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 5 name: addr output] data: 0x75d57dd8d000, addr: 80000000, size: 10000 (xe_compute_preempt:7567) igt_core-INFO: #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() (xe_compute_preempt:7537) igt_core-INFO: #2 ../lib/intel_compute.c:418 bo_execenv_sync() (xe_compute_preempt:7572) intel_compute-DEBUG: state context data base addr: 9000000 (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 6 name: general state base] data: 0x75d57dc8d000, addr: 6000000, size: 100000 (xe_compute_preempt:7567) igt_core-INFO: #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() (xe_compute_preempt:7537) igt_core-INFO: #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() (xe_compute_preempt:7572) intel_compute-DEBUG: offset indirect addr: 3d0000 (xe_compute_preempt:7567) igt_core-INFO: #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 7 name: binding table] data: 0x75d57dc7d000, addr: 210000, size: 10000 (xe_compute_preempt:7537) igt_core-INFO: #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() (xe_compute_preempt:7572) intel_compute-DEBUG: kernel start pointer: 3e0000 (xe_compute_preempt:7567) igt_core-INFO: #7 ../tests/intel/xe_compute_preempt.c:65 main() (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 8 name: batch] data: 0x75d57dc6d000, addr: 100000, size: 10000 (xe_compute_preempt:7537) igt_core-INFO: #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() (xe_compute_preempt:7572) intel_compute-DEBUG: sip start pointer: ffff0000 (xe_compute_preempt:7567) igt_core-INFO: #8 [__libc_init_first+0x8a] (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 9 name: state context data base] data: 0x75d57786d000, addr: 9000000, size: 6400000 (xe_compute_preempt:7537) igt_core-INFO: #6 ../tests/intel/xe_compute_preempt.c:189 __igt_unique____real_main65() (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:712: (xe_compute_preempt:7567) igt_core-INFO: #9 [__libc_start_main+0x8b] (xe_compute_preempt:7537) igt_core-INFO: #7 ../tests/intel/xe_compute_preempt.c:65 main() (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_compute_preempt:7567) igt_core-INFO: #10 [_start+0x25] (xe_compute_preempt:7532) intel_compute-DEBUG: [i: 10 name: sip kernel] data: 0x75d577865000, addr: 107ff0000, size: 8000 (xe_compute_preempt:7537) igt_core-INFO: #8 [__libc_init_first+0x8a] (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: Last errno: 5, Input/output error (xe_compute_preempt:7532) intel_compute-DEBUG: general state base: 6000000 **** END **** (xe_compute_preempt:7537) igt_core-INFO: #9 [__libc_start_main+0x8b] (xe_compute_preempt:7572) xe/xe_ioctl-CRITICAL: error: -5 != 0 (xe_compute_preempt:7537) igt_core-INFO: #10 [_start+0x25] (xe_compute_preempt:7532) intel_compute-DEBUG: surface state base: 200000 (xe_compute_preempt:7572) igt_core-INFO: Stack trace: **** END **** (xe_compute_preempt:7532) intel_compute-DEBUG: dynamic state base: 300000 (xe_compute_preempt:7572) igt_core-INFO: #0 ../lib/igt_core.c:2075 __igt_fail_assert() (xe_compute_preempt:7532) intel_compute-DEBUG: instruct base addr: 8000000 (xe_compute_preempt:7572) igt_core-INFO: #1 [xe_wait_ufence+0x57] (xe_compute_preempt:7572) igt_core-INFO: #2 ../lib/intel_compute.c:418 bo_execenv_sync() (xe_compute_preempt:7572) igt_core-INFO: #3 ../lib/intel_compute.c:2527 xe2lpg_compute_preempt_exec() (xe_compute_preempt:7532) intel_compute-DEBUG: bindless base addr: 200000 (xe_compute_preempt:7572) igt_core-INFO: #4 ../lib/intel_compute.c:2804 run_intel_compute_kernel_preempt() (xe_compute_preempt:7532) intel_compute-DEBUG: state context data base addr: 9000000 Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. (xe_compute_preempt:7532) intel_compute-DEBUG: offset indirect addr: 3d0000 Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. (xe_compute_preempt:7572) igt_core-INFO: #5 ../tests/intel/xe_compute_preempt.c:58 test_compute_preempt() Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE failed. **** DEBUG **** (xe_compute_preempt:7526) DEBUG: VRAM: 12216, child count: 73 **** END **** Dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE: FAIL (17.152s) Subtest compute-preempt-many-vram-evict: FAIL (17.154s) |
| Dmesg |
<6> [306.136544] Console: switching to colour dummy device 80x25
<6> [306.136794] [IGT] xe_compute_preempt: executing
<7> [306.198919] xe 0000:03:00.0: [drm:drm_pagemap_shrinker_scan [drm_gpusvm_helper]] Shrinking dpagemap ffff8881697408d0.
<6> [306.215886] [IGT] xe_compute_preempt: starting subtest compute-preempt-many-vram-evict
<6> [306.216804] [IGT] xe_compute_preempt: starting dynamic subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE
<3> [306.605150] DMAR: DRHD: handling fault status reg 3
<3> [306.605542] DMAR: [DMA Read NO_PASID] Request device [03:00.0] fault addr 0x6ddc8000 [fault reason 0x06] PTE Read access is not set
<4> [311.972123] ------------[ cut here ]------------
<4> [311.972137] xe 0000:03:00.0: [drm] Tile0: GT0: Unexpected engine class:instance 3:8 for context utilization
<4> [311.972147] WARNING: drivers/gpu/drm/xe/xe_lrc.c:2504 at xe_lrc_timestamp+0x191/0x440 [xe], CPU#10: kworker/u64:45/4935
<4> [311.972620] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [311.972942] efi_pstore nfnetlink autofs4
<4> [311.972963] CPU: 10 UID: 0 PID: 4935 Comm: kworker/u64:45 Tainted: G S U 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [311.972977] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER
<4> [311.972983] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [311.972989] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [311.973061] RIP: 0010:xe_lrc_timestamp+0x1a2/0x440 [xe]
<4> [311.973473] Code: 26 44 0f b6 70 08 48 89 55 c8 e8 79 74 58 e1 48 89 c6 48 8d 3d 6f f7 37 00 41 54 48 8b 55 c8 41 0f b6 ce 45 89 e9 45 0f b6 c7 <67> 48 0f b9 3a 58 eb 92 41 0f b6 bf 90 0f 00 00 e8 59 c8 03 00 45
<4> [311.973482] RSP: 0018:ffffc9000fce7bf8 EFLAGS: 00010246
<4> [311.973495] RAX: ffffffffa11faba1 RBX: ffff8881146ffc80 RCX: 0000000000000000
<4> [311.973504] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1004da0
<4> [311.973511] RBP: ffffc9000fce7c88 R08: 0000000000000000 R09: 0000000000000003
<4> [311.973518] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000008
<4> [311.973526] R13: 0000000000000003 R14: 0000000000000000 R15: 0000000000000000
<4> [311.973533] FS: 0000000000000000(0000) GS:ffff8888db1da000(0000) knlGS:0000000000000000
<4> [311.973541] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [311.973550] CR2: 000075d57dc5d000 CR3: 0000000003448004 CR4: 0000000000f72ef0
<4> [311.973559] PKRU: 55555554
<4> [311.973566] Call Trace:
<4> [311.973572] <TASK>
<4> [311.973585] ? enable_work+0x9d/0x100
<4> [311.973611] ? xe_lrc_start_seqno+0x2c/0x70 [xe]
<4> [311.974005] guc_exec_queue_timedout_job+0xf51/0x23e0 [xe]
<4> [311.974465] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [311.974492] ? lock_release+0xce/0x280
<4> [311.974521] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [311.974549] process_one_work+0x22e/0x6b0
<4> [311.974579] worker_thread+0x1e8/0x3d0
<4> [311.974595] ? __pfx_worker_thread+0x10/0x10
<4> [311.974609] kthread+0x11f/0x250
<4> [311.974629] ? __pfx_kthread+0x10/0x10
<4> [311.974647] ret_from_fork+0x344/0x3a0
<4> [311.974660] ? __pfx_kthread+0x10/0x10
<4> [311.974677] ret_from_fork_asm+0x1a/0x30
<4> [311.974712] </TASK>
<4> [311.974719] irq event stamp: 218387
<4> [311.974726] hardirqs last enabled at (218393): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [311.974743] hardirqs last disabled at (218398): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [311.974756] softirqs last enabled at (218202): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [311.974770] softirqs last disabled at (218197): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [311.974783] ---[ end trace 0000000000000000 ]---
<7> [311.974795] xe 0000:03:00.0: [drm:guc_exec_queue_timedout_job [xe]] Tile0: GT0: Check job timeout: seqno=92812, lrc_seqno=92812, guc_id=0, running_time_ms=109842, timeout_ms=5000, diff=0x77b81d23
<6> [312.616336] xe 0000:03:00.0: [drm] Tile0: GT0: Engine reset: engine_class=bcs, logical_mask: 0x2, guc_id=0, state=0x209
<5> [312.616489] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92812, lrc_seqno=92812, guc_id=0, flags=0x73 in no process [-1]
<7> [312.616514] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [312.616897] ------------[ cut here ]------------
<4> [312.616903] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [312.616910] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#10: kworker/u64:45/4935
<4> [312.617392] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [312.617711] efi_pstore nfnetlink autofs4
<4> [312.617736] CPU: 10 UID: 0 PID: 4935 Comm: kworker/u64:45 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [312.617752] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [312.617760] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [312.617769] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [312.617798] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [312.618223] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [312.618233] RSP: 0018:ffffc9000fce7c98 EFLAGS: 00010246
<4> [312.618245] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [312.618254] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [312.618262] RBP: ffffc9000fce7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [312.618270] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [312.618277] R13: ffff888104f22410 R14: ffff8881042b8000 R15: 00000000ffffffc2
<4> [312.618285] FS: 0000000000000000(0000) GS:ffff8888db1da000(0000) knlGS:0000000000000000
<4> [312.618294] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [312.618303] CR2: 000075d57dc5d000 CR3: 0000000003448004 CR4: 0000000000f72ef0
<4> [312.618312] PKRU: 55555554
<4> [312.618319] Call Trace:
<4> [312.618326] <TASK>
<4> [312.618360] ? __pfx_autoremove_wake_function+0x10/0x10
<4> [312.618394] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [312.618423] process_one_work+0x22e/0x6b0
<4> [312.618455] worker_thread+0x1e8/0x3d0
<4> [312.618470] ? __pfx_worker_thread+0x10/0x10
<4> [312.618483] kthread+0x11f/0x250
<4> [312.618503] ? __pfx_kthread+0x10/0x10
<4> [312.618521] ret_from_fork+0x344/0x3a0
<4> [312.618534] ? __pfx_kthread+0x10/0x10
<4> [312.618551] ret_from_fork_asm+0x1a/0x30
<4> [312.618586] </TASK>
<4> [312.618594] irq event stamp: 219237
<4> [312.618601] hardirqs last enabled at (219243): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [312.618617] hardirqs last disabled at (219248): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [312.618630] softirqs last enabled at (219082): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [312.618645] softirqs last disabled at (219077): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [312.618658] ---[ end trace 0000000000000000 ]---
<6> [312.618670] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [312.619069] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [312.619540] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [312.619578] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [312.621779] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [312.621866] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [312.621959] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [312.622067] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [312.622152] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [312.622231] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [312.622312] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [312.622388] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [312.622475] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [312.622583] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [312.623587] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [312.634435] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [312.634923] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [312.636940] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [312.637222] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [312.637450] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [312.637686] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [312.637924] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [312.638188] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [312.638426] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [312.638662] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [312.638901] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [312.639156] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [312.639379] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [312.639595] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [312.639808] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [312.640033] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [312.640234] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [312.640436] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [312.640639] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [312.640847] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [312.641051] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [312.641273] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [312.641484] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [312.641685] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [312.641883] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [312.642091] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [312.642278] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [312.642454] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [312.642632] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [312.642811] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [312.642986] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [312.643184] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [312.643358] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [312.643534] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [312.643709] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [312.643870] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [312.644033] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [312.644194] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [312.644354] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [312.644500] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [312.644645] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [312.644791] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [312.644936] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [312.645096] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [312.645237] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [312.645368] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [312.645505] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [312.645639] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [312.645768] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [312.645891] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [312.646018] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [312.646140] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [312.646264] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [312.646391] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [312.646518] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [312.646633] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [312.646748] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [312.646864] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [312.646973] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [312.647091] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [312.647203] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [312.649004] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [312.649867] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=4, flags=0x0 in xe_compute_pree [7526]
<7> [312.649870] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<7> [312.650376] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [312.650496] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<5> [312.650609] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=8, flags=0x0 in xe_compute_pree [7526]
<7> [312.650611] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [312.651131] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=12, flags=0x0 in xe_compute_pree [7526]
<7> [312.651134] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [312.651225] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=20, flags=0x0 in xe_compute_pree [7526]
<7> [312.651227] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [312.651498] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=28, flags=0x0 in xe_compute_pree [7526]
<7> [312.651500] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [312.651568] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=36, flags=0x0 in xe_compute_pree [7526]
<7> [312.651570] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [312.651817] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=40, flags=0x0 in xe_compute_pree [7526]
<7> [312.651819] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [318.116336] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92812, lrc_seqno=92812, guc_id=0, flags=0x73 in no process [-1]
<7> [318.116362] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [318.116735] ------------[ cut here ]------------
<4> [318.116741] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [318.116748] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#10: kworker/u64:45/4935
<4> [318.117220] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [318.117540] efi_pstore nfnetlink autofs4
<4> [318.117564] CPU: 10 UID: 0 PID: 4935 Comm: kworker/u64:45 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [318.117581] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [318.117588] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [318.117596] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [318.117624] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [318.118041] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [318.118052] RSP: 0018:ffffc9000fce7c98 EFLAGS: 00010246
<4> [318.118065] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [318.118073] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [318.118082] RBP: ffffc9000fce7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [318.118090] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [318.118097] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [318.118106] FS: 0000000000000000(0000) GS:ffff8888db1da000(0000) knlGS:0000000000000000
<4> [318.118114] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [318.118122] CR2: 000075d57dc5d000 CR3: 0000000003448004 CR4: 0000000000f72ef0
<4> [318.118132] PKRU: 55555554
<4> [318.118139] Call Trace:
<4> [318.118146] <TASK>
<4> [318.118173] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [318.118200] ? lock_release+0xce/0x280
<4> [318.118231] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [318.118259] process_one_work+0x22e/0x6b0
<4> [318.118291] worker_thread+0x1e8/0x3d0
<4> [318.118306] ? __pfx_worker_thread+0x10/0x10
<4> [318.118319] kthread+0x11f/0x250
<4> [318.118339] ? __pfx_kthread+0x10/0x10
<4> [318.118358] ret_from_fork+0x344/0x3a0
<4> [318.118370] ? __pfx_kthread+0x10/0x10
<4> [318.118386] ret_from_fork_asm+0x1a/0x30
<4> [318.118420] </TASK>
<4> [318.118428] irq event stamp: 228877
<4> [318.118435] hardirqs last enabled at (228883): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [318.118451] hardirqs last disabled at (228888): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [318.118464] softirqs last enabled at (228716): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [318.118479] softirqs last disabled at (228711): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [318.118493] ---[ end trace 0000000000000000 ]---
<6> [318.118504] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [318.118900] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [318.118938] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [318.119378] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [318.120835] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [318.120945] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [318.121033] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [318.121114] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [318.121195] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [318.121271] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [318.121350] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [318.121428] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [318.121520] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [318.121631] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [318.122641] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [318.133346] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 10ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [318.133795] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [318.135647] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [318.135859] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [318.136116] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [318.136330] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [318.136541] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [318.136745] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [318.136965] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [318.137157] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [318.137350] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [318.137543] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [318.137722] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [318.137908] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [318.138094] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [318.138277] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [318.138457] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [318.138630] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [318.138800] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [318.138987] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [318.139150] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [318.139335] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [318.139512] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [318.139682] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [318.139849] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [318.140034] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [318.140190] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [318.140339] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [318.140491] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [318.140643] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [318.140790] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [318.140948] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [318.141090] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [318.141229] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [318.141375] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [318.141520] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [318.141659] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [318.141794] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [318.141935] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [318.142064] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [318.142192] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [318.142312] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [318.142437] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [318.142559] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [318.142681] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [318.142799] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [318.142930] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [318.143046] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [318.143164] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [318.143281] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [318.143398] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [318.143511] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [318.143618] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [318.143727] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [318.143835] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [318.143950] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [318.144053] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [318.144159] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [318.144261] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [318.144363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [318.144469] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [318.146390] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<7> [318.147380] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [318.147494] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [319.841664] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2b2c2a2b
<7> [319.841814] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2c2b2b2c
<6> [319.947910] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [319.947934] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [319.947941] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [319.947950] nvme 0000:05:00.0: [ 0] RxErr (First)
<5> [323.234987] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92812, lrc_seqno=92812, guc_id=0, flags=0x73 in no process [-1]
<7> [323.235013] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.235395] ------------[ cut here ]------------
<4> [323.235402] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.235409] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#8: kworker/u64:45/4935
<4> [323.235873] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.236463] efi_pstore nfnetlink autofs4
<4> [323.236503] CPU: 8 UID: 0 PID: 4935 Comm: kworker/u64:45 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.236527] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.236538] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.236550] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.236594] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.237282] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.237297] RSP: 0018:ffffc9000fce7c98 EFLAGS: 00010246
<4> [323.237318] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.237330] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.237342] RBP: ffffc9000fce7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.237353] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.237363] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.237376] FS: 0000000000000000(0000) GS:ffff8888db0da000(0000) knlGS:0000000000000000
<4> [323.237389] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.237400] CR2: 000075d5842c5000 CR3: 0000000121ff6003 CR4: 0000000000f72ef0
<4> [323.237413] PKRU: 55555554
<4> [323.237423] Call Trace:
<4> [323.237432] <TASK>
<4> [323.237484] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.237529] ? lock_release+0xce/0x280
<4> [323.237578] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.237626] process_one_work+0x22e/0x6b0
<4> [323.237682] worker_thread+0x1e8/0x3d0
<4> [323.237706] ? __pfx_worker_thread+0x10/0x10
<4> [323.237726] kthread+0x11f/0x250
<4> [323.237778] ? __pfx_kthread+0x10/0x10
<4> [323.237807] ret_from_fork+0x344/0x3a0
<4> [323.237825] ? __pfx_kthread+0x10/0x10
<4> [323.237849] ret_from_fork_asm+0x1a/0x30
<4> [323.237914] </TASK>
<4> [323.237924] irq event stamp: 239737
<4> [323.237934] hardirqs last enabled at (239743): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.237956] hardirqs last disabled at (239748): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.237973] softirqs last enabled at (239574): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.237992] softirqs last disabled at (239567): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.238010] ---[ end trace 0000000000000000 ]---
<5> [323.244367] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92882, lrc_seqno=92882, guc_id=0, flags=0x73 in no process [-1]
<7> [323.244372] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.244443] ------------[ cut here ]------------
<4> [323.244444] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.244446] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.244519] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.244580] efi_pstore nfnetlink autofs4
<4> [323.244586] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.244589] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.244590] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.244591] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.244596] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.244667] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.244668] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.244671] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.244672] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.244673] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.244674] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.244675] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.244677] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.244678] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.244679] CR2: 000000c000447000 CR3: 00000001327ae001 CR4: 0000000000f72ef0
<4> [323.244681] PKRU: 55555554
<4> [323.244682] Call Trace:
<4> [323.244683] <TASK>
<4> [323.244688] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.244693] ? lock_release+0xce/0x280
<4> [323.244699] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.244704] process_one_work+0x22e/0x6b0
<4> [323.244710] worker_thread+0x1e8/0x3d0
<4> [323.244713] ? __pfx_worker_thread+0x10/0x10
<4> [323.244715] kthread+0x11f/0x250
<4> [323.244719] ? __pfx_kthread+0x10/0x10
<4> [323.244722] ret_from_fork+0x344/0x3a0
<4> [323.244725] ? __pfx_kthread+0x10/0x10
<4> [323.244727] ret_from_fork_asm+0x1a/0x30
<4> [323.244734] </TASK>
<4> [323.244735] irq event stamp: 668703
<4> [323.244736] hardirqs last enabled at (668709): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.244740] hardirqs last disabled at (668714): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.244742] softirqs last enabled at (666556): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.244744] softirqs last disabled at (666551): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.244747] ---[ end trace 0000000000000000 ]---
<6> [323.244756] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [323.244828] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [323.244835] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [323.244892] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [323.245535] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [323.245627] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [323.245725] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [323.245825] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [323.245910] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [323.245993] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [323.246081] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [323.246159] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [323.246249] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [323.246373] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [323.247521] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [323.257758] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [323.258025] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [323.259442] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [323.259518] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [323.259589] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [323.259670] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [323.259779] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [323.259859] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [323.259939] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [323.260026] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [323.260116] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [323.260202] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [323.260274] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [323.260355] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [323.260425] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [323.260508] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [323.260583] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [323.260661] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [323.260731] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [323.260816] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [323.260902] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [323.260981] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [323.261059] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [323.261133] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [323.261208] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [323.261283] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [323.261357] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [323.261431] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [323.261503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [323.261581] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [323.261672] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [323.261754] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [323.261840] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [323.261914] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [323.262004] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [323.262090] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [323.262181] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [323.262264] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [323.262340] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [323.262411] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [323.262484] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [323.262554] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [323.262628] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [323.262701] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [323.262783] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [323.262854] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [323.262930] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [323.263002] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [323.263073] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [323.263145] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [323.263217] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [323.263289] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [323.263360] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [323.263435] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [323.263511] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [323.263586] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [323.263659] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [323.263734] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [323.263820] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [323.263894] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [323.263968] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [323.265382] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [323.265387] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92882, lrc_seqno=92882, guc_id=0, flags=0x73 in no process [-1]
<7> [323.265389] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.265451] ------------[ cut here ]------------
<4> [323.265452] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.265454] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.265524] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.265582] efi_pstore nfnetlink autofs4
<4> [323.265586] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.265589] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.265590] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.265591] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.265596] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.265664] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.265665] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.265667] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.265669] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.265670] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.265671] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.265672] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.265673] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.265675] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.265676] CR2: 000000c000447000 CR3: 00000001327ae001 CR4: 0000000000f72ef0
<4> [323.265677] PKRU: 55555554
<4> [323.265678] Call Trace:
<4> [323.265679] <TASK>
<4> [323.265684] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.265689] ? lock_release+0xce/0x280
<4> [323.265694] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.265700] process_one_work+0x22e/0x6b0
<4> [323.265705] worker_thread+0x1e8/0x3d0
<4> [323.265708] ? __pfx_worker_thread+0x10/0x10
<4> [323.265710] kthread+0x11f/0x250
<4> [323.265714] ? __pfx_kthread+0x10/0x10
<4> [323.265717] ret_from_fork+0x344/0x3a0
<4> [323.265719] ? __pfx_kthread+0x10/0x10
<4> [323.265722] ret_from_fork_asm+0x1a/0x30
<4> [323.265728] </TASK>
<4> [323.265729] irq event stamp: 679411
<4> [323.265730] hardirqs last enabled at (679417): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.265734] hardirqs last disabled at (679422): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.265736] softirqs last enabled at (675942): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.265738] softirqs last disabled at (675933): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.265741] ---[ end trace 0000000000000000 ]---
<6> [323.265742] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [323.265830] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [323.266806] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [323.266933] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [323.267041] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [323.267426] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [323.268068] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [323.268158] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [323.268248] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [323.268342] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [323.268424] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [323.268510] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [323.268594] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [323.268674] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [323.268774] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [323.268898] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [323.270041] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [323.280756] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [323.281014] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [323.282294] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [323.282368] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [323.282440] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [323.282512] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [323.282585] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [323.282657] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [323.282730] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [323.282814] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [323.282886] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [323.282958] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [323.283030] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [323.283101] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [323.283171] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [323.283241] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [323.283311] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [323.283381] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [323.283450] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [323.283520] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [323.283591] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [323.283670] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [323.283751] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [323.283832] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [323.283910] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [323.283985] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [323.284060] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [323.284158] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [323.284250] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [323.284330] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [323.284407] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [323.284483] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [323.284559] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [323.284632] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [323.284714] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [323.284813] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [323.284895] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [323.284972] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [323.285050] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [323.285123] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [323.285196] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [323.285268] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [323.285342] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [323.285414] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [323.285487] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [323.285559] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [323.285634] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [323.285707] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [323.285794] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [323.285870] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [323.285965] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [323.286038] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [323.286112] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [323.286188] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [323.286261] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [323.286334] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [323.286406] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [323.286482] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [323.286556] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [323.286628] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [323.286703] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [323.288074] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [323.288936] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92882, lrc_seqno=92882, guc_id=0, flags=0x73 in no process [-1]
<7> [323.288939] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.289007] ------------[ cut here ]------------
<4> [323.289009] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.289011] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.289089] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.289151] efi_pstore nfnetlink autofs4
<4> [323.289155] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.289158] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.289159] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.289160] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.289166] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.289234] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.289236] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.289238] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.289240] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.289241] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.289242] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.289243] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.289244] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.289246] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.289247] CR2: 000000c000447000 CR3: 00000001327ae001 CR4: 0000000000f72ef0
<4> [323.289248] PKRU: 55555554
<4> [323.289249] Call Trace:
<4> [323.289250] <TASK>
<4> [323.289255] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.289260] ? lock_release+0xce/0x280
<4> [323.289267] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.289272] process_one_work+0x22e/0x6b0
<4> [323.289278] worker_thread+0x1e8/0x3d0
<4> [323.289281] ? __pfx_worker_thread+0x10/0x10
<4> [323.289283] kthread+0x11f/0x250
<4> [323.289287] ? __pfx_kthread+0x10/0x10
<4> [323.289290] ret_from_fork+0x344/0x3a0
<4> [323.289293] ? __pfx_kthread+0x10/0x10
<4> [323.289295] ret_from_fork_asm+0x1a/0x30
<4> [323.289302] </TASK>
<4> [323.289303] irq event stamp: 690105
<4> [323.289304] hardirqs last enabled at (690111): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.289307] hardirqs last disabled at (690116): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.289310] softirqs last enabled at (686852): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.289312] softirqs last disabled at (686845): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.289315] ---[ end trace 0000000000000000 ]---
<6> [323.291619] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.291659] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.291838] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.291867] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292429] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292456] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292509] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292557] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292582] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292606] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292633] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292665] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292682] xe 0000:03:00.0: [drm] exec queue reset detected
<6> [323.292708] xe 0000:03:00.0: [drm] exec queue reset detected
<7> [323.294701] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [323.294866] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<5> [323.296346] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92952, lrc_seqno=92952, guc_id=0, flags=0x73 in no process [-1]
<7> [323.296352] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.296449] ------------[ cut here ]------------
<4> [323.296451] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.296453] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.296552] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.296616] efi_pstore nfnetlink autofs4
<4> [323.296620] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.296623] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.296625] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.296626] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.296631] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.296721] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.296723] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.296725] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.296727] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.296728] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.296729] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.296730] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.296731] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.296733] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.296734] CR2: 00005797f42cbcd8 CR3: 00000001f02ae003 CR4: 0000000000f72ef0
<4> [323.296735] PKRU: 55555554
<4> [323.296736] Call Trace:
<4> [323.296737] <TASK>
<4> [323.296743] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.296760] ? lock_release+0xce/0x280
<4> [323.296768] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.296773] process_one_work+0x22e/0x6b0
<4> [323.296784] worker_thread+0x1e8/0x3d0
<4> [323.296789] ? __pfx_worker_thread+0x10/0x10
<4> [323.296793] kthread+0x11f/0x250
<4> [323.296798] ? __pfx_kthread+0x10/0x10
<4> [323.296804] ret_from_fork+0x344/0x3a0
<4> [323.296807] ? __pfx_kthread+0x10/0x10
<4> [323.296812] ret_from_fork_asm+0x1a/0x30
<4> [323.296818] </TASK>
<4> [323.296819] irq event stamp: 691403
<4> [323.296820] hardirqs last enabled at (691409): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.296824] hardirqs last disabled at (691414): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.296826] softirqs last enabled at (691332): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.296829] softirqs last disabled at (691321): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.296831] ---[ end trace 0000000000000000 ]---
<6> [323.296834] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [323.296905] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [323.296913] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [323.296929] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [323.297705] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [323.297812] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [323.297911] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [323.298023] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [323.298107] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [323.298200] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [323.298312] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [323.298401] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [323.298508] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [323.298636] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [323.299866] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [323.309762] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [323.310096] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [323.311611] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [323.311696] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [323.311798] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [323.311882] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [323.311967] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [323.312059] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [323.312139] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [323.312223] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [323.312307] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [323.312390] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [323.312484] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [323.312565] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [323.312650] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [323.312742] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [323.312861] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [323.312942] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [323.313026] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [323.313108] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [323.313193] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [323.313273] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [323.313349] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [323.313426] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [323.313503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [323.313577] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [323.313650] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [323.313720] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [323.313802] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [323.313877] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [323.313953] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [323.314031] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [323.314125] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [323.314205] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [323.314293] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [323.314414] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [323.314501] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [323.314583] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [323.314658] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [323.314731] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [323.314816] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [323.314888] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [323.314963] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [323.315037] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [323.315110] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [323.315182] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [323.315258] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [323.315330] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [323.315402] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [323.315473] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [323.315547] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [323.315618] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [323.315690] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [323.315777] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [323.315869] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [323.315943] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [323.316018] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [323.316094] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [323.316166] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [323.316239] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [323.316315] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [323.317679] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<7> [323.317908] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.317941] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.317940] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.317975] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.318458] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.318782] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.319007] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.319394] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [323.319429] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.319529] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<5> [323.319656] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92952, lrc_seqno=92952, guc_id=0, flags=0x73 in no process [-1]
<7> [323.319660] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.319736] ------------[ cut here ]------------
<4> [323.319738] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.319740] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.319874] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.319975] efi_pstore nfnetlink autofs4
<4> [323.319983] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.319987] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.319988] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.319990] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.319998] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.320101] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.320104] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.320108] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.320110] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.320113] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.320115] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.320117] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.320119] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.320121] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.320123] CR2: 00005797f42cbcd8 CR3: 00000001f02ae003 CR4: 0000000000f72ef0
<4> [323.320126] PKRU: 55555554
<4> [323.320128] Call Trace:
<4> [323.320130] <TASK>
<4> [323.320140] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.320148] ? lock_release+0xce/0x280
<4> [323.320158] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.320167] process_one_work+0x22e/0x6b0
<4> [323.320178] worker_thread+0x1e8/0x3d0
<4> [323.320182] ? __pfx_worker_thread+0x10/0x10
<4> [323.320186] kthread+0x11f/0x250
<4> [323.320192] ? __pfx_kthread+0x10/0x10
<4> [323.320197] ret_from_fork+0x344/0x3a0
<4> [323.320201] ? __pfx_kthread+0x10/0x10
<4> [323.320205] ret_from_fork_asm+0x1a/0x30
<4> [323.320217] </TASK>
<4> [323.320219] irq event stamp: 702165
<4> [323.320221] hardirqs last enabled at (702171): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.320226] hardirqs last disabled at (702176): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.320229] softirqs last enabled at (701422): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.320232] softirqs last disabled at (701417): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.320236] ---[ end trace 0000000000000000 ]---
<6> [323.320239] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [323.320346] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<5> [323.320364] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=64, flags=0x0 in xe_compute_pree [7526]
<7> [323.320368] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [323.320852] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=93, flags=0x0 in xe_compute_pree [7526]
<7> [323.320855] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<5> [323.321142] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=4294967169, lrc_seqno=4294967169, guc_id=100, flags=0x0 in xe_compute_pree [7526]
<7> [323.321145] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<6> [323.321423] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [323.321477] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [323.323859] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [323.323993] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [323.324113] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [323.324221] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [323.324326] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [323.324432] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [323.324535] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [323.324636] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [323.324759] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [323.324896] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [323.325785] xe 0000:03:00.0: [drm:xe_sync_entry_parse [xe]] Ioctl argument check failed at drivers/gpu/drm/xe/xe_sync.c:200: IS_ERR(sync->ufence)
<7> [323.326044] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [323.336771] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [323.337112] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [323.338725] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [323.338940] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [323.339046] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [323.339179] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [323.339274] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [323.339372] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [323.339466] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [323.339585] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [323.339673] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [323.339805] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [323.339910] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [323.340016] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [323.340169] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [323.340268] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [323.340395] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [323.340515] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [323.340614] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [323.340740] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [323.340909] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [323.341039] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [323.341150] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [323.341259] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [323.341375] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [323.341489] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [323.341601] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [323.341714] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [323.341889] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [323.342001] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [323.342171] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [323.342282] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [323.342393] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [323.342503] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [323.342619] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [323.342737] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [323.342920] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [323.343010] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [323.343099] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [323.343184] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [323.343273] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [323.343353] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [323.343440] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [323.343528] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [323.343615] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [323.343785] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [323.343875] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [323.343957] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [323.344090] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [323.344177] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [323.344262] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [323.344344] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [323.344426] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [323.344515] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [323.344684] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [323.344841] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [323.344925] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [323.345010] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [323.345092] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [323.345172] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [323.345257] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [323.356579] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [323.358180] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=92952, lrc_seqno=92952, guc_id=0, flags=0x73 in no process [-1]
<7> [323.358185] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.358271] ------------[ cut here ]------------
<4> [323.358273] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.358275] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.358355] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.358441] efi_pstore nfnetlink autofs4
<4> [323.358448] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.358452] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.358454] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.358456] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.358462] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.358540] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.358543] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.358546] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.358548] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.358550] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.358551] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.358553] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.358554] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.358556] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.358558] CR2: 00005797f42cbcd8 CR3: 0000000125ff5004 CR4: 0000000000f72ef0
<4> [323.358559] PKRU: 55555554
<4> [323.358561] Call Trace:
<4> [323.358562] <TASK>
<4> [323.358569] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.358576] ? lock_release+0xce/0x280
<4> [323.358584] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.358591] process_one_work+0x22e/0x6b0
<4> [323.358599] worker_thread+0x1e8/0x3d0
<4> [323.358603] ? __pfx_worker_thread+0x10/0x10
<4> [323.358606] kthread+0x11f/0x250
<4> [323.358610] ? __pfx_kthread+0x10/0x10
<4> [323.358614] ret_from_fork+0x344/0x3a0
<4> [323.358618] ? __pfx_kthread+0x10/0x10
<4> [323.358621] ret_from_fork_asm+0x1a/0x30
<4> [323.358630] </TASK>
<4> [323.358632] irq event stamp: 714369
<4> [323.358633] hardirqs last enabled at (714375): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.358637] hardirqs last disabled at (714380): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.358640] softirqs last enabled at (710576): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.358643] softirqs last disabled at (710543): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.358646] ---[ end trace 0000000000000000 ]---
<7> [323.365981] xe 0000:03:00.0: [drm:user_fence_worker [xe]] mmget_not_zero() failed, ufence wasn't signaled
<7> [323.366173] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [323.366283] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<5> [323.366484] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=93022, lrc_seqno=93022, guc_id=0, flags=0x73 in no process [-1]
<7> [323.366487] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.366546] ------------[ cut here ]------------
<4> [323.366547] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.366548] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#5: kworker/u64:39/4929
<4> [323.366618] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.366681] efi_pstore nfnetlink autofs4
<4> [323.366686] CPU: 5 UID: 0 PID: 4929 Comm: kworker/u64:39 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.366689] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.366690] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.366691] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.366696] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.366776] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.366778] RSP: 0018:ffffc9000fcb7c98 EFLAGS: 00010246
<4> [323.366780] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.366781] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.366782] RBP: ffffc9000fcb7da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.366783] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.366784] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.366786] FS: 0000000000000000(0000) GS:ffff8888daf5a000(0000) knlGS:0000000000000000
<4> [323.366787] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.366789] CR2: 00005797f42cbcd8 CR3: 0000000125ff5004 CR4: 0000000000f72ef0
<4> [323.366790] PKRU: 55555554
<4> [323.366791] Call Trace:
<4> [323.366793] <TASK>
<4> [323.366798] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.366803] ? lock_release+0xce/0x280
<4> [323.366810] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.366815] process_one_work+0x22e/0x6b0
<4> [323.366822] worker_thread+0x1e8/0x3d0
<4> [323.366824] ? __pfx_worker_thread+0x10/0x10
<4> [323.366827] kthread+0x11f/0x250
<4> [323.366830] ? __pfx_kthread+0x10/0x10
<4> [323.366834] ret_from_fork+0x344/0x3a0
<4> [323.366836] ? __pfx_kthread+0x10/0x10
<4> [323.366839] ret_from_fork_asm+0x1a/0x30
<4> [323.366846] </TASK>
<4> [323.366847] irq event stamp: 718133
<4> [323.366848] hardirqs last enabled at (718139): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.366851] hardirqs last disabled at (718144): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.366853] softirqs last enabled at (717972): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.366856] softirqs last disabled at (717963): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.366859] ---[ end trace 0000000000000000 ]---
<6> [323.366860] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [323.366929] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<6> [323.366936] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [323.367348] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [323.367956] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [323.368044] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [323.368132] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [323.368216] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [323.368298] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [323.368382] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [323.368466] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [323.368546] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [323.368638] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [323.368757] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<6> [323.369792] [IGT] xe_compute_preempt: finished subtest engine-DRM_XE_ENGINE_CLASS_COMPUTE, FAIL
<7> [323.369813] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<6> [323.370467] [IGT] xe_compute_preempt: finished subtest compute-preempt-many-vram-evict, FAIL
<7> [323.379951] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [323.380214] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [323.381442] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [323.381525] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [323.381598] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [323.381671] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [323.381743] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [323.382056] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [323.382131] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [323.382205] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [323.382275] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [323.382347] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [323.382418] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [323.382488] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [323.382560] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [323.382629] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [323.382701] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [323.382786] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [323.382874] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [323.382948] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [323.383021] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [323.383105] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [323.383183] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [323.383260] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [323.383341] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [323.383420] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [323.383496] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [323.383572] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [323.383647] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [323.383726] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [323.383815] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [323.383890] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [323.383965] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [323.384040] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [323.384121] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [323.384203] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [323.384287] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [323.384369] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [323.384447] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [323.384522] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [323.384595] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [323.384670] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [323.384754] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [323.384847] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [323.384942] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [323.385035] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [323.385143] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [323.385255] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [323.385363] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [323.385470] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [323.385582] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [323.385688] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [323.385799] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [323.385889] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [323.385977] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [323.386069] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [323.386183] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [323.386300] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [323.386415] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [323.386528] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [323.386641] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [323.387660] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [323.388641] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=93022, lrc_seqno=93022, guc_id=0, flags=0x73 in no process [-1]
<7> [323.388646] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.388920] ------------[ cut here ]------------
<4> [323.388922] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.388923] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#7: kworker/u64:25/4915
<4> [323.389068] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.389164] efi_pstore nfnetlink autofs4
<4> [323.389170] CPU: 7 UID: 0 PID: 4915 Comm: kworker/u64:25 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.389173] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.389175] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.389177] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.389184] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.389267] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.389269] RSP: 0018:ffffc9000fc57c98 EFLAGS: 00010246
<4> [323.389272] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.389273] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.389275] RBP: ffffc9000fc57da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.389276] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.389277] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.389279] FS: 0000000000000000(0000) GS:ffff8888db05a000(0000) knlGS:0000000000000000
<4> [323.389281] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.389282] CR2: 000075d58454e160 CR3: 0000000121ff6004 CR4: 0000000000f72ef0
<4> [323.389284] PKRU: 55555554
<4> [323.389285] Call Trace:
<4> [323.389287] <TASK>
<4> [323.389294] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.389300] ? lock_release+0xce/0x280
<4> [323.389307] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.389313] process_one_work+0x22e/0x6b0
<4> [323.389321] worker_thread+0x1e8/0x3d0
<4> [323.389324] ? __pfx_worker_thread+0x10/0x10
<4> [323.389327] kthread+0x11f/0x250
<4> [323.389332] ? __pfx_kthread+0x10/0x10
<4> [323.389337] ret_from_fork+0x344/0x3a0
<4> [323.389340] ? __pfx_kthread+0x10/0x10
<4> [323.389345] ret_from_fork_asm+0x1a/0x30
<4> [323.389356] </TASK>
<4> [323.389358] irq event stamp: 460095
<4> [323.389360] hardirqs last enabled at (460101): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.389364] hardirqs last disabled at (460106): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.389367] softirqs last enabled at (459334): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.389371] softirqs last disabled at (459325): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.389375] ---[ end trace 0000000000000000 ]---
<6> [323.389377] xe 0000:03:00.0: [drm] Tile0: GT0: trying reset from guc_exec_queue_timedout_job [xe]
<6> [323.389450] xe 0000:03:00.0: [drm] Tile0: GT0: reset queued
<7> [323.389674] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<7> [323.389793] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<6> [323.389963] xe 0000:03:00.0: [drm] Tile0: GT0: reset started
<7> [323.390209] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel stopped
<7> [323.391668] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying GT save-restore MMIOs
<7> [323.391785] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x4148] = 0x00000000
<7> [323.391895] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x8828] = 0x00800000
<7> [323.392112] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb0c8] = 0x55555440
<7> [323.392219] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb104] = 0x08104440
<7> [323.392315] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb108] = 0x30200000
<7> [323.392420] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xb158] = 0x0000007f
<7> [323.392520] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
<7> [323.392639] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [6144K, 832K)
<7> [323.392758] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel disabled
<7> [323.393970] xe 0000:03:00.0: [drm:xe_guc_ads_populate [xe]] Tile0: GT0: Updated ADS capture size 20480 (was 49152)
<7> [323.403756] xe 0000:03:00.0: [drm:guc_wait_ucode [xe]] Tile0: GT0: GuC load: init took 9ms, freq = 2150MHz (req = 2133MHz), before = 2150MHz, status = 0x8002F034
<7> [323.404024] xe 0000:03:00.0: [drm:guc_ct_change_state [xe]] Tile0: GT0: GuC CT communication channel enabled
<7> [323.405375] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: flag:0x1
<7> [323.405452] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: mocs entries: 16
<7> [323.405519] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[0] 0x4000 0xc
<7> [323.405586] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[1] 0x4004 0x10c
<7> [323.405651] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[2] 0x4008 0x130
<7> [323.405716] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[3] 0x400c 0x13c
<7> [323.405794] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[4] 0x4010 0x100
<7> [323.405878] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[5] 0x4014 0x100
<7> [323.405948] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[6] 0x4018 0x100
<7> [323.406019] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[7] 0x401c 0x100
<7> [323.406089] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[8] 0x4020 0x100
<7> [323.406161] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[9] 0x4024 0x100
<7> [323.406230] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[10] 0x4028 0x100
<7> [323.406302] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[11] 0x402c 0x100
<7> [323.406375] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[12] 0x4030 0x100
<7> [323.406446] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[13] 0x4034 0x100
<7> [323.406517] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[14] 0x4038 0x100
<7> [323.406587] xe 0000:03:00.0: [drm:xe_mocs_init [xe]] Tile0: GT0: GLOB_MOCS[15] 0x403c 0x100
<7> [323.406658] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying rcs0 save-restore MMIOs
<7> [323.406737] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x2050] = 0x10001000
<7> [323.406852] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20c4] = 0x3f7e0306
<7> [323.406947] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x20d4] = 0xc080c080
<7> [323.407024] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d0] = 0x00006210
<7> [323.407098] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d4] = 0x000062a8
<7> [323.407175] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24d8] = 0x1000dafc
<7> [323.407251] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24dc] = 0x1000db01
<7> [323.407324] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x24e0] = 0x0000db1c
<7> [323.407399] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe194] = 0x00400040
<7> [323.407476] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe48c] = 0x02000200
<7> [323.407551] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe49c] = 0x40004000
<7> [323.407625] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4c4] = 0x10401040
<7> [323.407699] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe4f0] = 0x00020002
<7> [323.407808] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe530] = 0x00000400
<7> [323.407915] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7c8] = 0x04002000
<7> [323.407997] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0xe7cc] = 0x00009000
<7> [323.408074] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x14800] = 0x00020002
<7> [323.408150] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs0 save-restore MMIOs
<7> [323.408223] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x22050] = 0x10001000
<7> [323.408318] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220c4] = 0x3f7e0306
<7> [323.408419] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x220d4] = 0xc080c080
<7> [323.408525] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying bcs8 save-restore MMIOs
<7> [323.408630] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee050] = 0x10001000
<7> [323.408727] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0c4] = 0x3f7e0306
<7> [323.408865] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x3ee0d4] = 0xc080c080
<7> [323.408986] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs0 save-restore MMIOs
<7> [323.409083] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a050] = 0x10001000
<7> [323.409159] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0c4] = 0x3f7e0308
<7> [323.409235] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a0d4] = 0xc080c080
<7> [323.409312] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d0] = 0x1000dafc
<7> [323.409384] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d4] = 0x1000db01
<7> [323.409455] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1a4d8] = 0x0000db1c
<7> [323.409530] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: Applying ccs1 save-restore MMIOs
<7> [323.409603] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c050] = 0x10001000
<7> [323.409676] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0c4] = 0x3f7e0308
<7> [323.409759] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c0d4] = 0xc080c080
<7> [323.409832] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d0] = 0x1000dafc
<7> [323.409906] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d4] = 0x1000db01
<7> [323.409980] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Tile0: GT0: REG[0x1c4d8] = 0x0000db1c
<7> [323.410058] xe 0000:03:00.0: [drm:xe_gt_apply_ccs_mode [xe]] Tile0: GT0: CCS_MODE=fff0fc0 config:00400000, num_engines:1, num_slices:2
<6> [323.410689] xe 0000:03:00.0: [drm] Tile0: GT0: reset done
<5> [323.410772] xe 0000:03:00.0: [drm] Tile0: GT0: Timedout job: seqno=93022, lrc_seqno=93022, guc_id=0, flags=0x73 in no process [-1]
<7> [323.410781] xe 0000:03:00.0: [drm:xe_devcoredump [xe]] Multiple hangs are occurring, but only the first snapshot was taken
<4> [323.411164] ------------[ cut here ]------------
<4> [323.411167] xe 0000:03:00.0: [drm] Tile0: GT0: Kernel-submitted job timed out
<4> [323.411170] WARNING: drivers/gpu/drm/xe/xe_guc_submit.c:1612 at guc_exec_queue_timedout_job+0x13f5/0x23e0 [xe], CPU#6: kworker/u64:25/4915
<4> [323.411501] Modules linked in: pmt_crashlog snd_hda_codec_intelhdmi snd_hda_codec_hdmi mei_lb mei_gsc_proxy mtd_intel_dg mei_gsc xe drm_gpuvm drm_gpusvm_helper drm_buddy drm_ttm_helper ttm gpu_sched drm_suballoc_helper drm_exec drm_display_helper cec rc_core drm_kunit_helpers i2c_algo_bit kunit intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal cmdlinepart intel_powerclamp hid_generic eeepc_wmi spi_nor coretemp asus_wmi mei_pxp mtd mei_hdcp sparse_keymap platform_profile wmi_bmof kvm_intel snd_hda_intel usbhid snd_intel_dspcfg kvm hid snd_hda_codec irqbypass ghash_clmulni_intel snd_hda_core aesni_intel snd_hwdep video rapl r8169 snd_pcm binfmt_misc intel_cstate i2c_i801 snd_timer realtek i2c_mux spi_intel_pci snd spi_intel soundcore i2c_smbus mei_me idma64 mei intel_pmc_core pmt_telemetry nls_iso8859_1 pmt_discovery pmt_class intel_pmc_ssram_telemetry pinctrl_alderlake intel_vsec wmi acpi_pad acpi_tad dm_multipath msr nvme_fabrics fuse
<4> [323.411593] efi_pstore nfnetlink autofs4
<4> [323.411598] CPU: 6 UID: 0 PID: 4915 Comm: kworker/u64:25 Tainted: G S U W 6.19.0-lgci-xe-xe-4576-cc2c646d39200973c-debug+ #1 PREEMPT(voluntary)
<4> [323.411601] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER, [W]=WARN
<4> [323.411602] Hardware name: ASUS System Product Name/PRIME Z790-P WIFI, BIOS 1645 03/15/2024
<4> [323.411604] Workqueue: gt-ordered-wq drm_sched_job_timedout [gpu_sched]
<4> [323.411610] RIP: 0010:guc_exec_queue_timedout_job+0x13fe/0x23e0 [xe]
<4> [323.411690] Code: 74 04 48 8b 7f 08 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 c5 47 5a e1 48 89 c6 48 8d 3d 8b ba 39 00 41 89 d8 44 89 e1 4c 89 ea <67> 48 0f b9 3a 48 8b 45 90 48 8b 40 60 e9 f5 ee ff ff 8b 70 08 49
<4> [323.411693] RSP: 0018:ffffc9000fc57c98 EFLAGS: 00010246
<4> [323.411696] RAX: ffffffffa11faba1 RBX: 0000000000000000 RCX: 0000000000000000
<4> [323.411698] RDX: ffff888104f22410 RSI: ffffffffa11faba1 RDI: ffffffffa1003d70
<4> [323.411700] RBP: ffffc9000fc57da8 R08: 0000000000000000 R09: 0000000000000000
<4> [323.411702] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [323.411704] R13: ffff888104f22410 R14: ffff8881350c5018 R15: 00000000ffffffc2
<4> [323.411706] FS: 0000000000000000(0000) GS:ffff8888dafda000(0000) knlGS:0000000000000000
<4> [323.411708] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [323.411710] CR2: 000075d57f7d5000 CR3: 0000000125ff5001 CR4: 0000000000f72ef0
<4> [323.411712] PKRU: 55555554
<4> [323.411713] Call Trace:
<4> [323.411715] <TASK>
<4> [323.411723] ? drm_sched_job_timedout+0x80/0x1a0 [gpu_sched]
<4> [323.411732] ? lock_release+0xce/0x280
<4> [323.411742] drm_sched_job_timedout+0x94/0x1a0 [gpu_sched]
<4> [323.411757] process_one_work+0x22e/0x6b0
<7> [323.411690] xe 0000:03:00.0: [drm:xe_gt_sriov_pf_config_restart [xe]] PF: Tile0: GT0: pushed 0 skip 24 of 24 VFs configurations
<4> [323.411769] worker_thread+0x1e8/0x3d0
<4> [323.411773] ? __pfx_worker_thread+0x10/0x10
<4> [323.411776] kthread+0x11f/0x250
<4> [323.411781] ? __pfx_kthread+0x10/0x10
<4> [323.411785] ret_from_fork+0x344/0x3a0
<4> [323.411788] ? __pfx_kthread+0x10/0x10
<4> [323.411792] ret_from_fork_asm+0x1a/0x30
<4> [323.411803] </TASK>
<4> [323.411804] irq event stamp: 468845
<4> [323.411806] hardirqs last enabled at (468851): [<ffffffff8149e939>] __up_console_sem+0x79/0xa0
<4> [323.411810] hardirqs last disabled at (468856): [<ffffffff8149e91e>] __up_console_sem+0x5e/0xa0
<4> [323.411813] softirqs last enabled at (466970): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.411816] softirqs last disabled at (466945): [<ffffffff813c968f>] __irq_exit_rcu+0x13f/0x160
<4> [323.411819] ---[ end trace 0000000000000000 ]---
<7> [323.412040] xe 0000:03:00.0: [drm:pf_worker_restart_func [xe]] PF: Tile0: GT0: restart completed
<7> [323.412266] xe 0000:03:00.0: [drm:user_fence_worker [xe]] mmget_not_zero() failed, ufence wasn't signaled
<7> [323.412377] xe 0000:03:00.0: [drm:user_fence_worker [xe]] mmget_not_zero() failed, ufence wasn't signaled
<7> [323.509542] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
<6> [323.510661] [IGT] xe_compute_preempt: exiting, ret=98
<6> [323.526820] Console: switching to colour frame buffer device 240x67
|