Result: 4 Warning(s)
i915_display_info23 igt_runner23 results23.json results23-xe-load.json guc_logs23.tar i915_display_info_post_exec23 serial_data23 boot23 dmesg23
| Detail | Value |
|---|---|
| Duration | unknown |
| Hostname |
shard-bmg-2 |
| Igt-Version |
IGT-Version: 2.4-ga3dfe1836 (x86_64) (Linux: 7.1.0-rc2-lgci-xe-xe-4990-835de80ce9b34b618-debug+ x86_64) |
| Out |
Using IGT_SRANDOM=1778048205 for randomisation Opened device: /dev/dri/card0 Starting subtest: partial-atomic-middle-remap-no-cpu-fault Stack trace: #0 ../lib/igt_core.c:2075 __igt_fail_assert() #1 [xe_wait_ufence+0x57] #2 ../tests/intel/xe_exec_system_allocator.c:886 __igt_unique____real_main2384() #3 ../tests/intel/xe_exec_system_allocator.c:2384 main() #4 [__libc_init_first+0x8a] #5 [__libc_start_main+0x8b] #6 [_start+0x25] Subtest partial-atomic-middle-remap-no-cpu-fault: FAIL (5.302s) runner: This test was killed due to a kernel taint (0x40244). This test caused an abort condition: Kernel badly tainted (0x40244, 0x200) (check dmesg for details): TAINT_WARN: WARN_ON has happened. |
| Err |
Starting subtest: partial-atomic-middle-remap-no-cpu-fault (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:763: (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: Last errno: 62, Timer expired (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: error: -62 != 0 Subtest partial-atomic-middle-remap-no-cpu-fault failed. **** DEBUG **** (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: Test assertion failure function xe_wait_ufence, file ../lib/xe/xe_ioctl.c:763: (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: Failed assertion: __xe_wait_ufence(fd, addr, value, exec_queue, &timeout) == 0 (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: Last errno: 62, Timer expired (xe_exec_system_allocator:3807) xe/xe_ioctl-CRITICAL: error: -62 != 0 (xe_exec_system_allocator:3807) igt_core-INFO: Stack trace: (xe_exec_system_allocator:3807) igt_core-INFO: #0 ../lib/igt_core.c:2075 __igt_fail_assert() (xe_exec_system_allocator:3807) igt_core-INFO: #1 [xe_wait_ufence+0x57] (xe_exec_system_allocator:3807) igt_core-INFO: #2 ../tests/intel/xe_exec_system_allocator.c:886 __igt_unique____real_main2384() (xe_exec_system_allocator:3807) igt_core-INFO: #3 ../tests/intel/xe_exec_system_allocator.c:2384 main() (xe_exec_system_allocator:3807) igt_core-INFO: #4 [__libc_init_first+0x8a] (xe_exec_system_allocator:3807) igt_core-INFO: #5 [__libc_start_main+0x8b] (xe_exec_system_allocator:3807) igt_core-INFO: #6 [_start+0x25] **** END **** Subtest partial-atomic-middle-remap-no-cpu-fault: FAIL (5.302s) Received signal SIGQUIT. Stack trace: #0 [fatal_sig_handler+0x17b] #1 [__sigaction+0x50] #2 [__close+0x14] #3 [__igt_unique____real_main2384+0x38f3] #4 [main+0x2d] #5 [__libc_init_first+0x8a] #6 [__libc_start_main+0x8b] #7 [_start+0x25] |
| Dmesg |
<6> [138.665734] Console: switching to colour dummy device 80x25
<6> [138.666020] [IGT] xe_exec_system_allocator: executing
<7> [138.676316] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
<6> [138.677287] [IGT] xe_exec_system_allocator: starting subtest partial-atomic-middle-remap-no-cpu-fault
<7> [138.683179] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 512; Own pages: 0.
<7> [138.685164] xe 0000:03:00.0: [drm:xe_svm_garbage_collector [xe]] Skipping madvise reset for vma.
<7> [138.686868] xe 0000:03:00.0: [drm:drm_pagemap_migrate_to_devmem [drm_gpusvm_helper]] Total pages 16; Own pages: 0.
<7> [139.479853] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x2e2e2b2c
<7> [139.480015] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2d2d2d2e
<7> [139.576158] xe 0000:03:00.0: [drm:intel_power_well_enable [xe]] enabling AUX_TC2
<7> [139.680103] xe 0000:03:00.0: [drm:intel_power_well_disable [xe]] disabling AUX_TC2
<7> [139.688153] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [140.689098] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [141.690426] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [142.691104] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [143.693480] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [143.979915] [IGT] xe_exec_system_allocator: finished subtest partial-atomic-middle-remap-no-cpu-fault, FAIL
<7> [144.694492] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [145.695427] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [146.696530] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [147.697594] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [148.697951] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [149.699384] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [150.699972] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [151.700932] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [152.702164] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [153.702528] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [154.480024] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x30302c2d
<7> [154.480189] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x2f2f2f30
<7> [154.704096] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [155.705842] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [156.706541] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [157.707170] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [158.709134] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [159.711218] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [160.712502] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [161.712753] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [162.714001] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [163.714678] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [164.715550] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [165.717194] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [166.717860] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [167.718493] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [168.719927] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [169.481507] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x30312d2e
<7> [169.481673] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x30303030
<7> [169.720366] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [170.721518] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [171.722196] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [171.828390] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [171.828396] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [171.828398] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [171.828400] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [172.723172] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [173.724434] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [173.831523] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [173.831529] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [173.831531] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [173.831538] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [174.725374] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [175.728446] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [176.728799] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [177.730216] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [177.836253] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [177.836261] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [177.836264] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [177.836268] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [178.731194] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [179.732682] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [180.733980] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [181.734647] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [182.735684] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [183.736821] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [184.478282] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x31322e2e
<7> [184.478446] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x31303131
<7> [184.737921] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [185.738677] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [186.739653] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [187.740382] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [188.741835] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [189.742863] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [190.743352] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [191.746800] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [192.747662] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [193.748822] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [194.749667] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [195.750534] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [195.856848] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [195.856854] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [195.856855] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [195.856857] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [196.751251] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [197.752490] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [198.753157] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [199.476786] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x32322e2f
<7> [199.476975] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x31313131
<7> [199.754427] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [199.861707] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [199.861776] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [199.861780] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [199.861784] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [200.755189] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [201.755901] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [201.862742] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [201.862751] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [201.862754] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [201.862757] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [202.756997] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [203.757809] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [204.759363] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [205.759684] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [206.760835] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [206.867939] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [206.867946] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [206.867948] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [206.867951] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [207.762179] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [207.868791] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [207.868797] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [207.868799] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [207.868801] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [208.762654] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [209.763704] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [210.765252] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [211.765937] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [212.767239] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [212.874235] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [212.874245] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [212.874249] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [212.874253] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [213.768570] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [214.484680] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x32322f30
<7> [214.484841] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x31323232
<7> [214.769608] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [215.771027] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [216.771455] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [217.772891] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [218.773967] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [219.775963] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [220.776443] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [221.777579] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [222.778659] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [223.780379] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [224.781516] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [225.782237] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [226.783514] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [227.784638] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [228.785622] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [229.485295] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x33333030
<7> [229.485456] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x32323233
<7> [229.786584] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [230.787528] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [231.788225] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [232.789654] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [233.791004] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [234.792059] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [235.793629] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [236.794444] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [237.795417] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [238.796355] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [239.797158] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [240.798400] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [241.799723] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [242.801369] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [243.801944] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [244.481129] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x33333030
<7> [244.481292] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x32323233
<7> [244.803030] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [245.804116] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [246.805211] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [247.807962] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<3> [248.307933] INFO: task xe_exec_system_:3807 blocked for more than 61 seconds.
<3> [248.315234] Tainted: G S U W N 7.1.0-rc2-lgci-xe-xe-4990-835de80ce9b34b618-debug+ #1
<3> [248.324455] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6> [248.332707] task:xe_exec_system_ state:D stack:0 pid:3807 tgid:3807 ppid:2233 task_flags:0x400100 flags:0x00080002
<6> [248.332715] Call Trace:
<6> [248.332717] <TASK>
<6> [248.332722] __schedule+0x5eb/0x1f70
<6> [248.332727] ? lock_acquire+0xc4/0x300
<6> [248.332731] ? schedule+0x10e/0x180
<6> [248.332733] ? lock_release+0xd0/0x2b0
<6> [248.332737] schedule+0x3a/0x180
<6> [248.332739] schedule_preempt_disabled+0x15/0x30
<6> [248.332741] rwsem_down_write_slowpath+0x30d/0x9c0
<6> [248.332744] ? lock_acquire+0xc4/0x300
<6> [248.332750] down_write+0xe5/0xf0
<6> [248.332753] xe_vm_close_and_put+0x70/0x1000 [xe]
<6> [248.332898] ? xa_find+0xe4/0x210
<6> [248.332904] xe_file_close+0x10a/0x1a0 [xe]
<6> [248.332976] drm_file_free+0x23d/0x2d0
<6> [248.332981] drm_close_helper.isra.0+0x6d/0x80
<6> [248.332984] drm_release_noglobal+0x20/0xa0
<6> [248.332987] __fput+0x10a/0x300
<6> [248.332991] fput_close_sync+0x3d/0xa0
<6> [248.332994] __x64_sys_close+0x3e/0x90
<6> [248.332998] x64_sys_call+0x1b7c/0x26e0
<6> [248.333001] do_syscall_64+0x103/0x1410
<6> [248.333004] ? do_syscall_64+0xb8/0x1410
<6> [248.333005] ? exc_page_fault+0xbd/0x2b0
<6> [248.333009] entry_SYSCALL_64_after_hwframe+0x76/0x7e
<6> [248.333012] RIP: 0033:0x72d182116724
<6> [248.333014] RSP: 002b:00007fff331430d8 EFLAGS: 00000202 ORIG_RAX: 0000000000000003
<6> [248.333017] RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 000072d182116724
<6> [248.333019] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000005
<6> [248.333021] RBP: 00007fff33146a40 R08: 0000000000000000 R09: 0000000000000000
<6> [248.333022] R10: 000072d1823a5c50 R11: 0000000000000202 R12: 000000000000e20b
<6> [248.333024] R13: 00007fff33143f00 R14: 00005a2a421beab8 R15: 000072d1825da000
<6> [248.333030] </TASK>
<3> [248.333049] INFO: task xe_exec_system_:3807 <writer> blocked on an rw-semaphore likely owned by task kworker/u65:3:2248 <writer>
<6> [248.344616] task:kworker/u65:3 state:R running task stack:0 pid:2248 tgid:2248 ppid:2 task_flags:0x4208060 flags:0x00080000
<6> [248.344624] Workqueue: xe_page_fault_work_queue xe_pagefault_queue_work [xe]
<6> [248.344725] Call Trace:
<6> [248.344726] <TASK>
<6> [248.344729] ? __pfx_stack_trace_consume_entry+0x10/0x10
<6> [248.344733] ? __xe_svm_handle_pagefault+0x950/0xef0 [xe]
<6> [248.344846] ? kmemleak_alloc+0x4a/0xa0
<6> [248.344852] ? xe_bb_new+0x66/0x120 [xe]
<6> [248.344917] ? _raw_spin_unlock_irqrestore+0x51/0x80
<6> [248.344919] ? dma_fence_default_wait+0x1f6/0x2d0
<6> [248.344923] ? trace_hardirqs_on+0x22/0xf0
<6> [248.344928] ? dma_fence_default_wait+0x7c/0x2d0
<6> [248.344934] ? rmap_walk_anon+0xf2/0x230
<6> [248.344939] ? __drm_pagemap_migrate_to_ram+0x140/0x3b0 [drm_gpusvm_helper]
<6> [248.344945] ? vma_alloc_folio_noprof+0x63/0xe0
<6> [248.344949] ? drm_pagemap_migrate_populate_ram_pfn+0xf8/0x360 [drm_gpusvm_helper]
<6> [248.344955] ? xe_svm_copy_to_ram+0x16/0x30 [xe]
<6> [248.345054] ? __drm_pagemap_migrate_to_ram+0x2d9/0x3b0 [drm_gpusvm_helper]
<6> [248.345063] ? drm_pagemap_migrate_to_ram+0x39/0x60 [drm_gpusvm_helper]
<6> [248.345067] ? rcu_read_unlock+0x26/0x80
<6> [248.345069] ? drm_pagemap_migrate_to_ram+0x39/0x60 [drm_gpusvm_helper]
<6> [248.345072] ? do_swap_page+0x1450/0x17d0
<6> [248.345076] ? __lock_acquire+0x43e/0x2790
<6> [248.345080] ? __pte_offset_map+0x46/0x250
<6> [248.345082] ? __pte_offset_map+0x19c/0x250
<6> [248.345086] ? __handle_mm_fault+0xa0c/0x1000
<6> [248.345094] ? handle_mm_fault+0x12c/0x300
<6> [248.345097] ? hmm_vma_walk_pmd+0x526/0xe30
<6> [248.345102] ? hmm_vma_fault.isra.0+0x67/0xd0
<6> [248.345105] ? hmm_vma_walk_pmd+0x59a/0xe30
<6> [248.345108] ? hmm_vma_walk_pmd+0x526/0xe30
<6> [248.345112] ? walk_pgd_range+0x57f/0xd70
<6> [248.345120] ? __walk_page_range+0x8e/0x290
<6> [248.345124] ? walk_page_range_mm_unsafe+0x19e/0x270
<6> [248.345126] ? lock_acquire+0xc4/0x300
<6> [248.345131] ? walk_page_range+0x2a/0x40
<6> [248.345133] ? hmm_range_fault+0x5b/0xc0
<6> [248.345137] ? drm_gpusvm_range_evict+0x102/0x1b0 [drm_gpusvm_helper]
<6> [248.345143] ? __xe_svm_handle_pagefault+0x950/0xef0 [xe]
<6> [248.345247] ? __lock_acquire+0x43e/0x2790
<6> [248.345253] ? lock_is_held_type+0xa3/0x130
<6> [248.345257] ? lock_acquire+0xc4/0x300
<6> [248.345260] ? xe_pagefault_queue_work+0x148/0x520 [xe]
<6> [248.345352] ? xe_svm_handle_pagefault+0x3d/0xb0 [xe]
<6> [248.345451] ? xe_pagefault_queue_work+0x1a9/0x520 [xe]
<6> [248.345541] ? process_one_work+0x239/0x740
<6> [248.345548] ? worker_thread+0x200/0x3f0
<6> [248.345551] ? __pfx_worker_thread+0x10/0x10
<6> [248.345554] ? kthread+0x10d/0x150
<6> [248.345556] ? __pfx_kthread+0x10/0x10
<6> [248.345559] ? ret_from_fork+0x3bd/0x470
<6> [248.345562] ? __pfx_kthread+0x10/0x10
<6> [248.345565] ? ret_from_fork_asm+0x1a/0x30
<6> [248.345572] </TASK>
<4> [248.345575]
Showing all locks held in the system:
<4> [248.345581] 1 lock held by khungtaskd/117:
<4> [248.345583] #0: ffffffff835c40a0 (rcu_read_lock){....}-{1:2}, at: debug_show_all_locks+0x37/0x220
<4> [248.345597] 1 lock held by in:imklog/849:
<4> [248.345598] #0: ffff888135058130 (&f->f_pos_lock){+.+.}-{3:3}, at: fdget_pos+0x81/0xd0
<4> [248.345605] 3 locks held by kworker/u64:8/986:
<4> [248.345608] 4 locks held by dmesg/2116:
<4> [248.345610] 4 locks held by kworker/u65:3/2248:
<4> [248.345611] #0: ffff8881583b1140 ((wq_completion)xe_page_fault_work_queue){+.+.}-{0:0}, at: process_one_work+0x4c4/0x740
<4> [248.345617] #1: ffffc900034bfe30 ((work_completion)(&pf_queue->worker)){+.+.}-{0:0}, at: process_one_work+0x1f9/0x740
<4> [248.345622] #2: ffff888153691690 (&vm->lock){++++}-{3:3}, at: xe_pagefault_queue_work+0x148/0x520 [xe]
<4> [248.345712] #3: ffff888118ab6578 (&mm->mmap_lock){++++}-{3:3}, at: drm_gpusvm_range_evict+0xf7/0x1b0 [drm_gpusvm_helper]
<4> [248.345734] 2 locks held by xe_exec_system_/3807:
<4> [248.345735] #0: ffffffff8384bff8 (drm_unplug_srcu){.+.+}-{0:0}, at: drm_dev_enter+0x54/0x100
<4> [248.345741] #1: ffff888153691690 (&vm->lock){++++}-{3:3}, at: xe_vm_close_and_put+0x70/0x1000 [xe]
<4> [248.345850]
<4> [248.345852] =============================================
<7> [248.808904] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [249.810084] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [250.811244] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [251.812208] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [252.812758] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [253.813782] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [254.815235] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [255.816386] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [256.817144] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [257.818666] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [258.820031] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [259.475977] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x34343031
<7> [259.476282] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x33333333
<7> [259.820777] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [260.821927] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [260.928689] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [260.928697] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [260.928701] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [260.928704] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [261.823233] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [262.823999] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [263.825063] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [263.931075] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [263.931081] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [263.931083] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [263.931086] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [264.825680] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [265.827226] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [266.827977] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [267.829186] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [268.829675] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [269.830599] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [270.831692] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [271.832857] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [272.833888] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [273.834743] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [274.475868] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x34343031
<7> [274.476019] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x33333334
<7> [274.836001] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [275.836910] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [276.837986] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [277.838561] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [278.839894] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [279.840555] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [280.841950] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [281.842563] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [282.844153] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [283.845098] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [284.845909] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [285.847117] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [286.847732] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [287.848570] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<6> [287.955471] pcieport 0000:00:06.0: AER: Multiple Correctable error message received from 0000:05:00.0
<4> [287.955477] nvme 0000:05:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
<4> [287.955479] nvme 0000:05:00.0: device [15b7:5017] error status/mask=00000001/0000e000
<4> [287.955481] nvme 0000:05:00.0: [ 0] RxErr (First)
<7> [288.849807] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [289.475026] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x33343131
<7> [289.475197] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x33333334
<7> [289.850875] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [290.851837] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [291.852966] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [292.853948] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [293.854992] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [294.856044] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [295.856834] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [296.857948] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [297.859153] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [298.860257] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [299.861621] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [300.863037] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [301.863858] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [302.864770] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [303.865924] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [304.477685] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 0 val 0x34353132
<7> [304.477861] xe 0000:03:00.0: [drm:xe_hwmon_read [xe]] thermal data for group 1 val 0x33343434
<7> [304.866805] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [305.868293] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [306.871378] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [307.872446] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [308.873472] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<3> [309.747936] INFO: task xe_exec_system_:3807 blocked for more than 122 seconds.
<3> [309.755167] Tainted: G S U W N 7.1.0-rc2-lgci-xe-xe-4990-835de80ce9b34b618-debug+ #1
<3> [309.764387] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6> [309.772212] task:xe_exec_system_ state:D stack:0 pid:3807 tgid:3807 ppid:2233 task_flags:0x400100 flags:0x00080002
<6> [309.772216] Call Trace:
<6> [309.772218] <TASK>
<6> [309.772220] __schedule+0x5eb/0x1f70
<6> [309.772241] ? lock_acquire+0xc4/0x300
<6> [309.772246] ? schedule+0x10e/0x180
<6> [309.772248] ? lock_release+0xd0/0x2b0
<6> [309.772252] schedule+0x3a/0x180
<6> [309.772255] schedule_preempt_disabled+0x15/0x30
<6> [309.772257] rwsem_down_write_slowpath+0x30d/0x9c0
<6> [309.772259] ? lock_acquire+0xc4/0x300
<6> [309.772265] down_write+0xe5/0xf0
<6> [309.772268] xe_vm_close_and_put+0x70/0x1000 [xe]
<6> [309.772387] ? xa_find+0xe4/0x210
<6> [309.772393] xe_file_close+0x10a/0x1a0 [xe]
<6> [309.772458] drm_file_free+0x23d/0x2d0
<6> [309.772462] drm_close_helper.isra.0+0x6d/0x80
<6> [309.772464] drm_release_noglobal+0x20/0xa0
<6> [309.772467] __fput+0x10a/0x300
<6> [309.772471] fput_close_sync+0x3d/0xa0
<6> [309.772473] __x64_sys_close+0x3e/0x90
<6> [309.772476] x64_sys_call+0x1b7c/0x26e0
<6> [309.772479] do_syscall_64+0x103/0x1410
<6> [309.772481] ? do_syscall_64+0xb8/0x1410
<6> [309.772482] ? exc_page_fault+0xbd/0x2b0
<6> [309.772485] entry_SYSCALL_64_after_hwframe+0x76/0x7e
<6> [309.772487] RIP: 0033:0x72d182116724
<6> [309.772489] RSP: 002b:00007fff331430d8 EFLAGS: 00000202 ORIG_RAX: 0000000000000003
<6> [309.772492] RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 000072d182116724
<6> [309.772493] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000005
<6> [309.772494] RBP: 00007fff33146a40 R08: 0000000000000000 R09: 0000000000000000
<6> [309.772495] R10: 000072d1823a5c50 R11: 0000000000000202 R12: 000000000000e20b
<6> [309.772496] R13: 00007fff33143f00 R14: 00005a2a421beab8 R15: 000072d1825da000
<6> [309.772502] </TASK>
<3> [309.772516] INFO: task xe_exec_system_:3807 <writer> blocked on an rw-semaphore likely owned by task kworker/u65:3:2248 <writer>
<6> [309.784070] task:kworker/u65:3 state:R running task stack:0 pid:2248 tgid:2248 ppid:2 task_flags:0x4208060 flags:0x00080000
<6> [309.784091] Workqueue: xe_page_fault_work_queue xe_pagefault_queue_work [xe]
<6> [309.784275] Call Trace:
<6> [309.784277] <TASK>
<6> [309.784282] ? reacquire_held_locks+0xe3/0x210
<6> [309.784291] ? kernel_text_address+0x6e/0x150
<6> [309.784303] ? kmemleak_alloc+0x4a/0xa0
<6> [309.784312] ? __xe_sa_bo_new+0x38/0x50 [xe]
<6> [309.784448] ? _raw_spin_unlock_irqrestore+0x51/0x80
<6> [309.784451] ? dma_fence_default_wait+0x1f6/0x2d0
<6> [309.784456] ? trace_hardirqs_on+0x22/0xf0
<6> [309.784464] ? dma_fence_default_wait+0x1fe/0x2d0
<6> [309.784468] ? dma_fence_default_wait+0x125/0x2d0
<6> [309.784472] ? __pfx_dma_fence_default_wait_cb+0x10/0x10
<6> [309.784479] ? dma_fence_wait_timeout+0x192/0x560
<6> [309.784486] ? xe_svm_copy+0x545/0x9b0 [xe]
<6> [309.784629] ? dma_iova_link+0xf3/0x350
<6> [309.784635] ? dma_iova_try_alloc+0xb0/0x130
<6> [309.784643] ? drm_pagemap_migrate_populate_ram_pfn+0xb1/0x360 [drm_gpusvm_helper]
<6> [309.784654] ? xe_svm_copy_to_ram+0x16/0x30 [xe]
<6> [309.784793] ? __drm_pagemap_migrate_to_ram+0x37a/0x3b0 [drm_gpusvm_helper]
<6> [309.784816] ? drm_pagemap_migrate_to_ram+0x39/0x60 [drm_gpusvm_helper]
<6> [309.784821] ? rcu_read_unlock+0x26/0x80
<6> [309.784825] ? drm_pagemap_migrate_to_ram+0x39/0x60 [drm_gpusvm_helper]
<6> [309.784831] ? do_swap_page+0x1450/0x17d0
<6> [309.784836] ? __lock_acquire+0x43e/0x2790
<6> [309.784839] ? __pte_offset_map+0x46/0x250
<6> [309.784841] ? __pte_offset_map+0x19c/0x250
<6> [309.784845] ? __handle_mm_fault+0xa0c/0x1000
<6> [309.784853] ? handle_mm_fault+0x12c/0x300
<6> [309.784857] ? hmm_vma_fault.isra.0+0x67/0xd0
<6> [309.784861] ? hmm_vma_walk_pmd+0x59a/0xe30
<6> [309.784864] ? hmm_vma_walk_pmd+0x526/0xe30
<6> [309.784868] ? walk_pgd_range+0x57f/0xd70
<6> [309.784879] ? __walk_page_range+0x8e/0x290
<6> [309.784887] ? walk_page_range_mm_unsafe+0x19e/0x270
<6> [309.784890] ? lock_acquire+0xc4/0x300
<6> [309.784900] ? walk_page_range+0x2a/0x40
<6> [309.784903] ? hmm_range_fault+0x5b/0xc0
<6> [309.784909] ? drm_gpusvm_range_evict+0x102/0x1b0 [drm_gpusvm_helper]
<6> [309.784922] ? __xe_svm_handle_pagefault+0x950/0xef0 [xe]
<6> [309.785034] ? __lock_acquire+0x43e/0x2790
<6> [309.785040] ? lock_is_held_type+0xa3/0x130
<6> [309.785045] ? lock_acquire+0xc4/0x300
<6> [309.785047] ? xe_pagefault_queue_work+0x148/0x520 [xe]
<6> [309.785139] ? xe_svm_handle_pagefault+0x3d/0xb0 [xe]
<6> [309.785242] ? xe_pagefault_queue_work+0x1a9/0x520 [xe]
<6> [309.785331] ? process_one_work+0x239/0x740
<6> [309.785338] ? worker_thread+0x200/0x3f0
<6> [309.785341] ? __pfx_worker_thread+0x10/0x10
<6> [309.785343] ? kthread+0x10d/0x150
<6> [309.785346] ? __pfx_kthread+0x10/0x10
<6> [309.785349] ? ret_from_fork+0x3bd/0x470
<6> [309.785352] ? __pfx_kthread+0x10/0x10
<6> [309.785355] ? ret_from_fork_asm+0x1a/0x30
<6> [309.785362] </TASK>
<4> [309.785364]
Showing all locks held in the system:
<4> [309.785370] 1 lock held by khungtaskd/117:
<4> [309.785372] #0: ffffffff835c40a0 (rcu_read_lock){....}-{1:2}, at: debug_show_all_locks+0x37/0x220
<4> [309.785385] 1 lock held by in:imklog/849:
<4> [309.785387] #0: ffff888135058130 (&f->f_pos_lock){+.+.}-{3:3}, at: fdget_pos+0x81/0xd0
<4> [309.785395] 3 locks held by dmesg/2116:
<4> [309.785397] 4 locks held by kworker/u65:3/2248:
<4> [309.785399] #0: ffff8881583b1140 ((wq_completion)xe_page_fault_work_queue){+.+.}-{0:0}, at: process_one_work+0x4c4/0x740
<4> [309.785405] #1: ffffc900034bfe30 ((work_completion)(&pf_queue->worker)){+.+.}-{0:0}, at: process_one_work+0x1f9/0x740
<4> [309.785411] #2: ffff888153691690 (&vm->lock){++++}-{3:3}, at: xe_pagefault_queue_work+0x148/0x520 [xe]
<4> [309.785500] #3: ffff888118ab6578 (&mm->mmap_lock){++++}-{3:3}, at: drm_gpusvm_range_evict+0xf7/0x1b0 [drm_gpusvm_helper]
<4> [309.785516] 2 locks held by xe_exec_system_/3807:
<4> [309.785517] #0: ffffffff8384bff8 (drm_unplug_srcu){.+.+}-{0:0}, at: drm_dev_enter+0x54/0x100
<4> [309.785524] #1: ffff888153691690 (&vm->lock){++++}-{3:3}, at: xe_vm_close_and_put+0x70/0x1000 [xe]
<4> [309.785626]
<4> [309.785627] =============================================
<7> [309.786658] xe 0000:03:00.0: [drm:__xe_svm_handle_pagefault [xe]] Get pages failed, falling back to retrying, asid=366, gpusvm=ffff888153691190, errno=-EOPNOTSUPP
<7> [309.794767] xe 0000:03:00.0: [drm:drm_client_dev_restore] fbdev: ret=0
<6> [309.811441] Console: switching to colour frame buffer device 240x67
<7> [309.836024] xe 0000:03:00.0: [drm:drm_pagemap_dev_unhold_work [drm_gpusvm_helper]] Releasing reference on provider device and module.
|