************************ crashinfo ************************* /exports/testreports/42123/testresults/sanity-pfl-zfs-centos7_x86_64-centos7_x86_64/oleg254-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.5SSRK/vmlinux [TAINTED] DUMPFILE: /exports/testreports/42123/testresults/sanity-pfl-zfs-centos7_x86_64-centos7_x86_64/oleg254-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Apr 19 03:12:01 EDT 2024 UPTIME: 01:03:02 LOAD AVERAGE: 0.00, 0.01, 0.05 TASKS: 150 NODENAME: oleg254-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 14 last 5s 27 last 60s 33 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 146 TASK_RUNNING 1 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 4667 ldlm_bl_01 0 ms (no user stack) 46 kworker/2:1 0 ms (no user stack) 9 rcu_sched 0 ms (no user stack) 50 kworker/0:1 19 ms (no user stack) 1849 lnet_discovery 19 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 860777 3.3 GB 90% of TOTAL MEM USED 94302 368.4 MB 9% of TOTAL MEM SHARED 9815 38.3 MB 1% of TOTAL MEM BUFFERS 5221 20.4 MB 0% of TOTAL MEM CACHED 47633 186.1 MB 4% of TOTAL MEM SLAB 10646 41.6 MB 1% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 61413 239.9 MB 8% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 5 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 3772.6 eth0 n/a 0.4 RSS_TOTAL=74664 pages, %mem= 1.3 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88013709e380 ffff8801370aa000 sysfs sysfs /sys ffff88013709e540 ffff880139944000 proc proc /proc ffff88012aa5a1c0 ffff880137678000 devtmpfs devtmpfs /dev ffff88012aa5a380 ffff8800b7e73800 securityfs securityfs /sys/kernel/security ffff88012aa5a540 ffff8800b7e74000 tmpfs tmpfs /dev/shm ffff88012aa5a700 ffff88012b2a0000 devpts devpts /dev/pts ffff88012aa5a8c0 ffff8800b7e74800 tmpfs tmpfs /run ffff88012aa5aa80 ffff8800b7e75000 tmpfs tmpfs /sys/fs/cgroup ffff88012aa5ac40 ffff8800b7e75800 cgroup cgroup /sys/fs/cgroup/systemd ffff88012aa5ae00 ffff8800b7e76000 pstore pstore /sys/fs/pstore ffff880137668fc0 ffff88012aa1e800 cgroup cgroup /sys/fs/cgroup/memory ffff880137669180 ffff88012aa1f000 cgroup cgroup /sys/fs/cgroup/freezer ffff880137669340 ffff88012aa1f800 cgroup cgroup /sys/fs/cgroup/pids ffff880137669500 ffff88012ab10000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff8801376696c0 ffff88012ab10800 cgroup cgroup /sys/fs/cgroup/blkio ffff880137669880 ffff88012ab11000 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff880137669a40 ffff88012ab11800 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff880137669c00 ffff88012ab12000 cgroup cgroup /sys/fs/cgroup/perf_event ffff880137669dc0 ffff88012ab12800 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012aafc000 ffff88012ab13000 cgroup cgroup /sys/fs/cgroup/devices ffff88012aafc380 ffff88012ab16800 configfs configfs /sys/kernel/config ffff880138ccac40 ffff88012a4ed000 ext4 /dev/nbd0 / ffff88012aafc700 ffff88012aba6800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff88013709ee00 ffff8800b59e9800 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff8800b62b2540 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff88012aafc8c0 ffff88012a4e9800 hugetlbfs hugetlbfs /dev/hugepages ffff88012aafca80 ffff88012b2a1800 mqueue mqueue /dev/mqueue ffff88013709efc0 ffff8800b59ee000 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff88013709f180 ffff8800b59ef000 ramfs none /mnt ffff88012aafcc40 ffff88012a4e8000 tmpfs none /var/lib/stateless/writable ffff8800b62b28c0 ffff8800b59fe800 squashfs /dev/vda /home/green/git/lustre-release ffff88013709f340 ffff88012a4e8000 tmpfs none /var/cache/man ffff88012aafce00 ffff88012a4e8000 tmpfs none /var/log ffff88012aafcfc0 ffff88012a4e8000 tmpfs none /var/lib/dbus ffff88012aafd180 ffff88012a4e8000 tmpfs none /tmp ffff88012aafd340 ffff88012a4e8000 tmpfs none /var/lib/dhclient ffff880138ccae00 ffff88012a4e8000 tmpfs none /var/tmp ffff88013709f500 ffff88012a4e8000 tmpfs none /var/lib/NetworkManager ffff8800b62b2a80 ffff88012a4e8000 tmpfs none /var/lib/systemd/random-seed ffff88013709f6c0 ffff88012a4e8000 tmpfs none /var/spool ffff88013709f880 ffff88012a4e8000 tmpfs none /var/lib/nfs ffff88013709fa40 ffff88012a4e8000 tmpfs none /var/lib/gssproxy ffff8800b62b2c40 ffff88012a4e8000 tmpfs none /var/lib/logrotate ffff8800b62b2e00 ffff88012a4e8000 tmpfs none /etc ffff88013709fc00 ffff88012a4e8000 tmpfs none /var/lib/rsyslog ffff8800b62b2fc0 ffff88012a4e8000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff88012aafd500 ffff8800b5891000 nfs4 192.168.200.253:/exports/state/oleg254-client.virtnet /var/lib/stateless/state ffff8800b62b3340 ffff8800b5891000 nfs4 192.168.200.253:/exports/state/oleg254-client.virtnet /boot ffff88012aafd6c0 ffff8800b5891000 nfs4 192.168.200.253:/exports/state/oleg254-client.virtnet /etc/etc/kdump.conf ffff88012aafd880 ffff88012aba6800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b6e96a80 ffff8801370af800 nfs4 192.168.200.253://exports/testreports/42123/testresults/sanity-pfl-zfs-centos7_x86_64-centos7_x86_64 /tmp/tmp/testlogs ffff8800b584ec40 ffff88012ab17000 tmpfs tmpfs /run/user/0 ffff88012aa5b880 ffff8800b59fe800 squashfs /dev/vda /usr/sbin/mount.lustre ffff88012aa5ba40 ffff8800aa0ec000 lustre 192.168.202.154@tcp:/lustre /mnt/lustre +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 61.791097] libcfs: loading out-of-tree module taints kernel. [ 61.793948] libcfs: module verification failed: signature and/or required key missing - tainting kernel [ 61.819535] LNet: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 2 [ 61.839159] alg: No test for adler32 (adler32-zlib) [ 62.989435] Lustre: Lustre: Build Version: 2.15.4_18_g9f02020 [ 63.283076] LNet: Added LNI 192.168.202.54@tcp [8/256/0/180] [ 63.288027] LNet: Accept secure, port 988 [ 64.874439] Key type lgssc registered [ 65.375918] Lustre: Echo OBD driver; http://www.lustre.org/ [ 133.719022] Lustre: Mounted lustre-client [ 136.632518] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 145.874474] Lustre: DEBUG MARKER: oleg254-client.virtnet: executing check_logdir /tmp/testlogs/ [ 147.397597] Lustre: DEBUG MARKER: oleg254-client.virtnet: executing yml_node [ 148.895280] Lustre: DEBUG MARKER: Client: 2.15.4.18 [ 149.728133] Lustre: DEBUG MARKER: MDS: 2.15.4.18 [ 150.937111] Lustre: DEBUG MARKER: OSS: 2.15.4.18 [ 151.644766] Lustre: DEBUG MARKER: -----============= acceptance-small: sanity-pfl ============----- Fri Apr 19 02:11:29 EDT 2024 [ 154.491144] Lustre: DEBUG MARKER: excepting tests: 24a [ 157.262103] Lustre: DEBUG MARKER: oleg254-client.virtnet: executing check_config_client /mnt/lustre [ 158.746665] Lustre: lustre-OST0000-osc-ffff8800aa0ec000: disconnect after 23s idle [ 165.608031] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 170.300389] Lustre: DEBUG MARKER: == sanity-pfl test 0a: Create full components file, no reused OSTs ========================================================== 02:11:47 (1713507107) [ 171.293300] Lustre: DEBUG MARKER: sanity-pfl test_0a: @@@@@@ FAIL: 1 == 1 [ 179.414328] Lustre: DEBUG MARKER: == sanity-pfl test 0b: Verify comp stripe count limits === 02:11:56 (1713507116) [ 196.064567] random: crng init done [ 781.564583] Lustre: 9410:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713507117/real 1713507117] req@ffff88013730e880 x1796742317090688/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa0ec000@192.168.202.154@tcp:12/10 lens 760/66376 e 24 to 1 dl 1713507719 ref 2 fl Rpc:XPQr/0/ffffffff rc 0/-1 job:'lfs.0' [ 781.572544] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection to lustre-MDT0000 (at 192.168.202.154@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 781.580246] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection restored to (at 192.168.202.154@tcp) [ 1382.582524] Lustre: 9410:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713507719/real 1713507719] req@ffff88013730e880 x1796742317090688/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa0ec000@192.168.202.154@tcp:12/10 lens 760/66376 e 24 to 1 dl 1713508320 ref 2 fl Rpc:XPQr/2/ffffffff rc 0/-1 job:'lfs.0' [ 1382.590822] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection to lustre-MDT0000 (at 192.168.202.154@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 1382.601604] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection restored to (at 192.168.202.154@tcp) [ 1983.604513] Lustre: 9410:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713508320/real 1713508320] req@ffff88013730e880 x1796742317090688/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa0ec000@192.168.202.154@tcp:12/10 lens 760/66376 e 24 to 1 dl 1713508921 ref 2 fl Rpc:XPQr/2/ffffffff rc 0/-1 job:'lfs.0' [ 1983.613730] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection to lustre-MDT0000 (at 192.168.202.154@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 1983.622615] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection restored to (at 192.168.202.154@tcp) [ 2584.624487] Lustre: 9410:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713508921/real 1713508921] req@ffff88013730e880 x1796742317090688/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa0ec000@192.168.202.154@tcp:12/10 lens 760/66376 e 24 to 1 dl 1713509522 ref 2 fl Rpc:XPQr/2/ffffffff rc 0/-1 job:'lfs.0' [ 2584.630564] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection to lustre-MDT0000 (at 192.168.202.154@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 2584.639856] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection restored to (at 192.168.202.154@tcp) [ 3185.642421] Lustre: 9410:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713509522/real 1713509522] req@ffff88013730e880 x1796742317090688/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa0ec000@192.168.202.154@tcp:12/10 lens 760/66376 e 24 to 1 dl 1713510123 ref 2 fl Rpc:XPQr/2/ffffffff rc 0/-1 job:'lfs.0' [ 3185.648652] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection to lustre-MDT0000 (at 192.168.202.154@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 3185.655752] Lustre: lustre-MDT0000-mdc-ffff8800aa0ec000: Connection restored to (at 192.168.202.154@tcp) ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 10.61s (real) 5.47s (CPU), Child processes: 5.12s