************************ crashinfo ************************* /exports/testreports/42104/testresults/sanityn-ldiskfs-DNE-centos7_x86_64-centos7_x86_64-retry1/oleg351-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.wZDvM/vmlinux [TAINTED] DUMPFILE: /exports/testreports/42104/testresults/sanityn-ldiskfs-DNE-centos7_x86_64-centos7_x86_64-retry1/oleg351-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Thu Apr 18 10:17:16 EDT 2024 UPTIME: 01:48:38 LOAD AVERAGE: 0.18, 0.90, 3.31 TASKS: 141 NODENAME: oleg351-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 26 last 5s 34 last 60s 53 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 137 TASK_RUNNING 1 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 9 rcu_sched 0 ms (no user stack) 11143 kworker/2:2 0 ms (no user stack) 11 rcuos/0 0 ms (no user stack) 11299 pdsh 18 ms pdsh -S -Rssh -w oleg351-server (PATH=$PATH:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin; cd /root; LUSTRE="/home/green/git/lustre-release/lustre" bash -c "umount -d -f /mnt/lustre-mds2") 11297 pdsh 18 ms pdsh -S -Rssh -w oleg351-server (PATH=$PATH:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin; cd /root; LUSTRE="/home/green/git/lustre-release/lustre" bash -c "umount -d -f /mnt/lustre-mds2") +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 873637 3.3 GB 91% of TOTAL MEM USED 81442 318.1 MB 8% of TOTAL MEM SHARED 7205 28.1 MB 0% of TOTAL MEM BUFFERS 2754 10.8 MB 0% of TOTAL MEM CACHED 33241 129.8 MB 3% of TOTAL MEM SLAB 11718 45.8 MB 1% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 66337 259.1 MB 8% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 7 TIME_WAIT 10 LISTEN 3 NAGLE disabled (TCP_NODELAY): 6 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 30 CLOSE 18 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 6516.6 eth0 n/a 0.0 RSS_TOTAL=70488 pages, %mem= 1.1 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff880137668380 ffff8800b6cf1800 sysfs sysfs /sys ffff880137668540 ffff880139944000 proc proc /proc ffff880137668700 ffff880137678000 devtmpfs devtmpfs /dev ffff8801376688c0 ffff88012b3eb800 securityfs securityfs /sys/kernel/security ffff880137668a80 ffff8800b6cf2000 tmpfs tmpfs /dev/shm ffff880137668c40 ffff88012b2b0800 devpts devpts /dev/pts ffff880137668e00 ffff8800b6cf2800 tmpfs tmpfs /run ffff880137668fc0 ffff8800b6cf3000 tmpfs tmpfs /sys/fs/cgroup ffff880137669180 ffff8800b6cf3800 cgroup cgroup /sys/fs/cgroup/systemd ffff880137669340 ffff8800b6cf4000 pstore pstore /sys/fs/pstore ffff880137669500 ffff8800b6cf6000 cgroup cgroup /sys/fs/cgroup/devices ffff8801376696c0 ffff8800b6cf5800 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff880137669880 ffff8800b6cf5000 cgroup cgroup /sys/fs/cgroup/freezer ffff880137669a40 ffff8800b6cf4800 cgroup cgroup /sys/fs/cgroup/perf_event ffff880137669c00 ffff8800b6cf6800 cgroup cgroup /sys/fs/cgroup/blkio ffff880137669dc0 ffff8800b6cf7000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012aa4a000 ffff8800b6cf7800 cgroup cgroup /sys/fs/cgroup/memory ffff88012aa4a1c0 ffff88012aae8000 cgroup cgroup /sys/fs/cgroup/pids ffff88012aa4a380 ffff88012aae8800 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012aa4a540 ffff88012aae9000 cgroup cgroup /sys/fs/cgroup/cpuset ffff8801377f7180 ffff88012b3ec800 configfs configfs /sys/kernel/config ffff8801377f7340 ffff8800b60af000 ext4 /dev/nbd0 / ffff8800b6d10540 ffff880137304800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff880138ccbdc0 ffff88012a65c000 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff8800b6128000 ffff88012a65d000 hugetlbfs hugetlbfs /dev/hugepages ffff8800b6d10c40 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff8800b6d10e00 ffff88012b2b2000 mqueue mqueue /dev/mqueue ffff88012aa4a700 ffff88012aa41000 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff8800b6d11180 ffff8800b60b7000 ramfs none /mnt ffff8800b61281c0 ffff88012b3ed000 tmpfs none /var/lib/stateless/writable ffff88012aa4a8c0 ffff8800b5c4b800 squashfs /dev/vda /home/green/git/lustre-release ffff88012aa4aa80 ffff88012b3ed000 tmpfs none /var/cache/man ffff8801377f7500 ffff88012b3ed000 tmpfs none /var/log ffff8800b6d11340 ffff88012b3ed000 tmpfs none /var/lib/dbus ffff8800b6d11500 ffff88012b3ed000 tmpfs none /tmp ffff88012aa4ac40 ffff88012b3ed000 tmpfs none /var/lib/dhclient ffff8800b6d116c0 ffff88012b3ed000 tmpfs none /var/tmp ffff8800b6d11880 ffff88012b3ed000 tmpfs none /var/lib/NetworkManager ffff8800b6d11a40 ffff88012b3ed000 tmpfs none /var/lib/systemd/random-seed ffff8800b6d11c00 ffff88012b3ed000 tmpfs none /var/spool ffff88012aa4ae00 ffff88012b3ed000 tmpfs none /var/lib/nfs ffff8800b6d11dc0 ffff88012b3ed000 tmpfs none /var/lib/gssproxy ffff8800b6d10a80 ffff88012b3ed000 tmpfs none /var/lib/logrotate ffff8800b6d108c0 ffff88012b3ed000 tmpfs none /etc ffff8800b6128380 ffff88012b3ed000 tmpfs none /var/lib/rsyslog ffff88012aa4afc0 ffff88012b3ed000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff8800b6d10700 ffff88012aaef800 nfs4 192.168.200.253:/exports/state/oleg351-client.virtnet /var/lib/stateless/state ffff8800b6d10000 ffff88012aaef800 nfs4 192.168.200.253:/exports/state/oleg351-client.virtnet /boot ffff88012aa4b180 ffff88012aaef800 nfs4 192.168.200.253:/exports/state/oleg351-client.virtnet /etc/etc/kdump.conf ffff8800b5fa4000 ffff880137304800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff88012ec1a8c0 ffff88012aaea800 nfs4 192.168.200.253://exports/testreports/42104/testresults/sanityn-ldiskfs-DNE-centos7_x86_64-centos7_x86_64-retry1 /tmp/tmp/testlogs ffff88012ec1aa80 ffff88012a613800 tmpfs tmpfs /run/user/0 ffff88012ec1b180 ffff8800b5c4b800 squashfs /dev/vda /usr/sbin/mount.lustre +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 6275.804264] Lustre: Echo OBD driver; http://www.lustre.org/ [ 6279.092681] Lustre: DEBUG MARKER: Iteration 50 [ 6279.203404] LustreError: 4017:0:(llite_lib.c:1369:ll_fill_super()) cfs_race id 1417 sleeping [ 6279.203468] LustreError: 4018:0:(llite_lib.c:1369:ll_fill_super()) cfs_fail_race id 1417 waking [ 6279.212958] LustreError: 4017:0:(llite_lib.c:1369:ll_fill_super()) cfs_fail_race id 1417 awake: rc=5000 [ 6280.265988] Lustre: Mounted lustre-client [ 6280.267741] Lustre: Skipped 1 previous similar message [ 6280.479956] Lustre: Unmounted lustre-client [ 6281.227836] Key type lgssc unregistered [ 6281.313606] LNet: 4317:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit [ 6282.317505] LNet: Removed LNI 192.168.203.51@tcp [ 6282.729525] alg: No test for adler32 (adler32-zlib) [ 6283.481055] libcfs: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 2 [ 6283.585697] Lustre: Lustre: Build Version: 2.15.62_23_g6990a41 [ 6283.657570] LNet: Added LNI 192.168.203.51@tcp [8/256/0/180] [ 6283.659490] LNet: Accept secure, port 988 [ 6285.212077] Key type lgssc registered [ 6285.596189] Lustre: Echo OBD driver; http://www.lustre.org/ [ 6289.402393] Lustre: Mounted lustre-client [ 6294.391404] Lustre: DEBUG MARKER: == sanityn test 110: do not grant another lock on resend ========================================================== 10:13:30 (1713449610) [ 6310.845047] Lustre: 5738:0:(client.c:2340:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713449611/real 1713449611] req@ffff88012d5daa00 x1796682127775232/t0(0) o36->lustre-MDT0000-mdc-ffff88012b3ee000@192.168.203.151@tcp:12/10 lens 496/440 e 0 to 1 dl 1713449627 ref 2 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'ln.0' uid:0 gid:0 [ 6310.858319] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection to lustre-MDT0000 (at 192.168.203.151@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 6310.875927] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection restored to (at 192.168.203.151@tcp) [ 6326.880045] Lustre: 5738:0:(client.c:2340:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713449627/real 1713449627] req@ffff88012d5daa00 x1796682127775232/t0(0) o36->lustre-MDT0000-mdc-ffff88012b3ee000@192.168.203.151@tcp:12/10 lens 496/440 e 0 to 1 dl 1713449643 ref 2 fl Rpc:XQr/202/ffffffff rc 0/-1 job:'ln.0' uid:0 gid:0 [ 6326.893347] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection to lustre-MDT0000 (at 192.168.203.151@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 6326.909043] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection restored to (at 192.168.203.151@tcp) [ 6342.913024] Lustre: 5738:0:(client.c:2340:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713449643/real 1713449643] req@ffff88012d5daa00 x1796682127775232/t0(0) o36->lustre-MDT0000-mdc-ffff88012b3ee000@192.168.203.151@tcp:12/10 lens 496/440 e 0 to 1 dl 1713449659 ref 2 fl Rpc:XQr/202/ffffffff rc 0/-1 job:'ln.0' uid:0 gid:0 [ 6342.926314] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection to lustre-MDT0000 (at 192.168.203.151@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 6342.941674] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection restored to (at 192.168.203.151@tcp) [ 6358.946047] Lustre: 5738:0:(client.c:2340:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713449659/real 1713449659] req@ffff88012d5daa00 x1796682127775232/t0(0) o36->lustre-MDT0000-mdc-ffff88012b3ee000@192.168.203.151@tcp:12/10 lens 496/440 e 0 to 1 dl 1713449675 ref 2 fl Rpc:XQr/202/ffffffff rc 0/-1 job:'ln.0' uid:0 gid:0 [ 6358.965866] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection to lustre-MDT0000 (at 192.168.203.151@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 6358.984524] Lustre: lustre-MDT0000-mdc-ffff88012b3ee000: Connection restored to (at 192.168.203.151@tcp) [ 6361.761250] Lustre: DEBUG MARKER: == sanityn test 111: A racy rename/link an open file should not cause fs corruption ========================================================== 10:14:38 (1713449678) [ 6369.881618] Lustre: DEBUG MARKER: == sanityn test 112: update max-inherit in default LMV === 10:14:46 (1713449686) [ 6376.279388] Lustre: DEBUG MARKER: == sanityn test 113: check servers of specified fs ======= 10:14:52 (1713449692) [ 6380.867156] Lustre: DEBUG MARKER: == sanityn test 114: implicit default LMV inherit ======== 10:14:57 (1713449697) [ 6390.996009] Lustre: DEBUG MARKER: == sanityn test 115: ldiskfs doesn't check direntry for uniqueness ========================================================== 10:15:07 (1713449707) [ 6407.796715] Lustre: DEBUG MARKER: cleanup: ====================================================== [ 6408.496836] Lustre: DEBUG MARKER: == sanityn test complete, duration 6292 sec ============== 10:15:24 (1713449724) [ 6509.872963] Lustre: Unmounted lustre-client ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 10.22s (real) 5.22s (CPU), Child processes: 4.98s