************************ crashinfo ************************* /exports/testreports/42053/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg236-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.ItPjh/vmlinux [TAINTED] DUMPFILE: /exports/testreports/42053/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg236-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Tue Apr 16 15:51:03 EDT 2024 UPTIME: 00:11:56 LOAD AVERAGE: 50.24, 126.92, 96.67 TASKS: 321 NODENAME: oleg236-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +++WARNING+++ High Load Averages: 50.24, 126.92, 96.67 +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 129 last 5s 135 last 60s 198 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 269 TASK_RUNNING 1 TASK_UNINTERRUPTIBLE 48 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 9 rcu_sched 0 ms (no user stack) 20 rcuos/1 0 ms (no user stack) 27 rcuos/2 0 ms (no user stack) 1955 ptlrpcd_00_00 0 ms (no user stack) 1948 socknal_sd00_01 2 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 658229 2.5 GB 68% of TOTAL MEM USED 296850 1.1 GB 31% of TOTAL MEM SHARED 12914 50.4 MB 1% of TOTAL MEM BUFFERS 5147 20.1 MB 0% of TOTAL MEM CACHED 48959 191.2 MB 5% of TOTAL MEM SLAB 176089 687.8 MB 18% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 90268 352.6 MB 12% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 5 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 714.7 eth0 n/a 0.0 RSS_TOTAL=297764 pages, %mem= 2.4 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff880137668380 ffff880137679000 sysfs sysfs /sys ffff880137668540 ffff880139944000 proc proc /proc ffff880137668700 ffff880137678000 devtmpfs devtmpfs /dev ffff8801376688c0 ffff8800b6d00800 securityfs securityfs /sys/kernel/security ffff880137668a80 ffff880137679800 tmpfs tmpfs /dev/shm ffff880137668c40 ffff88013771f000 devpts devpts /dev/pts ffff880137668e00 ffff88013767a000 tmpfs tmpfs /run ffff880137668fc0 ffff88013767a800 tmpfs tmpfs /sys/fs/cgroup ffff880137669180 ffff88013767b000 cgroup cgroup /sys/fs/cgroup/systemd ffff880137669340 ffff88013767b800 pstore pstore /sys/fs/pstore ffff880137669500 ffff88013767d800 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff8801376696c0 ffff88013767d000 cgroup cgroup /sys/fs/cgroup/pids ffff880137669880 ffff88013767c800 cgroup cgroup /sys/fs/cgroup/devices ffff880137669a40 ffff88013767c000 cgroup cgroup /sys/fs/cgroup/memory ffff880137669c00 ffff88013767e000 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff880137669dc0 ffff88013767e800 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012aae6000 ffff88013767f000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012aae61c0 ffff88013767f800 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012aae6380 ffff88012ab00000 cgroup cgroup /sys/fs/cgroup/blkio ffff88012aae6540 ffff88012ab00800 cgroup cgroup /sys/fs/cgroup/freezer ffff880138ccb6c0 ffff88012abbd000 configfs configfs /sys/kernel/config ffff880138ccb880 ffff8800b5822000 ext4 /dev/nbd0 / ffff8800b6ee6380 ffff88013731e000 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff88012aae76c0 ffff88012ab04000 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff88012aae7880 ffff88012ab01800 hugetlbfs hugetlbfs /dev/hugepages ffff8800b6ee6000 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff88012aae7a40 ffff88012b288800 mqueue mqueue /dev/mqueue ffff8800b6ee6540 ffff88013731c000 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff8800b6ee6700 ffff8800b64f8800 ramfs none /mnt ffff8801377f6fc0 ffff8800b67d7000 tmpfs none /var/lib/stateless/writable ffff8800b6ee68c0 ffff8800b64f8000 squashfs /dev/vda /home/green/git/lustre-release ffff880138ccb340 ffff8800b67d7000 tmpfs none /var/cache/man ffff8800b6ee6a80 ffff8800b67d7000 tmpfs none /var/log ffff8800b6ee6c40 ffff8800b67d7000 tmpfs none /var/lib/dbus ffff88012aae7c00 ffff8800b67d7000 tmpfs none /tmp ffff88012aae7dc0 ffff8800b67d7000 tmpfs none /var/lib/dhclient ffff880138ccba40 ffff8800b67d7000 tmpfs none /var/tmp ffff880138ccbc00 ffff8800b67d7000 tmpfs none /var/lib/NetworkManager ffff8800b6ee6e00 ffff8800b67d7000 tmpfs none /var/lib/systemd/random-seed ffff8801377f7180 ffff8800b67d7000 tmpfs none /var/spool ffff88013737a000 ffff8800b67d7000 tmpfs none /var/lib/nfs ffff8801377f7340 ffff8800b67d7000 tmpfs none /var/lib/gssproxy ffff8800b6ee6fc0 ffff8800b67d7000 tmpfs none /var/lib/logrotate ffff8800b6ee7180 ffff8800b67d7000 tmpfs none /etc ffff8800b6ee7340 ffff8800b67d7000 tmpfs none /var/lib/rsyslog ffff8800b6ee7500 ffff8800b67d7000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff8800b6ee76c0 ffff8800b64fa800 nfs4 192.168.200.253:/exports/state/oleg236-client.virtnet /var/lib/stateless/state ffff8800b6ee7c00 ffff8800b64fa800 nfs4 192.168.200.253:/exports/state/oleg236-client.virtnet /boot ffff8800b6ee7dc0 ffff8800b64fa800 nfs4 192.168.200.253:/exports/state/oleg236-client.virtnet /etc/etc/kdump.conf ffff88013737a1c0 ffff88013731e000 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b00a3180 ffff8800b5802000 nfs4 192.168.200.253://exports/testreports/42053/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64 /tmp/tmp/testlogs ffff88012aae6a80 ffff8800b6519800 tmpfs tmpfs /run/user/0 ffff8800b00a3880 ffff8800b64f8000 squashfs /dev/vda /usr/sbin/mount.lustre ffff8800b01b4380 ffff88012bf70800 lustre 192.168.202.136@tcp:/lustre /mnt/lustre ffff8800b01b4700 ffff88013681e800 lustre 192.168.202.136@tcp:/lustre /mnt/lustre2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 366.161750] LustreError: 32491:0:(lustre_lmv.h:178:lmv_stripe_object_dump()) dump LMV: refs 215092432 magic=0x1 count=1 index=1 hash=crush:0x2000003 max_inherit=0 max_inherit_rr=0 version=2 migrate_offset=0 migrate_hash=invalid:0 pool= [ 366.167567] LustreError: 32491:0:(lustre_lmv.h:185:lmv_stripe_object_dump()) stripe[0] [0x240000400:0x1a7:0x0] [ 366.170219] LustreError: 32491:0:(lustre_lmv.h:185:lmv_stripe_object_dump()) Skipped 2 previous similar messages [ 366.173269] LustreError: 32491:0:(lustre_lmv.h:178:lmv_stripe_object_dump()) dump LMV: refs 215092432 magic=0x1 count=2 index=1 hash=crush:0x82000003 max_inherit=0 max_inherit_rr=0 version=1 migrate_offset=1 migrate_hash=fnv_1a_64:2 pool= [ 370.118748] LustreError: 4238:0:(file.c:264:ll_close_inode_openhandle()) lustre-clilmv-ffff88012bf70800: inode [0x200000402:0x6104:0x0] mdc close failed: rc = -116 [ 370.124801] LustreError: 4238:0:(file.c:264:ll_close_inode_openhandle()) Skipped 194 previous similar messages [ 371.754891] LustreError: 5505:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x240000403:0x480b:0x0]: rc = -5 [ 371.761148] LustreError: 5505:0:(lcommon_cl.c:196:cl_file_inode_init()) Skipped 886 previous similar messages [ 371.763804] LustreError: 5505:0:(llite_lib.c:3691:ll_prep_inode()) new_inode -fatal: rc -5 [ 371.766160] LustreError: 5505:0:(llite_lib.c:3691:ll_prep_inode()) Skipped 886 previous similar messages [ 374.577620] LustreError: 6760:0:(llite_lib.c:1868:ll_update_lsm_md()) lustre: [0x240000403:0x53da:0x0] dir layout mismatch: [ 374.581701] LustreError: 6760:0:(lustre_lmv.h:178:lmv_stripe_object_dump()) dump LMV: refs 215092432 magic=0x1 count=1 index=1 hash=crush:0x2000003 max_inherit=0 max_inherit_rr=0 version=2 migrate_offset=0 migrate_hash=invalid:0 pool= [ 374.588005] LustreError: 6760:0:(lustre_lmv.h:185:lmv_stripe_object_dump()) stripe[0] [0x240000400:0x1b5:0x0] [ 374.590222] LustreError: 6760:0:(lustre_lmv.h:185:lmv_stripe_object_dump()) Skipped 2 previous similar messages [ 374.592491] LustreError: 6760:0:(lustre_lmv.h:178:lmv_stripe_object_dump()) dump LMV: refs 215092432 magic=0x1 count=2 index=1 hash=crush:0x82000003 max_inherit=0 max_inherit_rr=0 version=1 migrate_offset=1 migrate_hash=fnv_1a_64:2 pool= [ 389.504813] Lustre: dir [0x240000403:0x5714:0x0] stripe 2 readdir failed: -2, directory is partially accessed! [ 389.509575] Lustre: Skipped 161 previous similar messages [ 398.905045] LustreError: 26632:0:(llite_lib.c:1868:ll_update_lsm_md()) lustre: [0x200000403:0x73cc:0x0] dir layout mismatch: [ 398.911348] LustreError: 26632:0:(lustre_lmv.h:178:lmv_stripe_object_dump()) dump LMV: refs 215092432 magic=0x4 count=1 index=0 hash=crush:0x2000003 max_inherit=0 max_inherit_rr=0 version=2 migrate_offset=0 migrate_hash=invalid:0 pool= [ 398.921594] LustreError: 26632:0:(lustre_lmv.h:185:lmv_stripe_object_dump()) stripe[0] [0x200000400:0x1e5:0x0] [ 398.924682] LustreError: 26632:0:(lustre_lmv.h:185:lmv_stripe_object_dump()) Skipped 2 previous similar messages [ 398.927423] LustreError: 26632:0:(lustre_lmv.h:178:lmv_stripe_object_dump()) dump LMV: refs 215092432 magic=0x1 count=3 index=0 hash=crush:0x82000003 max_inherit=0 max_inherit_rr=0 version=1 migrate_offset=1 migrate_hash=crush:2000003 pool= [ 402.681995] 16[31113]: segfault at 8 ip 00007fbacb9127e8 sp 00007ffe6849e700 error 4 in ld-2.17.so[7fbacb907000+22000] [ 410.732446] 12[5233]: segfault at 0 ip 0000000000401870 sp 00007ffe1452d428 error 6 in 12[400000+6000] [ 456.869832] LustreError: 11-0: lustre-MDT0001-mdc-ffff88012bf70800: operation ldlm_enqueue to node 192.168.202.136@tcp failed: rc = -107 [ 456.870487] Lustre: lustre-MDT0001-mdc-ffff88012bf70800: Connection to lustre-MDT0001 (at 192.168.202.136@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 456.871878] LustreError: 167-0: lustre-MDT0001-mdc-ffff88012bf70800: This client was evicted by lustre-MDT0001; in progress operations using this service will fail. [ 456.888445] LustreError: Skipped 1 previous similar message [ 456.893177] LustreError: 25906:0:(file.c:5550:ll_inode_revalidate_fini()) lustre: revalidate FID [0x240000402:0x50d8:0x0] error: rc = -108 [ 456.961584] Lustre: lustre-MDT0001-mdc-ffff88012bf70800: Connection restored to (at 192.168.202.136@tcp) [ 493.633377] Lustre: DEBUG MARKER: == racer test 2: racer rename: oleg236-client.virtnet DURATION=300 ========================================================== 15:47:19 (1713296839) [ 495.070620] LustreError: 10674:0:(lov_object.c:1360:lov_layout_change()) lustre-clilov-ffff88013681e800: cannot apply new layout on [0x240000402:0x5b7c:0x0] : rc = -5 [ 495.075531] LustreError: 10674:0:(lov_object.c:1360:lov_layout_change()) Skipped 112 previous similar messages [ 495.416967] LustreError: 49:0:(statahead.c:792:ll_statahead_interpret_work()) lustre: getattr callback for 5 [0x240000403:0x5875:0x0]: rc = -5 [ 495.421647] LustreError: 49:0:(statahead.c:792:ll_statahead_interpret_work()) Skipped 89 previous similar messages [ 551.853609] Lustre: 1508:0:(client.c:2340:ptlrpc_expire_one_request()) @@@ Request sent has failed due to network error: [sent 1713296897/real 1713296897] req@ffff8800a645a300 x1796521597841664/t0(0) o101->lustre-MDT0001-mdc-ffff88013681e800@192.168.202.136@tcp:12/10 lens 4616/70288 e 0 to 1 dl 1713297045 ref 2 fl Rpc:ReXPQU/200/ffffffff rc 0/-1 job:'rm.0' uid:0 gid:0 [ 551.865632] Lustre: lustre-MDT0001-mdc-ffff88013681e800: Connection to lustre-MDT0001 (at 192.168.202.136@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 551.890617] Lustre: lustre-MDT0001-mdc-ffff88013681e800: Connection restored to (at 192.168.202.136@tcp) [ 677.362621] Lustre: dir [0x240000403:0x72f8:0x0] stripe 1 readdir failed: -2, directory is partially accessed! [ 677.364966] Lustre: Skipped 13 previous similar messages ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) High Load Averages: 50.24, 126.92, 96.67 There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 11.04s (real) 5.90s (CPU), Child processes: 5.10s