************************ crashinfo ************************* /exports/testreports/42123/testresults/sanity-pfl-special1-ldiskfs-centos7_x86_64-centos7_x86_64/oleg341-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.iGrzl/vmlinux [TAINTED] DUMPFILE: /exports/testreports/42123/testresults/sanity-pfl-special1-ldiskfs-centos7_x86_64-centos7_x86_64/oleg341-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Apr 19 03:12:19 EDT 2024 UPTIME: 01:03:21 LOAD AVERAGE: 0.00, 0.01, 0.05 TASKS: 151 NODENAME: oleg341-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 11 last 5s 28 last 60s 35 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 147 TASK_RUNNING 1 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 8705 dd 0 ms dd if=/dev/zero of=/mnt/lustre/d1c.sanity-pfl/f1c.sanity-pfl bs=1k count=1 seek=20000 4593 ldlm_bl_01 0 ms (no user stack) 46 kworker/2:1 0 ms (no user stack) 34 rcuos/3 7 ms (no user stack) 9 rcu_sched 7 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 862815 3.3 GB 90% of TOTAL MEM USED 92264 360.4 MB 9% of TOTAL MEM SHARED 9763 38.1 MB 1% of TOTAL MEM BUFFERS 5221 20.4 MB 0% of TOTAL MEM CACHED 46144 180.2 MB 4% of TOTAL MEM SLAB 10659 41.6 MB 1% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 61428 240 MB 8% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 4 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 3796.2 eth0 n/a 0.9 RSS_TOTAL=71248 pages, %mem= 1.2 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff8801370141c0 ffff8800b6cf8800 sysfs sysfs /sys ffff880137014380 ffff880139944000 proc proc /proc ffff880137014540 ffff880137678000 devtmpfs devtmpfs /dev ffff880137014700 ffff88012aa1c800 securityfs securityfs /sys/kernel/security ffff8801370148c0 ffff8800b6cf9000 tmpfs tmpfs /dev/shm ffff880137014a80 ffff880137678800 devpts devpts /dev/pts ffff880137014c40 ffff8800b6cf9800 tmpfs tmpfs /run ffff880137014e00 ffff8800b6cfa000 tmpfs tmpfs /sys/fs/cgroup ffff880137014fc0 ffff8800b6cfa800 cgroup cgroup /sys/fs/cgroup/systemd ffff880137015180 ffff8800b6cfb000 pstore pstore /sys/fs/pstore ffff880137015340 ffff8800b6cfd000 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff880137015500 ffff8800b6cfc800 cgroup cgroup /sys/fs/cgroup/devices ffff8801370156c0 ffff8800b6cfc000 cgroup cgroup /sys/fs/cgroup/cpuset ffff880137015880 ffff8800b6cfb800 cgroup cgroup /sys/fs/cgroup/blkio ffff880137015a40 ffff8800b6cfd800 cgroup cgroup /sys/fs/cgroup/memory ffff880137015c00 ffff8800b6cfe000 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff880137015dc0 ffff8800b6cfe800 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012aa66000 ffff8800b6cff000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012aa661c0 ffff8800b6cff800 cgroup cgroup /sys/fs/cgroup/pids ffff88012aa66380 ffff88012aae0000 cgroup cgroup /sys/fs/cgroup/freezer ffff880137668e00 ffff880000025800 configfs configfs /sys/kernel/config ffff8801370b6700 ffff8800b6d06000 ext4 /dev/nbd0 / ffff88012aa668c0 ffff88012aae2000 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff880137669500 ffff8800b580d000 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff8801370b68c0 ffff8800b5884800 hugetlbfs hugetlbfs /dev/hugepages ffff8801370b6a80 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff8801370b6c40 ffff88013767a000 mqueue mqueue /dev/mqueue ffff88012aa66a80 ffff880000022800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff88012aa66e00 ffff880000025000 ramfs none /mnt ffff8801370b6e00 ffff8800b5881000 tmpfs none /var/lib/stateless/writable ffff8801370b6fc0 ffff8800b5880800 squashfs /dev/vda /home/green/git/lustre-release ffff8801370b7180 ffff8800b5881000 tmpfs none /var/cache/man ffff8801370b7340 ffff8800b5881000 tmpfs none /var/log ffff88012aa66fc0 ffff8800b5881000 tmpfs none /var/lib/dbus ffff8801370b7500 ffff8800b5881000 tmpfs none /tmp ffff8801370b76c0 ffff8800b5881000 tmpfs none /var/lib/dhclient ffff88012aa67180 ffff8800b5881000 tmpfs none /var/tmp ffff88012e20e000 ffff8800b5881000 tmpfs none /var/lib/NetworkManager ffff8801376696c0 ffff8800b5881000 tmpfs none /var/lib/systemd/random-seed ffff88012aa67340 ffff8800b5881000 tmpfs none /var/spool ffff88012e20e1c0 ffff8800b5881000 tmpfs none /var/lib/nfs ffff880137669880 ffff8800b5881000 tmpfs none /var/lib/gssproxy ffff88012aa67500 ffff8800b5881000 tmpfs none /var/lib/logrotate ffff88012aa676c0 ffff8800b5881000 tmpfs none /etc ffff88012aa67880 ffff8800b5881000 tmpfs none /var/lib/rsyslog ffff88012aa67a40 ffff8800b5881000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff88012e20e380 ffff8800b5887000 nfs4 192.168.200.253:/exports/state/oleg341-client.virtnet /var/lib/stateless/state ffff880137669a40 ffff8800b5887000 nfs4 192.168.200.253:/exports/state/oleg341-client.virtnet /boot ffff88012e20e700 ffff8800b5887000 nfs4 192.168.200.253:/exports/state/oleg341-client.virtnet /etc/etc/kdump.conf ffff88012aa67c00 ffff88012aae2000 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b00c9340 ffff8800b6df8000 nfs4 192.168.200.253://exports/testreports/42123/testresults/sanity-pfl-special1-ldiskfs-centos7_x86_64-centos7_x86_64 /tmp/tmp/testlogs ffff8800b00ce540 ffff88012a5a0800 tmpfs tmpfs /run/user/0 ffff8800b00cec40 ffff8800b5880800 squashfs /dev/vda /usr/sbin/mount.lustre ffff88012e20e8c0 ffff88012aae7800 lustre 192.168.203.141@tcp:/lustre /mnt/lustre +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 67.140101] libcfs: loading out-of-tree module taints kernel. [ 67.148998] libcfs: module verification failed: signature and/or required key missing - tainting kernel [ 67.173583] LNet: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 2 [ 67.180703] alg: No test for adler32 (adler32-zlib) [ 68.284426] Lustre: Lustre: Build Version: 2.15.4_18_g9f02020 [ 68.520785] LNet: Added LNI 192.168.203.41@tcp [8/256/0/180] [ 68.524472] LNet: Accept secure, port 988 [ 70.107356] Key type lgssc registered [ 70.781393] Lustre: Echo OBD driver; http://www.lustre.org/ [ 148.487316] Lustre: Mounted lustre-client [ 151.416649] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 163.957189] Lustre: DEBUG MARKER: oleg341-client.virtnet: executing check_logdir /tmp/testlogs/ [ 165.993660] Lustre: DEBUG MARKER: oleg341-client.virtnet: executing yml_node [ 169.359339] Lustre: DEBUG MARKER: Client: 2.15.4.18 [ 170.892080] Lustre: DEBUG MARKER: MDS: 2.15.4.18 [ 172.260551] Lustre: DEBUG MARKER: OSS: 2.15.4.18 [ 173.054546] Lustre: DEBUG MARKER: -----============= acceptance-small: sanity-pfl ============----- Fri Apr 19 02:11:50 EDT 2024 [ 173.530704] Lustre: lustre-OST0000-osc-ffff88012aae7800: disconnect after 23s idle [ 175.609208] Lustre: DEBUG MARKER: excepting tests: [ 178.626597] Lustre: DEBUG MARKER: oleg341-client.virtnet: executing check_config_client /mnt/lustre [ 188.634173] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 195.140515] Lustre: DEBUG MARKER: == sanity-pfl test 1c: Test overstriping w/max stripe count ========================================================== 02:12:12 (1713507132) [ 201.078328] random: crng init done [ 218.650945] Lustre: lustre-OST0001-osc-ffff88012aae7800: disconnect after 24s idle [ 218.658937] Lustre: Skipped 1 previous similar message [ 797.904358] Lustre: 8705:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713507133/real 1713507133] req@ffff8800b0045f00 x1796742322331776/t0(0) o101->lustre-MDT0000-mdc-ffff88012aae7800@192.168.203.141@tcp:12/10 lens 376/904 e 24 to 1 dl 1713507735 ref 2 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'dd.0' [ 797.914067] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection to lustre-MDT0000 (at 192.168.203.141@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 797.923616] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection restored to (at 192.168.203.141@tcp) [ 1398.926457] Lustre: 8705:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713507735/real 1713507735] req@ffff8800b0045f00 x1796742322331776/t0(0) o101->lustre-MDT0000-mdc-ffff88012aae7800@192.168.203.141@tcp:12/10 lens 376/904 e 24 to 1 dl 1713508336 ref 2 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'dd.0' [ 1398.940853] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection to lustre-MDT0000 (at 192.168.203.141@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 1398.955826] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection restored to (at 192.168.203.141@tcp) [ 1999.959360] Lustre: 8705:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713508336/real 1713508336] req@ffff8800b0045f00 x1796742322331776/t0(0) o101->lustre-MDT0000-mdc-ffff88012aae7800@192.168.203.141@tcp:12/10 lens 376/904 e 24 to 1 dl 1713508937 ref 2 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'dd.0' [ 1999.965090] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection to lustre-MDT0000 (at 192.168.203.141@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 1999.972459] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection restored to (at 192.168.203.141@tcp) [ 2600.974501] Lustre: 8705:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713508937/real 1713508937] req@ffff8800b0045f00 x1796742322331776/t0(0) o101->lustre-MDT0000-mdc-ffff88012aae7800@192.168.203.141@tcp:12/10 lens 376/904 e 24 to 1 dl 1713509538 ref 2 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'dd.0' [ 2600.987832] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection to lustre-MDT0000 (at 192.168.203.141@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 2601.005103] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection restored to (at 192.168.203.141@tcp) [ 3202.009378] Lustre: 8705:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713509538/real 1713509538] req@ffff8800b0045f00 x1796742322331776/t0(0) o101->lustre-MDT0000-mdc-ffff88012aae7800@192.168.203.141@tcp:12/10 lens 376/904 e 24 to 1 dl 1713510139 ref 2 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'dd.0' [ 3202.025841] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection to lustre-MDT0000 (at 192.168.203.141@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 3202.043503] Lustre: lustre-MDT0000-mdc-ffff88012aae7800: Connection restored to (at 192.168.203.141@tcp) ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 10.75s (real) 5.55s (CPU), Child processes: 5.17s