oleg406-server: Warning: Permanently added 'oleg406-server' (ECDSA) to the list of known hosts. pdsh@oleg406-client: oleg406-server: ssh exited with exit code 2 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 2 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 2 Started at Thu Apr 18 00:04:58 EDT 2024 Lustre is not mounted, trying to do setup ... Stopping clients: oleg406-client.virtnet /mnt/lustre (opts:-f) Stopping clients: oleg406-client.virtnet /mnt/lustre2 (opts:-f) pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 2 oleg406-server: oleg406-server.virtnet: executing set_hostid Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions ../libcfs/libcfs/libcfs options: 'cpu_npartitions=2' ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' quota/lquota options: 'hash_lqs_cur_bits=3' loading modules on: 'oleg406-server' oleg406-server: oleg406-server.virtnet: executing load_modules_local oleg406-server: Loading modules from /home/green/git/lustre-release/lustre oleg406-server: detected 4 online CPUs by sysfs oleg406-server: Force libcfs to create 2 CPU partitions oleg406-server: ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' oleg406-server: quota/lquota options: 'hash_lqs_cur_bits=3' Formatting mgs, mds, osts Format mds1: /dev/vdc pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Format mds2: /dev/vdd pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Format ost1: /dev/vde pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Format ost2: /dev/vdf pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Checking servers environments Checking clients oleg406-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions libkmod: kmod_module_get_holders: could not open '/sys/module/acpi_cpufreq/holders': No such file or directory loading modules on: 'oleg406-server' oleg406-server: oleg406-server.virtnet: executing load_modules_local oleg406-server: Loading modules from /home/green/git/lustre-release/lustre oleg406-server: detected 4 online CPUs by sysfs oleg406-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg406-server: oleg406-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Commit the device label on /dev/vdc Started lustre-MDT0000 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg406-server: oleg406-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Commit the device label on /dev/vdd Started lustre-MDT0001 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 oleg406-server: mount.lustre: increased '/sys/devices/virtual/block/dm-2/queue/max_sectors_kb' from 512 to 16384 oleg406-server: mount.lustre: increased '/sys/devices/pci0000:00/0000:00:09.0/virtio5/block/vde/queue/max_sectors_kb' from 512 to 16384 seq.cli-lustre-OST0000-super.width=65536 oleg406-server: oleg406-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Commit the device label on /dev/vde Started lustre-OST0000 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 oleg406-server: mount.lustre: increased '/sys/devices/virtual/block/dm-3/queue/max_sectors_kb' from 512 to 16384 oleg406-server: mount.lustre: increased '/sys/devices/pci0000:00/0000:00:0a.0/virtio6/block/vdf/queue/max_sectors_kb' from 512 to 16384 seq.cli-lustre-OST0001-super.width=65536 oleg406-server: oleg406-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Commit the device label on /dev/vdf Started lustre-OST0001 Starting client: oleg406-client.virtnet: -o user_xattr,flock oleg406-server@tcp:/lustre /mnt/lustre Starting client oleg406-client.virtnet: -o user_xattr,flock oleg406-server@tcp:/lustre /mnt/lustre Started clients oleg406-client.virtnet: 192.168.204.106@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) oleg406-client: Warning: Permanently added 'oleg406-client.virtnet' (ECDSA) to the list of known hosts. Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff88012abbd000.idle_timeout=debug osc.lustre-OST0001-osc-ffff88012abbd000.idle_timeout=debug setting jobstats to procname_uid Setting lustre.sys.jobid_var from disable to procname_uid Waiting 90s for 'procname_uid' Updated after 2s: want 'procname_uid' got 'procname_uid' disable quota as required oleg406-client: oleg406-client.virtnet: executing check_logdir /tmp/testlogs/ oleg406-server: oleg406-server.virtnet: executing check_logdir /tmp/testlogs/ Logging to local directory: /tmp/testlogs/ oleg406-client: oleg406-client.virtnet: executing yml_node oleg406-server: oleg406-server.virtnet: executing yml_node Client: 2.15.62.23 client: CentOS Linux release 7.9.2009 (Core) CLIENT_OS_ID="centos" CLIENT_OS_ID_LIKE="rhel fedora" CLIENT_OS_VERSION_ID="7" MDS: 2.15.62.23 mds1: CentOS Linux release 7.9.2009 (Core) MDS1_OS_ID="centos" MDS1_OS_ID_LIKE="rhel fedora" MDS1_OS_VERSION_ID="7" OSS: 2.15.62.23 ost1: CentOS Linux release 7.9.2009 (Core) OST1_OS_ID="centos" OST1_OS_ID_LIKE="rhel fedora" OST1_OS_VERSION_ID="7" running: replay-vbr run_suite replay-vbr /home/green/git/lustre-release/lustre/tests/replay-vbr.sh replay-vbr returned 0 Failing mds1 on oleg406-server Stopping /mnt/lustre-mds1 (opts:) on oleg406-server 00:40:05 (1713415205) shut down Failover mds1 to oleg406-server mount facets: mds1 pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg406-server: oleg406-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg406-client: oleg406-server: ssh exited with exit code 1 Started lustre-MDT0000 00:40:19 (1713415219) targets are mounted 00:40:19 (1713415219) facet_failover done oleg406-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping clients: oleg406-client.virtnet /mnt/lustre (opts:) Stopping client oleg406-client.virtnet /mnt/lustre opts: Stopping clients: oleg406-client.virtnet /mnt/lustre2 (opts:) Stopping /mnt/lustre-mds1 (opts:-f) on oleg406-server Stopping /mnt/lustre-mds2 (opts:-f) on oleg406-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg406-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg406-server unloading modules on: 'oleg406-server' oleg406-server: oleg406-server.virtnet: executing unload_modules_local modules unloaded. Finished at Thu Apr 18 00:41:04 EDT 2024 in 2169s /home/green/git/lustre-release/lustre/tests/auster: completed with rc 0