== replay-vbr test 12a: lost data due to missed REMOTE client during replay ========================================================== 07:20:26 (1713439226) Starting client: oleg413-client.virtnet: -o user_xattr,flock oleg413-server@tcp:/lustre /mnt/lustre2 mdt.lustre-MDT0000.commit_on_sharing=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1704 1285984 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3048 7210992 1% /mnt/lustre total: 25 open/close in 0.08 seconds: 296.81 ops/second total: 25 open/close in 0.08 seconds: 295.17 ops/second Stopping client oleg413-client.virtnet /mnt/lustre2 (opts:) Failing mds1 on oleg413-server Stopping /mnt/lustre-mds1 (opts:) on oleg413-server 07:20:30 (1713439230) shut down Failover mds1 to oleg413-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg413-server: oleg413-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg413-client: oleg413-server: ssh exited with exit code 1 Started lustre-MDT0000 07:20:46 (1713439246) targets are mounted 07:20:46 (1713439246) facet_failover done - unlinked 0 (time 1713439316 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second - unlinked 0 (time 1713439316 ; total 0 ; last 0) total: 25 unlinks in 1 seconds: 25.000000 unlinks/second Can't lstat /mnt/lustre/d12a.replay-vbr/f12a.replay-vbr: No such file or directory