-----============= acceptance-small: replay-single ============----- Tue Apr 16 10:49:30 EDT 2024 excepting tests: 110f 131b 59 36 === replay-single: start setup 10:49:33 (1713278973) === oleg438-client.virtnet: executing check_config_client /mnt/lustre oleg438-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg438-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800abf96800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800abf96800.idle_timeout=debug disable quota as required oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all === replay-single: finish setup 10:49:41 (1713278981) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0a: empty replay =================== 10:49:42 (1713278982) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:49:45 (1713278985) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:49:57 (1713278997) targets are mounted 10:49:57 (1713278997) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 10:50:04 (1713279004) Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 10:50:05 (1713279005) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 10:50:18 (1713279018) targets are mounted 10:50:18 (1713279018) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.07 seconds: 289.33 ops/second - unlinked 0 (time 1713279021 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0c: check replay-barrier =========== 10:50:24 (1713279024) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:50:27 (1713279027) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:50:39 (1713279039) targets are mounted 10:50:39 (1713279039) facet_failover done Starting client: oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (78s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0d: expired recovery with no clients ========================================================== 10:51:44 (1713279104) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:51:46 (1713279106) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:51:59 (1713279119) targets are mounted 10:51:59 (1713279119) facet_failover done Starting client: oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre PASS 0d (77s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 1: simple create =================== 10:53:03 (1713279183) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:53:05 (1713279185) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:53:18 (1713279198) targets are mounted 10:53:18 (1713279198) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2a: touch ========================== 10:53:24 (1713279204) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:53:26 (1713279206) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:53:38 (1713279218) targets are mounted 10:53:38 (1713279218) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2b: touch ========================== 10:53:44 (1713279224) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:53:47 (1713279227) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:53:59 (1713279239) targets are mounted 10:53:59 (1713279239) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2c: setstripe replay =============== 10:54:06 (1713279246) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:54:08 (1713279248) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:21 (1713279261) targets are mounted 10:54:21 (1713279261) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2d: setdirstripe replay ============ 10:54:27 (1713279267) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:54:30 (1713279270) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:42 (1713279282) targets are mounted 10:54:42 (1713279282) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2e: O_CREAT|O_EXCL create replay === 10:54:54 (1713279294) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:54:58 (1713279298) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:55:10 (1713279310) targets are mounted 10:55:10 (1713279310) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 10:55:16 (1713279316) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:55:19 (1713279319) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:55:32 (1713279332) targets are mounted 10:55:32 (1713279332) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3b: replay failed open -ENOMEM ===== 10:55:38 (1713279338) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:55:40 (1713279340) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:55:53 (1713279353) targets are mounted 10:55:53 (1713279353) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3c: replay failed open -ENOMEM ===== 10:55:59 (1713279359) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:56:02 (1713279362) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:56:16 (1713279376) targets are mounted 10:56:16 (1713279376) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 10:56:23 (1713279383) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:56:27 (1713279387) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:56:40 (1713279400) targets are mounted 10:56:40 (1713279400) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4b: |x| rm 10 files ================ 10:56:46 (1713279406) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3761152 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3761152 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7522304 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:56:49 (1713279409) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:57:03 (1713279423) targets are mounted 10:57:03 (1713279423) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 5: |x| 220 open(O_CREAT) =========== 10:57:10 (1713279430) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:57:13 (1713279433) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:57:26 (1713279446) targets are mounted 10:57:26 (1713279446) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6a: mkdir + contained create ======= 10:57:37 (1713279457) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 8192 7530496 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:57:39 (1713279459) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:57:52 (1713279472) targets are mounted 10:57:52 (1713279472) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6b: |X| rmdir ====================== 10:58:00 (1713279480) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:58:03 (1713279483) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:16 (1713279496) targets are mounted 10:58:16 (1713279496) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 7: mkdir |X| contained create ====== 10:58:22 (1713279502) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:58:25 (1713279505) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:37 (1713279517) targets are mounted 10:58:37 (1713279517) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 8: creat open |X| close ============ 10:58:44 (1713279524) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:58:47 (1713279527) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:59:00 (1713279540) targets are mounted 10:59:00 (1713279540) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 9: |X| create (same inum/gen) ====== 10:59:06 (1713279546) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:59:08 (1713279548) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:59:21 (1713279561) targets are mounted 10:59:21 (1713279561) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 10: create |X| rename unlink ======= 10:59:27 (1713279567) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:59:30 (1713279570) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 10:59:42 (1713279582) targets are mounted 10:59:42 (1713279582) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 10:59:48 (1713279588) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre new old Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 10:59:51 (1713279591) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:00:03 (1713279603) targets are mounted 11:00:03 (1713279603) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 12: open, unlink |X| close ========= 11:00:09 (1713279609) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:00:11 (1713279611) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:00:24 (1713279624) targets are mounted 11:00:24 (1713279624) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 13: open chmod 0 |x| write close === 11:00:29 (1713279629) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:00:32 (1713279632) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:00:44 (1713279644) targets are mounted 11:00:44 (1713279644) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 11:00:52 (1713279652) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:00:55 (1713279655) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:08 (1713279668) targets are mounted 11:01:08 (1713279668) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 11:01:14 (1713279674) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:01:16 (1713279676) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:29 (1713279689) targets are mounted 11:01:29 (1713279689) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 11:01:35 (1713279695) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:01:37 (1713279697) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:49 (1713279709) targets are mounted 11:01:49 (1713279709) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 11:01:55 (1713279715) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:01:58 (1713279718) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:10 (1713279730) targets are mounted 11:02:10 (1713279730) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 11:02:16 (1713279736) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 pid: 21657 will close Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:02:19 (1713279739) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:32 (1713279752) targets are mounted 11:02:32 (1713279752) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 19: mcreate, open, write, rename === 11:02:39 (1713279759) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre old Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:02:42 (1713279762) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:55 (1713279775) targets are mounted 11:02:55 (1713279775) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 11:03:03 (1713279783) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:03:06 (1713279786) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:03:19 (1713279799) targets are mounted 11:03:19 (1713279799) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 11:03:27 (1713279807) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x240000400 10000+0 records in 10000+0 records out 40960000 bytes (41 MB) copied, 1.22792 s, 33.4 MB/s Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:03:32 (1713279812) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:03:45 (1713279825) targets are mounted 11:03:45 (1713279825) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg438-server: oleg438-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg438-server: *.lustre-MDT0000.recovery_status status: COMPLETE sleep 5 for ZFS zfs Waiting for MDT destroys to complete before 6144, after 6144 PASS 20b (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20c: check that client eviction does not affect file content ========================================================== 11:04:01 (1713279841) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 -rw-r--r-- 1 root root 1 Apr 16 11:04 /mnt/lustre/f20c.replay-single pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 PASS 20c (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 11:04:08 (1713279848) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:04:12 (1713279852) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:04:25 (1713279865) targets are mounted 11:04:25 (1713279865) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 11:04:33 (1713279873) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3764224 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7530496 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:04:37 (1713279877) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:04:50 (1713279890) targets are mounted 11:04:50 (1713279890) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 11:04:58 (1713279898) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:05:01 (1713279901) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:05:14 (1713279914) targets are mounted 11:05:14 (1713279914) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 11:05:22 (1713279922) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:05:25 (1713279925) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:05:39 (1713279939) targets are mounted 11:05:39 (1713279939) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 11:05:47 (1713279947) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:05:50 (1713279950) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:06:03 (1713279963) targets are mounted 11:06:03 (1713279963) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 11:06:11 (1713279971) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:06:15 (1713279975) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:06:28 (1713279988) targets are mounted 11:06:28 (1713279988) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 11:06:36 (1713279996) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:06:40 (1713280000) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:06:54 (1713280014) targets are mounted 11:06:54 (1713280014) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 11:07:01 (1713280021) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:07:04 (1713280024) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:18 (1713280038) targets are mounted 11:07:18 (1713280038) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 11:07:26 (1713280046) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:07:29 (1713280049) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:43 (1713280063) targets are mounted 11:07:43 (1713280063) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 11:07:51 (1713280071) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:07:54 (1713280074) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:08 (1713280088) targets are mounted 11:08:08 (1713280088) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 11:08:16 (1713280096) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:08:19 (1713280099) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:33 (1713280113) targets are mounted 11:08:33 (1713280113) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 11:08:41 (1713280121) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 PASS 32 (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 11:08:48 (1713280128) total: 10 open/close in 0.07 seconds: 139.45 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.04 seconds: 263.90 ops/second PASS 33a (11s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33b: test fid seq allocation ======= 11:09:01 (1713280141) fail_loc=0x1311 total: 10 open/close in 0.07 seconds: 146.45 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg438-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 first stat failed: 5 total: 10 open/close in 0.08 seconds: 130.41 ops/second PASS 33b (11s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 11:09:15 (1713280155) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (11s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 35: test recovery from llog for unlink op ========================================================== 11:09:28 (1713280168) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (12s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_36 skipping ALWAYS excluded test 36 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 11:09:43 (1713280183) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3456 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 37 (12s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 11:09:58 (1713280198) total: 800 open/close in 4.52 seconds: 177.07 ops/second - unlinked 0 (time 1713280205 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3712 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:10:09 (1713280209) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:10:22 (1713280222) targets are mounted 11:10:22 (1713280222) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713280226 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 11:10:35 (1713280235) total: 800 open/close in 4.38 seconds: 182.53 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1713280243 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:10:46 (1713280246) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:11:00 (1713280260) targets are mounted 11:11:00 (1713280260) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713280265 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 11:11:14 (1713280274) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00294317 s, 1.4 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00654093 s, 626 kB/s PASS 41 (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 42: recovery after ost failure ===== 11:11:19 (1713280279) total: 800 open/close in 4.47 seconds: 178.96 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3712 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1713280288 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second debug=-1 Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:11:31 (1713280291) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:11:45 (1713280305) targets are mounted 11:11:45 (1713280305) facet_failover done wait for MDS to timeout and recover debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck - unlinked 0 (time 1713280347 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (71s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 11:12:33 (1713280353) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3712 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:12:37 (1713280357) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:12:50 (1713280370) targets are mounted 11:12:50 (1713280370) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44a: race in target handle connect ========================================================== 11:13:08 (1713280388) at_max=40 1 of 10 (1713280390) service : cur 5 worst 5 (at 1713280371, 19s ago) 4 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 2 of 10 (1713280395) service : cur 5 worst 5 (at 1713280371, 25s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 3 of 10 (1713280401) service : cur 5 worst 5 (at 1713280371, 31s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 4 of 10 (1713280407) service : cur 5 worst 5 (at 1713280371, 36s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 5 of 10 (1713280412) service : cur 5 worst 5 (at 1713280371, 42s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 6 of 10 (1713280418) service : cur 5 worst 5 (at 1713280371, 48s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 7 of 10 (1713280424) service : cur 5 worst 5 (at 1713280371, 53s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 8 of 10 (1713280429) service : cur 5 worst 5 (at 1713280371, 59s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 9 of 10 (1713280435) service : cur 5 worst 5 (at 1713280371, 65s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 10 of 10 (1713280441) service : cur 5 worst 5 (at 1713280371, 70s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (61s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44b: race in target handle connect ========================================================== 11:14:11 (1713280451) 1 of 10 (1713280452) service : cur 5 worst 5 (at 1713280371, 81s ago) 5 0 0 0 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.204.138@tcp:/lustre 7542784 6144 7532544 1% /mnt/lustre 2 of 10 (1713280492) service : cur 40 worst 40 (at 1713280492, 1s ago) 40 0 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 3 of 10 (1713280513) service : cur 40 worst 40 (at 1713280492, 22s ago) 40 0 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 4 of 10 (1713280534) service : cur 40 worst 40 (at 1713280492, 42s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 5 of 10 (1713280554) service : cur 40 worst 40 (at 1713280492, 63s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 6 of 10 (1713280575) service : cur 40 worst 40 (at 1713280492, 83s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 7 of 10 (1713280596) service : cur 40 worst 40 (at 1713280492, 104s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 8 of 10 (1713280616) service : cur 40 worst 40 (at 1713280492, 125s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 9 of 10 (1713280637) service : cur 40 worst 40 (at 1713280492, 145s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress 10 of 10 (1713280658) service : cur 40 worst 40 (at 1713280492, 166s ago) 40 40 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: '/mnt/lustre': Operation already in progress PASS 44b (229s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44c: race in target handle connect ========================================================== 11:18:02 (1713280682) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 100 create in 0.45 seconds: 223.14 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg438-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 first stat failed: 5 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:18:15 (1713280695) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:18:29 (1713280709) targets are mounted 11:18:29 (1713280709) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 45: Handle failed close ============ 11:18:37 (1713280717) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 /mnt/lustre/f45.replay-single has type file OK PASS 45 (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 11:18:42 (1713280722) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:19:00 (1713280740) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:19:14 (1713280754) targets are mounted 11:19:14 (1713280754) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 11:19:22 (1713280762) total: 20 open/close in 0.15 seconds: 136.89 ops/second Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:19:24 (1713280764) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:19:38 (1713280778) targets are mounted 11:19:38 (1713280778) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.13 seconds: 159.66 ops/second - unlinked 0 (time 1713280844 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (83s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 11:20:48 (1713280848) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3584 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 20 open/close in 0.12 seconds: 170.68 ops/second Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:20:51 (1713280851) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:21:05 (1713280865) targets are mounted 11:21:05 (1713280865) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.14 seconds: 143.46 ops/second - unlinked 0 (time 1713280927 ; total 0 ; last 0) total: 40 unlinks in 0 seconds: inf unlinks/second PASS 48 (80s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 11:22:11 (1713280931) PASS 50 (8s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 52: time out lock replay (3764) ==== 11:22:21 (1713280941) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.6888 fail_loc=0x80000157 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:22:24 (1713280944) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:22:37 (1713280957) targets are mounted 11:22:37 (1713280957) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 11:23:02 (1713280982) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:23:07 (1713280987) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:23:21 (1713281001) targets are mounted 11:23:21 (1713281001) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 11:23:29 (1713281009) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.6888 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:23:34 (1713281014) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:23:46 (1713281026) targets are mounted 11:23:46 (1713281026) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 11:23:53 (1713281033) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:23:57 (1713281037) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:24:09 (1713281049) targets are mounted 11:24:09 (1713281049) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 11:24:15 (1713281055) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:24:18 (1713281058) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:24:31 (1713281071) targets are mounted 11:24:31 (1713281071) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 11:24:38 (1713281078) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:24:43 (1713281083) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:24:56 (1713281096) targets are mounted 11:24:56 (1713281096) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 11:25:04 (1713281104) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:25:09 (1713281109) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:25:22 (1713281122) targets are mounted 11:25:22 (1713281122) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 11:25:29 (1713281129) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:25:34 (1713281134) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:25:48 (1713281148) targets are mounted 11:25:48 (1713281148) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 11:26:02 (1713281162) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:26:08 (1713281168) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:26:22 (1713281182) targets are mounted 11:26:22 (1713281182) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 11:26:35 (1713281195) fail_loc=0x8000012b fail_loc=0x0 rm: cannot remove '/mnt/lustre/f55.replay-single': No such file or directory touch: cannot touch '/mnt/lustre/f55.replay-single': No such file or directory PASS 55 (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 11:26:54 (1713281214) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3584 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:26:57 (1713281217) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:27:11 (1713281231) targets are mounted 11:27:11 (1713281231) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 57: test recovery from llog for setattr op ========================================================== 11:27:28 (1713281248) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3584 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:27:32 (1713281252) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:27:46 (1713281266) targets are mounted 11:27:46 (1713281266) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg438-server: oleg438-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg438-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed wait 40 secs maximumly for oleg438-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 11:27:57 (1713281277) fail_loc=0x8000012c total: 2500 open/close in 8.62 seconds: 289.86 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 4480 2204032 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:28:11 (1713281291) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:28:24 (1713281304) targets are mounted 11:28:24 (1713281304) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1713281323 ; total 0 ; last 0) total: 2500 unlinks in 5 seconds: 500.000000 unlinks/second PASS 58a (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58b: test replay of setxattr op ==== 11:28:52 (1713281332) Starting client: oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:28:56 (1713281336) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:29:09 (1713281349) targets are mounted 11:29:09 (1713281349) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg438-client.virtnet /mnt/lustre2 (opts:) oleg438-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 11:29:19 (1713281359) Starting client: oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg438-client.virtnet /mnt/lustre2 (opts:) PASS 58c (41s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_59 skipping ALWAYS excluded test 59 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 11:30:03 (1713281403) total: 200 open/close in 0.48 seconds: 413.78 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3712 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1713281406 ; total 0 ; last 0) total: 100 unlinks in 1 seconds: 100.000000 unlinks/second Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:30:08 (1713281408) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:30:22 (1713281422) targets are mounted 11:30:22 (1713281422) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713281426 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second PASS 60 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 11:30:30 (1713281430) total: 800 open/close in 2.25 seconds: 355.58 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1713281436 ; total 0 ; last 0) total: 800 unlinks in 2 seconds: 400.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:30:40 (1713281440) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:30:54 (1713281454) targets are mounted 11:30:54 (1713281454) facet_failover done Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:31:05 (1713281465) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:31:19 (1713281479) targets are mounted 11:31:19 (1713281479) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (85s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 11:31:57 (1713281517) fail_loc=0x8000013a Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:31:59 (1713281519) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:32:13 (1713281533) targets are mounted 11:32:13 (1713281533) facet_failover done Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:32:25 (1713281545) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:32:38 (1713281558) targets are mounted 11:32:38 (1713281558) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00505065 s, 811 kB/s PASS 61b (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 11:32:46 (1713281566) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:32:59 (1713281579) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:33:13 (1713281593) targets are mounted 11:33:13 (1713281593) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 11:33:21 (1713281601) Stopping /mnt/lustre-mds1 (opts:) on oleg438-server fail_loc=0x80000605 Starting mgs: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: mount.lustre: mount lustre-mdt1/mdt1 at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg438-client: oleg438-server: ssh exited with exit code 95 Start of lustre-mdt1/mdt1 on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (10s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 62: don't mis-drop resent replay === 11:33:33 (1713281613) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3712 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.13 seconds: 190.38 ops/second fail_loc=0x80000707 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:33:37 (1713281617) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:33:51 (1713281631) targets are mounted 11:33:51 (1713281631) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1713281652 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second PASS 62 (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65a: AT: verify early replies ====== 11:34:16 (1713281656) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:3.0:1713281686.494935:0:7143:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a8829180 x1796503159606592/t0(0) o101->lustre-MDT0000-mdc-ffff8800ab2d3800@192.168.204.138@tcp:12/10 lens 664/66320 e 1 to 0 dl 1713281721 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 36 worst 40 (at 1713280492, 1200s ago) 36 5 5 5 portal 23 : cur 5 worst 10 (at 1713279219, 2473s ago) 10 5 5 10 portal 30 : cur 5 worst 5 (at 1713279183, 2509s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713279247, 2445s ago) 5 5 5 5 portal 13 : cur 10 worst 10 (at 1713279583, 2109s ago) 10 10 0 0 portal 12 : cur 11 worst 40 (at 1713280492, 1209s ago) 36 5 5 5 portal 23 : cur 5 worst 10 (at 1713279219, 2482s ago) 10 5 5 10 portal 30 : cur 5 worst 5 (at 1713279183, 2518s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713279247, 2454s ago) 5 5 5 5 portal 13 : cur 10 worst 10 (at 1713279583, 2118s ago) 10 10 0 0 PASS 65a (46s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 11:35:05 (1713281705) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:3.0:1713281735.039114:0:1914:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a8ad9c00 x1796503159616064/t0(0) o4->lustre-OST0000-osc-ffff8800ab2d3800@192.168.204.138@tcp:6/4 lens 4584/448 e 1 to 0 dl 1713281770 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck portal 28 : cur 5 worst 5 (at 1713279179, 2561s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1713279180, 2560s ago) 5 5 5 5 portal 17 : cur 5 worst 5 (at 1713279224, 2516s ago) 5 0 5 5 portal 6 : cur 36 worst 36 (at 1713281740, 0s ago) 36 5 0 0 PASS 65b (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 11:35:44 (1713281744) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1713280492, 1275s ago) 5 36 5 5 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1713280492, 1281s ago) 5 36 5 5 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1713280492, 1292s ago) 36 36 5 5 fail_loc=0 portal 12 : cur 5 worst 40 (at 1713280492, 1301s ago) 36 36 5 5 Current MDT timeout 5, worst 40 PASS 66a (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66b: AT: verify net latency adjusts ========================================================== 11:36:36 (1713281796) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 10, worst 10 PASS 66b (84s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 11:38:02 (1713281882) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (75s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 11:39:19 (1713281959) at_history=8 at_history=8 Creating to objid 5441 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 17 open/close in 0.09 seconds: 197.21 ops/second Connected clients: oleg438-client.virtnet oleg438-client.virtnet service : cur 5 worst 5 (at 1713278952, 3031s ago) 1 0 1 0 phase 2 1 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg438-client.virtnet oleg438-client.virtnet service : cur 5 worst 5 (at 1713278952, 3033s ago) 1 1 0 1 0 osc reconnect attempts on 2nd slow PASS 67b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 68: AT: verify slowing locks ======= 11:39:48 (1713281988) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1968: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1983: $ldlm_enqueue_min: ambiguous redirect PASS 68 (70s) debug_raw_pointers=0 debug_raw_pointers=0 Cleaning up AT ... debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70a: check multi client t-f ======== 11:41:01 (1713282061) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70b: dbench 1mdts recovery; 1 clients ========================================================== 11:41:04 (1713282064) Starting client oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre Started clients oleg438-client.virtnet: 192.168.204.138@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg438-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found Started rundbench load pid=14203 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3678208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3717120 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7395328 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:41:11 (1713282071) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:41:25 (1713282085) targets are mounted 11:41:25 (1713282085) facet_failover done oleg438-client.virtnet: looking for dbench program oleg438-client.virtnet: /usr/bin/dbench oleg438-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg438-client.virtnet oleg438-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single' oleg438-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg438-client.virtnet' oleg438-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg438-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg438-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg438-client.virtnet at Tue Apr 16 11:41:05 EDT 2024 oleg438-client.virtnet: waiting for dbench pid 14227 oleg438-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg438-client.virtnet: oleg438-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg438-client.virtnet: failed to create barrier semaphore oleg438-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg438-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg438-client.virtnet: releasing clients oleg438-client.virtnet: 1 381 13.56 MB/sec warmup 1 sec latency 40.370 ms oleg438-client.virtnet: 1 721 10.65 MB/sec warmup 2 sec latency 98.598 ms oleg438-client.virtnet: 1 940 7.26 MB/sec warmup 3 sec latency 46.531 ms oleg438-client.virtnet: 1 1273 5.79 MB/sec warmup 4 sec latency 44.714 ms oleg438-client.virtnet: 1 1660 5.24 MB/sec warmup 5 sec latency 68.743 ms oleg438-client.virtnet: 1 1841 4.39 MB/sec warmup 6 sec latency 531.298 ms oleg438-client.virtnet: 1 1841 3.76 MB/sec warmup 7 sec latency 1531.572 ms oleg438-client.virtnet: 1 1841 3.29 MB/sec warmup 8 sec latency 2531.836 ms oleg438-client.virtnet: 1 1841 2.93 MB/sec warmup 9 sec latency 3532.122 ms oleg438-client.virtnet: 1 1841 2.64 MB/sec warmup 10 sec latency 4532.365 ms oleg438-client.virtnet: 1 1841 2.40 MB/sec warmup 11 sec latency 5532.610 ms oleg438-client.virtnet: 1 1841 2.20 MB/sec warmup 12 sec latency 6532.825 ms oleg438-client.virtnet: 1 1841 2.03 MB/sec warmup 13 sec latency 7533.059 ms oleg438-client.virtnet: 1 1841 1.88 MB/sec warmup 14 sec latency 8533.278 ms oleg438-client.virtnet: 1 1841 1.76 MB/sec warmup 15 sec latency 9533.446 ms oleg438-client.virtnet: 1 1841 1.65 MB/sec warmup 16 sec latency 10533.652 ms oleg438-client.virtnet: 1 1841 1.55 MB/sec warmup 17 sec latency 11533.904 ms oleg438-client.virtnet: 1 1841 1.46 MB/sec warmup 18 sec latency 12534.196 ms oleg438-client.virtnet: 1 1841 1.39 MB/sec warmup 19 sec latency 13534.471 ms oleg438-client.virtnet: 1 1841 1.32 MB/sec warmup 20 sec latency 14534.747 ms oleg438-client.virtnet: 1 1841 1.25 MB/sec warmup 21 sec latency 15535.027 ms oleg438-client.virtnet: 1 1841 1.20 MB/sec warmup 22 sec latency 16535.271 ms oleg438-client.virtnet: 1 1841 1.15 MB/sec warmup 23 sec latency 17535.516 ms oleg438-client.virtnet: 1 1841 1.10 MB/sec warmup 24 sec latency 18535.743 ms oleg438-client.virtnet: 1 1841 1.05 MB/sec warmup 25 sec latency 19535.994 ms oleg438-client.virtnet: 1 1841 1.01 MB/sec warmup 26 sec latency 20536.237 ms oleg438-client.virtnet: 1 1841 0.98 MB/sec warmup 27 sec latency 21536.456 ms oleg438-client.virtnet: 1 1841 0.94 MB/sec warmup 28 sec latency 22536.734 ms oleg438-client.virtnet: 1 1841 0.91 MB/sec warmup 29 sec latency 23537.027 ms oleg438-client.virtnet: 1 1841 0.88 MB/sec warmup 30 sec latency 24537.305 ms oleg438-client.virtnet: 1 1841 0.85 MB/sec warmup 31 sec latency 25537.526 ms oleg438-client.virtnet: 1 1841 0.82 MB/sec warmup 32 sec oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3720192 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3732480 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7452672 1% /mnt/lustre test_70b fail mds1 2 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:41:46 (1713282106) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:41:59 (1713282119) targets are mounted 11:41:59 (1713282119) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3701760 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3729408 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7431168 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:42:11 (1713282131) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all latency 26537.794 ms oleg438-client.virtnet: 1 1841 0.80 MB/sec warmup 33 sec latency 27538.032 ms oleg438-client.virtnet: 1 2020 0.78 MB/sec warmup 34 sec latency 28229.177 ms oleg438-client.virtnet: 1 2484 0.82 MB/sec warmup 35 sec latency 49.544 ms oleg438-client.virtnet: 1 3348 0.98 MB/sec warmup 36 sec latency 33.465 ms oleg438-client.virtnet: 1 3975 1.08 MB/sec warmup 37 sec latency 12.966 ms oleg438-client.virtnet: 1 4297 1.06 MB/sec warmup 38 sec latency 44.615 ms oleg438-client.virtnet: 1 4605 1.08 MB/sec warmup 39 sec latency 62.826 ms oleg438-client.virtnet: 1 5031 1.13 MB/sec warmup 40 sec latency 16.139 ms oleg438-client.virtnet: 1 5251 1.11 MB/sec warmup 41 sec latency 458.449 ms oleg438-client.virtnet: 1 5251 1.08 MB/sec warmup 42 sec latency 1458.706 ms oleg438-client.virtnet: 1 5251 1.05 MB/sec warmup 43 sec latency 2458.979 ms oleg438-client.virtnet: 1 5251 1.03 MB/sec warmup 44 sec latency 3459.236 ms oleg438-client.virtnet: 1 5251 1.01 MB/sec warmup 45 sec latency 4459.506 ms oleg438-client.virtnet: 1 5251 0.99 MB/sec warmup 46 sec latency 5459.760 ms oleg438-client.virtnet: 1 5251 0.96 MB/sec warmup 47 sec latency 6459.986 ms oleg438-client.virtnet: 1 5251 0.94 MB/sec warmup 48 sec latency 7460.268 ms oleg438-client.virtnet: 1 5251 0.93 MB/sec warmup 49 sec latency 8460.437 ms oleg438-client.virtnet: 1 5251 0.91 MB/sec warmup 50 sec latency 9460.642 ms oleg438-client.virtnet: 1 5251 0.89 MB/sec warmup 51 sec latency 10460.816 ms oleg438-client.virtnet: 1 5251 0.87 MB/sec warmup 52 sec latency 11461.017 ms oleg438-client.virtnet: 1 5251 0.86 MB/sec warmup 53 sec latency 12461.187 ms oleg438-client.virtnet: 1 5251 0.84 MB/sec warmup 54 sec latency 13461.294 ms oleg438-client.virtnet: 1 5251 0.82 MB/sec warmup 55 sec latency 14461.391 ms oleg438-client.virtnet: 1 5251 0.81 MB/sec warmup 56 sec latency 15461.507 ms oleg438-client.virtnet: 1 5251 0.80 MB/sec warmup 57 sec latency 16461.626 ms oleg438-client.virtnet: 1 5251 0.78 MB/sec warmup 58 sec latency 17461.877 ms oleg438-client.virtnet: 1 5278 0.77 MB/sec warmup 59 sec latency 18317.479 ms oleg438-client.virtnet: 1 6851 7.46 MB/sec execute 1 sec latency 24.807 ms oleg438-client.virtnet: 1 7529 6.09 MB/sec execute 2 sec latency 11.575 ms oleg438-client.virtnet: 1 8015 4.30 MB/sec execute 3 sec latency 31.747 ms oleg438-client.virtnet: 1 8614 4.32 MB/sec execute 4 sec latency 26.703 ms oleg438-client.virtnet: 1 9115 3.55 MB/sec execute 5 sec latency 120.579 ms oleg438-client.virtnet: 1 9115 2.96 MB/sec execute 6 sec latency 1120.730 ms oleg438-client.virtnet: 1 9115 2.53 MB/sec execute 7 sec latency 2120.909 ms oleg438-client.virtnet: 1 9115 2.22 MB/sec execute 8 sec latency 3121.030 ms oleg438-client.virtnet: 1 9115 1.97 MB/sec execute 9 sec latency 4121.194 ms oleg438-client.virtnet: 1 9115 1.77 MB/sec execute 10 sec latency 5121.370 ms oleg438-client.virtnet: 1 9115 1.61 MB/sec execute 11 sec latency 6121.560 ms oleg438-client.virtnet: 1 9115 1.48 MB/sec execute 12 sec latency 7121.760 ms oleg438-client.virtnet: 1 9115 1.36 MB/sec execute 13 sec latency 8121.891 ms oleg438-client.virtnet: 1 9115 1.27 MB/sec execute 14 sec latency 9122.051 ms oleg438-client.virtnet: 1 9115 1.18 MB/sec execute 15 sec latency 10122.329 ms oleg438-client.virtnet: 1 9115 1.11 MB/sec execute 16 sec latency 11122.526 ms oleg438-client.virtnet: 1 9115 1.04 MB/sec execute 17 sec latency 12122.666 ms oleg438-client.vipdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:42:23 (1713282143) targets are mounted 11:42:23 (1713282143) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3704832 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3725312 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7430144 1% /mnt/lustre test_70b fail mds1 4 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:42:36 (1713282156) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:42:49 (1713282169) targets are mounted 11:42:49 (1713282169) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3710976 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3730432 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7441408 1% /mnt/lustre test_70b fail mds1 5 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:43:01 (1713282181) shut down rtnet: 1 9115 0.99 MB/sec execute 18 sec latency 13122.774 ms oleg438-client.virtnet: 1 9115 0.93 MB/sec execute 19 sec latency 14122.905 ms oleg438-client.virtnet: 1 9115 0.89 MB/sec execute 20 sec latency 15123.167 ms oleg438-client.virtnet: 1 9115 0.84 MB/sec execute 21 sec latency 16123.359 ms oleg438-client.virtnet: 1 9115 0.81 MB/sec execute 22 sec latency 17123.556 ms oleg438-client.virtnet: 1 9115 0.77 MB/sec execute 23 sec latency 18123.728 ms oleg438-client.virtnet: 1 9148 0.74 MB/sec execute 24 sec latency 18981.227 ms oleg438-client.virtnet: 1 9606 0.80 MB/sec execute 25 sec latency 38.947 ms oleg438-client.virtnet: 1 10396 1.02 MB/sec execute 26 sec latency 43.765 ms oleg438-client.virtnet: 1 10913 1.15 MB/sec execute 27 sec latency 23.451 ms oleg438-client.virtnet: 1 11277 1.12 MB/sec execute 28 sec latency 15.664 ms oleg438-client.virtnet: 1 11572 1.11 MB/sec execute 29 sec latency 47.468 ms oleg438-client.virtnet: 1 12103 1.21 MB/sec execute 30 sec latency 29.443 ms oleg438-client.virtnet: 1 12260 1.18 MB/sec execute 31 sec latency 572.532 ms oleg438-client.virtnet: 1 12260 1.14 MB/sec execute 32 sec latency 1572.667 ms oleg438-client.virtnet: 1 12260 1.11 MB/sec execute 33 sec latency 2572.886 ms oleg438-client.virtnet: 1 12260 1.07 MB/sec execute 34 sec latency 3573.117 ms oleg438-client.virtnet: 1 12260 1.04 MB/sec execute 35 sec latency 4573.282 ms oleg438-client.virtnet: 1 12260 1.01 MB/sec execute 36 sec latency 5573.461 ms oleg438-client.virtnet: 1 12260 0.99 MB/sec execute 37 sec latency 6573.641 ms oleg438-client.virtnet: 1 12260 0.96 MB/sec execute 38 sec latency 7573.754 ms oleg438-client.virtnet: 1 12260 0.94 MB/sec execute 39 sec latency 8573.903 ms oleg438-client.virtnet: 1 12260 0.91 MB/sec execute 40 sec latency 9574.092 ms oleg438-client.virtnet: 1 12260 0.89 MB/sec execute 41 sec latency 10574.220 ms oleg438-client.virtnet: 1 12260 0.87 MB/sec execute 42 sec latency 11574.362 ms oleg438-client.virtnet: 1 12260 0.85 MB/sec execute 43 sec latency 12574.486 ms oleg438-client.virtnet: 1 12260 0.83 MB/sec execute 44 sec latency 13574.689 ms oleg438-client.virtnet: 1 12260 0.81 MB/sec execute 45 sec latency 14574.868 ms oleg438-client.virtnet: 1 12260 0.79 MB/sec execute 46 sec latency 15575.097 ms oleg438-client.virtnet: 1 12260 0.78 MB/sec execute 47 sec latency 16575.336 ms oleg438-client.virtnet: 1 12260 0.76 MB/sec execute 48 sec latency 17575.556 ms oleg438-client.virtnet: 1 12287 0.75 MB/sec execute 49 sec latency 18478.609 ms oleg438-client.virtnet: 1 12711 0.74 MB/sec execute 50 sec latency 37.114 ms oleg438-client.virtnet: 1 13150 0.77 MB/sec execute 51 sec latency 41.770 ms oleg438-client.virtnet: 1 13927 0.88 MB/sec execute 52 sec latency 46.816 ms oleg438-client.virtnet: 1 14459 0.95 MB/sec execute 53 sec latency 24.244 ms oleg438-client.virtnet: 1 14759 0.93 MB/sec execute 54 sec latency 14.131 ms oleg438-client.virtnet: 1 15079 0.93 MB/sec execute 55 sec latency 60.000 ms oleg438-client.virtnet: 1 15188 0.92 MB/sec execute 56 sec latency 524.802 ms oleg438-client.virtnet: 1 15188 0.90 MB/sec execute 57 sec latency 1524.977 ms oleg438-client.virtnet: 1 15188 0.89 MB/sec execute 58 sec latency 2525.119 ms oleg438-client.virtnet: 1 15188 0.87 MB/sec execute 59 sec latency 3525.304 ms oleg438-client.virtnet: 1 15188 0.86 MB/sec execute 60 sec latency 4525.524 ms oleg438-client.virtnet: 1 15188 0.84 MB/sec execute 61 sec latency 5525.707 ms oleg438-client.virtnet: 1 Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:43:15 (1713282195) targets are mounted 11:43:15 (1713282195) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 41984 3706880 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3742720 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7449600 1% /mnt/lustre test_70b fail mds1 6 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:43:26 (1713282206) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:43:38 (1713282218) targets are mounted 11:43:38 (1713282218) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 41984 3706880 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3740672 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7447552 1% /mnt/lustre test_70b fail mds1 7 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 15188 0.83 MB/sec execute 62 sec latency 6525.908 ms oleg438-client.virtnet: 1 15188 0.82 MB/sec execute 63 sec latency 7526.041 ms oleg438-client.virtnet: 1 15188 0.80 MB/sec execute 64 sec latency 8526.204 ms oleg438-client.virtnet: 1 15188 0.79 MB/sec execute 65 sec latency 9526.501 ms oleg438-client.virtnet: 1 15188 0.78 MB/sec execute 66 sec latency 10526.726 ms oleg438-client.virtnet: 1 15188 0.77 MB/sec execute 67 sec latency 11526.858 ms oleg438-client.virtnet: 1 15188 0.76 MB/sec execute 68 sec latency 12526.996 ms oleg438-client.virtnet: 1 15188 0.74 MB/sec execute 69 sec latency 13527.169 ms oleg438-client.virtnet: 1 15188 0.73 MB/sec execute 70 sec latency 14527.334 ms oleg438-client.virtnet: 1 15188 0.72 MB/sec execute 71 sec latency 15527.553 ms oleg438-client.virtnet: 1 15188 0.71 MB/sec execute 72 sec latency 16527.741 ms oleg438-client.virtnet: 1 15188 0.70 MB/sec execute 73 sec latency 17527.873 ms oleg438-client.virtnet: 1 15188 0.69 MB/sec execute 74 sec latency 18527.969 ms oleg438-client.virtnet: 1 15670 0.74 MB/sec execute 75 sec latency 18590.514 ms oleg438-client.virtnet: 1 16056 0.73 MB/sec execute 76 sec latency 56.791 ms oleg438-client.virtnet: 1 16549 0.74 MB/sec execute 77 sec latency 41.514 ms oleg438-client.virtnet: 1 17463 0.83 MB/sec execute 78 sec latency 29.001 ms oleg438-client.virtnet: 1 18141 0.88 MB/sec execute 79 sec latency 12.902 ms oleg438-client.virtnet: 1 18641 0.88 MB/sec execute 80 sec latency 35.176 ms oleg438-client.virtnet: 1 18657 0.87 MB/sec execute 81 sec latency 966.205 ms oleg438-client.virtnet: 1 18657 0.86 MB/sec execute 82 sec latency 1966.374 ms oleg438-client.virtnet: 1 18657 0.85 MB/sec execute 83 sec latency 2966.569 ms oleg438-client.virtnet: 1 18657 0.84 MB/sec execute 84 sec latency 3966.734 ms oleg438-client.virtnet: 1 18657 0.83 MB/sec execute 85 sec latency 4966.878 ms oleg438-client.virtnet: 1 18657 0.82 MB/sec execute 86 sec latency 5967.048 ms oleg438-client.virtnet: 1 18657 0.81 MB/sec execute 87 sec latency 6967.224 ms oleg438-client.virtnet: 1 18657 0.80 MB/sec execute 88 sec latency 7967.429 ms oleg438-client.virtnet: 1 18657 0.79 MB/sec execute 89 sec latency 8967.591 ms oleg438-client.virtnet: 1 18657 0.78 MB/sec execute 90 sec latency 9967.713 ms oleg438-client.virtnet: 1 18657 0.77 MB/sec execute 91 sec latency 10967.886 ms oleg438-client.virtnet: 1 18657 0.76 MB/sec execute 92 sec latency 11968.018 ms oleg438-client.virtnet: 1 18657 0.76 MB/sec execute 93 sec latency 12968.161 ms oleg438-client.virtnet: 1 18657 0.75 MB/sec execute 94 sec latency 13968.275 ms oleg438-client.virtnet: 1 18657 0.74 MB/sec execute 95 sec latency 14968.413 ms oleg438-client.virtnet: 1 18657 0.73 MB/sec execute 96 sec latency 15968.553 ms oleg438-client.virtnet: 1 18657 0.72 MB/sec execute 97 sec latency 16968.691 ms oleg438-client.virtnet: 1 18657 0.72 MB/sec execute 98 sec latency 17968.808 ms oleg438-client.virtnet: 1 18657 0.71 MB/sec execute 99 sec latency 18968.927 ms oleg438-client.virtnet: 1 19156 0.75 MB/sec execute 100 sec latency 19108.136 ms oleg438-client.virtnet: 1 19545 0.74 MB/sec execute 101 sec latency 51.384 ms oleg438-client.virtnet: 1 20080 0.75 MB/sec execute 102 sec latency 36.989 ms oleg438-client.virtnet: 1 20702 0.80 MB/sec execute 103 sec latency 28.558 ms oleg438-client.virtnet: 1 21409 0.84 MB/sec execute 104 sec latency 24.772 ms oleg438-client.virtnet: 1 21779 0.84 MB/sec execute 105 sec latency 13.403 ms oleg438-client.virtnet: 1 22011:43:51 (1713282231) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:44:04 (1713282244) targets are mounted 11:44:04 (1713282244) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3840 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3709952 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3743744 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 54272 7453696 1% /mnt/lustre test_70b fail mds1 8 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:44:18 (1713282258) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:44:31 (1713282271) targets are mounted 11:44:31 (1713282271) facet_failover done 11 0.84 MB/sec execute 106 sec latency 458.348 ms oleg438-client.virtnet: 1 22011 0.83 MB/sec execute 107 sec latency 1458.500 ms oleg438-client.virtnet: 1 22011 0.82 MB/sec execute 108 sec latency 2458.650 ms oleg438-client.virtnet: 1 22011 0.82 MB/sec execute 109 sec latency 3458.775 ms oleg438-client.virtnet: 1 22011 0.81 MB/sec execute 110 sec latency 4458.930 ms oleg438-client.virtnet: 1 22011 0.80 MB/sec execute 111 sec latency 5459.095 ms oleg438-client.virtnet: 1 22011 0.79 MB/sec execute 112 sec latency 6459.235 ms oleg438-client.virtnet: 1 22011 0.79 MB/sec execute 113 sec latency 7459.367 ms oleg438-client.virtnet: 1 22011 0.78 MB/sec execute 114 sec latency 8459.551 ms oleg438-client.virtnet: 1 22011 0.77 MB/sec execute 115 sec latency 9459.709 ms oleg438-client.virtnet: 1 22011 0.77 MB/sec execute 116 sec latency 10459.902 ms oleg438-client.virtnet: 1 22011 0.76 MB/sec execute 117 sec latency 11460.042 ms oleg438-client.virtnet: 1 22011 0.75 MB/sec execute 118 sec latency 12460.192 ms oleg438-client.virtnet: 1 22011 0.75 MB/sec execute 119 sec latency 13460.326 ms oleg438-client.virtnet: 1 22319 0.75 MB/sec execute 120 sec latency 13509.177 ms oleg438-client.virtnet: 1 23017 0.78 MB/sec execute 121 sec latency 32.416 ms oleg438-client.virtnet: 1 23527 0.77 MB/sec execute 122 sec latency 49.780 ms oleg438-client.virtnet: 1 24225 0.82 MB/sec execute 123 sec latency 31.816 ms oleg438-client.virtnet: 1 25004 0.86 MB/sec execute 124 sec latency 23.655 ms oleg438-client.virtnet: 1 25536 0.86 MB/sec execute 125 sec latency 11.835 ms oleg438-client.virtnet: 1 25879 0.86 MB/sec execute 126 sec latency 38.733 ms oleg438-client.virtnet: 1 25879 0.86 MB/sec execute 127 sec latency 1028.860 ms oleg438-client.virtnet: 1 25879 0.85 MB/sec execute 128 sec latency 2028.975 ms oleg438-client.virtnet: 1 25879 0.84 MB/sec execute 129 sec latency 3029.094 ms oleg438-client.virtnet: 1 25879 0.84 MB/sec execute 130 sec latency 4029.215 ms oleg438-client.virtnet: 1 25879 0.83 MB/sec execute 131 sec latency 5029.334 ms oleg438-client.virtnet: 1 25879 0.82 MB/sec execute 132 sec latency 6029.443 ms oleg438-client.virtnet: 1 25879 0.82 MB/sec execute 133 sec latency 7029.658 ms oleg438-client.virtnet: 1 25879 0.81 MB/sec execute 134 sec latency 8029.859 ms oleg438-client.virtnet: 1 25879 0.81 MB/sec execute 135 sec latency 9029.998 ms oleg438-client.virtnet: 1 25879 0.80 MB/sec execute 136 sec latency 10030.173 ms oleg438-client.virtnet: 1 25879 0.79 MB/sec execute 137 sec latency 11030.377 ms oleg438-client.virtnet: 1 25879 0.79 MB/sec execute 138 sec latency 12030.556 ms oleg438-client.virtnet: 1 25879 0.78 MB/sec execute 139 sec latency 13030.671 ms oleg438-client.virtnet: 1 25879 0.78 MB/sec execute 140 sec latency 14030.851 ms oleg438-client.virtnet: 1 25879 0.77 MB/sec execute 141 sec latency 15030.997 ms oleg438-client.virtnet: 1 25879 0.77 MB/sec execute 142 sec latency 16031.176 ms oleg438-client.virtnet: 1 25879 0.76 MB/sec execute 143 sec latency 17031.314 ms oleg438-client.virtnet: 1 25879 0.76 MB/sec execute 144 sec latency 18031.423 ms oleg438-client.virtnet: 1 25879 0.75 MB/sec execute 145 sec latency 19031.547 ms oleg438-client.virtnet: 1 25879 0.75 MB/sec execute 146 sec latency 20031.667 ms oleg438-client.virtnet: 1 25879 0.74 MB/sec execute 147 sec latency 21031.785 ms oleg438-client.virtnet: 1 25879 0.74 MB/sec execute 148 sec latency 22031.907 ms oleg438-client.virtnet: 1 25879 0.73 MB/sec execute 149 sec latency 23032.051 ms oleg438-client.virtnet: 1 2oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3710976 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3730432 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7441408 1% /mnt/lustre test_70b fail mds1 9 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:44:47 (1713282287) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:44:59 (1713282299) targets are mounted 11:44:59 (1713282299) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3724288 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3752960 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 54272 7477248 1% /mnt/lustre test_70b fail mds1 10 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:45:19 (1713282319) shut down 5879 0.73 MB/sec execute 150 sec latency 24032.206 ms oleg438-client.virtnet: 1 25879 0.72 MB/sec execute 151 sec latency 25032.329 ms oleg438-client.virtnet: 1 25879 0.72 MB/sec execute 152 sec latency 26032.459 ms oleg438-client.virtnet: 1 25879 0.71 MB/sec execute 153 sec latency 27032.589 ms oleg438-client.virtnet: 1 25879 0.71 MB/sec execute 154 sec latency 28032.777 ms oleg438-client.virtnet: 1 26381 0.73 MB/sec execute 155 sec latency 28166.068 ms oleg438-client.virtnet: 1 26823 0.73 MB/sec execute 156 sec latency 39.585 ms oleg438-client.virtnet: 1 27254 0.73 MB/sec execute 157 sec latency 50.504 ms oleg438-client.virtnet: 1 27793 0.76 MB/sec execute 158 sec latency 51.058 ms oleg438-client.virtnet: 1 28488 0.79 MB/sec execute 159 sec latency 42.562 ms oleg438-client.virtnet: 1 28813 0.79 MB/sec execute 160 sec latency 14.658 ms oleg438-client.virtnet: 1 29161 0.79 MB/sec execute 161 sec latency 74.378 ms oleg438-client.virtnet: 1 29161 0.79 MB/sec execute 162 sec latency 1074.528 ms oleg438-client.virtnet: 1 29161 0.78 MB/sec execute 163 sec latency 2074.680 ms oleg438-client.virtnet: 1 29161 0.78 MB/sec execute 164 sec latency 3074.893 ms oleg438-client.virtnet: 1 29161 0.77 MB/sec execute 165 sec latency 4075.119 ms oleg438-client.virtnet: 1 29161 0.77 MB/sec execute 166 sec latency 5075.342 ms oleg438-client.virtnet: 1 29161 0.76 MB/sec execute 167 sec latency 6075.530 ms oleg438-client.virtnet: 1 29161 0.76 MB/sec execute 168 sec latency 7075.673 ms oleg438-client.virtnet: 1 29161 0.75 MB/sec execute 169 sec latency 8075.802 ms oleg438-client.virtnet: 1 29161 0.75 MB/sec execute 170 sec latency 9075.986 ms oleg438-client.virtnet: 1 29161 0.74 MB/sec execute 171 sec latency 10076.156 ms oleg438-client.virtnet: 1 29161 0.74 MB/sec execute 172 sec latency 11076.371 ms oleg438-client.virtnet: 1 29161 0.74 MB/sec execute 173 sec latency 12076.532 ms oleg438-client.virtnet: 1 29161 0.73 MB/sec execute 174 sec latency 13076.654 ms oleg438-client.virtnet: 1 29161 0.73 MB/sec execute 175 sec latency 14076.776 ms oleg438-client.virtnet: 1 29161 0.72 MB/sec execute 176 sec latency 15076.918 ms oleg438-client.virtnet: 1 29161 0.72 MB/sec execute 177 sec latency 16077.109 ms oleg438-client.virtnet: 1 29161 0.72 MB/sec execute 178 sec latency 17077.373 ms oleg438-client.virtnet: 1 29161 0.71 MB/sec execute 179 sec latency 18077.529 ms oleg438-client.virtnet: 1 29315 0.71 MB/sec execute 180 sec latency 18409.676 ms oleg438-client.virtnet: 1 29646 0.71 MB/sec execute 181 sec latency 48.295 ms oleg438-client.virtnet: 1 30039 0.73 MB/sec execute 182 sec latency 38.175 ms oleg438-client.virtnet: 1 30354 0.72 MB/sec execute 183 sec latency 41.261 ms oleg438-client.virtnet: 1 30778 0.73 MB/sec execute 184 sec latency 56.035 ms oleg438-client.virtnet: 1 31185 0.75 MB/sec execute 185 sec latency 55.705 ms oleg438-client.virtnet: 1 31935 0.78 MB/sec execute 186 sec latency 37.960 ms oleg438-client.virtnet: 1 32135 0.77 MB/sec execute 187 sec latency 487.460 ms oleg438-client.virtnet: 1 32135 0.77 MB/sec execute 188 sec latency 1487.609 ms oleg438-client.virtnet: 1 32135 0.77 MB/sec execute 189 sec latency 2487.740 ms oleg438-client.virtnet: 1 32135 0.76 MB/sec execute 190 sec latency 3487.865 ms oleg438-client.virtnet: 1 32135 0.76 MB/sec execute 191 sec latency 4488.057 ms oleg438-client.virtnet: 1 32135 0.75 MB/sec execute 192 sec latency 5488.324 ms oleg438-client.virtnet: 1 32135 0.75 MB/sec execute 193 sec latency 6488.537 ms oleg438-client.virtnet: 1 32135 0.75Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:45:32 (1713282332) targets are mounted 11:45:32 (1713282332) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3687424 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3747840 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7435264 1% /mnt/lustre test_70b fail mds1 11 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:45:47 (1713282347) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:46:00 (1713282360) targets are mounted 11:46:00 (1713282360) facet_failover done MB/sec execute 194 sec latency 7488.753 ms oleg438-client.virtnet: 1 32135 0.74 MB/sec execute 195 sec latency 8488.929 ms oleg438-client.virtnet: 1 32135 0.74 MB/sec execute 196 sec latency 9489.091 ms oleg438-client.virtnet: 1 32135 0.74 MB/sec execute 197 sec latency 10489.279 ms oleg438-client.virtnet: 1 32135 0.73 MB/sec execute 198 sec latency 11489.489 ms oleg438-client.virtnet: 1 32135 0.73 MB/sec execute 199 sec latency 12489.717 ms oleg438-client.virtnet: 1 32135 0.72 MB/sec execute 200 sec latency 13489.911 ms oleg438-client.virtnet: 1 32135 0.72 MB/sec execute 201 sec latency 14490.093 ms oleg438-client.virtnet: 1 32135 0.72 MB/sec execute 202 sec latency 15490.271 ms oleg438-client.virtnet: 1 32135 0.71 MB/sec execute 203 sec latency 16490.487 ms oleg438-client.virtnet: 1 32135 0.71 MB/sec execute 204 sec latency 17490.693 ms oleg438-client.virtnet: 1 32135 0.71 MB/sec execute 205 sec latency 18490.885 ms oleg438-client.virtnet: 1 32135 0.70 MB/sec execute 206 sec latency 19491.089 ms oleg438-client.virtnet: 1 32135 0.70 MB/sec execute 207 sec latency 20491.211 ms oleg438-client.virtnet: 1 32135 0.70 MB/sec execute 208 sec latency 21491.394 ms oleg438-client.virtnet: 1 32135 0.69 MB/sec execute 209 sec latency 22491.569 ms oleg438-client.virtnet: 1 32135 0.69 MB/sec execute 210 sec latency 23491.759 ms oleg438-client.virtnet: 1 32135 0.69 MB/sec execute 211 sec latency 24491.984 ms oleg438-client.virtnet: 1 32135 0.68 MB/sec execute 212 sec latency 25492.176 ms oleg438-client.virtnet: 1 32135 0.68 MB/sec execute 213 sec latency 26492.346 ms oleg438-client.virtnet: 1 32135 0.68 MB/sec execute 214 sec latency 27492.527 ms oleg438-client.virtnet: 1 32338 0.68 MB/sec execute 215 sec latency 27754.386 ms oleg438-client.virtnet: 1 32585 0.68 MB/sec execute 216 sec latency 18.375 ms oleg438-client.virtnet: 1 32863 0.68 MB/sec execute 217 sec latency 74.832 ms oleg438-client.virtnet: 1 33452 0.69 MB/sec execute 218 sec latency 21.790 ms oleg438-client.virtnet: 1 33813 0.69 MB/sec execute 219 sec latency 41.970 ms oleg438-client.virtnet: 1 34207 0.69 MB/sec execute 220 sec latency 39.927 ms oleg438-client.virtnet: 1 34541 0.70 MB/sec execute 221 sec latency 50.736 ms oleg438-client.virtnet: 1 34885 0.71 MB/sec execute 222 sec latency 459.815 ms oleg438-client.virtnet: 1 34885 0.71 MB/sec execute 223 sec latency 1459.973 ms oleg438-client.virtnet: 1 34885 0.71 MB/sec execute 224 sec latency 2460.182 ms oleg438-client.virtnet: 1 34885 0.70 MB/sec execute 225 sec latency 3460.412 ms oleg438-client.virtnet: 1 34885 0.70 MB/sec execute 226 sec latency 4460.617 ms oleg438-client.virtnet: 1 34885 0.70 MB/sec execute 227 sec latency 5460.807 ms oleg438-client.virtnet: 1 34885 0.70 MB/sec execute 228 sec latency 6461.000 ms oleg438-client.virtnet: 1 34885 0.69 MB/sec execute 229 sec latency 7461.233 ms oleg438-client.virtnet: 1 34885 0.69 MB/sec execute 230 sec latency 8461.427 ms oleg438-client.virtnet: 1 34885 0.69 MB/sec execute 231 sec latency 9461.632 ms oleg438-client.virtnet: 1 34885 0.68 MB/sec execute 232 sec latency 10461.814 ms oleg438-client.virtnet: 1 34885 0.68 MB/sec execute 233 sec latency 11461.934 ms oleg438-client.virtnet: 1 34885 0.68 MB/sec execute 234 sec latency 12462.118 ms oleg438-client.virtnet: 1 34885 0.67 MB/sec execute 235 sec latency 13462.324 ms oleg438-client.virtnet: 1 34885 0.67 MB/sec execute 236 sec latency 14462.535 ms oleg438-client.virtnet: 1 34885 0.67 MB/sec execute 237 sec latency 15462.713 ms oleg438-client.virtnet: 1 34885 oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 0.67 MB/sec execute 238 sec latency 16462.878 ms oleg438-client.virtnet: 1 34885 0.66 MB/sec execute 239 sec latency 17463.072 ms oleg438-client.virtnet: 1 35307 0.67 MB/sec execute 240 sec latency 17680.526 ms oleg438-client.virtnet: 1 35727 0.68 MB/sec execute 241 sec latency 17.314 ms oleg438-client.virtnet: 1 35973 0.68 MB/sec execute 242 sec latency 15.865 ms oleg438-client.virtnet: 1 36299 0.68 MB/sec execute 243 sec latency 63.349 ms oleg438-client.virtnet: 1 36643 0.69 MB/sec execute 244 sec latency 45.509 ms oleg438-client.virtnet: 1 37019 0.69 MB/sec execute 245 sec latency 39.898 ms oleg438-client.virtnet: 1 37354 0.69 MB/sec execute 246 sec latency 43.354 ms oleg438-client.virtnet: 1 38073 0.70 MB/sec execute 247 sec latency 30.739 ms oleg438-client.virtnet: 1 38996 0.74 MB/sec execute 248 sec latency 26.784 ms oleg438-client.virtnet: 1 39324 0.74 MB/sec execute 249 sec latency 15.648 ms oleg438-client.virtnet: 1 39560 0.74 MB/sec execute 250 sec latency 15.870 ms oleg438-client.virtnet: 1 39843 0.74 MB/sec execute 251 sec latency 74.768 ms oleg438-client.virtnet: 1 40127 0.74 MB/sec execute 252 sec latency 47.103 ms oleg438-client.virtnet: 1 40709 0.75 MB/sec execute 253 sec latency 26.540 ms oleg438-client.virtnet: 1 41123 0.75 MB/sec execute 254 sec latency 45.093 ms oleg438-client.virtnet: 1 41476 0.75 MB/sec execute 255 sec latency 58.347 ms oleg438-client.virtnet: 1 41985 0.77 MB/sec execute 256 sec latency 45.140 ms oleg438-client.virtnet: 1 42590 0.78 MB/sec execute 257 sec latency 60.233 ms oleg438-client.virtnet: 1 42939 0.79 MB/sec execute 258 sec latency 14.758 ms oleg438-client.virtnet: 1 43262 0.79 MB/sec execute 259 sec latency 16.005 ms oleg438-client.virtnet: 1 43459 0.78 MB/sec execute 260 sec latency 60.729 ms oleg438-client.virtnet: 1 43759 0.79 MB/sec execute 261 sec latency 46.165 ms oleg438-client.virtnet: 1 44173 0.80 MB/sec execute 262 sec latency 41.662 ms oleg438-client.virtnet: 1 44872 0.80 MB/sec execute 263 sec latency 24.986 ms oleg438-client.virtnet: 1 45498 0.82 MB/sec execute 264 sec latency 42.757 ms oleg438-client.virtnet: 1 46049 0.83 MB/sec execute 265 sec latency 42.113 ms oleg438-client.virtnet: 1 46419 0.84 MB/sec execute 266 sec latency 15.712 ms oleg438-client.virtnet: 1 46776 0.83 MB/sec execute 267 sec latency 12.294 ms oleg438-client.virtnet: 1 47031 0.83 MB/sec execute 268 sec latency 61.054 ms oleg438-client.virtnet: 1 47334 0.83 MB/sec execute 269 sec latency 43.531 ms oleg438-client.virtnet: 1 47725 0.84 MB/sec execute 270 sec latency 43.319 ms oleg438-client.virtnet: 1 48037 0.84 MB/sec execute 271 sec latency 38.288 ms oleg438-client.virtnet: 1 48493 0.84 MB/sec execute 272 sec latency 39.029 ms oleg438-client.virtnet: 1 49143 0.86 MB/sec execute 273 sec latency 48.583 ms oleg438-client.virtnet: 1 49839 0.88 MB/sec execute 274 sec latency 17.969 ms oleg438-client.virtnet: 1 50114 0.88 MB/sec execute 275 sec latency 15.543 ms oleg438-client.virtnet: 1 50408 0.88 MB/sec execute 276 sec latency 66.769 ms oleg438-client.virtnet: 1 50657 0.88 MB/sec execute 277 sec latency 46.263 ms oleg438-client.virtnet: 1 51272 0.89 MB/sec execute 278 sec latency 47.486 ms oleg438-client.virtnet: 1 51581 0.89 MB/sec execute 279 sec latency 39.966 ms oleg438-client.virtnet: 1 52030 0.89 MB/sec execute 280 sec latency 42.653 ms oleg438-client.virtnet: 1 52354 0.89 MB/sec execute 281 sec latency 46.888 ms oleg438-client.virtnet: 1 52962 0.91 MB/sec execute 282 sec latency 54.163 ms oleg438-client.virtnet: 1 53394 0.92 MB/sec execute 283 sec latency 23.325 ms oleg438-client.virtnet: 1 53662 0.92 MB/sec execute 284 sec latency 15.693 ms oleg438-client.virtnet: 1 53956 0.92 MB/sec execute 285 sec latency 65.689 ms oleg438-client.virtnet: 1 54160 0.91 MB/sec execute 286 sec latency 50.047 ms oleg438-client.virtnet: 1 54488 0.92 MB/sec execute 287 sec latency 42.178 ms oleg438-client.virtnet: 1 54876 0.92 MB/sec execute 288 sec latency 45.149 ms oleg438-client.virtnet: 1 55215 0.92 MB/sec execute 289 sec latency 36.209 ms oleg438-client.virtnet: 1 55616 0.92 MB/sec execute 290 sec latency 42.850 ms oleg438-client.virtnet: 1 56109 0.93 MB/sec execute 291 sec latency 53.778 ms oleg438-client.virtnet: 1 56599 0.94 MB/sec execute 292 sec latency 52.644 ms oleg438-client.virtnet: 1 57028 0.95 MB/sec execute 293 sec latency 15.706 ms oleg438-client.virtnet: 1 57263 0.95 MB/sec execute 294 sec latency 15.473 ms oleg438-client.virtnet: 1 57530 0.95 MB/sec execute 295 sec latency 70.023 ms oleg438-client.virtnet: 1 57770 0.95 MB/sec execute 296 sec latency 49.749 ms oleg438-client.virtnet: 1 58193 0.96 MB/sec execute 297 sec latency 41.187 ms oleg438-client.virtnet: 1 58492 0.96 MB/sec execute 298 sec latency 41.684 ms oleg438-client.virtnet: 1 58841 0.96 MB/sec execute 299 sec latency 40.399 ms oleg438-client.virtnet: 1 cleanup 300 sec oleg438-client.virtnet: 0 cleanup 300 sec oleg438-client.virtnet: oleg438-client.virtnet: Operation Count AvgLat MaxLat oleg438-client.virtnet: ---------------------------------------- oleg438-client.virtnet: NTCreateX 9220 18.594 28166.054 oleg438-client.virtnet: Close 6775 4.044 17680.505 oleg438-client.virtnet: Rename 393 9.282 16.709 oleg438-client.virtnet: Unlink 1863 2.629 9.897 oleg438-client.virtnet: Qpathinfo 8356 1.772 18.359 oleg438-client.virtnet: Qfileinfo 1460 0.161 2.424 oleg438-client.virtnet: Qfsinfo 1532 0.145 8.278 oleg438-client.virtnet: Sfileinfo 758 22.413 13509.162 oleg438-client.virtnet: Find 3232 0.526 28.340 oleg438-client.virtnet: WriteX 4558 0.959 5.174 oleg438-client.virtnet: ReadX 14448 0.020 2.528 oleg438-client.virtnet: LockX 30 1.410 2.028 oleg438-client.virtnet: UnlockX 30 1.471 2.054 oleg438-client.virtnet: Flush 647 82.784 18590.502 oleg438-client.virtnet: oleg438-client.virtnet: Throughput 0.955004 MB/sec 1 clients 1 procs max_latency=28166.068 ms oleg438-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg438-client.virtnet at Tue Apr 16 11:47:06 EDT 2024 with return code 0 oleg438-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg438-client.virtnet oleg438-client.virtnet: /mnt/lustre/d70b.replay-single/oleg438-client.virtnet /mnt/lustre/d70b.replay-single/oleg438-client.virtnet oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/PM' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/ACCESS' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/COREL' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORDPRO' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/SEED' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/PARADOX' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORD' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/EXCEL' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp/PWRPNT' oleg438-client.virtnet: removed directory: 'clients/client0/~dmtmp' oleg438-client.virtnet: removed directory: 'clients/client0' oleg438-client.virtnet: removed directory: 'clients' oleg438-client.virtnet: removed 'client.txt' oleg438-client.virtnet: /mnt/lustre/d70b.replay-single/oleg438-client.virtnet oleg438-client.virtnet: dbench successfully finished PASS 70b (364s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70c: tar 1mdts recovery ============ 11:47:10 (1713282430) Starting client oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre Started clients oleg438-client.virtnet: 192.168.204.138@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar 26405 tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 12032 2196608 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3195904 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 22528 3196928 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 40960 6392832 1% /mnt/lustre test_70c fail mds1 1 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:49:24 (1713282564) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:49:38 (1713282578) targets are mounted 11:49:38 (1713282578) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 11904 2196608 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 16384 3258368 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 24576 3250176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 40960 6508544 1% /mnt/lustre test_70c fail mds1 2 times Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:52:00 (1713282720) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:52:14 (1713282734) targets are mounted 11:52:14 (1713282734) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (359s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70d: mkdir/rmdir striped dir 1mdts recovery ========================================================== 11:53:11 (1713282791) SKIP: replay-single test_70d needs >= 2 MDTs SKIP 70d (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70e: rename cross-MDT with random fails ========================================================== 11:53:15 (1713282795) SKIP: replay-single test_70e needs >= 2 MDTs SKIP 70e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 11:53:18 (1713282798) mount clients oleg438-client.virtnet ... Starting client oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre Started clients oleg438-client.virtnet: 192.168.204.138@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 9216 3756032 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7518208 1% /mnt/lustre ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:53:26 (1713282806) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... Started lustre-OST0000 11:53:40 (1713282820) targets are mounted 11:53:40 (1713282820) facet_failover done ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 9216 3760128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3753984 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7514112 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:53:53 (1713282833) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Started lustre-OST0000 11:54:07 (1713282847) targets are mounted 11:54:07 (1713282847) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 9216 3749888 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3757056 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7506944 1% /mnt/lustre ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:54:21 (1713282861) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... Started lustre-OST0000 11:54:35 (1713282875) targets are mounted 11:54:35 (1713282875) facet_failover done ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 7168 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 6144 3755008 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 13312 7505920 1% /mnt/lustre ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... test_70f failing OST 4 times Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:54:48 (1713282888) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Started lustre-OST0000 11:55:02 (1713282902) targets are mounted 11:55:02 (1713282902) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 9216 3752960 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3757056 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7510016 1% /mnt/lustre ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... test_70f failing OST 5 times Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:55:16 (1713282916) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... Started lustre-OST0000 11:55:30 (1713282930) targets are mounted 11:55:30 (1713282930) facet_failover done ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... ldlm.namespaces.MGC192.168.204.138@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800ab2d3800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800ab2d3800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg438-client.virtnet' ... PASS 70f (139s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 11:55:40 (1713282940) SKIP: replay-single test_71a needs >= 2 MDTs SKIP 71a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 11:55:43 (1713282943) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:55:47 (1713282947) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:56:00 (1713282960) targets are mounted 11:56:00 (1713282960) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 11:56:23 (1713282983) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6888 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:56:26 (1713282986) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:56:40 (1713283000) targets are mounted 11:56:40 (1713283000) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 11:57:03 (1713283023) Stopping clients: oleg438-client.virtnet /mnt/lustre (opts:) Stopping client oleg438-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg438-server Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:57:06 (1713283026) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:57:20 (1713283040) targets are mounted 11:57:20 (1713283040) facet_failover done Starting client oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre Started clients oleg438-client.virtnet: 192.168.204.138@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 11:57:30 (1713283050) SKIP: replay-single test_80a needs >= 2 MDTs SKIP 80a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 11:57:34 (1713283054) SKIP: replay-single test_80b needs >= 2 MDTs SKIP 80b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 11:57:37 (1713283057) SKIP: replay-single test_80c needs >= 2 MDTs SKIP 80c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 11:57:40 (1713283060) SKIP: replay-single test_80d needs >= 2 MDTs SKIP 80d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 11:57:43 (1713283063) SKIP: replay-single test_80e needs >= 2 MDTs SKIP 80e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 11:57:47 (1713283067) SKIP: replay-single test_80f needs >= 2 MDTs SKIP 80f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 11:57:50 (1713283070) SKIP: replay-single test_80g needs >= 2 MDTs SKIP 80g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 11:57:53 (1713283073) SKIP: replay-single test_80h needs >= 2 MDTs SKIP 80h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 11:57:56 (1713283076) SKIP: replay-single test_81a needs >= 2 MDTs SKIP 81a (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 11:58:00 (1713283080) SKIP: replay-single test_81b needs >= 2 MDTs SKIP 81b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 11:58:03 (1713283083) SKIP: replay-single test_81c needs >= 2 MDTs SKIP 81c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 11:58:06 (1713283086) SKIP: replay-single test_81d needs >= 2 MDTs SKIP 81d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 11:58:10 (1713283090) SKIP: replay-single test_81e needs >= 2 MDTs SKIP 81e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 11:58:13 (1713283093) SKIP: replay-single test_81f needs >= 2 MDTs SKIP 81f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 11:58:16 (1713283096) SKIP: replay-single test_81g needs >= 2 MDTs SKIP 81g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 11:58:20 (1713283100) SKIP: replay-single test_81h needs >= 2 MDTs SKIP 81h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 84a: stale open during export disconnect ========================================================== 11:58:23 (1713283103) fail_loc=0x80000144 total: 1 open/close in 0.01 seconds: 159.26 ops/second pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 PASS 84a (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 11:58:30 (1713283110) before recovery: unused locks count = 201 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 11:58:34 (1713283114) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 11:58:48 (1713283128) targets are mounted 11:58:48 (1713283128) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 11:58:56 (1713283136) before recovery: unused locks count = 100 Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:59:02 (1713283142) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:59:15 (1713283155) targets are mounted 11:59:15 (1713283155) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 11:59:21 (1713283161) Stopping clients: oleg438-client.virtnet /mnt/lustre (opts:) Stopping client oleg438-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre Started clients oleg438-client.virtnet: 192.168.204.138@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) PASS 86 (8s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87a: write replay ================== 11:59:31 (1713283171) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 8192 7530496 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.152487 s, 55.0 MB/s Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 11:59:35 (1713283175) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 11:59:49 (1713283189) targets are mounted 11:59:49 (1713283189) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.064972 s, 129 MB/s PASS 87a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 11:59:57 (1713283197) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 12288 3757056 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7522304 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.121764 s, 68.9 MB/s 8+0 records in 8+0 records out 8 bytes (8 B) copied, 0.00292045 s, 2.7 kB/s Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 12:00:02 (1713283202) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 12:00:16 (1713283216) targets are mounted 12:00:16 (1713283216) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes (72 B) copied, 0.00547399 s, 13.2 kB/s PASS 87b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 88: MDS should not assign same objid to different files ========================================================== 12:00:23 (1713283223) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 13312 3756032 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 17408 7521280 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3840 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 13312 3756032 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 17408 7521280 1% /mnt/lustre before test: last_id = 11041, next_id = 11012 Creating to objid 11041 on ost lustre-OST0000... total: 31 open/close in 0.15 seconds: 201.11 ops/second total: 8 open/close in 0.04 seconds: 207.41 ops/second before recovery: last_id = 11137, next_id = 11050 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg438-server Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 11081, next_id = 11050 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0442006 s, 11.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0428011 s, 12.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0405071 s, 12.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0371105 s, 14.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0434148 s, 12.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0396444 s, 13.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0410396 s, 12.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0374697 s, 14.0 MB/s -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11012 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11013 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11014 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11015 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11016 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11017 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11018 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11019 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11020 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11021 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11022 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11023 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11024 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11025 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11026 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11027 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11028 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11029 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11030 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11031 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11032 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11033 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11034 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11035 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11036 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11037 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11038 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11039 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11040 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11041 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11042 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11043 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11044 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11045 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11046 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11047 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11048 -rw-r--r-- 1 root root 0 Apr 16 12:00 /mnt/lustre/d88.replay-single/f-11049 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11053 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11054 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11055 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11056 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11057 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11058 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11059 -rw-r--r-- 1 root root 524288 Apr 16 12:01 /mnt/lustre/d88.replay-single/f-11060 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0374201 s, 14.0 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0367735 s, 14.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0360916 s, 14.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.035991 s, 14.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0365627 s, 14.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0364985 s, 14.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0401226 s, 13.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0412189 s, 12.7 MB/s PASS 88 (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 89: no disk space leak on late ost connection ========================================================== 12:01:19 (1713283279) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed wait 40 secs maximumly for oleg438-server mds-ost sync done. sleep 5 for ZFS zfs Waiting for MDT destroys to complete 10+0 records in 10+0 records out 41943040 bytes (42 MB) copied, 0.269764 s, 155 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg438-server Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:01:32 (1713283292) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:01:45 (1713283305) targets are mounted 12:01:45 (1713283305) facet_failover done Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg438-client.virtnet: -o user_xattr,flock oleg438-server@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 68 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed wait 40 secs maximumly for oleg438-server mds-ost sync done. sleep 5 for ZFS zfs Waiting for MDT destroys to complete free_before: 7517184 free_after: 7517184 PASS 89 (113s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 12:03:14 (1713283394) Create the files Fail ost1 lustre-OST0000_UUID, display the list of affected files Stopping /mnt/lustre-ost1 (opts:) on oleg438-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/f0 Querying files on shutdown ost1: lfs find --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 0 11082 0x2b4a 0x240000400 * /mnt/lustre/d90.replay-single/f0 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 11083 0x2b4b 0x240000400 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Failover ost1 to oleg438-server Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 90 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93a: replay + reconnect ============ 12:03:37 (1713283417) 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.00275292 s, 372 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 12:03:39 (1713283419) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 12:03:53 (1713283433) targets are mounted 12:03:53 (1713283433) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (60s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93b: replay + reconnect on mds ===== 12:04:39 (1713283479) total: 20 open/close in 0.06 seconds: 308.79 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:04:41 (1713283481) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:04:54 (1713283494) targets are mounted 12:04:54 (1713283494) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (99s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 12:06:20 (1713283580) SKIP: replay-single test_100a needs >= 2 MDTs SKIP 100a (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 12:06:22 (1713283582) SKIP: replay-single test_100b needs >= 2 MDTs SKIP 100b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 12:06:24 (1713283584) SKIP: replay-single test_100c needs >= 2 MDTs SKIP 100c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 12:06:27 (1713283587) SKIP: replay-single test_100d needs > 1 MDTs SKIP 100d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 12:06:30 (1713283590) SKIP: replay-single test_100e needs >= 2 MDTs SKIP 100e (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 12:06:32 (1713283592) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3968 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3747840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7513088 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 first stat failed: 5 PASS 101 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 12:07:00 (1713283620) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (12:07:01) ... fail_loc=0 done (12:07:17) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 12:07:21 (1713283641) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (12:07:22) ... fail_loc=0 done (12:07:38) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 12:07:41 (1713283661) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 4096 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3333120 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3350528 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 6683648 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (12:07:44) ... fail_loc=0 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:07:46 (1713283666) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:08:00 (1713283680) targets are mounted 12:08:00 (1713283680) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (12:08:04) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 12:08:08 (1713283688) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (12:08:09) ... fail_loc=0 Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:08:12 (1713283692) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:08:26 (1713283706) targets are mounted 12:08:26 (1713283706) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (12:08:30) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 103: Check otr_next_id overflow ==== 12:08:34 (1713283714) fail_loc=0x80000162 total: 30 open/close in 0.16 seconds: 193.31 ops/second Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:08:37 (1713283717) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:08:50 (1713283730) targets are mounted 12:08:50 (1713283730) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 12:08:58 (1713283738) SKIP: replay-single test_110a needs >= 2 MDTs SKIP 110a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 12:09:02 (1713283742) SKIP: replay-single test_110b needs >= 2 MDTs SKIP 110b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 12:09:05 (1713283745) SKIP: replay-single test_110c needs >= 2 MDTs SKIP 110c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 12:09:08 (1713283748) SKIP: replay-single test_110d needs >= 2 MDTs SKIP 110d (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 12:09:12 (1713283752) SKIP: replay-single test_110e needs >= 2 MDTs SKIP 110e (1s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_110f skipping excluded test 110f debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 12:09:16 (1713283756) SKIP: replay-single test_110g needs >= 2 MDTs SKIP 110g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 12:09:19 (1713283759) SKIP: replay-single test_111a needs >= 2 MDTs SKIP 111a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 12:09:23 (1713283763) SKIP: replay-single test_111b needs >= 2 MDTs SKIP 111b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 12:09:26 (1713283766) SKIP: replay-single test_111c needs >= 2 MDTs SKIP 111c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 12:09:29 (1713283769) SKIP: replay-single test_111d needs >= 2 MDTs SKIP 111d (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 12:09:33 (1713283773) SKIP: replay-single test_111e needs >= 2 MDTs SKIP 111e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 12:09:36 (1713283776) SKIP: replay-single test_111f needs >= 2 MDTs SKIP 111f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 12:09:40 (1713283780) SKIP: replay-single test_111g needs >= 2 MDTs SKIP 111g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 12:09:43 (1713283783) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 12:09:46 (1713283786) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 12:09:50 (1713283790) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 12:09:53 (1713283793) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 12:09:57 (1713283797) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 12:10:00 (1713283800) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 12:10:04 (1713283804) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 12:10:07 (1713283807) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 12:10:11 (1713283811) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 12:10:14 (1713283814) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 12:10:17 (1713283817) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 12:10:21 (1713283821) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 12:10:24 (1713283824) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 12:10:28 (1713283828) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 115: failover for create/unlink striped directory ========================================================== 12:10:31 (1713283831) SKIP: replay-single test_115 needs >= 2 MDTs SKIP 115 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116a: large update log master MDT recovery ========================================================== 12:10:35 (1713283835) SKIP: replay-single test_116a needs >= 2 MDTs SKIP 116a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116b: large update log slave MDT recovery ========================================================== 12:10:38 (1713283838) SKIP: replay-single test_116b needs >= 2 MDTs SKIP 116b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 12:10:41 (1713283841) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 12:10:45 (1713283845) SKIP: replay-single test_118 needs >= 2 MDTs SKIP 118 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 12:10:48 (1713283848) SKIP: replay-single test_119 needs >= 2 MDTs SKIP 119 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 12:10:51 (1713283851) SKIP: replay-single test_120 needs >= 2 MDTs SKIP 120 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 121: lock replay timed out and race ========================================================== 12:10:55 (1713283855) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.6888 Stopping /mnt/lustre-mds1 (opts:) on oleg438-server Failover mds1 to oleg438-server fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 fail_loc=0x0 at_max=600 PASS 121 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 12:11:25 (1713283885) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 4096 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3333120 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7098368 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:11:28 (1713283888) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:11:42 (1713283902) targets are mounted 12:11:42 (1713283902) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130b: DoM file create (inherited) replay ========================================================== 12:11:50 (1713283910) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3333120 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7098368 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:11:54 (1713283914) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:12:07 (1713283927) targets are mounted 12:12:07 (1713283927) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 131a: DoM file write lock replay === 12:12:16 (1713283936) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3333120 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7098368 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes (8 B) copied, 0.00225099 s, 3.6 kB/s Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:12:19 (1713283939) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:12:33 (1713283953) targets are mounted 12:12:33 (1713283953) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (23s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_131b skipping excluded test 131b debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 132a: PFL new component instantiate replay ========================================================== 12:12:42 (1713283962) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3968 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3333120 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7098368 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0298582 s, 35.1 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x240000400:0x306a:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x280000400:0x2fc2:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x240000400:0x306b:0x0] } Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:12:46 (1713283966) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:12:59 (1713283979) targets are mounted 12:12:59 (1713283979) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 1 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x240000400:0x306a:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x280000400:0x2fc2:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x240000400:0x306b:0x0] } PASS 132a (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 12:13:07 (1713283987) SKIP: replay-single test_133 needs >= 2 MDTs SKIP 133 (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 134: replay creation of a file created in a pool ========================================================== 12:13:11 (1713283991) Creating new pool oleg438-server: Pool lustre.pool_134 created Adding targets to pool oleg438-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 4096 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3747840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 5120 3764224 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 26624 7512064 1% /mnt/lustre Failing mds1 on oleg438-server Stopping /mnt/lustre-mds1 (opts:) on oleg438-server 12:13:20 (1713284000) shut down Failover mds1 to oleg438-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-MDT0000 12:13:34 (1713284014) targets are mounted 12:13:34 (1713284014) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg438-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg438-server: Pool lustre.pool_134 destroyed PASS 134 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 135: Server failure in lock replay phase ========================================================== 12:13:48 (1713284028) Failing ost1 on oleg438-server Stopping /mnt/lustre-ost1 (opts:) on oleg438-server 12:13:50 (1713284030) shut down Failover ost1 to oleg438-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 12:14:04 (1713284044) targets are mounted 12:14:04 (1713284044) facet_failover done oleg438-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 4096 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 21504 3747840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 5120 3764224 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 26624 7512064 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 Stopping /mnt/lustre-ost1 (opts:) on oleg438-server Failover ost1 to oleg438-server oleg438-server: oleg438-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 oleg438-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg438-server Failover ost1 to oleg438-server oleg438-server: oleg438-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 End of sync oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg438-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg438-server Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg438-server: oleg438-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg438-client: oleg438-server: ssh exited with exit code 1 Started lustre-OST0001 pdsh@oleg438-client: oleg438-client: ssh exited with exit code 5 ldlm.cancel_unused_locks_before_replay=1 PASS 135 (93s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 12:15:23 (1713284123) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 12:15:27 (1713284127) SKIP: replay-single test_200 Need remote client SKIP 200 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-single test complete, duration 5158 sec ======== 12:15:29 (1713284129) === replay-single: start cleanup 12:15:29 (1713284129) === === replay-single: finish cleanup 12:15:32 (1713284132) ===