-----============= acceptance-small: replay-dual ============----- Thu Apr 18 03:23:15 EDT 2024 excepting tests: 14b 21b 21b skipping tests SLOW=no: 21b === replay-dual: start setup 03:23:19 (1713424999) === Starting client oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 Started clients oleg259-client.virtnet: 192.168.202.159@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) oleg259-client.virtnet: executing check_config_client /mnt/lustre oleg259-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg259-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800ab6cf000.idle_timeout=debug osc.lustre-OST0000-osc-ffff8800b425e800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800ab6cf000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b425e800.idle_timeout=debug disable quota as required oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all === replay-dual: finish setup 03:23:27 (1713425007) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 0a: expired recovery with lost client ========================================================== 03:23:28 (1713425008) Check file is LU482_FAILED=/tmp/replay-dual.lu482.VEC3Jz UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 50 open/close in 0.37 seconds: 135.15 ops/second fail_loc=0x80000514 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:23:31 (1713425011) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:23:43 (1713425023) targets are mounted 03:23:43 (1713425023) facet_failover done Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 - unlinked 0 (time 1713425106 ; total 0 ; last 0) total: 50 unlinks in 0 seconds: inf unlinks/second PASS 0a (99s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 0b: lost client during waiting for next transno ========================================================== 03:25:09 (1713425109) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:25:11 (1713425111) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:25:24 (1713425124) targets are mounted 03:25:24 (1713425124) facet_failover done Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 PASS 0b (86s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 1: |X| simple create ================= 03:26:37 (1713425197) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:26:39 (1713425199) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:26:52 (1713425212) targets are mounted 03:26:52 (1713425212) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 1 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 2: |X| mkdir adir ==================== 03:26:59 (1713425219) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:27:01 (1713425221) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:27:14 (1713425234) targets are mounted 03:27:14 (1713425234) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 2 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 3: |X| mkdir adir, mkdir adir/bdir === 03:27:20 (1713425240) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:27:22 (1713425242) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:27:35 (1713425255) targets are mounted 03:27:35 (1713425255) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 3 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 4: |X| mkdir adir (-EEXIST), mkdir adir/bdir ========================================================== 03:27:43 (1713425263) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre mkdir: cannot create directory '/mnt/lustre/adir': File exists Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:27:46 (1713425266) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:27:58 (1713425278) targets are mounted 03:27:58 (1713425278) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 5: open, unlink |X| close ============ 03:28:06 (1713425286) multiop /mnt/lustre2/a vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6912 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:28:09 (1713425289) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:28:22 (1713425302) targets are mounted 03:28:22 (1713425302) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 6: open1, open2, unlink |X| close1 [fail mds1] close2 ========================================================== 03:28:29 (1713425309) multiop /mnt/lustre2/a vo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6912 multiop /mnt/lustre/a vo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6912 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:28:32 (1713425312) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:28:44 (1713425324) targets are mounted 03:28:44 (1713425324) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 6 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 8: replay of resent request ========== 03:28:50 (1713425330) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x119 fail_loc=0 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:29:09 (1713425349) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:29:22 (1713425362) targets are mounted 03:29:22 (1713425362) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 8 (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 9: resending a replayed create ======= 03:29:28 (1713425368) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000119 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:29:31 (1713425371) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:29:43 (1713425383) targets are mounted 03:29:43 (1713425383) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 9 (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 10: resending a replayed unlink ====== 03:30:04 (1713425404) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000119 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:30:06 (1713425406) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:30:19 (1713425419) targets are mounted 03:30:19 (1713425419) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 10 (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 11: both clients timeout during replay ========================================================== 03:30:40 (1713425440) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x0119 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:30:43 (1713425443) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:30:55 (1713425455) targets are mounted 03:30:55 (1713425455) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 15 sec fail_loc=0 PASS 11 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 12: open resend timeout ============== 03:31:14 (1713425474) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f12.replay-dual vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6912 fail_loc=0x80000302 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:31:17 (1713425477) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:31:29 (1713425489) targets are mounted 03:31:29 (1713425489) facet_failover done fail_loc=0 /mnt/lustre/f12.replay-dual /mnt/lustre/f12.replay-dual has type file OK PASS 12 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 13: close resend timeout ============= 03:31:48 (1713425508) multiop /mnt/lustre/f13.replay-dual vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6912 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000115 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:31:50 (1713425510) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:32:03 (1713425523) targets are mounted 03:32:03 (1713425523) facet_failover done fail_loc=0 /mnt/lustre/f13.replay-dual /mnt/lustre/f13.replay-dual has type file OK PASS 13 (17s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-dual test_14b skipping ALWAYS excluded test 14b debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 15a: timeout waiting for lost client during replay, 1 client completes ========================================================== 03:32:07 (1713425527) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.17 seconds: 145.06 ops/second total: 1 open/close in 0.01 seconds: 156.67 ops/second Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:32:10 (1713425530) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:32:22 (1713425542) targets are mounted 03:32:22 (1713425542) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713425621 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 PASS 15a (95s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 15c: remove multiple OST orphans ===== 03:33:43 (1713425623) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:34:10 (1713425650) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:34:23 (1713425663) targets are mounted 03:34:23 (1713425663) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 PASS 15c (115s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 16: fail MDS during recovery (3571) == 03:35:40 (1713425740) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.26 seconds: 97.91 ops/second total: 1 open/close in 0.01 seconds: 125.09 ops/second Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:35:43 (1713425743) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:35:56 (1713425756) targets are mounted 03:35:56 (1713425756) facet_failover done Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:36:18 (1713425778) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:36:32 (1713425792) targets are mounted 03:36:32 (1713425792) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713425869 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 PASS 16 (130s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 17: fail OST during recovery (3571) == 03:37:52 (1713425872) total: 25 open/close in 0.31 seconds: 81.47 ops/second total: 1 open/close in 0.01 seconds: 72.16 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing ost1 on oleg259-server Stopping /mnt/lustre-ost1 (opts:) on oleg259-server 03:37:55 (1713425875) shut down Failover ost1 to oleg259-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-OST0000 03:38:08 (1713425888) targets are mounted 03:38:08 (1713425888) facet_failover done Failing ost1 on oleg259-server Stopping /mnt/lustre-ost1 (opts:) on oleg259-server 03:38:29 (1713425909) shut down Failover ost1 to oleg259-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-OST0000 03:38:42 (1713425922) targets are mounted 03:38:42 (1713425922) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713425999 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 PASS 17 (128s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 18: ldlm_handle_enqueue succeeds on evicted export (3822) ========================================================== 03:40:02 (1713426002) debug=+dlmtrace using seed 2728130329 running for 500 iterations total: 500 stats in 0 seconds: inf stats/second fail_loc=0x8000030b ldlm.namespaces.MGC192.168.202.159@tcp.early_lock_cancel=0 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800b67d1800.early_lock_cancel=0 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800b6fdf800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0000-osc-ffff8800b67d1800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0000-osc-ffff8800b6fdf800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0001-osc-ffff8800b67d1800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0001-osc-ffff8800b6fdf800.early_lock_cancel=0 fail_loc=0x80000305 Error in opening file "/mnt/lustre2/d18.replay-dual/f18.replay-dual"(flags=O_RDONLY) 2: No such file or directory ldlm.namespaces.MGC192.168.202.159@tcp.early_lock_cancel=1 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800b67d1800.early_lock_cancel=1 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800b6fdf800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0000-osc-ffff8800b67d1800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0000-osc-ffff8800b6fdf800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0001-osc-ffff8800b67d1800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0001-osc-ffff8800b6fdf800.early_lock_cancel=1 fail_loc=0 fail_loc=0 PASS 18 (46s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 19: resend of open request =========== 03:40:50 (1713426050) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x157 - open/close 0 (time 1713426138.84 total 86.04 last 0.00) total: 1 open/close in 86.04 seconds: 0.01 ops/second fail_loc=0 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:42:20 (1713426140) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:42:34 (1713426154) targets are mounted 03:42:34 (1713426154) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 19 (109s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 20: recovery time is not increasing == 03:42:40 (1713426160) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:42:43 (1713426163) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:42:57 (1713426177) targets are mounted 03:42:57 (1713426177) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:45:23 (1713426323) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:45:36 (1713426336) targets are mounted 03:45:36 (1713426336) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 PASS 20 (318s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 21a: commit on sharing =============== 03:47:59 (1713426479) mdt.lustre-MDT0000.commit_on_sharing=1 Replay barrier on lustre-MDT0000 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:48:02 (1713426482) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:48:15 (1713426495) targets are mounted 03:48:15 (1713426495) facet_failover done Starting client: oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre2 mdt.lustre-MDT0000.commit_on_sharing=0 PASS 21a (157s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-dual test_21b skipping SLOW test 21b debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22a: c1 lfs mkdir -i 1 dir1, M1 drop reply & fail, c2 mkdir dir1/dir ========================================================== 03:50:38 (1713426638) SKIP: replay-dual test_22a needs >= 2 MDTs SKIP 22a (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22b: c1 lfs mkdir -i 1 d1, M1 drop reply & fail M0/M1, c2 mkdir d1/dir ========================================================== 03:50:40 (1713426640) SKIP: replay-dual test_22b needs >= 2 MDTs SKIP 22b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22c: c1 lfs mkdir -i 1 d1, M1 drop update & fail M1, c2 mkdir d1/dir ========================================================== 03:50:42 (1713426642) SKIP: replay-dual test_22c needs >= 2 MDTs SKIP 22c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22d: c1 lfs mkdir -i 1 d1, M1 drop update & fail M0/M1,c2 mkdir d1/dir ========================================================== 03:50:44 (1713426644) SKIP: replay-dual test_22d needs >= 2 MDTs SKIP 22d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23a: c1 rmdir d1, M1 drop reply and fail, client2 mkdir d1 ========================================================== 03:50:46 (1713426646) SKIP: replay-dual test_23a needs >= 2 MDTs SKIP 23a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23b: c1 rmdir d1, M1 drop reply and fail M0/M1, c2 mkdir d1 ========================================================== 03:50:49 (1713426649) SKIP: replay-dual test_23b needs >= 2 MDTs SKIP 23b (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23c: c1 rmdir d1, M0 drop update reply and fail M0, c2 mkdir d1 ========================================================== 03:50:51 (1713426651) SKIP: replay-dual test_23c needs >= 2 MDTs SKIP 23c (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23d: c1 rmdir d1, M0 drop update reply and fail M0/M1, c2 mkdir d1 ========================================================== 03:50:53 (1713426653) SKIP: replay-dual test_23d needs >= 2 MDTs SKIP 23d (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 24: reconstruct on non-existing object ========================================================== 03:50:55 (1713426655) fail_loc=0x119 fail_loc=0 truncate: cannot truncate '/mnt/lustre/f24.replay-dual' to length 100: No such file or directory PASS 24 (87s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 25: replay|resend ==================== 03:52:23 (1713426743) 1+0 records in 1+0 records out 512 bytes (512 B) copied, 0.00335874 s, 152 kB/s fail_loc=0x304 fail_loc=0x80000325 Failing ost1 on oleg259-server Stopping /mnt/lustre-ost1 (opts:) on oleg259-server 03:52:26 (1713426746) shut down Failover ost1 to oleg259-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-OST0000 03:52:39 (1713426759) targets are mounted 03:52:39 (1713426759) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 6515: 15806 Terminated LUSTRE="/home/green/git/lustre-release/lustre" bash -c "multiop /mnt/lustre2/f25.replay-dual Ow512" fail_loc=0 PASS 25 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 26: dbench and tar with mds failover ========================================================== 03:52:46 (1713426766) Starting client oleg259-client.virtnet: -o user_xattr,flock oleg259-server@tcp:/lustre /mnt/lustre Started clients oleg259-client.virtnet: 192.168.202.159@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar loop with pid 17411 Started dbench loop with 17412 looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Thu Apr 18 03:52:47 EDT 2024 waiting for dbench pid 17443 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs failed to create barrier semaphore 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients 1 267 9.99 MB/sec warmup 1 sec latency 18.692 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3456 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3681280 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3698688 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7379968 1% /mnt/lustre 1 511 8.86 MB/sec warmup 2 sec latency 20.628 ms 1 801 7.16 MB/sec warmup 3 sec latency 51.026 ms test_26 fail mds1 1 times 1 1128 5.75 MB/sec warmup 4 sec latency 39.209 ms Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 1 1467 5.21 MB/sec warmup 5 sec latency 180.210 ms 03:52:53 (1713426773) shut down 1 1467 4.34 MB/sec warmup 6 sec latency 1180.385 ms 1 1467 3.72 MB/sec warmup 7 sec latency 2180.534 ms 1 1467 3.25 MB/sec warmup 8 sec latency 3180.723 ms 1 1467 2.89 MB/sec warmup 9 sec latency 4180.899 ms 1 1467 2.60 MB/sec warmup 10 sec latency 5181.112 ms 1 1467 2.37 MB/sec warmup 11 sec latency 6181.311 ms 1 1467 2.17 MB/sec warmup 12 sec latency 7181.492 ms 1 1467 2.00 MB/sec warmup 13 sec latency 8181.733 ms 1 1467 1.86 MB/sec warmup 14 sec latency 9181.928 ms 1 1467 1.74 MB/sec warmup 15 sec latency 10182.140 ms Failover mds1 to oleg259-server mount facets: mds1 1 1467 1.63 MB/sec warmup 16 sec latency 11182.274 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 1467 1.53 MB/sec warmup 17 sec latency 12182.398 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:53:05 (1713426785) targets are mounted 03:53:05 (1713426785) facet_failover done 1 1467 1.45 MB/sec warmup 18 sec latency 13182.542 ms 1 1467 1.37 MB/sec warmup 19 sec latency 14182.681 ms 1 1467 0.00 MB/sec execute 1 sec latency 16183.007 ms 1 1467 0.00 MB/sec execute 2 sec latency 17183.224 ms 1 1467 0.00 MB/sec execute 3 sec latency 18183.444 ms 1 1467 0.00 MB/sec execute 4 sec latency 19183.584 ms 1 1493 0.00 MB/sec execute 5 sec latency 20065.208 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 1775 0.04 MB/sec execute 6 sec latency 37.208 ms 1 2108 0.09 MB/sec execute 7 sec latency 41.136 ms 1 2402 0.24 MB/sec execute 8 sec latency 42.601 ms 1 2896 0.75 MB/sec execute 9 sec latency 45.329 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 4864 2203648 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 43008 3553280 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3591168 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 56320 7144448 1% /mnt/lustre 1 3352 0.93 MB/sec execute 10 sec latency 38.559 ms 1 3767 1.25 MB/sec execute 11 sec latency 14.407 ms 1 3992 1.17 MB/sec execute 12 sec latency 15.605 ms test_26 fail mds1 2 times Failing mds1 on oleg259-server 1 4245 1.10 MB/sec execute 13 sec latency 105.805 ms Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:53:21 (1713426801) shut down 1 4305 1.03 MB/sec execute 14 sec latency 680.490 ms 1 4305 0.96 MB/sec execute 15 sec latency 1680.672 ms 1 4305 0.90 MB/sec execute 16 sec latency 2680.854 ms 1 4305 0.85 MB/sec execute 17 sec latency 3681.115 ms 1 4305 0.80 MB/sec execute 18 sec latency 4681.288 ms 1 4305 0.76 MB/sec execute 19 sec latency 5681.486 ms 1 4305 0.72 MB/sec execute 20 sec latency 6681.718 ms 1 4305 0.69 MB/sec execute 21 sec latency 7681.924 ms 1 4305 0.65 MB/sec execute 22 sec latency 8682.099 ms 1 4305 0.63 MB/sec execute 23 sec latency 9682.273 ms 1 4305 0.60 MB/sec execute 24 sec latency 10682.467 ms Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 4305 0.58 MB/sec execute 25 sec latency 11682.664 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all 1 4305 0.55 MB/sec execute 26 sec latency 12682.874 ms pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:53:34 (1713426814) targets are mounted 03:53:34 (1713426814) facet_failover done 1 4305 0.53 MB/sec execute 27 sec latency 13683.058 ms 1 4305 0.51 MB/sec execute 28 sec latency 14683.308 ms 1 4305 0.50 MB/sec execute 29 sec latency 15683.557 ms 1 4305 0.48 MB/sec execute 30 sec latency 16683.785 ms 1 4305 0.46 MB/sec execute 31 sec latency 17683.953 ms 1 4305 0.45 MB/sec execute 32 sec latency 18684.133 ms 1 4305 0.44 MB/sec execute 33 sec latency 19684.329 ms 1 4305 0.42 MB/sec execute 34 sec latency 20684.467 ms 1 4330 0.41 MB/sec execute 35 sec latency 21584.207 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 4557 0.42 MB/sec execute 36 sec latency 39.365 ms 1 4969 0.52 MB/sec execute 37 sec latency 14.800 ms 1 5227 0.51 MB/sec execute 38 sec latency 57.536 ms 1 5549 0.50 MB/sec execute 39 sec latency 41.183 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 6400 2202240 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 48128 3472384 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 17408 3526656 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 65536 6999040 1% /mnt/lustre 1 5916 0.52 MB/sec execute 40 sec latency 61.858 ms 1 6323 0.61 MB/sec execute 41 sec latency 48.247 ms 1 6831 0.68 MB/sec execute 42 sec latency 43.675 ms test_26 fail mds1 3 times Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 1 7242 0.74 MB/sec execute 43 sec latency 24.931 ms 03:53:51 (1713426831) shut down 1 7269 0.72 MB/sec execute 44 sec latency 904.054 ms 1 7269 0.71 MB/sec execute 45 sec latency 1904.235 ms 1 7269 0.69 MB/sec execute 46 sec latency 2904.422 ms 1 7269 0.68 MB/sec execute 47 sec latency 3904.604 ms 1 7269 0.66 MB/sec execute 48 sec latency 4904.802 ms 1 7269 0.65 MB/sec execute 49 sec latency 5905.055 ms 1 7269 0.64 MB/sec execute 50 sec latency 6905.224 ms 1 7269 0.62 MB/sec execute 51 sec latency 7905.409 ms 1 7269 0.61 MB/sec execute 52 sec latency 8905.600 ms 1 7269 0.60 MB/sec execute 53 sec latency 9905.793 ms Failover mds1 to oleg259-server mount facets: mds1 1 7269 0.59 MB/sec execute 54 sec latency 10906.003 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 7269 0.58 MB/sec execute 55 sec latency 11906.160 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all 1 7269 0.57 MB/sec execute 56 sec latency 12906.362 ms pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:54:04 (1713426844) targets are mounted 03:54:04 (1713426844) facet_failover done 1 7269 0.56 MB/sec execute 57 sec latency 13906.564 ms 1 7269 0.55 MB/sec execute 58 sec latency 14906.871 ms 1 7269 0.54 MB/sec execute 59 sec latency 15907.082 ms 1 7269 0.53 MB/sec execute 60 sec latency 16907.259 ms 1 7269 0.52 MB/sec execute 61 sec latency 17907.450 ms 1 7269 0.51 MB/sec execute 62 sec latency 18907.661 ms 1 7269 0.51 MB/sec execute 63 sec latency 19907.854 ms 1 7269 0.50 MB/sec execute 64 sec latency 20908.072 ms 1 7269 0.49 MB/sec execute 65 sec latency 21908.255 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 7476 0.50 MB/sec execute 66 sec latency 21999.212 ms 1 7674 0.50 MB/sec execute 67 sec latency 23.577 ms 1 7882 0.49 MB/sec execute 68 sec latency 92.792 ms 1 8094 0.49 MB/sec execute 69 sec latency 56.616 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 8064 2200576 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 49152 3437568 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 18432 3437568 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 67584 6875136 1% /mnt/lustre 1 8489 0.55 MB/sec execute 70 sec latency 18.885 ms 1 8740 0.54 MB/sec execute 71 sec latency 43.500 ms 1 9049 0.54 MB/sec execute 72 sec latency 43.679 ms test_26 fail mds1 4 times 1 9432 0.55 MB/sec execute 73 sec latency 14.953 ms Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 1 9544 0.54 MB/sec execute 74 sec latency 469.891 ms 03:54:21 (1713426861) shut down 1 9544 0.54 MB/sec execute 75 sec latency 1470.064 ms 1 9544 0.53 MB/sec execute 76 sec latency 2470.254 ms 1 9544 0.52 MB/sec execute 77 sec latency 3470.453 ms 1 9544 0.52 MB/sec execute 78 sec latency 4470.729 ms 1 9544 0.51 MB/sec execute 79 sec latency 5470.940 ms 1 9544 0.50 MB/sec execute 80 sec latency 6471.144 ms 1 9544 0.50 MB/sec execute 81 sec latency 7471.340 ms 1 9544 0.49 MB/sec execute 82 sec latency 8471.551 ms 1 9544 0.49 MB/sec execute 83 sec latency 9471.733 ms 1 9544 0.48 MB/sec execute 84 sec latency 10471.917 ms Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 9544 0.47 MB/sec execute 85 sec latency 11472.037 ms 1 9544 0.47 MB/sec execute 86 sec latency 12472.207 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:54:34 (1713426874) targets are mounted 03:54:34 (1713426874) facet_failover done 1 9544 0.46 MB/sec execute 87 sec latency 13472.409 ms 1 9544 0.46 MB/sec execute 88 sec latency 14472.533 ms 1 9544 0.45 MB/sec execute 89 sec latency 15472.722 ms 1 9544 0.45 MB/sec execute 90 sec latency 16472.913 ms 1 9544 0.44 MB/sec execute 91 sec latency 17473.134 ms 1 9544 0.44 MB/sec execute 92 sec latency 18473.325 ms 1 9544 0.43 MB/sec execute 93 sec latency 19473.527 ms 1 9544 0.43 MB/sec execute 94 sec latency 20473.673 ms 1 9561 0.42 MB/sec execute 95 sec latency 21402.501 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 10031 0.48 MB/sec execute 96 sec latency 41.977 ms 1 10607 0.52 MB/sec execute 97 sec latency 29.826 ms 1 10902 0.53 MB/sec execute 98 sec latency 15.433 ms 1 11134 0.53 MB/sec execute 99 sec latency 14.848 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 9216 2199424 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 51200 3381248 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 20480 3414016 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 71680 6795264 2% /mnt/lustre 1 cleanup 100 sec 0 cleanup 101 sec Operation Count AvgLat MaxLat ---------------------------------------- NTCreateX 1724 33.700 21999.198 Close 1249 34.959 21584.184 Rename 69 11.539 17.364 Unlink 355 4.663 17.459 Qpathinfo 1593 2.423 26.971 Qfileinfo 274 0.451 2.216 Qfsinfo 282 0.523 3.736 Sfileinfo 120 6.292 9.959 Find 603 1.348 13.206 WriteX 837 2.010 5.654 ReadX 2685 0.139 29.173 LockX 6 1.518 1.845 UnlockX 6 1.574 1.855 Flush 111 28.024 105.789 Throughput 0.52789 MB/sec 1 clients 1 procs max_latency=21999.212 ms stopping dbench on /mnt/lustre at Thu Apr 18 03:54:48 EDT 2024 with return code 0 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Thu Apr 18 03:54:49 EDT 2024 waiting for dbench pid 21627 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients test_26 fail mds1 5 times Failing mds1 on oleg259-server 1 261 9.86 MB/sec warmup 1 sec latency 20.759 ms Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:54:51 (1713426891) shut down 1 330 6.00 MB/sec warmup 2 sec latency 714.806 ms 1 330 4.00 MB/sec warmup 3 sec latency 1715.031 ms 1 330 3.00 MB/sec warmup 4 sec latency 2715.236 ms 1 330 2.40 MB/sec warmup 5 sec latency 3715.414 ms 1 330 2.00 MB/sec warmup 6 sec latency 4715.606 ms 1 330 1.71 MB/sec warmup 7 sec latency 5715.794 ms 1 330 1.50 MB/sec warmup 8 sec latency 6715.965 ms 1 330 1.33 MB/sec warmup 9 sec latency 7716.199 ms 1 330 1.20 MB/sec warmup 10 sec latency 8716.411 ms 1 330 1.09 MB/sec warmup 11 sec latency 9716.615 ms 1 330 1.00 MB/sec warmup 12 sec latency 10716.782 ms Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 330 0.92 MB/sec warmup 13 sec latency 11716.959 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all 1 330 0.86 MB/sec warmup 14 sec latency 12717.131 ms pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:55:04 (1713426904) targets are mounted 03:55:04 (1713426904) facet_failover done 1 330 0.80 MB/sec warmup 15 sec latency 13717.322 ms 1 330 0.75 MB/sec warmup 16 sec latency 14717.544 ms 1 330 0.71 MB/sec warmup 17 sec latency 15717.734 ms 1 330 0.67 MB/sec warmup 18 sec latency 16717.936 ms 1 330 0.63 MB/sec warmup 19 sec latency 17718.153 ms 1 330 0.00 MB/sec execute 1 sec latency 19718.484 ms 1 330 0.00 MB/sec execute 2 sec latency 20718.633 ms 1 365 0.35 MB/sec execute 3 sec latency 21532.278 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 664 2.29 MB/sec execute 4 sec latency 18.187 ms 1 873 1.93 MB/sec execute 5 sec latency 33.221 ms 1 1133 1.83 MB/sec execute 6 sec latency 37.726 ms 1 1486 2.01 MB/sec execute 7 sec latency 59.572 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 10880 2197760 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 28672 3382272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 43008 3399680 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 71680 6781952 2% /mnt/lustre 1 1791 1.79 MB/sec execute 8 sec latency 47.712 ms 1 2139 1.63 MB/sec execute 9 sec latency 32.265 ms 1 2440 1.61 MB/sec execute 10 sec latency 50.910 ms test_26 fail mds1 6 times Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 1 2922 1.95 MB/sec execute 11 sec latency 39.714 ms 03:55:21 (1713426921) shut down 1 2975 1.79 MB/sec execute 12 sec latency 838.114 ms 1 2975 1.65 MB/sec execute 13 sec latency 1838.334 ms 1 2975 1.54 MB/sec execute 14 sec latency 2838.544 ms 1 2975 1.43 MB/sec execute 15 sec latency 3838.739 ms 1 2975 1.34 MB/sec execute 16 sec latency 4838.921 ms 1 2975 1.26 MB/sec execute 17 sec latency 5839.097 ms 1 2975 1.19 MB/sec execute 18 sec latency 6839.272 ms 1 2975 1.13 MB/sec execute 19 sec latency 7839.503 ms 1 2975 1.08 MB/sec execute 20 sec latency 8839.682 ms 1 2975 1.02 MB/sec execute 21 sec latency 9839.878 ms Failover mds1 to oleg259-server mount facets: mds1 1 2975 0.98 MB/sec execute 22 sec latency 10840.048 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 2975 0.93 MB/sec execute 23 sec latency 11840.258 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all 1 2975 0.90 MB/sec execute 24 sec latency 12840.498 ms pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:55:33 (1713426933) targets are mounted 03:55:33 (1713426933) facet_failover done 1 2975 0.86 MB/sec execute 25 sec latency 13840.676 ms 1 2975 0.83 MB/sec execute 26 sec latency 14840.882 ms 1 2975 0.80 MB/sec execute 27 sec latency 15841.018 ms 1 2975 0.77 MB/sec execute 28 sec latency 16841.157 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 3366 0.81 MB/sec execute 29 sec latency 16965.424 ms 1 3724 0.89 MB/sec execute 30 sec latency 16.081 ms 1 3970 0.91 MB/sec execute 31 sec latency 18.320 ms 1 4233 0.88 MB/sec execute 32 sec latency 23.627 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 10624 2198016 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 33792 3550208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 47104 3536896 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 80896 7087104 2% /mnt/lustre 1 4372 0.87 MB/sec execute 33 sec latency 139.562 ms 1 4632 0.88 MB/sec execute 34 sec latency 45.817 ms 1 5031 0.95 MB/sec execute 35 sec latency 14.783 ms test_26 fail mds1 7 times 1 5292 0.93 MB/sec execute 36 sec latency 47.085 ms Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 1 5449 0.90 MB/sec execute 37 sec latency 489.394 ms 03:55:46 (1713426946) shut down 1 5449 0.88 MB/sec execute 38 sec latency 1489.608 ms 1 5449 0.86 MB/sec execute 39 sec latency 2489.799 ms 1 5449 0.84 MB/sec execute 40 sec latency 3489.984 ms 1 5449 0.82 MB/sec execute 41 sec latency 4490.161 ms 1 5449 0.80 MB/sec execute 42 sec latency 5490.338 ms 1 5449 0.78 MB/sec execute 43 sec latency 6490.542 ms 1 5449 0.76 MB/sec execute 44 sec latency 7490.753 ms 1 5449 0.74 MB/sec execute 45 sec latency 8490.988 ms 1 5449 0.73 MB/sec execute 46 sec latency 9491.172 ms 1 5449 0.71 MB/sec execute 47 sec latency 10491.348 ms Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 5449 0.70 MB/sec execute 48 sec latency 11491.596 ms 1 5449 0.68 MB/sec execute 49 sec latency 12491.799 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:55:59 (1713426959) targets are mounted 03:55:59 (1713426959) facet_failover done 1 5449 0.67 MB/sec execute 50 sec latency 13491.992 ms 1 5449 0.66 MB/sec execute 51 sec latency 14492.205 ms 1 5449 0.64 MB/sec execute 52 sec latency 15492.360 ms 1 5470 0.63 MB/sec execute 53 sec latency 16333.266 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 5820 0.63 MB/sec execute 54 sec latency 13.877 ms 1 6118 0.65 MB/sec execute 55 sec latency 78.852 ms 1 6568 0.73 MB/sec execute 56 sec latency 41.480 ms 1 7163 0.80 MB/sec execute 57 sec latency 37.621 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 8576 2200064 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 36864 3700736 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 48128 3708928 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 84992 7409664 2% /mnt/lustre 1 7407 0.81 MB/sec execute 58 sec latency 17.776 ms 1 7623 0.80 MB/sec execute 59 sec latency 21.343 ms 1 7860 0.79 MB/sec execute 60 sec latency 81.305 ms test_26 fail mds1 8 times Failing mds1 on oleg259-server 1 8091 0.79 MB/sec execute 61 sec latency 41.336 ms Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:56:11 (1713426971) shut down 1 8190 0.79 MB/sec execute 62 sec latency 667.879 ms 1 8190 0.78 MB/sec execute 63 sec latency 1668.018 ms 1 8190 0.77 MB/sec execute 64 sec latency 2668.183 ms 1 8190 0.76 MB/sec execute 65 sec latency 3668.383 ms 1 8190 0.74 MB/sec execute 66 sec latency 4668.591 ms 1 8190 0.73 MB/sec execute 67 sec latency 5668.790 ms 1 8190 0.72 MB/sec execute 68 sec latency 6668.986 ms 1 8190 0.71 MB/sec execute 69 sec latency 7669.194 ms 1 8190 0.70 MB/sec execute 70 sec latency 8669.399 ms 1 8190 0.69 MB/sec execute 71 sec latency 9669.599 ms 1 8190 0.68 MB/sec execute 72 sec latency 10669.818 ms Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 8190 0.67 MB/sec execute 73 sec latency 11670.002 ms oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all 1 8190 0.66 MB/sec execute 74 sec latency 12670.143 ms pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:56:24 (1713426984) targets are mounted 03:56:24 (1713426984) facet_failover done 1 8190 0.66 MB/sec execute 75 sec latency 13670.301 ms 1 8190 0.65 MB/sec execute 76 sec latency 14670.512 ms 1 8190 0.64 MB/sec execute 77 sec latency 15670.661 ms 1 8230 0.63 MB/sec execute 78 sec latency 16517.197 ms oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 8577 0.66 MB/sec execute 79 sec latency 17.976 ms 1 8829 0.66 MB/sec execute 80 sec latency 72.539 ms 1 9147 0.65 MB/sec execute 81 sec latency 54.851 ms 1 9498 0.66 MB/sec execute 82 sec latency 67.487 ms 1 9881 0.70 MB/sec execute 83 sec latency 56.547 ms 1 10438 0.73 MB/sec execute 84 sec latency 37.855 ms 1 10827 0.77 MB/sec execute 85 sec latency 14.126 ms 1 11084 0.77 MB/sec execute 86 sec latency 14.622 ms 1 11330 0.76 MB/sec execute 87 sec latency 120.379 ms dbench killed by signal 15 stopping dbench on /mnt/lustre at Thu Apr 18 03:56:37 EDT 2024 with return code 0 21627 pts/0 S+ 0:00 dbench -c client.txt 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100 killed dbench main pid 21627 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished PASS 26 (233s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 28: lock replay should be ordered: waiting after granted ========================================================== 03:56:40 (1713427000) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00269082 s, 1.5 MB/s fail_loc=0x80000324 fail_loc=0x32a Failing ost1 on oleg259-server Stopping /mnt/lustre-ost1 (opts:) on oleg259-server 03:56:50 (1713427010) shut down Failover ost1 to oleg259-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-OST0000 03:57:03 (1713427023) targets are mounted 03:57:03 (1713427023) facet_failover done 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00713744 s, 574 kB/s oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 28 (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 29: replay vs update with the same xid ========================================================== 03:57:11 (1713427031) SKIP: replay-dual test_29 needs >= 2 MDTs SKIP 29 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 30: layout lock replay is not blocked on IO ========================================================== 03:57:13 (1713427033) 10+0 records in 10+0 records out 40960 bytes (41 kB) copied, 0.00912722 s, 4.5 MB/s 10+0 records in 10+0 records out 40960 bytes (41 kB) copied, 0.00946889 s, 4.3 MB/s fail_loc=0x32e fail_val=4 Failing mds1 on oleg259-server Stopping /mnt/lustre-mds1 (opts:) on oleg259-server 03:57:16 (1713427036) shut down Failover mds1 to oleg259-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-MDT0000 03:57:29 (1713427049) targets are mounted 03:57:29 (1713427049) facet_failover done 160+0 records in 160+0 records out 81920 bytes (82 kB) copied, 21.111 s, 3.9 kB/s oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 31: deadlock on file_remove_privs and occupied mod rpc slots ========================================================== 03:57:40 (1713427060) Failing ost1 on oleg259-server Stopping /mnt/lustre-ost1 (opts:) on oleg259-server 03:57:41 (1713427061) shut down Creating to objid 3233 on ost lustre-OST0000... total: 32 open/close in 0.17 seconds: 189.84 ops/second at_max=0 fail_loc=0x80001420 file /mnt/lustre2/d31.replay-dual/mdtdir/f31.replay-dual is ready Failover ost1 to oleg259-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg259-server: oleg259-server.virtnet: executing set_default_debug -1 all pdsh@oleg259-client: oleg259-server: ssh exited with exit code 1 Started lustre-OST0000 03:57:54 (1713427074) targets are mounted 03:57:54 (1713427074) facet_failover done oleg259-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in IDLE FULL state after 0 sec pids: 30236 30237 30242 30243 30244 30245 30246 30247 30248 at_max=600 PASS 31 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 32: gap in update llog shouldn't break recovery ========================================================== 03:58:01 (1713427081) SKIP: replay-dual test_32 needs >= 2 MDTs SKIP 32 (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 33: Check for OBD_INCOMPAT_MULTI_RPCS in last_rcvd after abort_recovery ========================================================== 03:58:03 (1713427083) SKIP: replay-dual test_33 ldiskfs only test SKIP 33 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-dual test complete, duration 2090 sec ========== 03:58:05 (1713427085) === replay-dual: start cleanup 03:58:05 (1713427085) === Stopping clients: oleg259-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg259-client.virtnet /mnt/lustre2 opts: === replay-dual: finish cleanup 03:58:06 (1713427086) ===