-----============= acceptance-small: replay-dual ============----- Wed Apr 17 10:29:33 EDT 2024 excepting tests: 14b 21b 21b skipping tests SLOW=no: 21b === replay-dual: start setup 10:29:37 (1713364177) === Starting client oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 Started clients oleg443-client.virtnet: 192.168.204.143@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) oleg443-client.virtnet: executing check_config_client /mnt/lustre oleg443-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg443-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800aa67b000.idle_timeout=debug osc.lustre-OST0000-osc-ffff88012a66c000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800aa67b000.idle_timeout=debug osc.lustre-OST0001-osc-ffff88012a66c000.idle_timeout=debug disable quota as required oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all === replay-dual: finish setup 10:29:43 (1713364183) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 0a: expired recovery with lost client ========================================================== 10:29:45 (1713364185) Check file is LU482_FAILED=/tmp/replay-dual.lu482.RW4t88 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 50 open/close in 0.70 seconds: 71.43 ops/second fail_loc=0x80000514 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:29:49 (1713364189) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:30:01 (1713364201) targets are mounted 10:30:01 (1713364201) facet_failover done Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 - unlinked 0 (time 1713364283 ; total 1 ; last 1) total: 50 unlinks in 1 seconds: 50.000000 unlinks/second PASS 0a (99s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 0b: lost client during waiting for next transno ========================================================== 10:31:25 (1713364285) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:31:28 (1713364288) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:31:41 (1713364301) targets are mounted 10:31:41 (1713364301) facet_failover done Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 PASS 0b (87s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 1: |X| simple create ================= 10:32:54 (1713364374) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:32:56 (1713364376) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:33:09 (1713364389) targets are mounted 10:33:09 (1713364389) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 1 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 2: |X| mkdir adir ==================== 10:33:16 (1713364396) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:33:18 (1713364398) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:33:31 (1713364411) targets are mounted 10:33:31 (1713364411) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 2 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 3: |X| mkdir adir, mkdir adir/bdir === 10:33:37 (1713364417) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:33:39 (1713364419) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:33:52 (1713364432) targets are mounted 10:33:52 (1713364432) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 3 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 4: |X| mkdir adir (-EEXIST), mkdir adir/bdir ========================================================== 10:34:00 (1713364440) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre mkdir: cannot create directory '/mnt/lustre/adir': File exists Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:34:02 (1713364442) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:16 (1713364456) targets are mounted 10:34:16 (1713364456) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 5: open, unlink |X| close ============ 10:34:25 (1713364465) multiop /mnt/lustre2/a vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6881 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:34:28 (1713364468) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:41 (1713364481) targets are mounted 10:34:41 (1713364481) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 6: open1, open2, unlink |X| close1 [fail mds1] close2 ========================================================== 10:34:47 (1713364487) multiop /mnt/lustre2/a vo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 multiop /mnt/lustre/a vo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:34:50 (1713364490) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:03 (1713364503) targets are mounted 10:35:03 (1713364503) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 6 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 8: replay of resent request ========== 10:35:09 (1713364509) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x119 fail_loc=0 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:35:28 (1713364528) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:41 (1713364541) targets are mounted 10:35:41 (1713364541) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 8 (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 9: resending a replayed create ======= 10:35:47 (1713364547) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000119 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:35:50 (1713364550) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:03 (1713364563) targets are mounted 10:36:03 (1713364563) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 9 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 10: resending a replayed unlink ====== 10:36:27 (1713364587) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000119 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:36:30 (1713364590) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:43 (1713364603) targets are mounted 10:36:43 (1713364603) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 10 (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 11: both clients timeout during replay ========================================================== 10:37:11 (1713364631) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x0119 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:37:15 (1713364635) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:37:28 (1713364648) targets are mounted 10:37:28 (1713364648) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 11 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 12: open resend timeout ============== 10:37:48 (1713364668) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f12.replay-dual vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 fail_loc=0x80000302 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:37:52 (1713364672) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:05 (1713364685) targets are mounted 10:38:05 (1713364685) facet_failover done fail_loc=0 /mnt/lustre/f12.replay-dual /mnt/lustre/f12.replay-dual has type file OK PASS 12 (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 13: close resend timeout ============= 10:38:23 (1713364703) multiop /mnt/lustre/f13.replay-dual vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000115 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:38:26 (1713364706) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:39 (1713364719) targets are mounted 10:38:39 (1713364719) facet_failover done fail_loc=0 /mnt/lustre/f13.replay-dual /mnt/lustre/f13.replay-dual has type file OK PASS 13 (18s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-dual test_14b skipping ALWAYS excluded test 14b debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 15a: timeout waiting for lost client during replay, 1 client completes ========================================================== 10:38:43 (1713364723) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.23 seconds: 110.06 ops/second total: 1 open/close in 0.01 seconds: 112.32 ops/second Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:38:46 (1713364726) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:59 (1713364739) targets are mounted 10:38:59 (1713364739) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713364812 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 PASS 15a (90s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 15c: remove multiple OST orphans ===== 10:40:14 (1713364814) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:40:40 (1713364840) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:40:52 (1713364852) targets are mounted 10:40:52 (1713364852) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 PASS 15c (110s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 16: fail MDS during recovery (3571) == 10:42:06 (1713364926) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.17 seconds: 148.09 ops/second total: 1 open/close in 0.01 seconds: 167.71 ops/second Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:42:09 (1713364929) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:42:21 (1713364941) targets are mounted 10:42:21 (1713364941) facet_failover done Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:42:42 (1713364962) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:42:55 (1713364975) targets are mounted 10:42:55 (1713364975) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713365054 ; total 1 ; last 1) total: 25 unlinks in 1 seconds: 25.000000 unlinks/second Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 PASS 16 (129s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 17: fail OST during recovery (3571) == 10:44:16 (1713365056) total: 25 open/close in 0.18 seconds: 135.34 ops/second total: 1 open/close in 0.01 seconds: 100.73 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing ost1 on oleg443-server Stopping /mnt/lustre-ost1 (opts:) on oleg443-server 10:44:19 (1713365059) shut down Failover ost1 to oleg443-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-OST0000 10:44:32 (1713365072) targets are mounted 10:44:32 (1713365072) facet_failover done Failing ost1 on oleg443-server Stopping /mnt/lustre-ost1 (opts:) on oleg443-server 10:44:53 (1713365093) shut down Failover ost1 to oleg443-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-OST0000 10:45:06 (1713365106) targets are mounted 10:45:06 (1713365106) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713365183 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 PASS 17 (129s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 18: ldlm_handle_enqueue succeeds on evicted export (3822) ========================================================== 10:46:26 (1713365186) debug=+dlmtrace fail_loc=0x8000030b using seed 3560560230 running for 500 iterations total: 500 stats in 0 seconds: inf stats/second ldlm.namespaces.MGC192.168.204.143@tcp.early_lock_cancel=0 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800a8bd4800.early_lock_cancel=0 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800b6cc7000.early_lock_cancel=0 ldlm.namespaces.lustre-OST0000-osc-ffff8800a8bd4800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0000-osc-ffff8800b6cc7000.early_lock_cancel=0 ldlm.namespaces.lustre-OST0001-osc-ffff8800a8bd4800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0001-osc-ffff8800b6cc7000.early_lock_cancel=0 fail_loc=0x80000305 Error in opening file "/mnt/lustre2/d18.replay-dual/f18.replay-dual"(flags=O_RDONLY) 2: No such file or directory ldlm.namespaces.MGC192.168.204.143@tcp.early_lock_cancel=1 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800a8bd4800.early_lock_cancel=1 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800b6cc7000.early_lock_cancel=1 ldlm.namespaces.lustre-OST0000-osc-ffff8800a8bd4800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0000-osc-ffff8800b6cc7000.early_lock_cancel=1 ldlm.namespaces.lustre-OST0001-osc-ffff8800a8bd4800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0001-osc-ffff8800b6cc7000.early_lock_cancel=1 fail_loc=0 fail_loc=0 PASS 18 (46s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 19: resend of open request =========== 10:47:13 (1713365233) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x157 - open/close 0 (time 1713365321.46 total 86.02 last 0.00) total: 1 open/close in 86.02 seconds: 0.01 ops/second fail_loc=0 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:48:42 (1713365322) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:48:55 (1713365335) targets are mounted 10:48:55 (1713365335) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 19 (106s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 20: recovery time is not increasing == 10:49:01 (1713365341) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:49:03 (1713365343) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:49:16 (1713365356) targets are mounted 10:49:16 (1713365356) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:51:40 (1713365500) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:51:52 (1713365512) targets are mounted 10:51:52 (1713365512) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 PASS 20 (314s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 21a: commit on sharing =============== 10:54:17 (1713365657) mdt.lustre-MDT0000.commit_on_sharing=1 Replay barrier on lustre-MDT0000 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 10:54:20 (1713365660) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:33 (1713365673) targets are mounted 10:54:33 (1713365673) facet_failover done Starting client: oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre2 mdt.lustre-MDT0000.commit_on_sharing=0 PASS 21a (157s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-dual test_21b skipping SLOW test 21b debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22a: c1 lfs mkdir -i 1 dir1, M1 drop reply & fail, c2 mkdir dir1/dir ========================================================== 10:56:56 (1713365816) SKIP: replay-dual test_22a needs >= 2 MDTs SKIP 22a (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22b: c1 lfs mkdir -i 1 d1, M1 drop reply & fail M0/M1, c2 mkdir d1/dir ========================================================== 10:56:58 (1713365818) SKIP: replay-dual test_22b needs >= 2 MDTs SKIP 22b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22c: c1 lfs mkdir -i 1 d1, M1 drop update & fail M1, c2 mkdir d1/dir ========================================================== 10:57:00 (1713365820) SKIP: replay-dual test_22c needs >= 2 MDTs SKIP 22c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22d: c1 lfs mkdir -i 1 d1, M1 drop update & fail M0/M1,c2 mkdir d1/dir ========================================================== 10:57:03 (1713365823) SKIP: replay-dual test_22d needs >= 2 MDTs SKIP 22d (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23a: c1 rmdir d1, M1 drop reply and fail, client2 mkdir d1 ========================================================== 10:57:05 (1713365825) SKIP: replay-dual test_23a needs >= 2 MDTs SKIP 23a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23b: c1 rmdir d1, M1 drop reply and fail M0/M1, c2 mkdir d1 ========================================================== 10:57:07 (1713365827) SKIP: replay-dual test_23b needs >= 2 MDTs SKIP 23b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23c: c1 rmdir d1, M0 drop update reply and fail M0, c2 mkdir d1 ========================================================== 10:57:10 (1713365830) SKIP: replay-dual test_23c needs >= 2 MDTs SKIP 23c (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23d: c1 rmdir d1, M0 drop update reply and fail M0/M1, c2 mkdir d1 ========================================================== 10:57:12 (1713365832) SKIP: replay-dual test_23d needs >= 2 MDTs SKIP 23d (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 24: reconstruct on non-existing object ========================================================== 10:57:14 (1713365834) fail_loc=0x119 fail_loc=0 truncate: cannot truncate '/mnt/lustre/f24.replay-dual' to length 100: No such file or directory PASS 24 (87s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 25: replay|resend ==================== 10:58:43 (1713365923) 1+0 records in 1+0 records out 512 bytes (512 B) copied, 0.00419283 s, 122 kB/s fail_loc=0x304 fail_loc=0x80000325 Failing ost1 on oleg443-server Stopping /mnt/lustre-ost1 (opts:) on oleg443-server 10:58:46 (1713365926) shut down Failover ost1 to oleg443-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-OST0000 10:59:00 (1713365940) targets are mounted 10:59:00 (1713365940) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 6528: 15677 Terminated LUSTRE="/home/green/git/lustre-release/lustre" bash -c "multiop /mnt/lustre2/f25.replay-dual Ow512" fail_loc=0 PASS 25 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 26: dbench and tar with mds failover ========================================================== 10:59:06 (1713365946) Starting client oleg443-client.virtnet: -o user_xattr,flock oleg443-server@tcp:/lustre /mnt/lustre Started clients oleg443-client.virtnet: 192.168.204.143@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar loop with pid 17282 Started dbench loop with 17283 looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Wed Apr 17 10:59:07 EDT 2024 waiting for dbench pid 17314 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs failed to create barrier semaphore 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients 1 247 9.40 MB/sec warmup 1 sec latency 27.900 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3584 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3641344 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3716096 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7357440 1% /mnt/lustre 1 503 8.73 MB/sec warmup 2 sec latency 20.637 ms 1 789 7.15 MB/sec warmup 3 sec latency 54.005 ms 1 1038 5.72 MB/sec warmup 4 sec latency 37.553 ms test_26 fail mds1 1 times Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 1410 5.20 MB/sec warmup 5 sec latency 97.045 ms 1 1410 4.34 MB/sec warmup 6 sec latency 1097.261 ms 1 1410 3.72 MB/sec warmup 7 sec latency 2097.511 ms 1 1410 3.25 MB/sec warmup 8 sec latency 3097.748 ms 1 1410 2.89 MB/sec warmup 9 sec latency 4097.974 ms 1 1410 2.60 MB/sec warmup 10 sec latency 5098.247 ms 1 1410 2.36 MB/sec warmup 11 sec latency 6098.561 ms 10:59:19 (1713365959) shut down 1 1410 2.17 MB/sec warmup 12 sec latency 7098.908 ms 1 1410 2.00 MB/sec warmup 13 sec latency 8099.213 ms 1 1410 1.86 MB/sec warmup 14 sec latency 9099.485 ms 1 1410 1.73 MB/sec warmup 15 sec latency 10099.748 ms 1 1410 1.63 MB/sec warmup 16 sec latency 11100.093 ms 1 1410 1.53 MB/sec warmup 17 sec latency 12100.369 ms 1 1410 1.44 MB/sec warmup 18 sec latency 13100.621 ms 1 1410 1.37 MB/sec warmup 19 sec latency 14100.937 ms 1 1410 0.00 MB/sec execute 1 sec latency 16101.447 ms Failover mds1 to oleg443-server mount facets: mds1 1 1410 0.00 MB/sec execute 2 sec latency 17101.753 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 1410 0.00 MB/sec execute 3 sec latency 18102.012 ms 1 1410 0.00 MB/sec execute 4 sec latency 19102.206 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 10:59:32 (1713365972) targets are mounted 10:59:32 (1713365972) facet_failover done 1 1410 0.00 MB/sec execute 5 sec latency 20102.403 ms 1 1410 0.00 MB/sec execute 6 sec latency 21102.634 ms 1 1410 0.00 MB/sec execute 7 sec latency 22102.915 ms 1 1410 0.00 MB/sec execute 8 sec latency 23103.137 ms 1 1410 0.00 MB/sec execute 9 sec latency 24103.305 ms 1 1410 0.00 MB/sec execute 10 sec latency 25103.520 ms 1 1410 0.00 MB/sec execute 11 sec latency 26103.815 ms 1 1410 0.00 MB/sec execute 12 sec latency 27104.003 ms 1 1410 0.00 MB/sec execute 13 sec latency 28104.144 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid 1 1492 0.00 MB/sec execute 14 sec latency 28778.889 ms mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 1804 0.02 MB/sec execute 15 sec latency 50.633 ms 1 2162 0.04 MB/sec execute 16 sec latency 34.706 ms 1 2453 0.12 MB/sec execute 17 sec latency 41.585 ms 1 2969 0.42 MB/sec execute 18 sec latency 36.767 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 4864 2203648 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 41984 3536896 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 18432 3596288 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 60416 7133184 1% /mnt/lustre 1 3516 0.65 MB/sec execute 19 sec latency 35.565 ms 1 3836 0.69 MB/sec execute 20 sec latency 13.284 ms 1 4076 0.67 MB/sec execute 21 sec latency 13.715 ms test_26 fail mds1 2 times Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 4305 0.66 MB/sec execute 22 sec latency 113.842 ms 1 4305 0.63 MB/sec execute 23 sec latency 1114.154 ms 1 4305 0.60 MB/sec execute 24 sec latency 2114.411 ms 10:59:52 (1713365992) shut down 1 4305 0.58 MB/sec execute 25 sec latency 3114.644 ms 1 4305 0.55 MB/sec execute 26 sec latency 4114.984 ms 1 4305 0.53 MB/sec execute 27 sec latency 5115.287 ms 1 4305 0.51 MB/sec execute 28 sec latency 6115.456 ms 1 4305 0.50 MB/sec execute 29 sec latency 7115.588 ms 1 4305 0.48 MB/sec execute 30 sec latency 8115.769 ms 1 4305 0.47 MB/sec execute 31 sec latency 9115.962 ms 1 4305 0.45 MB/sec execute 32 sec latency 10116.231 ms 1 4305 0.44 MB/sec execute 33 sec latency 11116.449 ms 1 4305 0.42 MB/sec execute 34 sec latency 12116.681 ms Failover mds1 to oleg443-server mount facets: mds1 1 4305 0.41 MB/sec execute 35 sec latency 13116.955 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 4305 0.40 MB/sec execute 36 sec latency 14117.114 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 1 4305 0.39 MB/sec execute 37 sec latency 15117.292 ms Started lustre-MDT0000 11:00:05 (1713366005) targets are mounted 11:00:05 (1713366005) facet_failover done 1 4305 0.38 MB/sec execute 38 sec latency 16117.462 ms 1 4305 0.37 MB/sec execute 39 sec latency 17117.716 ms 1 4305 0.36 MB/sec execute 40 sec latency 18117.988 ms 1 4305 0.35 MB/sec execute 41 sec latency 19118.211 ms 1 4305 0.34 MB/sec execute 42 sec latency 20118.371 ms 1 4305 0.34 MB/sec execute 43 sec latency 21118.491 ms 1 4348 0.33 MB/sec execute 44 sec latency 21931.838 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 4588 0.35 MB/sec execute 45 sec latency 45.134 ms 1 4992 0.42 MB/sec execute 46 sec latency 14.792 ms 1 5278 0.41 MB/sec execute 47 sec latency 50.579 ms 1 5625 0.41 MB/sec execute 48 sec latency 38.474 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 6400 2202240 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 44032 3478528 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 19456 3483648 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 63488 6962176 1% /mnt/lustre 1 5977 0.43 MB/sec execute 49 sec latency 48.134 ms 1 6484 0.53 MB/sec execute 50 sec latency 38.728 ms 1 7066 0.62 MB/sec execute 51 sec latency 31.882 ms test_26 fail mds1 3 times Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 7355 0.63 MB/sec execute 52 sec latency 45.824 ms 11:00:20 (1713366020) shut down 1 7355 0.62 MB/sec execute 53 sec latency 1046.010 ms 1 7355 0.61 MB/sec execute 54 sec latency 2046.179 ms 1 7355 0.60 MB/sec execute 55 sec latency 3046.394 ms 1 7355 0.59 MB/sec execute 56 sec latency 4046.590 ms 1 7355 0.58 MB/sec execute 57 sec latency 5046.819 ms 1 7355 0.57 MB/sec execute 58 sec latency 6047.022 ms 1 7355 0.56 MB/sec execute 59 sec latency 7047.190 ms 1 7355 0.55 MB/sec execute 60 sec latency 8047.362 ms 1 7355 0.54 MB/sec execute 61 sec latency 9047.547 ms 1 7355 0.53 MB/sec execute 62 sec latency 10047.757 ms Failover mds1 to oleg443-server mount facets: mds1 1 7355 0.52 MB/sec execute 63 sec latency 11047.952 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 7355 0.51 MB/sec execute 64 sec latency 12048.182 ms 1 7355 0.51 MB/sec execute 65 sec latency 13048.405 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 11:00:33 (1713366033) targets are mounted 11:00:33 (1713366033) facet_failover done 1 7355 0.50 MB/sec execute 66 sec latency 14048.616 ms 1 7355 0.49 MB/sec execute 67 sec latency 15048.858 ms 1 7355 0.48 MB/sec execute 68 sec latency 16049.097 ms 1 7355 0.48 MB/sec execute 69 sec latency 17049.360 ms 1 7355 0.47 MB/sec execute 70 sec latency 18049.574 ms 1 7355 0.46 MB/sec execute 71 sec latency 19049.882 ms 1 7355 0.46 MB/sec execute 72 sec latency 20050.075 ms 1 7355 0.45 MB/sec execute 73 sec latency 21050.202 ms 1 7356 0.44 MB/sec execute 74 sec latency 22037.574 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 7584 0.44 MB/sec execute 75 sec latency 29.890 ms 1 7821 0.44 MB/sec execute 76 sec latency 91.588 ms 1 8046 0.44 MB/sec execute 77 sec latency 51.946 ms 1 8352 0.45 MB/sec execute 78 sec latency 31.751 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 7936 2200704 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 46080 3388416 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 21504 3399680 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 67584 6788096 1% /mnt/lustre 1 8705 0.49 MB/sec execute 79 sec latency 51.871 ms 1 9016 0.48 MB/sec execute 80 sec latency 42.009 ms 1 9449 0.50 MB/sec execute 81 sec latency 11.762 ms test_26 fail mds1 4 times Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 9823 0.54 MB/sec execute 82 sec latency 47.121 ms 11:00:50 (1713366050) shut down 1 9864 0.53 MB/sec execute 83 sec latency 951.311 ms 1 9864 0.53 MB/sec execute 84 sec latency 1951.546 ms 1 9864 0.52 MB/sec execute 85 sec latency 2951.681 ms 1 9864 0.51 MB/sec execute 86 sec latency 3951.854 ms 1 9864 0.51 MB/sec execute 87 sec latency 4952.049 ms 1 9864 0.50 MB/sec execute 88 sec latency 5952.286 ms 1 9864 0.50 MB/sec execute 89 sec latency 6952.454 ms 1 9864 0.49 MB/sec execute 90 sec latency 7952.682 ms 1 9864 0.49 MB/sec execute 91 sec latency 8952.936 ms 1 9864 0.48 MB/sec execute 92 sec latency 9953.116 ms Failover mds1 to oleg443-server mount facets: mds1 1 9864 0.47 MB/sec execute 93 sec latency 10953.291 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 9864 0.47 MB/sec execute 94 sec latency 11953.575 ms 1 9864 0.46 MB/sec execute 95 sec latency 12953.796 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:03 (1713366063) targets are mounted 11:01:03 (1713366063) facet_failover done 1 9864 0.46 MB/sec execute 96 sec latency 13953.980 ms 1 9864 0.46 MB/sec execute 97 sec latency 14954.214 ms 1 9864 0.45 MB/sec execute 98 sec latency 15954.351 ms 1 9864 0.45 MB/sec execute 99 sec latency 16954.516 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 cleanup 100 sec 0 cleanup 101 sec Operation Count AvgLat MaxLat ---------------------------------------- NTCreateX 1516 38.284 28778.877 Close 1135 20.905 21931.805 Rename 64 11.312 16.793 Unlink 280 83.028 22037.557 Qpathinfo 1433 2.348 32.062 Qfileinfo 263 0.427 2.400 Qfsinfo 230 0.460 2.469 Sfileinfo 116 6.100 8.798 Find 518 1.184 11.862 WriteX 766 1.867 7.617 ReadX 2493 0.125 33.172 LockX 6 1.506 1.765 UnlockX 6 1.565 1.957 Flush 101 26.259 93.445 Throughput 0.446188 MB/sec 1 clients 1 procs max_latency=28778.889 ms stopping dbench on /mnt/lustre at Wed Apr 17 11:01:08 EDT 2024 with return code 0 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Wed Apr 17 11:01:09 EDT 2024 waiting for dbench pid 21441 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients 1 235 8.95 MB/sec warmup 1 sec latency 34.202 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 9216 2199424 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 47104 3379200 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 23552 3410944 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 70656 6790144 2% /mnt/lustre 1 485 8.47 MB/sec warmup 2 sec latency 22.421 ms 1 776 7.12 MB/sec warmup 3 sec latency 62.986 ms 1 1029 5.72 MB/sec warmup 4 sec latency 31.911 ms test_26 fail mds1 5 times Failing mds1 on oleg443-server 1 1393 5.20 MB/sec warmup 5 sec latency 25.855 ms Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 1490 4.34 MB/sec warmup 6 sec latency 555.199 ms 11:01:16 (1713366076) shut down 1 1490 3.72 MB/sec warmup 7 sec latency 1555.365 ms 1 1490 3.26 MB/sec warmup 8 sec latency 2555.565 ms 1 1490 2.89 MB/sec warmup 9 sec latency 3555.795 ms 1 1490 2.60 MB/sec warmup 10 sec latency 4556.048 ms 1 1490 2.37 MB/sec warmup 11 sec latency 5556.230 ms 1 1490 2.17 MB/sec warmup 12 sec latency 6556.380 ms 1 1490 2.00 MB/sec warmup 13 sec latency 7556.556 ms 1 1490 1.86 MB/sec warmup 14 sec latency 8556.763 ms 1 1490 1.74 MB/sec warmup 15 sec latency 9556.946 ms 1 1490 1.63 MB/sec warmup 16 sec latency 10557.153 ms Failover mds1 to oleg443-server mount facets: mds1 1 1490 1.53 MB/sec warmup 17 sec latency 11557.367 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 1490 1.45 MB/sec warmup 18 sec latency 12557.496 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all 1 1490 1.37 MB/sec warmup 19 sec latency 13557.674 ms pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:29 (1713366089) targets are mounted 11:01:29 (1713366089) facet_failover done 1 1490 0.00 MB/sec execute 1 sec latency 15558.056 ms 1 1490 0.00 MB/sec execute 2 sec latency 16558.211 ms 1 1490 0.00 MB/sec execute 3 sec latency 17558.404 ms 1 1490 0.00 MB/sec execute 4 sec latency 18558.559 ms 1 1490 0.00 MB/sec execute 5 sec latency 19558.703 ms 1 1490 0.00 MB/sec execute 6 sec latency 20558.860 ms 1 1490 0.00 MB/sec execute 7 sec latency 21559.021 ms 1 1490 0.00 MB/sec execute 8 sec latency 22559.182 ms 1 1490 0.00 MB/sec execute 9 sec latency 23559.336 ms 1 1490 0.00 MB/sec execute 10 sec latency 24559.511 ms 1 1490 0.00 MB/sec execute 11 sec latency 25559.663 ms 1 1490 0.00 MB/sec execute 12 sec latency 26559.806 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 1803 0.02 MB/sec execute 13 sec latency 26632.504 ms 1 2156 0.05 MB/sec execute 14 sec latency 31.617 ms 1 2376 0.13 MB/sec execute 15 sec latency 69.648 ms 1 2609 0.19 MB/sec execute 16 sec latency 46.498 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 10880 2197760 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 44032 3531776 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 29696 3531776 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 73728 7063552 2% /mnt/lustre 1 2987 0.44 MB/sec execute 17 sec latency 79.628 ms 1 3378 0.52 MB/sec execute 18 sec latency 47.830 ms 1 3696 0.67 MB/sec execute 19 sec latency 17.389 ms test_26 fail mds1 6 times 1 3867 0.69 MB/sec execute 20 sec latency 27.066 ms Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 4002 0.67 MB/sec execute 21 sec latency 297.668 ms 11:01:51 (1713366111) shut down 1 4002 0.64 MB/sec execute 22 sec latency 1297.876 ms 1 4002 0.61 MB/sec execute 23 sec latency 2298.179 ms 1 4002 0.58 MB/sec execute 24 sec latency 3298.383 ms 1 4002 0.56 MB/sec execute 25 sec latency 4298.573 ms 1 4002 0.54 MB/sec execute 26 sec latency 5298.732 ms 1 4002 0.52 MB/sec execute 27 sec latency 6298.904 ms 1 4002 0.50 MB/sec execute 28 sec latency 7299.086 ms 1 4002 0.48 MB/sec execute 29 sec latency 8299.250 ms 1 4002 0.47 MB/sec execute 30 sec latency 9299.391 ms 1 4002 0.45 MB/sec execute 31 sec latency 10299.563 ms Failover mds1 to oleg443-server mount facets: mds1 1 4002 0.44 MB/sec execute 32 sec latency 11299.837 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 4002 0.42 MB/sec execute 33 sec latency 12299.970 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 1 4002 0.41 MB/sec execute 34 sec latency 13300.060 ms Started lustre-MDT0000 11:02:04 (1713366124) targets are mounted 11:02:04 (1713366124) facet_failover done 1 4002 0.40 MB/sec execute 35 sec latency 14300.257 ms 1 4002 0.39 MB/sec execute 36 sec latency 15300.453 ms 1 4002 0.38 MB/sec execute 37 sec latency 16300.695 ms 1 4002 0.37 MB/sec execute 38 sec latency 17300.892 ms 1 4002 0.36 MB/sec execute 39 sec latency 18301.104 ms 1 4002 0.35 MB/sec execute 40 sec latency 19301.371 ms 1 4002 0.34 MB/sec execute 41 sec latency 20301.726 ms 1 4002 0.33 MB/sec execute 42 sec latency 21301.891 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 4165 0.33 MB/sec execute 43 sec latency 21617.224 ms 1 4421 0.33 MB/sec execute 44 sec latency 76.913 ms 1 4554 0.33 MB/sec execute 45 sec latency 59.210 ms 1 4839 0.35 MB/sec execute 46 sec latency 22.075 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 11136 2197504 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 54272 3568640 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 36864 3590144 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 91136 7158784 2% /mnt/lustre 1 5141 0.41 MB/sec execute 47 sec latency 79.136 ms 1 5414 0.40 MB/sec execute 48 sec latency 63.171 ms 1 5760 0.40 MB/sec execute 49 sec latency 33.264 ms test_26 fail mds1 7 times 1 6037 0.44 MB/sec execute 50 sec latency 48.187 ms Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 1 6325 0.49 MB/sec execute 51 sec latency 366.759 ms 11:02:21 (1713366141) shut down 1 6325 0.48 MB/sec execute 52 sec latency 1366.992 ms 1 6325 0.47 MB/sec execute 53 sec latency 2367.124 ms 1 6325 0.46 MB/sec execute 54 sec latency 3367.308 ms 1 6325 0.46 MB/sec execute 55 sec latency 4367.487 ms 1 6325 0.45 MB/sec execute 56 sec latency 5367.765 ms 1 6325 0.44 MB/sec execute 57 sec latency 6367.976 ms 1 6325 0.43 MB/sec execute 58 sec latency 7368.143 ms 1 6325 0.42 MB/sec execute 59 sec latency 8368.370 ms 1 6325 0.42 MB/sec execute 60 sec latency 9368.556 ms 1 6325 0.41 MB/sec execute 61 sec latency 10368.773 ms Failover mds1 to oleg443-server mount facets: mds1 1 6325 0.40 MB/sec execute 62 sec latency 11368.908 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 6325 0.40 MB/sec execute 63 sec latency 12369.037 ms oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all 1 6325 0.39 MB/sec execute 64 sec latency 13369.249 ms pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:34 (1713366154) targets are mounted 11:02:34 (1713366154) facet_failover done 1 6325 0.39 MB/sec execute 65 sec latency 14369.424 ms 1 6325 0.38 MB/sec execute 66 sec latency 15369.617 ms 1 6325 0.37 MB/sec execute 67 sec latency 16369.918 ms 1 6325 0.37 MB/sec execute 68 sec latency 17370.172 ms 1 6325 0.36 MB/sec execute 69 sec latency 18370.364 ms 1 6325 0.36 MB/sec execute 70 sec latency 19370.550 ms 1 6325 0.35 MB/sec execute 71 sec latency 20370.750 ms 1 6431 0.35 MB/sec execute 72 sec latency 21251.165 ms oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 6877 0.39 MB/sec execute 73 sec latency 51.233 ms 1 7185 0.43 MB/sec execute 74 sec latency 21.599 ms 1 7364 0.44 MB/sec execute 75 sec latency 27.076 ms 1 7536 0.44 MB/sec execute 76 sec latency 24.332 ms 1 7780 0.43 MB/sec execute 77 sec latency 107.605 ms 1 7919 0.43 MB/sec execute 78 sec latency 84.634 ms 1 8184 0.44 MB/sec execute 79 sec latency 73.864 ms 1 8583 0.48 MB/sec execute 80 sec latency 15.464 ms 1 8821 0.47 MB/sec execute 81 sec latency 86.820 ms 1 9033 0.47 MB/sec execute 82 sec latency 60.205 ms 1 9446 0.48 MB/sec execute 83 sec latency 12.559 ms 1 9685 0.49 MB/sec execute 84 sec latency 77.123 ms 1 10123 0.54 MB/sec execute 85 sec latency 64.718 ms 1 10725 0.59 MB/sec execute 86 sec latency 49.341 ms 1 11013 0.60 MB/sec execute 87 sec latency 12.830 ms 1 11248 0.59 MB/sec execute 88 sec latency 18.323 ms 1 11464 0.59 MB/sec execute 89 sec latency 99.541 ms 1 11756 0.60 MB/sec execute 90 sec latency 52.959 ms 1 12119 0.63 MB/sec execute 91 sec latency 17.171 ms 1 12351 0.63 MB/sec execute 92 sec latency 83.619 ms 1 12632 0.62 MB/sec execute 93 sec latency 67.167 ms dbench killed by signal 15 stopping dbench on /mnt/lustre at Wed Apr 17 11:03:03 EDT 2024 with return code 0 21441 pts/0 S+ 0:00 dbench -c client.txt 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100 killed dbench main pid 21441 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished PASS 26 (239s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 28: lock replay should be ordered: waiting after granted ========================================================== 11:03:07 (1713366187) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00302378 s, 1.4 MB/s fail_loc=0x80000324 fail_loc=0x32a Failing ost1 on oleg443-server Stopping /mnt/lustre-ost1 (opts:) on oleg443-server 11:03:13 (1713366193) shut down Failover ost1 to oleg443-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.0028579 s, 1.4 MB/s oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-OST0000 11:03:27 (1713366207) targets are mounted 11:03:27 (1713366207) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 28 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 29: replay vs update with the same xid ========================================================== 11:03:36 (1713366216) SKIP: replay-dual test_29 needs >= 2 MDTs SKIP 29 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 30: layout lock replay is not blocked on IO ========================================================== 11:03:39 (1713366219) 10+0 records in 10+0 records out 40960 bytes (41 kB) copied, 0.00861799 s, 4.8 MB/s 10+0 records in 10+0 records out 40960 bytes (41 kB) copied, 0.0114748 s, 3.6 MB/s fail_loc=0x32e fail_val=4 Failing mds1 on oleg443-server Stopping /mnt/lustre-mds1 (opts:) on oleg443-server 11:03:42 (1713366222) shut down Failover mds1 to oleg443-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-MDT0000 11:03:55 (1713366235) targets are mounted 11:03:55 (1713366235) facet_failover done 160+0 records in 160+0 records out 81920 bytes (82 kB) copied, 20.1527 s, 4.1 kB/s oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 31: deadlock on file_remove_privs and occupied mod rpc slots ========================================================== 11:04:06 (1713366246) Failing ost1 on oleg443-server Stopping /mnt/lustre-ost1 (opts:) on oleg443-server 11:04:08 (1713366248) shut down Creating to objid 3201 on ost lustre-OST0000... total: 32 open/close in 0.29 seconds: 110.20 ops/second at_max=0 fail_loc=0x80001420 file /mnt/lustre2/d31.replay-dual/mdtdir/f31.replay-dual is not ready, wait 0.5 second... file /mnt/lustre2/d31.replay-dual/mdtdir/f31.replay-dual is ready Failover ost1 to oleg443-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg443-server: oleg443-server.virtnet: executing set_default_debug -1 all pdsh@oleg443-client: oleg443-server: ssh exited with exit code 1 Started lustre-OST0000 11:04:21 (1713366261) targets are mounted 11:04:21 (1713366261) facet_failover done oleg443-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL IDLE state after 0 sec pids: 29114 29117 29122 29123 29124 29125 29126 29127 29128 at_max=600 PASS 31 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 32: gap in update llog shouldn't break recovery ========================================================== 11:04:29 (1713366269) SKIP: replay-dual test_32 needs >= 2 MDTs SKIP 32 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 33: Check for OBD_INCOMPAT_MULTI_RPCS in last_rcvd after abort_recovery ========================================================== 11:04:32 (1713366272) SKIP: replay-dual test_33 ldiskfs only test SKIP 33 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-dual test complete, duration 2101 sec ========== 11:04:34 (1713366274) === replay-dual: start cleanup 11:04:34 (1713366274) === Stopping clients: oleg443-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg443-client.virtnet /mnt/lustre2 opts: === replay-dual: finish cleanup 11:04:35 (1713366275) ===