-----============= acceptance-small: replay-single ============----- Wed Apr 17 10:29:58 EDT 2024 excepting tests: 110f 131b 59 36 === replay-single: start setup 10:30:02 (1713364202) === oleg234-client.virtnet: executing check_config_client /mnt/lustre oleg234-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg234-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800abb89000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800abb89000.idle_timeout=debug disable quota as required oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === replay-single: finish setup 10:30:09 (1713364209) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0a: empty replay =================== 10:30:10 (1713364210) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3048 7210992 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:30:13 (1713364213) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:30:27 (1713364227) targets are mounted 10:30:27 (1713364227) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 10:30:35 (1713364235) Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 10:30:37 (1713364237) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 10:30:50 (1713364250) targets are mounted 10:30:50 (1713364250) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.06 seconds: 314.20 ops/second - unlinked 0 (time 1713364253 ; total 0 ; last 0) total: 20 unlinks in 1 seconds: 20.000000 unlinks/second PASS 0b (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0c: check replay-barrier =========== 10:30:56 (1713364256) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:31:00 (1713364260) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:31:13 (1713364273) targets are mounted 10:31:13 (1713364273) facet_failover done Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (90s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0d: expired recovery with no clients ========================================================== 10:32:27 (1713364347) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:32:31 (1713364351) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:32:45 (1713364365) targets are mounted 10:32:45 (1713364365) facet_failover done Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre PASS 0d (90s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 1: simple create =================== 10:33:59 (1713364439) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:34:03 (1713364443) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:18 (1713364458) targets are mounted 10:34:18 (1713364458) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2a: touch ========================== 10:34:28 (1713364468) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:34:31 (1713364471) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:45 (1713364485) targets are mounted 10:34:45 (1713364485) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2b: touch ========================== 10:34:54 (1713364494) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:34:58 (1713364498) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:11 (1713364511) targets are mounted 10:35:11 (1713364511) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2c: setstripe replay =============== 10:35:20 (1713364520) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:35:23 (1713364523) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:37 (1713364537) targets are mounted 10:35:37 (1713364537) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2d: setdirstripe replay ============ 10:35:46 (1713364546) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:35:50 (1713364550) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:03 (1713364563) targets are mounted 10:36:03 (1713364563) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2e: O_CREAT|O_EXCL create replay === 10:36:12 (1713364572) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:36:17 (1713364577) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:31 (1713364591) targets are mounted 10:36:31 (1713364591) facet_failover done Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 10:36:40 (1713364600) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:36:43 (1713364603) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:57 (1713364617) targets are mounted 10:36:57 (1713364617) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3b: replay failed open -ENOMEM ===== 10:37:06 (1713364626) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:37:10 (1713364630) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:37:24 (1713364644) targets are mounted 10:37:24 (1713364644) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3c: replay failed open -ENOMEM ===== 10:37:33 (1713364653) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:37:37 (1713364657) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:37:50 (1713364670) targets are mounted 10:37:51 (1713364671) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 10:37:59 (1713364679) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:38:03 (1713364683) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:17 (1713364697) targets are mounted 10:38:17 (1713364697) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4b: |x| rm 10 files ================ 10:38:27 (1713364707) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:38:30 (1713364710) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:44 (1713364724) targets are mounted 10:38:44 (1713364724) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 5: |x| 220 open(O_CREAT) =========== 10:38:53 (1713364733) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:38:57 (1713364737) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:39:11 (1713364751) targets are mounted 10:39:11 (1713364751) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6a: mkdir + contained create ======= 10:39:27 (1713364767) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1884 1285804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:39:30 (1713364770) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:39:44 (1713364784) targets are mounted 10:39:44 (1713364784) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6b: |X| rmdir ====================== 10:39:55 (1713364795) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:39:58 (1713364798) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:40:12 (1713364812) targets are mounted 10:40:12 (1713364812) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 7: mkdir |X| contained create ====== 10:40:20 (1713364820) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:40:24 (1713364824) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:40:37 (1713364837) targets are mounted 10:40:37 (1713364837) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 8: creat open |X| close ============ 10:40:46 (1713364846) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:40:49 (1713364849) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:41:03 (1713364863) targets are mounted 10:41:03 (1713364863) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 9: |X| create (same inum/gen) ====== 10:41:12 (1713364872) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:41:15 (1713364875) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:41:29 (1713364889) targets are mounted 10:41:29 (1713364889) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 10: create |X| rename unlink ======= 10:41:38 (1713364898) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:41:41 (1713364901) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:41:55 (1713364915) targets are mounted 10:41:55 (1713364915) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 10:42:04 (1713364924) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre new old Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:42:08 (1713364928) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:42:22 (1713364942) targets are mounted 10:42:22 (1713364942) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 12: open, unlink |X| close ========= 10:42:31 (1713364951) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:42:34 (1713364954) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:42:48 (1713364968) targets are mounted 10:42:48 (1713364968) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 13: open chmod 0 |x| write close === 10:42:57 (1713364977) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:43:00 (1713364980) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:43:14 (1713364994) targets are mounted 10:43:14 (1713364994) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 10:43:23 (1713365003) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:43:27 (1713365007) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:43:40 (1713365020) targets are mounted 10:43:40 (1713365020) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 10:43:49 (1713365029) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:43:52 (1713365032) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:44:06 (1713365046) targets are mounted 10:44:06 (1713365046) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 10:44:15 (1713365055) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:44:18 (1713365058) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:44:32 (1713365072) targets are mounted 10:44:32 (1713365072) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 10:44:41 (1713365081) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:44:44 (1713365084) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:44:58 (1713365098) targets are mounted 10:44:58 (1713365098) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 10:45:06 (1713365106) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 pid: 24937 will close Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:45:10 (1713365110) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:45:23 (1713365123) targets are mounted 10:45:23 (1713365123) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 19: mcreate, open, write, rename === 10:45:32 (1713365132) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre old Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:45:35 (1713365135) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:45:49 (1713365149) targets are mounted 10:45:49 (1713365149) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 10:45:58 (1713365158) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210924 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:46:01 (1713365161) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:46:15 (1713365175) targets are mounted 10:46:15 (1713365175) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 10:46:23 (1713365183) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x280000401 dd: error writing '/mnt/lustre/f20b.replay-single': Cannot send after transport endpoint shutdown 9254+0 records in 9253+0 records out 37900288 bytes (38 MB) copied, 1.49913 s, 25.3 MB/s Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:46:27 (1713365187) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:46:40 (1713365200) targets are mounted 10:46:40 (1713365200) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg234-server: oleg234-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg234-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for MDT destroys to complete before 3116, after 3116 PASS 20b (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20c: check that client eviction does not affect file content ========================================================== 10:46:50 (1713365210) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 -rw-r--r-- 1 root root 1 Apr 17 10:46 /mnt/lustre/f20c.replay-single PASS 20c (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 10:46:54 (1713365214) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210880 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:46:58 (1713365218) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:47:11 (1713365231) targets are mounted 10:47:11 (1713365231) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 10:47:20 (1713365240) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605424 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210896 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:47:23 (1713365243) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:47:37 (1713365257) targets are mounted 10:47:37 (1713365257) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 10:47:46 (1713365266) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:47:50 (1713365270) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:48:03 (1713365283) targets are mounted 10:48:03 (1713365283) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 10:48:12 (1713365292) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:48:16 (1713365296) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:48:29 (1713365309) targets are mounted 10:48:29 (1713365309) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 10:48:38 (1713365318) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:48:41 (1713365321) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:48:55 (1713365335) targets are mounted 10:48:55 (1713365335) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 10:49:04 (1713365344) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:49:07 (1713365347) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:49:21 (1713365361) targets are mounted 10:49:21 (1713365361) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 10:49:30 (1713365370) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:49:33 (1713365373) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:49:47 (1713365387) targets are mounted 10:49:47 (1713365387) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 10:49:55 (1713365395) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:49:59 (1713365399) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:50:12 (1713365412) targets are mounted 10:50:12 (1713365412) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 10:50:21 (1713365421) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:50:24 (1713365424) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:50:38 (1713365438) targets are mounted 10:50:38 (1713365438) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 10:50:47 (1713365447) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:50:50 (1713365450) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:51:04 (1713365464) targets are mounted 10:51:04 (1713365464) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 10:51:12 (1713365472) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:51:16 (1713365476) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:51:30 (1713365490) targets are mounted 10:51:30 (1713365490) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 10:51:39 (1713365499) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 pdsh@oleg234-client: oleg234-client: ssh exited with exit code 5 PASS 32 (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 10:51:44 (1713365504) total: 10 open/close in 0.04 seconds: 255.44 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.04 seconds: 269.33 ops/second PASS 33a (17s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33b: test fid seq allocation ======= 10:52:03 (1713365523) fail_loc=0x1311 total: 10 open/close in 0.04 seconds: 258.95 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.05 seconds: 191.33 ops/second PASS 33b (17s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 10:52:22 (1713365542) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1908 1285780 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1720 1285968 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 35: test recovery from llog for unlink op ========================================================== 10:52:42 (1713365562) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error oleg234-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg234-client: oleg234-client: ssh exited with exit code 5 first stat failed: 5 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (18s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_36 skipping ALWAYS excluded test 36 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 10:53:01 (1713365581) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1980 1285708 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1784 1285904 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg234-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg234-client: oleg234-client: ssh exited with exit code 5 first stat failed: 5 PASS 37 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 10:53:22 (1713365602) total: 800 open/close in 2.01 seconds: 398.13 ops/second - unlinked 0 (time 1713365605 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2108 1285580 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:53:29 (1713365609) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:53:44 (1713365624) targets are mounted 10:53:44 (1713365624) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713365630 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 10:53:57 (1713365637) total: 800 open/close in 2.24 seconds: 357.08 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2112 1285576 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210920 1% /mnt/lustre - unlinked 0 (time 1713365643 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:54:06 (1713365646) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:21 (1713365661) targets are mounted 10:54:21 (1713365661) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713365668 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 10:54:36 (1713365676) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00272038 s, 1.5 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00407569 s, 1.0 MB/s PASS 41 (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 42: recovery after ost failure ===== 10:54:41 (1713365681) total: 800 open/close in 3.48 seconds: 229.67 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2140 1285548 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713365690 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second debug=-1 Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 10:54:54 (1713365694) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 10:55:09 (1713365709) targets are mounted 10:55:09 (1713365709) facet_failover done wait for MDS to timeout and recover debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck - unlinked 0 (time 1713365750 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (72s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 10:55:56 (1713365756) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2196 1285492 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 10:56:00 (1713365760) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 10:56:14 (1713365774) targets are mounted 10:56:14 (1713365774) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44a: race in target handle connect ========================================================== 10:56:32 (1713365792) at_max=40 1 of 10 (1713365794) service : cur 5 worst 5 (at 1713364161, 1634s ago) 4 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1713365799) service : cur 5 worst 5 (at 1713364161, 1639s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1713365805) service : cur 5 worst 5 (at 1713364161, 1645s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1713365810) service : cur 5 worst 5 (at 1713364161, 1650s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1713365816) service : cur 5 worst 5 (at 1713364161, 1656s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1713365822) service : cur 5 worst 5 (at 1713364161, 1661s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1713365827) service : cur 5 worst 5 (at 1713364161, 1667s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1713365833) service : cur 5 worst 5 (at 1713364161, 1673s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1713365838) service : cur 5 worst 5 (at 1713364161, 1678s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1713365844) service : cur 5 worst 5 (at 1713364161, 1684s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (59s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44b: race in target handle connect ========================================================== 10:57:32 (1713365852) 1 of 10 (1713365853) service : cur 5 worst 5 (at 1713364161, 1693s ago) 5 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1713365893) service : cur 40 worst 40 (at 1713365894, 0s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1713365914) service : cur 40 worst 40 (at 1713365894, 20s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1713365934) service : cur 40 worst 40 (at 1713365894, 41s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1713365955) service : cur 40 worst 40 (at 1713365894, 61s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1713365975) service : cur 40 worst 40 (at 1713365894, 82s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1713365996) service : cur 40 worst 40 (at 1713365894, 103s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1713366016) service : cur 40 worst 40 (at 1713365894, 123s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1713366037) service : cur 40 worst 40 (at 1713365894, 144s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1713366058) service : cur 40 worst 40 (at 1713365894, 164s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.134@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre PASS 44b (228s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44c: race in target handle connect ========================================================== 11:01:22 (1713366082) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 100 create in 0.25 seconds: 405.39 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:01:40 (1713366100) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:54 (1713366114) targets are mounted 11:01:54 (1713366114) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 45: Handle failed close ============ 11:02:02 (1713366122) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 /mnt/lustre/f45.replay-single has type file OK PASS 45 (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 11:02:06 (1713366126) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:02:24 (1713366144) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:37 (1713366157) targets are mounted 11:02:37 (1713366157) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 11:02:46 (1713366166) total: 20 open/close in 0.07 seconds: 281.45 ops/second Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:02:48 (1713366168) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:03:02 (1713366182) targets are mounted 11:03:02 (1713366182) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.06 seconds: 338.36 ops/second - unlinked 0 (time 1713366247 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (82s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 11:04:10 (1713366250) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2124 1285564 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 20 open/close in 0.07 seconds: 298.09 ops/second Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:04:13 (1713366253) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:04:27 (1713366267) targets are mounted 11:04:27 (1713366267) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.06 seconds: 310.93 ops/second - unlinked 0 (time 1713366330 ; total 0 ; last 0) total: 40 unlinks in 1 seconds: 40.000000 unlinks/second PASS 48 (82s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 11:05:33 (1713366333) PASS 50 (7s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 52: time out lock replay (3764) ==== 11:05:42 (1713366342) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7528 fail_loc=0x80000157 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:05:43 (1713366343) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:05:56 (1713366356) targets are mounted 11:05:56 (1713366356) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (77s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 11:07:00 (1713366420) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:07:05 (1713366425) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:20 (1713366440) targets are mounted 11:07:20 (1713366440) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 11:07:28 (1713366448) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7528 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:07:33 (1713366453) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:47 (1713366467) targets are mounted 11:07:47 (1713366467) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 11:07:56 (1713366476) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:08:01 (1713366481) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:15 (1713366495) targets are mounted 11:08:15 (1713366495) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 11:08:24 (1713366504) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:08:27 (1713366507) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:40 (1713366520) targets are mounted 11:08:40 (1713366520) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 11:08:49 (1713366529) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:08:54 (1713366534) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:09:08 (1713366548) targets are mounted 11:09:08 (1713366548) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 11:09:16 (1713366556) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:09:21 (1713366561) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:09:35 (1713366575) targets are mounted 11:09:35 (1713366575) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 11:09:43 (1713366583) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:09:48 (1713366588) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:10:02 (1713366602) targets are mounted 11:10:02 (1713366602) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 11:10:11 (1713366611) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:10:16 (1713366616) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:10:30 (1713366630) targets are mounted 11:10:30 (1713366630) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 11:10:38 (1713366638) fail_loc=0x8000012b fail_loc=0x0 rm: touch: cannot remove '/mnt/lustre/f55.replay-single'cannot touch '/mnt/lustre/f55.replay-single': No such file or directory : No such file or directory PASS 55 (62s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 11:11:42 (1713366702) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2132 1285556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:11:45 (1713366705) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:11:59 (1713366719) targets are mounted 11:11:59 (1713366719) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 57: test recovery from llog for setattr op ========================================================== 11:12:17 (1713366737) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2132 1285556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:12:21 (1713366741) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:12:34 (1713366754) targets are mounted 11:12:34 (1713366754) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg234-server: oleg234-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg234-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg234-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 11:12:46 (1713366766) fail_loc=0x8000012c total: 2500 open/close in 4.39 seconds: 569.85 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2420 1285268 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:12:55 (1713366775) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:13:08 (1713366788) targets are mounted 11:13:08 (1713366788) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1713366804 ; total 0 ; last 0) total: 2500 unlinks in 3 seconds: 833.333313 unlinks/second PASS 58a (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58b: test replay of setxattr op ==== 11:13:30 (1713366810) Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2412 1285276 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:13:34 (1713366814) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:13:50 (1713366830) targets are mounted 11:13:50 (1713366830) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg234-client.virtnet /mnt/lustre2 (opts:) oleg234-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 11:14:00 (1713366840) Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg234-client.virtnet /mnt/lustre2 (opts:) PASS 58c (128s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_59 skipping ALWAYS excluded test 59 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 11:16:10 (1713366970) total: 200 open/close in 0.37 seconds: 537.70 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2268 1285420 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713366973 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:16:14 (1713366974) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:16:28 (1713366988) targets are mounted 11:16:28 (1713366988) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713366994 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second PASS 60 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 11:16:37 (1713366997) total: 800 open/close in 1.56 seconds: 513.46 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2352 1285336 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2024 1285664 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713367002 ; total 0 ; last 0) total: 800 unlinks in 1 seconds: 800.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:16:44 (1713367004) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:16:58 (1713367018) targets are mounted 11:16:58 (1713367018) facet_failover done Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:17:09 (1713367029) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:17:23 (1713367043) targets are mounted 11:17:23 (1713367043) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (80s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 11:17:59 (1713367079) fail_loc=0x8000013a Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:18:01 (1713367081) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:18:13 (1713367093) targets are mounted 11:18:13 (1713367093) facet_failover done Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:18:24 (1713367104) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:18:37 (1713367117) targets are mounted 11:18:37 (1713367117) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.0023434 s, 1.7 MB/s PASS 61b (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 11:18:46 (1713367126) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:18:57 (1713367137) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:19:10 (1713367150) targets are mounted 11:19:10 (1713367150) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 11:19:17 (1713367157) Stopping /mnt/lustre-mds1 (opts:) on oleg234-server fail_loc=0x80000605 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: mount.lustre: mount /dev/mapper/mds1_flakey at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg234-client: oleg234-server: ssh exited with exit code 95 Start of /dev/mapper/mds1_flakey on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (10s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 62: don't mis-drop resent replay === 11:19:29 (1713367169) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2344 1285344 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3128 7210912 1% /mnt/lustre total: 25 open/close in 0.07 seconds: 344.47 ops/second fail_loc=0x80000707 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:19:32 (1713367172) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:19:46 (1713367186) targets are mounted 11:19:46 (1713367186) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1713367249 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second PASS 62 (82s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65a: AT: verify early replies ====== 11:20:52 (1713367252) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:2.0:1713367281.991646:0:14499:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a8dd5f80 x1796592505682432/t0(0) o101->lustre-MDT0000-mdc-ffff8800aab95000@192.168.202.134@tcp:12/10 lens 664/66320 e 1 to 0 dl 1713367316 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 36 worst 40 (at 1713365893, 1394s ago) 36 40 40 40 portal 29 : cur 5 worst 5 (at 1713364435, 2852s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713364435, 2852s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713364440, 2847s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713364522, 2765s ago) 5 5 5 0 portal 13 : cur 5 worst 5 (at 1713364900, 2387s ago) 5 5 0 0 portal 12 : cur 11 worst 40 (at 1713365893, 1403s ago) 36 40 40 40 portal 29 : cur 5 worst 5 (at 1713364435, 2861s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713364435, 2861s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713364440, 2856s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713364522, 2774s ago) 5 5 5 0 portal 13 : cur 5 worst 5 (at 1713364900, 2396s ago) 5 5 0 0 PASS 65a (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 11:21:38 (1713367298) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:3.0:1713367327.879314:0:1955:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800aa84a300 x1796592505692352/t0(0) o4->lustre-OST0000-osc-ffff8800aab95000@192.168.202.134@tcp:6/4 lens 4584/448 e 1 to 0 dl 1713367362 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck portal 28 : cur 5 worst 5 (at 1713364435, 2898s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1713364436, 2897s ago) 5 5 5 5 portal 17 : cur 5 worst 5 (at 1713364495, 2838s ago) 5 0 0 5 portal 6 : cur 36 worst 36 (at 1713367332, 1s ago) 36 5 0 0 PASS 65b (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 11:22:16 (1713367336) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1713365893, 1465s ago) 36 40 40 40 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1713365893, 1471s ago) 36 40 40 40 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1713365893, 1481s ago) 36 40 40 40 fail_loc=0 portal 12 : cur 5 worst 40 (at 1713365893, 1491s ago) 36 40 40 40 Current MDT timeout 5, worst 40 PASS 66a (49s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66b: AT: verify net latency adjusts ========================================================== 11:23:07 (1713367387) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 10, worst 10 PASS 66b (83s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 11:24:32 (1713367472) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (74s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 11:25:48 (1713367548) at_history=8 at_history=8 Creating to objid 5057 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 17 open/close in 0.09 seconds: 197.45 ops/second Connected clients: oleg234-client.virtnet oleg234-client.virtnet service : cur 5 worst 5 (at 1713364171, 3402s ago) 1 1 0 1 phase 2 1 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg234-client.virtnet oleg234-client.virtnet service : cur 5 worst 5 (at 1713364171, 3404s ago) 1 1 1 0 0 osc reconnect attempts on 2nd slow PASS 67b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 68: AT: verify slowing locks ======= 11:26:19 (1713367579) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1968: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1983: $ldlm_enqueue_min: ambiguous redirect PASS 68 (69s) debug_raw_pointers=0 debug_raw_pointers=0 Cleaning up AT ... debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70a: check multi client t-f ======== 11:27:30 (1713367650) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70b: dbench 2mdts recovery; 1 clients ========================================================== 11:27:34 (1713367654) Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) striped dir -i0 -c2 -H crush /mnt/lustre/d70b.replay-single + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg234-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found Started rundbench load pid=21611 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 26164 3556136 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 5388 3591052 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 31552 7147188 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:27:42 (1713367662) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:27:58 (1713367678) targets are mounted 11:27:58 (1713367678) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2428 1285260 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2128 1285560 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 27452 3568880 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 5552 3594232 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 33004 7163112 1% /mnt/lustre oleg234-client.virtnet: looking for dbench program oleg234-client.virtnet: /usr/bin/dbench oleg234-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg234-client.virtnet oleg234-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg234-client.virtnet' oleg234-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg234-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg234-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg234-client.virtnet at Wed Apr 17 11:27:35 EDT 2024 oleg234-client.virtnet: waiting for dbench pid 21635 oleg234-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg234-client.virtnet: oleg234-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg234-client.virtnet: failed to create barrier semaphore oleg234-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg234-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg234-client.virtnet: releasing clients oleg234-client.virtnet: 1 356 12.78 MB/sec warmup 1 sec latency 17.770 ms oleg234-client.virtnet: 1 778 10.68 MB/sec warmup 2 sec latency 15.535 ms oleg234-client.virtnet: 1 1177 7.67 MB/sec warmup 3 sec latency 15.205 ms oleg234-client.virtnet: 1 1603 6.54 MB/sec warmup 4 sec latency 15.811 ms oleg234-client.virtnet: 1 1993 5.30 MB/sec warmup 5 sec latency 16.927 ms oleg234-client.virtnet: 1 2399 4.66 MB/sec warmup 6 sec latency 14.558 ms oleg234-client.virtnet: 1 2969 4.79 MB/sec warmup 7 sec latency 15.010 ms oleg234-client.virtnet: 1 3885 4.97 MB/sec warmup 8 sec latency 11.546 ms oleg234-client.virtnet: 1 4083 4.45 MB/sec warmup 9 sec latency 618.388 ms oleg234-client.virtnet: 1 4083 4.01 MB/sec warmup 10 sec latency 1618.563 ms oleg234-client.virtnet: 1 4083 3.64 MB/sec warmup 11 sec latency 2618.755 ms oleg234-client.virtnet: 1 4083 3.34 MB/sec warmup 12 sec latency 3618.954 ms oleg234-client.virtnet: 1 4083 3.08 MB/sec warmup 13 sec latency 4619.192 ms oleg234-client.virtnet: 1 4083 2.86 MB/sec warmup 14 sec latency 5619.392 ms oleg234-client.virtnet: 1 4083 2.67 MB/sec warmup 15 sec latency 6619.602 ms oleg234-client.virtnet: 1 4083 2.51 MB/sec warmup 16 sec latency 7619.818 ms oleg234-client.virtnet: 1 4083 2.36 MB/sec warmup 17 sec latency 8619.985 ms oleg234-client.virtnet: 1 4083 2.23 MB/sec warmup 18 sec latency 9620.165 ms oleg234-client.virtnet: 1 4083 2.11 MB/sec warmup 19 sec latency 10620.399 ms oleg234-client.virtnet: 1 4083 2.00 MB/sec warmup 20 sec latency 11620.637 ms oleg234-client.virtnet: 1 4083 1.91 MB/sec warmup 21 sec latency 12620.858 ms oleg234-client.virtnet: 1 4083 1.82 MB/sec warmup 22 sec latency 13621.081 ms oleg234-client.virtnet: 1 4083 1.74 MB/sec warmup 23 sec latency 14621.279 ms oleg234-client.virtnet: 1 4083 1.67 MB/sec warmup 24 sec latency 15621.483 ms oleg234-client.virtnet: 1 4083 1.60 MB/sec warmup 25 sec latency 16621.679 ms oleg234-client.virtnet: 1 4083 1.54 MB/sec warmup 26 sec latency 17621.894 ms oleg234-client.virtnet: 1 4421 1.51 MB/sec warmup 27 sec latency 17820.008 ms oleg234-client.virtnet: 1 4768 1.51 MB/sec warmup 28 sec latency 14.651 ms oleg234-client.virtnet: 1 5174 1.56 MB/sec warmup 29 sec latency 14.717 ms oleg234-client.virtnet: 1 5537 1.52 MB/sec warmup 30 sec latency 16.381 ms oleg234-client.virtnet: 1 5951 1.52 MB/sec warmup 31 sec latency 15.967 ms oleg234-client.virtnet: 1 6477 1.64 MB/sec warmup 32 sec latency 15.522 ms oleg234-client.virtnet: 1 6833 1.65 MB/sec warmup 33 sec latency 465.test_70b fail mds2 2 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:28:11 (1713367691) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:28:26 (1713367706) targets are mounted 11:28:26 (1713367706) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37828 3569136 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11884 3592244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49712 7161380 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:28:38 (1713367718) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:28:53 (1713367733) targets are mounted 11:28:53 (1713367733) facet_failover done 647 ms oleg234-client.virtnet: 1 7324 1.73 MB/sec warmup 34 sec latency 15.577 ms oleg234-client.virtnet: 1 7559 1.69 MB/sec warmup 35 sec latency 23.068 ms oleg234-client.virtnet: 1 7593 1.64 MB/sec warmup 36 sec latency 866.864 ms oleg234-client.virtnet: 1 7593 1.60 MB/sec warmup 37 sec latency 1867.008 ms oleg234-client.virtnet: 1 7593 1.56 MB/sec warmup 38 sec latency 2867.200 ms oleg234-client.virtnet: 1 7593 1.52 MB/sec warmup 39 sec latency 3867.475 ms oleg234-client.virtnet: 1 7593 1.48 MB/sec warmup 40 sec latency 4867.735 ms oleg234-client.virtnet: 1 7593 1.44 MB/sec warmup 41 sec latency 5867.965 ms oleg234-client.virtnet: 1 7593 1.41 MB/sec warmup 42 sec latency 6868.187 ms oleg234-client.virtnet: 1 7593 1.38 MB/sec warmup 43 sec latency 7868.408 ms oleg234-client.virtnet: 1 7593 1.34 MB/sec warmup 44 sec latency 8868.649 ms oleg234-client.virtnet: 1 7593 1.31 MB/sec warmup 45 sec latency 9868.860 ms oleg234-client.virtnet: 1 7593 1.29 MB/sec warmup 46 sec latency 10869.024 ms oleg234-client.virtnet: 1 7593 1.26 MB/sec warmup 47 sec latency 11869.233 ms oleg234-client.virtnet: 1 7593 1.23 MB/sec warmup 48 sec latency 12869.464 ms oleg234-client.virtnet: 1 7593 1.21 MB/sec warmup 49 sec latency 13869.685 ms oleg234-client.virtnet: 1 7593 1.18 MB/sec warmup 50 sec latency 14869.895 ms oleg234-client.virtnet: 1 7593 1.16 MB/sec warmup 51 sec latency 15870.117 ms oleg234-client.virtnet: 1 7593 1.14 MB/sec warmup 52 sec latency 16870.333 ms oleg234-client.virtnet: 1 7593 1.12 MB/sec warmup 53 sec latency 17870.565 ms oleg234-client.virtnet: 1 7593 1.10 MB/sec warmup 54 sec latency 18870.767 ms oleg234-client.virtnet: 1 7674 1.08 MB/sec warmup 55 sec latency 19649.878 ms oleg234-client.virtnet: 1 8059 1.07 MB/sec warmup 56 sec latency 13.513 ms oleg234-client.virtnet: 1 8530 1.13 MB/sec warmup 57 sec latency 12.796 ms oleg234-client.virtnet: 1 8929 1.11 MB/sec warmup 58 sec latency 12.875 ms oleg234-client.virtnet: 1 9446 1.12 MB/sec warmup 59 sec latency 13.639 ms oleg234-client.virtnet: 1 10667 5.61 MB/sec execute 1 sec latency 14.573 ms oleg234-client.virtnet: 1 10974 3.47 MB/sec execute 2 sec latency 15.740 ms oleg234-client.virtnet: 1 11524 2.62 MB/sec execute 3 sec latency 14.122 ms oleg234-client.virtnet: 1 11902 2.34 MB/sec execute 4 sec latency 15.731 ms oleg234-client.virtnet: 1 12314 2.50 MB/sec execute 5 sec latency 15.558 ms oleg234-client.virtnet: 1 12680 2.14 MB/sec execute 6 sec latency 13.082 ms oleg234-client.virtnet: 1 13144 2.15 MB/sec execute 7 sec latency 15.119 ms oleg234-client.virtnet: 1 14269 3.11 MB/sec execute 8 sec latency 13.985 ms oleg234-client.virtnet: 1 14566 2.93 MB/sec execute 9 sec latency 15.689 ms oleg234-client.virtnet: 1 14938 2.67 MB/sec execute 10 sec latency 17.621 ms oleg234-client.virtnet: 1 15382 2.58 MB/sec execute 11 sec latency 15.680 ms oleg234-client.virtnet: 1 15786 2.63 MB/sec execute 12 sec latency 15.207 ms oleg234-client.virtnet: 1 16358 2.47 MB/sec execute 13 sec latency 16.626 ms oleg234-client.virtnet: 1 17200 2.78 MB/sec execute 14 sec latency 15.211 ms oleg234-client.virtnet: 1 17861 2.94 MB/sec execute 15 sec latency 14.885 ms oleg234-client.virtnet: 1 18144 2.84 MB/sec execute 16 sec latency 15.066 ms oleg234-client.virtnet: 1 18496 2.70 MB/sec execute 17 sec latency 13.879 ms oleg234-client.virtnet: 1 18838 2.64 MB/sec execute 18 sec latency 15.444 ms oleg234-client.virtnet: 1 19243 2.66 MB/sec executoleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2084 1285604 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37616 3568396 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12020 3594008 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49636 7162404 1% /mnt/lustre test_70b fail mds2 4 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:29:06 (1713367746) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:29:22 (1713367762) targets are mounted 11:29:22 (1713367762) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37604 3566660 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12020 3594836 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49624 7161496 1% /mnt/lustre test_70b fail mds1 5 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:29:35 (1713367775) shut down e 19 sec latency 15.790 ms oleg234-client.virtnet: 1 19556 2.54 MB/sec execute 20 sec latency 16.589 ms oleg234-client.virtnet: 1 19947 2.44 MB/sec execute 21 sec latency 10.826 ms oleg234-client.virtnet: 1 20291 2.42 MB/sec execute 22 sec latency 14.991 ms oleg234-client.virtnet: 1 20989 2.56 MB/sec execute 23 sec latency 15.353 ms oleg234-client.virtnet: 1 21477 2.64 MB/sec execute 24 sec latency 14.083 ms oleg234-client.virtnet: 1 21748 2.59 MB/sec execute 25 sec latency 15.514 ms oleg234-client.virtnet: 1 22083 2.50 MB/sec execute 26 sec latency 20.654 ms oleg234-client.virtnet: 1 22432 2.47 MB/sec execute 27 sec latency 16.782 ms oleg234-client.virtnet: 1 22812 2.49 MB/sec execute 28 sec latency 64.630 ms oleg234-client.virtnet: 1 23100 2.41 MB/sec execute 29 sec latency 179.389 ms oleg234-client.virtnet: 1 23479 2.35 MB/sec execute 30 sec latency 11.402 ms oleg234-client.virtnet: 1 23767 2.32 MB/sec execute 31 sec latency 222.356 ms oleg234-client.virtnet: 1 23767 2.24 MB/sec execute 32 sec latency 1222.695 ms oleg234-client.virtnet: 1 23767 2.18 MB/sec execute 33 sec latency 2222.938 ms oleg234-client.virtnet: 1 23767 2.11 MB/sec execute 34 sec latency 3223.125 ms oleg234-client.virtnet: 1 23767 2.05 MB/sec execute 35 sec latency 4223.365 ms oleg234-client.virtnet: 1 23767 1.99 MB/sec execute 36 sec latency 5223.656 ms oleg234-client.virtnet: 1 23767 1.94 MB/sec execute 37 sec latency 6223.906 ms oleg234-client.virtnet: 1 23767 1.89 MB/sec execute 38 sec latency 7224.140 ms oleg234-client.virtnet: 1 23767 1.84 MB/sec execute 39 sec latency 8224.407 ms oleg234-client.virtnet: 1 23767 1.80 MB/sec execute 40 sec latency 9224.669 ms oleg234-client.virtnet: 1 23767 1.75 MB/sec execute 41 sec latency 10224.947 ms oleg234-client.virtnet: 1 23767 1.71 MB/sec execute 42 sec latency 11225.174 ms oleg234-client.virtnet: 1 23767 1.67 MB/sec execute 43 sec latency 12225.356 ms oleg234-client.virtnet: 1 23767 1.63 MB/sec execute 44 sec latency 13225.553 ms oleg234-client.virtnet: 1 23767 1.60 MB/sec execute 45 sec latency 14225.740 ms oleg234-client.virtnet: 1 23767 1.56 MB/sec execute 46 sec latency 15225.958 ms oleg234-client.virtnet: 1 23767 1.53 MB/sec execute 47 sec latency 16226.205 ms oleg234-client.virtnet: 1 23767 1.50 MB/sec execute 48 sec latency 17226.399 ms oleg234-client.virtnet: 1 23767 1.47 MB/sec execute 49 sec latency 18226.552 ms oleg234-client.virtnet: 1 23767 1.44 MB/sec execute 50 sec latency 19226.729 ms oleg234-client.virtnet: 1 24204 1.49 MB/sec execute 51 sec latency 19569.779 ms oleg234-client.virtnet: 1 24862 1.58 MB/sec execute 52 sec latency 14.844 ms oleg234-client.virtnet: 1 25168 1.57 MB/sec execute 53 sec latency 15.767 ms oleg234-client.virtnet: 1 25453 1.55 MB/sec execute 54 sec latency 16.263 ms oleg234-client.virtnet: 1 25785 1.54 MB/sec execute 55 sec latency 16.840 ms oleg234-client.virtnet: 1 26120 1.53 MB/sec execute 56 sec latency 15.425 ms oleg234-client.virtnet: 1 26513 1.56 MB/sec execute 57 sec latency 14.734 ms oleg234-client.virtnet: 1 26865 1.54 MB/sec execute 58 sec latency 15.924 ms oleg234-client.virtnet: 1 27304 1.54 MB/sec execute 59 sec latency 12.590 ms oleg234-client.virtnet: 1 28040 1.62 MB/sec execute 60 sec latency 19.087 ms oleg234-client.virtnet: 1 28538 1.66 MB/sec execute 61 sec latency 15.528 ms oleg234-client.virtnet: 1 28810 1.66 MB/sec execute 62 sec latency 15.888 ms oleg234-client.virtnet: 1 29139 1.64 MB/sec execute 63 sec latency 18.960 ms oleg234-clFailover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:29:50 (1713367790) targets are mounted 11:29:50 (1713367790) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37760 3568116 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11884 3594144 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49644 7162260 1% /mnt/lustre test_70b fail mds2 6 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:30:03 (1713367803) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:30:18 (1713367818) targets are mounted 11:30:18 (1713367818) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid ient.virtnet: 1 29454 1.64 MB/sec execute 64 sec latency 15.694 ms oleg234-client.virtnet: 1 29894 1.66 MB/sec execute 65 sec latency 15.511 ms oleg234-client.virtnet: 1 30210 1.64 MB/sec execute 66 sec latency 17.925 ms oleg234-client.virtnet: 1 30620 1.62 MB/sec execute 67 sec latency 10.894 ms oleg234-client.virtnet: 1 31011 1.63 MB/sec execute 68 sec latency 15.606 ms oleg234-client.virtnet: 1 31767 1.70 MB/sec execute 69 sec latency 15.265 ms oleg234-client.virtnet: 1 32198 1.74 MB/sec execute 70 sec latency 15.706 ms oleg234-client.virtnet: 1 32470 1.72 MB/sec execute 71 sec latency 15.487 ms oleg234-client.virtnet: 1 32838 1.70 MB/sec execute 72 sec latency 18.708 ms oleg234-client.virtnet: 1 33173 1.70 MB/sec execute 73 sec latency 15.544 ms oleg234-client.virtnet: 1 33582 1.72 MB/sec execute 74 sec latency 15.313 ms oleg234-client.virtnet: 1 33942 1.70 MB/sec execute 75 sec latency 15.568 ms oleg234-client.virtnet: 1 34355 1.70 MB/sec execute 76 sec latency 15.296 ms oleg234-client.virtnet: 1 34939 1.75 MB/sec execute 77 sec latency 15.449 ms oleg234-client.virtnet: 1 35570 1.79 MB/sec execute 78 sec latency 15.602 ms oleg234-client.virtnet: 1 35873 1.79 MB/sec execute 79 sec latency 16.295 ms oleg234-client.virtnet: 1 36229 1.77 MB/sec execute 80 sec latency 15.602 ms oleg234-client.virtnet: 1 36567 1.77 MB/sec execute 81 sec latency 15.459 ms oleg234-client.virtnet: 1 36976 1.78 MB/sec execute 82 sec latency 16.376 ms oleg234-client.virtnet: 1 37311 1.77 MB/sec execute 83 sec latency 16.737 ms oleg234-client.virtnet: 1 37707 1.75 MB/sec execute 84 sec latency 10.666 ms oleg234-client.virtnet: 1 37956 1.74 MB/sec execute 85 sec latency 348.882 ms oleg234-client.virtnet: 1 38514 1.79 MB/sec execute 86 sec latency 14.944 ms oleg234-client.virtnet: 1 39183 1.83 MB/sec execute 87 sec latency 16.366 ms oleg234-client.virtnet: 1 39275 1.82 MB/sec execute 88 sec latency 743.391 ms oleg234-client.virtnet: 1 39275 1.80 MB/sec execute 89 sec latency 1743.657 ms oleg234-client.virtnet: 1 39275 1.78 MB/sec execute 90 sec latency 2743.910 ms oleg234-client.virtnet: 1 39275 1.76 MB/sec execute 91 sec latency 3744.138 ms oleg234-client.virtnet: 1 39275 1.74 MB/sec execute 92 sec latency 4744.418 ms oleg234-client.virtnet: 1 39275 1.72 MB/sec execute 93 sec latency 5744.673 ms oleg234-client.virtnet: 1 39275 1.70 MB/sec execute 94 sec latency 6744.890 ms oleg234-client.virtnet: 1 39275 1.68 MB/sec execute 95 sec latency 7745.100 ms oleg234-client.virtnet: 1 39275 1.67 MB/sec execute 96 sec latency 8745.417 ms oleg234-client.virtnet: 1 39275 1.65 MB/sec execute 97 sec latency 9745.674 ms oleg234-client.virtnet: 1 39275 1.63 MB/sec execute 98 sec latency 10745.889 ms oleg234-client.virtnet: 1 39275 1.62 MB/sec execute 99 sec latency 11746.119 ms oleg234-client.virtnet: 1 39275 1.60 MB/sec execute 100 sec latency 12746.353 ms oleg234-client.virtnet: 1 39275 1.58 MB/sec execute 101 sec latency 13746.595 ms oleg234-client.virtnet: 1 39275 1.57 MB/sec execute 102 sec latency 14747.029 ms oleg234-client.virtnet: 1 39275 1.55 MB/sec execute 103 sec latency 15747.290 ms oleg234-client.virtnet: 1 39275 1.54 MB/sec execute 104 sec latency 16747.525 ms oleg234-client.virtnet: 1 39275 1.52 MB/sec execute 105 sec latency 17747.760 ms oleg234-client.virtnet: 1 39275 1.51 MB/sec execute 106 sec latency 18747.977 ms oleg234-client.virtnet: 1 39275 1.49 MB/sec execute 107 sec latency 19748.151 ms oleg234-client.virtnet: 1 39501 1.48 MB/smdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37592 3569372 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11888 3594416 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49480 7163788 1% /mnt/lustre test_70b fail mds1 7 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:30:32 (1713367832) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:30:47 (1713367847) targets are mounted 11:30:47 (1713367847) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37596 3568800 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11892 3594268 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49488 7163068 1% /mnt/lustre test_70b fail mds2 8 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:31:00 (1713367860) shut down ec execute 108 sec latency 19782.748 ms oleg234-client.virtnet: 1 39790 1.47 MB/sec execute 109 sec latency 17.601 ms oleg234-client.virtnet: 1 40148 1.47 MB/sec execute 110 sec latency 15.685 ms oleg234-client.virtnet: 1 40551 1.49 MB/sec execute 111 sec latency 16.054 ms oleg234-client.virtnet: 1 40853 1.48 MB/sec execute 112 sec latency 16.939 ms oleg234-client.virtnet: 1 41241 1.47 MB/sec execute 113 sec latency 11.076 ms oleg234-client.virtnet: 1 41632 1.48 MB/sec execute 114 sec latency 15.481 ms oleg234-client.virtnet: 1 42315 1.52 MB/sec execute 115 sec latency 15.837 ms oleg234-client.virtnet: 1 42795 1.54 MB/sec execute 116 sec latency 12.917 ms oleg234-client.virtnet: 1 43045 1.53 MB/sec execute 117 sec latency 15.707 ms oleg234-client.virtnet: 1 43387 1.52 MB/sec execute 118 sec latency 19.121 ms oleg234-client.virtnet: 1 43729 1.52 MB/sec execute 119 sec latency 15.381 ms oleg234-client.virtnet: 1 44108 1.54 MB/sec execute 120 sec latency 16.223 ms oleg234-client.virtnet: 1 44424 1.53 MB/sec execute 121 sec latency 17.345 ms oleg234-client.virtnet: 1 44805 1.52 MB/sec execute 122 sec latency 11.738 ms oleg234-client.virtnet: 1 45200 1.52 MB/sec execute 123 sec latency 15.689 ms oleg234-client.virtnet: 1 45943 1.56 MB/sec execute 124 sec latency 15.383 ms oleg234-client.virtnet: 1 46362 1.59 MB/sec execute 125 sec latency 16.118 ms oleg234-client.virtnet: 1 46602 1.58 MB/sec execute 126 sec latency 16.211 ms oleg234-client.virtnet: 1 46945 1.57 MB/sec execute 127 sec latency 19.504 ms oleg234-client.virtnet: 1 47304 1.57 MB/sec execute 128 sec latency 15.910 ms oleg234-client.virtnet: 1 47710 1.58 MB/sec execute 129 sec latency 14.876 ms oleg234-client.virtnet: 1 48017 1.57 MB/sec execute 130 sec latency 16.470 ms oleg234-client.virtnet: 1 48481 1.57 MB/sec execute 131 sec latency 11.183 ms oleg234-client.virtnet: 1 48917 1.59 MB/sec execute 132 sec latency 15.357 ms oleg234-client.virtnet: 1 49540 1.60 MB/sec execute 133 sec latency 15.801 ms oleg234-client.virtnet: 1 49947 1.62 MB/sec execute 134 sec latency 16.656 ms oleg234-client.virtnet: 1 50214 1.61 MB/sec execute 135 sec latency 15.198 ms oleg234-client.virtnet: 1 50587 1.61 MB/sec execute 136 sec latency 16.556 ms oleg234-client.virtnet: 1 50939 1.60 MB/sec execute 137 sec latency 18.889 ms oleg234-client.virtnet: 1 51342 1.61 MB/sec execute 138 sec latency 15.514 ms oleg234-client.virtnet: 1 51696 1.61 MB/sec execute 139 sec latency 17.737 ms oleg234-client.virtnet: 1 52108 1.61 MB/sec execute 140 sec latency 11.958 ms oleg234-client.virtnet: 1 52699 1.63 MB/sec execute 141 sec latency 14.762 ms oleg234-client.virtnet: 1 53386 1.66 MB/sec execute 142 sec latency 13.869 ms oleg234-client.virtnet: 1 53763 1.66 MB/sec execute 143 sec latency 12.143 ms oleg234-client.virtnet: 1 54215 1.65 MB/sec execute 144 sec latency 17.090 ms oleg234-client.virtnet: 1 54274 1.65 MB/sec execute 145 sec latency 926.355 ms oleg234-client.virtnet: 1 54274 1.63 MB/sec execute 146 sec latency 1926.583 ms oleg234-client.virtnet: 1 54274 1.62 MB/sec execute 147 sec latency 2926.868 ms oleg234-client.virtnet: 1 54274 1.61 MB/sec execute 148 sec latency 3927.139 ms oleg234-client.virtnet: 1 54274 1.60 MB/sec execute 149 sec latency 4927.378 ms oleg234-client.virtnet: 1 54274 1.59 MB/sec execute 150 sec latency 5927.610 ms oleg234-client.virtnet: 1 54274 1.58 MB/sec execute 151 sec latency 6927.856 ms oleg234-client.virtnet: 1 54274 1.57 MB/sec execute 152 sec latency 7928.137 ms oleg234-client.virtnet: 1 54Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:31:15 (1713367875) targets are mounted 11:31:15 (1713367875) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37592 3569144 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11888 3594992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49480 7164136 1% /mnt/lustre test_70b fail mds1 9 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:31:28 (1713367888) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:31:43 (1713367903) targets are mounted 11:31:43 (1713367903) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37604 3566704 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11972 3593980 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49576 7160684 1% /mnt/lustre 274 1.56 MB/sec execute 153 sec latency 8928.365 ms oleg234-client.virtnet: 1 54274 1.55 MB/sec execute 154 sec latency 9928.585 ms oleg234-client.virtnet: 1 54274 1.54 MB/sec execute 155 sec latency 10928.776 ms oleg234-client.virtnet: 1 54274 1.53 MB/sec execute 156 sec latency 11929.001 ms oleg234-client.virtnet: 1 54274 1.52 MB/sec execute 157 sec latency 12929.183 ms oleg234-client.virtnet: 1 54274 1.51 MB/sec execute 158 sec latency 13929.371 ms oleg234-client.virtnet: 1 54274 1.50 MB/sec execute 159 sec latency 14929.614 ms oleg234-client.virtnet: 1 54274 1.49 MB/sec execute 160 sec latency 15929.849 ms oleg234-client.virtnet: 1 54274 1.48 MB/sec execute 161 sec latency 16930.056 ms oleg234-client.virtnet: 1 54274 1.47 MB/sec execute 162 sec latency 17930.319 ms oleg234-client.virtnet: 1 54274 1.46 MB/sec execute 163 sec latency 18930.538 ms oleg234-client.virtnet: 1 54383 1.46 MB/sec execute 164 sec latency 19641.352 ms oleg234-client.virtnet: 1 54781 1.47 MB/sec execute 165 sec latency 17.390 ms oleg234-client.virtnet: 1 55084 1.46 MB/sec execute 166 sec latency 17.406 ms oleg234-client.virtnet: 1 55539 1.46 MB/sec execute 167 sec latency 11.236 ms oleg234-client.virtnet: 1 56115 1.48 MB/sec execute 168 sec latency 15.664 ms oleg234-client.virtnet: 1 56792 1.50 MB/sec execute 169 sec latency 13.273 ms oleg234-client.virtnet: 1 57110 1.50 MB/sec execute 170 sec latency 12.738 ms oleg234-client.virtnet: 1 57405 1.50 MB/sec execute 171 sec latency 16.973 ms oleg234-client.virtnet: 1 57726 1.49 MB/sec execute 172 sec latency 22.717 ms oleg234-client.virtnet: 1 58167 1.51 MB/sec execute 173 sec latency 14.997 ms oleg234-client.virtnet: 1 58484 1.50 MB/sec execute 174 sec latency 16.810 ms oleg234-client.virtnet: 1 58848 1.49 MB/sec execute 175 sec latency 11.387 ms oleg234-client.virtnet: 1 59230 1.49 MB/sec execute 176 sec latency 14.925 ms oleg234-client.virtnet: 1 59788 1.52 MB/sec execute 177 sec latency 15.633 ms oleg234-client.virtnet: 1 60413 1.54 MB/sec execute 178 sec latency 15.273 ms oleg234-client.virtnet: 1 60700 1.54 MB/sec execute 179 sec latency 15.780 ms oleg234-client.virtnet: 1 61035 1.53 MB/sec execute 180 sec latency 22.729 ms oleg234-client.virtnet: 1 61326 1.52 MB/sec execute 181 sec latency 16.931 ms oleg234-client.virtnet: 1 61764 1.54 MB/sec execute 182 sec latency 15.632 ms oleg234-client.virtnet: 1 62091 1.53 MB/sec execute 183 sec latency 17.022 ms oleg234-client.virtnet: 1 62487 1.53 MB/sec execute 184 sec latency 10.582 ms oleg234-client.virtnet: 1 62888 1.53 MB/sec execute 185 sec latency 15.458 ms oleg234-client.virtnet: 1 63691 1.56 MB/sec execute 186 sec latency 15.602 ms oleg234-client.virtnet: 1 64150 1.57 MB/sec execute 187 sec latency 15.524 ms oleg234-client.virtnet: 1 64428 1.56 MB/sec execute 188 sec latency 15.052 ms oleg234-client.virtnet: 1 64772 1.56 MB/sec execute 189 sec latency 17.951 ms oleg234-client.virtnet: 1 65104 1.56 MB/sec execute 190 sec latency 15.423 ms oleg234-client.virtnet: 1 65494 1.57 MB/sec execute 191 sec latency 16.787 ms oleg234-client.virtnet: 1 65850 1.56 MB/sec execute 192 sec latency 18.017 ms oleg234-client.virtnet: 1 66258 1.56 MB/sec execute 193 sec latency 16.838 ms oleg234-client.virtnet: 1 66785 1.58 MB/sec execute 194 sec latency 15.983 ms oleg234-client.virtnet: 1 67428 1.60 MB/sec execute 195 sec latency 15.137 ms oleg234-client.virtnet: 1 67758 1.60 MB/sec execute 196 sec latency 14.548 ms oleg234-client.virtnet: 1 68085 1.59 MB/sec execute 197 sec latency 15.700 ms oltest_70b fail mds2 10 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:31:56 (1713367916) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:32:12 (1713367932) targets are mounted 11:32:12 (1713367932) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2388 1285300 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37608 3568640 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11900 3593352 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49508 7161992 1% /mnt/lustre test_70b fail mds1 11 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:32:25 (1713367945) shut down Failover mds1 to oleg234-server mount facets: mds1 eg234-client.virtnet: 1 68191 1.58 MB/sec execute 198 sec latency 638.516 ms oleg234-client.virtnet: 1 68511 1.58 MB/sec execute 199 sec latency 15.663 ms oleg234-client.virtnet: 1 68926 1.59 MB/sec execute 200 sec latency 16.133 ms oleg234-client.virtnet: 1 69110 1.58 MB/sec execute 201 sec latency 446.104 ms oleg234-client.virtnet: 1 69110 1.58 MB/sec execute 202 sec latency 1446.315 ms oleg234-client.virtnet: 1 69110 1.57 MB/sec execute 203 sec latency 2446.568 ms oleg234-client.virtnet: 1 69110 1.56 MB/sec execute 204 sec latency 3446.797 ms oleg234-client.virtnet: 1 69110 1.55 MB/sec execute 205 sec latency 4447.010 ms oleg234-client.virtnet: 1 69110 1.55 MB/sec execute 206 sec latency 5447.220 ms oleg234-client.virtnet: 1 69110 1.54 MB/sec execute 207 sec latency 6447.515 ms oleg234-client.virtnet: 1 69110 1.53 MB/sec execute 208 sec latency 7447.755 ms oleg234-client.virtnet: 1 69110 1.52 MB/sec execute 209 sec latency 8447.934 ms oleg234-client.virtnet: 1 69110 1.52 MB/sec execute 210 sec latency 9448.104 ms oleg234-client.virtnet: 1 69110 1.51 MB/sec execute 211 sec latency 10448.361 ms oleg234-client.virtnet: 1 69110 1.50 MB/sec execute 212 sec latency 11448.592 ms oleg234-client.virtnet: 1 69110 1.50 MB/sec execute 213 sec latency 12448.800 ms oleg234-client.virtnet: 1 69110 1.49 MB/sec execute 214 sec latency 13448.986 ms oleg234-client.virtnet: 1 69110 1.48 MB/sec execute 215 sec latency 14449.166 ms oleg234-client.virtnet: 1 69110 1.47 MB/sec execute 216 sec latency 15449.343 ms oleg234-client.virtnet: 1 69110 1.47 MB/sec execute 217 sec latency 16449.537 ms oleg234-client.virtnet: 1 69110 1.46 MB/sec execute 218 sec latency 17449.733 ms oleg234-client.virtnet: 1 69110 1.45 MB/sec execute 219 sec latency 18450.002 ms oleg234-client.virtnet: 1 69110 1.45 MB/sec execute 220 sec latency 19450.235 ms oleg234-client.virtnet: 1 69390 1.44 MB/sec execute 221 sec latency 19597.385 ms oleg234-client.virtnet: 1 69779 1.44 MB/sec execute 222 sec latency 13.678 ms oleg234-client.virtnet: 1 70296 1.45 MB/sec execute 223 sec latency 15.711 ms oleg234-client.virtnet: 1 70918 1.48 MB/sec execute 224 sec latency 15.727 ms oleg234-client.virtnet: 1 71229 1.48 MB/sec execute 225 sec latency 16.182 ms oleg234-client.virtnet: 1 71480 1.47 MB/sec execute 226 sec latency 15.512 ms oleg234-client.virtnet: 1 71823 1.47 MB/sec execute 227 sec latency 15.999 ms oleg234-client.virtnet: 1 72142 1.47 MB/sec execute 228 sec latency 16.154 ms oleg234-client.virtnet: 1 72559 1.47 MB/sec execute 229 sec latency 15.864 ms oleg234-client.virtnet: 1 73009 1.47 MB/sec execute 230 sec latency 15.280 ms oleg234-client.virtnet: 1 73405 1.47 MB/sec execute 231 sec latency 14.997 ms oleg234-client.virtnet: 1 73927 1.49 MB/sec execute 232 sec latency 15.968 ms oleg234-client.virtnet: 1 74565 1.50 MB/sec execute 233 sec latency 15.414 ms oleg234-client.virtnet: 1 74862 1.50 MB/sec execute 234 sec latency 15.228 ms oleg234-client.virtnet: 1 75195 1.50 MB/sec execute 235 sec latency 15.749 ms oleg234-client.virtnet: 1 75487 1.49 MB/sec execute 236 sec latency 20.004 ms oleg234-client.virtnet: 1 75909 1.50 MB/sec execute 237 sec latency 15.782 ms oleg234-client.virtnet: 1 76244 1.50 MB/sec execute 238 sec latency 16.266 ms oleg234-client.virtnet: 1 76604 1.49 MB/sec execute 239 sec latency 14.011 ms oleg234-client.virtnet: 1 76986 1.49 MB/sec execute 240 sec latency 15.236 ms oleg234-client.virtnet: 1 77555 1.51 MB/sec execute 241 sec latency 15.788 ms oleg234-client.virtnet: 1 78185 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:32:40 (1713367960) targets are mounted 11:32:40 (1713367960) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1.53 MB/sec execute 242 sec latency 15.525 ms oleg234-client.virtnet: 1 78470 1.53 MB/sec execute 243 sec latency 15.785 ms oleg234-client.virtnet: 1 78797 1.52 MB/sec execute 244 sec latency 18.919 ms oleg234-client.virtnet: 1 79086 1.52 MB/sec execute 245 sec latency 17.010 ms oleg234-client.virtnet: 1 79519 1.53 MB/sec execute 246 sec latency 15.890 ms oleg234-client.virtnet: 1 79899 1.52 MB/sec execute 247 sec latency 13.763 ms oleg234-client.virtnet: 1 80440 1.52 MB/sec execute 248 sec latency 9.304 ms oleg234-client.virtnet: 1 81272 1.54 MB/sec execute 249 sec latency 13.634 ms oleg234-client.virtnet: 1 81903 1.56 MB/sec execute 250 sec latency 13.734 ms oleg234-client.virtnet: 1 82321 1.55 MB/sec execute 251 sec latency 12.950 ms oleg234-client.virtnet: 1 82746 1.55 MB/sec execute 252 sec latency 12.351 ms oleg234-client.virtnet: 1 83213 1.56 MB/sec execute 253 sec latency 14.148 ms oleg234-client.virtnet: 1 83671 1.56 MB/sec execute 254 sec latency 14.876 ms oleg234-client.virtnet: 1 84209 1.56 MB/sec execute 255 sec latency 12.367 ms oleg234-client.virtnet: 1 84997 1.58 MB/sec execute 256 sec latency 16.004 ms oleg234-client.virtnet: 1 85485 1.59 MB/sec execute 257 sec latency 12.315 ms oleg234-client.virtnet: 1 85923 1.59 MB/sec execute 258 sec latency 12.775 ms oleg234-client.virtnet: 1 86338 1.59 MB/sec execute 259 sec latency 12.550 ms oleg234-client.virtnet: 1 86769 1.59 MB/sec execute 260 sec latency 14.008 ms oleg234-client.virtnet: 1 87194 1.59 MB/sec execute 261 sec latency 15.914 ms oleg234-client.virtnet: 1 87674 1.59 MB/sec execute 262 sec latency 11.359 ms oleg234-client.virtnet: 1 88667 1.62 MB/sec execute 263 sec latency 12.999 ms oleg234-client.virtnet: 1 89014 1.62 MB/sec execute 264 sec latency 16.425 ms oleg234-client.virtnet: 1 89328 1.61 MB/sec execute 265 sec latency 14.755 ms oleg234-client.virtnet: 1 89655 1.61 MB/sec execute 266 sec latency 17.344 ms oleg234-client.virtnet: 1 90120 1.62 MB/sec execute 267 sec latency 15.519 ms oleg234-client.virtnet: 1 90430 1.62 MB/sec execute 268 sec latency 17.450 ms oleg234-client.virtnet: 1 90800 1.61 MB/sec execute 269 sec latency 11.967 ms oleg234-client.virtnet: 1 91183 1.61 MB/sec execute 270 sec latency 15.201 ms oleg234-client.virtnet: 1 91744 1.63 MB/sec execute 271 sec latency 15.678 ms oleg234-client.virtnet: 1 92376 1.64 MB/sec execute 272 sec latency 15.997 ms oleg234-client.virtnet: 1 92665 1.64 MB/sec execute 273 sec latency 15.254 ms oleg234-client.virtnet: 1 93017 1.63 MB/sec execute 274 sec latency 20.638 ms oleg234-client.virtnet: 1 93409 1.63 MB/sec execute 275 sec latency 16.741 ms oleg234-client.virtnet: 1 93831 1.64 MB/sec execute 276 sec latency 15.311 ms oleg234-client.virtnet: 1 94258 1.63 MB/sec execute 277 sec latency 15.926 ms oleg234-client.virtnet: 1 94678 1.63 MB/sec execute 278 sec latency 13.482 ms oleg234-client.virtnet: 1 95240 1.65 MB/sec execute 279 sec latency 15.530 ms oleg234-client.virtnet: 1 95850 1.66 MB/sec execute 280 sec latency 15.293 ms oleg234-client.virtnet: 1 96155 1.66 MB/sec execute 281 sec latency 14.989 ms oleg234-client.virtnet: 1 96478 1.65 MB/sec execute 282 sec latency 15.975 ms oleg234-client.virtnet: 1 96775 1.65 MB/sec execute 283 sec latency 20.154 ms oleg234-client.virtnet: 1 97201 1.66 MB/sec execute 284 sec latency 15.210 ms oleg234-client.virtnet: 1 97511 1.65 MB/sec execute 285 sec latency 16.438 ms oleg234-client.virtnet: 1 97877 1.65 MB/sec execute 286 sec latency 14.464 ms oleg234-client.virtnet: 1 98284 1.65 MB/sec execute 287 sec latency 15.574 ms oleg234-client.virtnet: 1 98840 1.66 MB/sec execute 288 sec latency 15.104 ms oleg234-client.virtnet: 1 99480 1.67 MB/sec execute 289 sec latency 14.838 ms oleg234-client.virtnet: 1 99772 1.67 MB/sec execute 290 sec latency 15.737 ms oleg234-client.virtnet: 1 100151 1.67 MB/sec execute 291 sec latency 17.801 ms oleg234-client.virtnet: 1 100482 1.67 MB/sec execute 292 sec latency 16.195 ms oleg234-client.virtnet: 1 100873 1.67 MB/sec execute 293 sec latency 15.193 ms oleg234-client.virtnet: 1 101195 1.67 MB/sec execute 294 sec latency 16.586 ms oleg234-client.virtnet: 1 101599 1.66 MB/sec execute 295 sec latency 11.415 ms oleg234-client.virtnet: 1 101968 1.67 MB/sec execute 296 sec latency 14.993 ms oleg234-client.virtnet: 1 102865 1.69 MB/sec execute 297 sec latency 14.917 ms oleg234-client.virtnet: 1 103186 1.69 MB/sec execute 298 sec latency 15.602 ms oleg234-client.virtnet: 1 103502 1.69 MB/sec execute 299 sec latency 15.661 ms oleg234-client.virtnet: 1 cleanup 300 sec oleg234-client.virtnet: 0 cleanup 300 sec oleg234-client.virtnet: oleg234-client.virtnet: Operation Count AvgLat MaxLat oleg234-client.virtnet: ---------------------------------------- oleg234-client.virtnet: NTCreateX 16253 11.086 19782.713 oleg234-client.virtnet: Close 11903 1.659 9.038 oleg234-client.virtnet: Rename 687 10.757 18.236 oleg234-client.virtnet: Unlink 3313 2.767 7.798 oleg234-client.virtnet: Qpathinfo 14693 3.349 19641.326 oleg234-client.virtnet: Qfileinfo 2551 0.204 2.577 oleg234-client.virtnet: Qfsinfo 2714 0.344 14.530 oleg234-client.virtnet: Sfileinfo 1313 5.759 12.581 oleg234-client.virtnet: Find 5695 0.594 12.803 oleg234-client.virtnet: WriteX 8008 1.297 8.421 oleg234-client.virtnet: ReadX 25455 0.025 7.902 oleg234-client.virtnet: LockX 52 1.636 2.032 oleg234-client.virtnet: UnlockX 52 1.731 3.630 oleg234-client.virtnet: Flush 1146 8.627 638.404 oleg234-client.virtnet: oleg234-client.virtnet: Throughput 1.68714 MB/sec 1 clients 1 procs max_latency=19782.748 ms oleg234-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg234-client.virtnet at Wed Apr 17 11:33:35 EDT 2024 with return code 0 oleg234-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg234-client.virtnet oleg234-client.virtnet: /mnt/lustre/d70b.replay-single/oleg234-client.virtnet /mnt/lustre/d70b.replay-single/oleg234-client.virtnet oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/PWRPNT' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORDPRO' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORD' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/EXCEL' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/ACCESS' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/PM' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/SEED' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/COREL' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp/PARADOX' oleg234-client.virtnet: removed directory: 'clients/client0/~dmtmp' oleg234-client.virtnet: removed directory: 'clients/client0' oleg234-client.virtnet: removed directory: 'clients' oleg234-client.virtnet: removed 'client.txt' oleg234-client.virtnet: /mnt/lustre/d70b.replay-single/oleg234-client.virtnet oleg234-client.virtnet: dbench successfully finished PASS 70b (363s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70c: tar 2mdts recovery ============ 11:33:39 (1713368019) Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar 2236 striped dir -i0 -c2 -H crush2 /mnt/lustre/d70c.replay-single tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6628 1281060 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4456 1283232 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 14468 3579168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 10312 3585216 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 24780 7164384 1% /mnt/lustre test_70c fail mds2 1 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:35:55 (1713368155) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:36:10 (1713368170) targets are mounted 11:36:10 (1713368170) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6908 1280780 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4292 1283396 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 6764 3586048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 24008 3564388 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 30772 7150436 1% /mnt/lustre test_70c fail mds1 2 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:38:37 (1713368317) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:38:52 (1713368332) targets are mounted 11:38:52 (1713368332) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (350s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70d: mkdir/rmdir striped dir 2mdts recovery ========================================================== 11:39:30 (1713368370) Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 5574 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4028 1283660 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3584 1284104 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3604396 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3603772 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208168 1% /mnt/lustre test_70d fail mds2 1 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:41:45 (1713368505) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:41:59 (1713368519) targets are mounted 11:41:59 (1713368519) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4696 1282992 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4584 1283104 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3604396 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3603772 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208168 1% /mnt/lustre test_70d fail mds2 2 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 11:44:22 (1713368662) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 11:44:37 (1713368677) targets are mounted 11:44:37 (1713368677) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4710: 5574 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test || { echo "mkdir fails"; break; }; $LFS mkdir -i1 -c2 $DIR/$tdir/test1 || { echo "mkdir fails"; break; }; touch $DIR/$tdir/test/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test || { echo "rmdir fails"; ls -lR $DIR/$tdir; break; }; touch $DIR/$tdir/test1/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test1/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test1 || { echo "rmdir fails"; ls -lR $DIR/$tdir/test1; break; }; done ) (wd: ~) PASS 70d (316s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70e: rename cross-MDT with random fails ========================================================== 11:44:48 (1713368688) debug=+ha Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started PID=26629 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3348 1284340 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3148 1284540 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3604396 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3603772 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208168 1% /mnt/lustre test_70e fail mds1 1 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:47:03 (1713368823) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:47:17 (1713368837) targets are mounted 11:47:17 (1713368837) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2768 1284920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2236 1285452 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3604396 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3603772 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208168 1% /mnt/lustre test_70e fail mds1 2 times Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:49:39 (1713368979) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:49:53 (1713368993) targets are mounted 11:49:53 (1713368993) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70e (314s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 11:50:04 (1713369004) mount clients oleg234-client.virtnet ... Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2240 1285448 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3599240 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3601352 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7200592 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:50:11 (1713369011) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:50:25 (1713369025) targets are mounted 11:50:25 (1713369025) facet_failover done ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2240 1285448 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3586760 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3593040 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7179800 1% /mnt/lustre ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:50:38 (1713369038) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:50:52 (1713369052) targets are mounted 11:50:52 (1713369052) facet_failover done ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2240 1285448 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 5684 3593048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3576416 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 7232 7169464 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:51:05 (1713369065) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Started lustre-OST0000 11:51:19 (1713369079) targets are mounted 11:51:19 (1713369079) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2240 1285448 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3567992 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3580536 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7148528 1% /mnt/lustre ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... test_70f failing OST 4 times Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:51:32 (1713369092) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:51:46 (1713369106) targets are mounted 11:51:46 (1713369106) facet_failover done ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2240 1285448 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3592952 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 5644 3593064 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 7232 7186016 1% /mnt/lustre ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... test_70f failing OST 5 times Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 11:51:58 (1713369118) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 11:52:12 (1713369132) targets are mounted 11:52:12 (1713369132) facet_failover done ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... ldlm.namespaces.MGC192.168.202.134@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aab95000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aab95000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg234-client.virtnet' ... PASS 70f (135s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 11:52:21 (1713369141) Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 13774 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2880 1284808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4032 1283656 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3040 1284648 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4204 1283484 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds2 mds1 1 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:54:38 (1713369278) shut down Failover mds1 to oleg234-server Failover mds2 to oleg234-server mount facets: mds2 mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 11:54:52 (1713369292) targets are mounted 11:54:52 (1713369292) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3080 1284608 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4228 1283460 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3252 1284436 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2348 1285340 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds2 mds1 2 times Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:57:21 (1713369441) shut down Failover mds1 to oleg234-server Failover mds2 to oleg234-server mount facets: mds2 mount facets: mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 11:57:35 (1713369455) targets are mounted 11:57:35 (1713369455) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4710: 13774 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test; rmdir $DIR/$tdir/test; done ) (wd: ~) PASS 71a (329s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 11:57:50 (1713369470) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4188 1283500 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4016 1283672 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:57:54 (1713369474) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:58:08 (1713369488) targets are mounted 11:58:08 (1713369488) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 11:58:28 (1713369508) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7528 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2420 1285268 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1285676 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:58:32 (1713369512) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:58:46 (1713369526) targets are mounted 11:58:46 (1713369526) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 11:59:09 (1713369549) Stopping clients: oleg234-client.virtnet /mnt/lustre (opts:) Stopping client oleg234-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg234-server Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:59:11 (1713369551) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:59:24 (1713369564) targets are mounted 11:59:24 (1713369564) facet_failover done Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 11:59:37 (1713369577) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2436 1285252 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 11:59:40 (1713369580) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 11:59:54 (1713369594) targets are mounted 11:59:54 (1713369594) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.05 seconds: 412.74 ops/second PASS 80a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 12:00:03 (1713369603) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:00:14 (1713369614) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:00:28 (1713369628) targets are mounted 12:00:28 (1713369628) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.04 seconds: 457.49 ops/second PASS 80b (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 12:00:37 (1713369637) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2500 1285188 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2500 1285188 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:00:43 (1713369643) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:00:56 (1713369656) targets are mounted 12:00:56 (1713369656) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:01:03 (1713369663) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:01:17 (1713369677) targets are mounted 12:01:17 (1713369677) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.05 seconds: 433.70 ops/second PASS 80c (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 12:01:26 (1713369686) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2536 1285152 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2536 1285152 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:01:41 (1713369701) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:02:02 (1713369722) targets are mounted 12:02:02 (1713369722) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.04 seconds: 454.59 ops/second PASS 80d (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 12:02:24 (1713369744) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:02:31 (1713369751) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:02:45 (1713369765) targets are mounted 12:02:45 (1713369765) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.07 seconds: 274.59 ops/second PASS 80e (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 12:02:55 (1713369775) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:03:00 (1713369780) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:03:13 (1713369793) targets are mounted 12:03:13 (1713369793) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.09 seconds: 212.50 ops/second PASS 80f (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 12:03:23 (1713369803) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:03:32 (1713369812) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:03:46 (1713369826) targets are mounted 12:03:46 (1713369826) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:03:53 (1713369833) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:04:07 (1713369847) targets are mounted 12:04:07 (1713369847) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.07 seconds: 303.34 ops/second PASS 80g (51s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 12:04:16 (1713369856) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:04:28 (1713369868) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:04:54 (1713369894) targets are mounted 12:04:54 (1713369894) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.06 seconds: 354.37 ops/second PASS 80h (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 12:05:05 (1713369905) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:05:16 (1713369916) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:05:30 (1713369930) targets are mounted 12:05:30 (1713369930) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81a (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 12:05:40 (1713369940) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:05:45 (1713369945) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:06:01 (1713369961) targets are mounted 12:06:01 (1713369961) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 12:06:11 (1713369971) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:06:17 (1713369977) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:06:32 (1713369992) targets are mounted 12:06:32 (1713369992) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:06:39 (1713369999) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:06:53 (1713370013) targets are mounted 12:06:53 (1713370013) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81c (49s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 12:07:02 (1713370022) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:07:17 (1713370037) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:07:37 (1713370057) targets are mounted 12:07:37 (1713370057) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81d (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 12:07:48 (1713370068) fail_loc=0x119 fail_loc=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:07:54 (1713370074) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:08:09 (1713370089) targets are mounted 12:08:09 (1713370089) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81e (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 12:08:18 (1713370098) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:08:23 (1713370103) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:08:38 (1713370118) targets are mounted 12:08:38 (1713370118) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81f (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 12:08:48 (1713370128) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:08:57 (1713370137) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:09:12 (1713370152) targets are mounted 12:09:12 (1713370152) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:09:20 (1713370160) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:09:35 (1713370175) targets are mounted 12:09:35 (1713370175) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81g (55s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 12:09:45 (1713370185) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:09:55 (1713370195) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:10:21 (1713370221) targets are mounted 12:10:21 (1713370221) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81h (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 84a: stale open during export disconnect ========================================================== 12:10:32 (1713370232) fail_loc=0x80000144 total: 1 open/close in 0.01 seconds: 120.36 ops/second oleg234-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg234-client: oleg234-client: ssh exited with exit code 5 PASS 84a (6s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 12:10:40 (1713370240) before recovery: unused locks count = 201 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:10:42 (1713370242) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:10:56 (1713370256) targets are mounted 12:10:56 (1713370256) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 12:11:05 (1713370265) before recovery: unused locks count = 100 Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 12:11:10 (1713370270) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 12:11:24 (1713370284) targets are mounted 12:11:24 (1713370284) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 12:11:33 (1713370293) Stopping clients: oleg234-client.virtnet /mnt/lustre (opts:) Stopping client oleg234-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 mdt.lustre-MDT0001.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Started clients oleg234-client.virtnet: 192.168.202.134@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) PASS 86 (12s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87a: write replay ================== 12:11:47 (1713370307) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2456 1285232 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1788 3605232 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1748 3605272 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3536 7210504 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.141089 s, 59.5 MB/s Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 12:11:52 (1713370312) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 12:12:07 (1713370327) targets are mounted 12:12:07 (1713370327) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.0385775 s, 217 MB/s PASS 87a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 12:12:13 (1713370333) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2456 1285232 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9980 3597040 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1748 3605272 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11728 7202312 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.127086 s, 66.0 MB/s 8+0 records in 8+0 records out 8 bytes (8 B) copied, 0.0014705 s, 5.4 kB/s Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 12:12:18 (1713370338) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 12:12:32 (1713370352) targets are mounted 12:12:32 (1713370352) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes (72 B) copied, 0.00459012 s, 15.7 kB/s PASS 87b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 88: MDS should not assign same objid to different files ========================================================== 12:12:39 (1713370359) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9984 3597036 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1748 3605272 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9984 3597036 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1748 3605272 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre before test: last_id = 13633, next_id = 13604 Creating to objid 13633 on ost lustre-OST0000... total: 31 open/close in 0.10 seconds: 311.32 ops/second total: 8 open/close in 0.02 seconds: 369.40 ops/second before recovery: last_id = 13665, next_id = 13642 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg234-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 13673, next_id = 13642 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0449198 s, 11.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0417884 s, 12.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.036714 s, 14.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0456716 s, 11.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0410805 s, 12.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0346361 s, 15.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0313119 s, 16.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0274458 s, 19.1 MB/s -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13604 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13605 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13606 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13607 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13608 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13609 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13610 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13611 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13612 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13613 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13614 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13615 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13616 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13617 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13618 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13619 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13620 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13621 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13622 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13623 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13624 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13625 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13626 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13627 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13628 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13629 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13630 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13631 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13632 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13633 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13634 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13635 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13636 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13637 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13638 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13639 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13640 -rw-r--r-- 1 root root 0 Apr 17 12:12 /mnt/lustre/d88.replay-single/f-13641 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13645 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13646 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13647 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13648 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13649 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13650 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13651 -rw-r--r-- 1 root root 524288 Apr 17 12:13 /mnt/lustre/d88.replay-single/f-13652 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0439445 s, 11.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0397461 s, 13.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0422401 s, 12.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0411652 s, 12.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0389442 s, 13.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0403891 s, 13.0 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0387104 s, 13.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0459934 s, 11.4 MB/s PASS 88 (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 89: no disk space leak on late ost connection ========================================================== 12:13:34 (1713370414) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg234-server mds-ost sync done. Waiting for MDT destroys to complete 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.0890681 s, 118 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg234-server Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:13:42 (1713370422) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:13:56 (1713370436) targets are mounted 12:13:56 (1713370436) facet_failover done Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 67 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg234-server mds-ost sync done. Waiting for MDT destroys to complete free_before: 7646308 free_after: 7646308 PASS 89 (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 12:15:27 (1713370527) Create the files Fail ost2 lustre-OST0001_UUID, display the list of affected files Stopping /mnt/lustre-ost2 (opts:) on oleg234-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/f0 /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all Querying files on shutdown ost2: lfs find --obd lustre-OST0001_UUID /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0001_UUID /mnt/lustre/d90.replay-single/f1 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 1 13603 0x3523 0x2c0000401 * /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 1 13602 0x3522 0x2c0000401 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f1 Failover ost2 to oleg234-server Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 90 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93a: replay + reconnect ============ 12:15:48 (1713370548) 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.00200948 s, 510 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg234-server Stopping /mnt/lustre-ost1 (opts:) on oleg234-server 12:15:50 (1713370550) shut down Failover ost1 to oleg234-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 12:16:03 (1713370563) targets are mounted 12:16:03 (1713370563) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (59s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93b: replay + reconnect on mds ===== 12:16:48 (1713370608) total: 20 open/close in 0.07 seconds: 279.36 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:16:50 (1713370610) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:17:04 (1713370624) targets are mounted 12:17:04 (1713370624) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (102s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 12:18:32 (1713370712) fail_loc=0x1701 Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:18:33 (1713370713) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:18:46 (1713370726) targets are mounted 12:18:46 (1713370726) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200033850:0x1:0x0] 1 [0x24000b3ca:0x1:0x0] total: 20 open/close in 0.05 seconds: 377.72 ops/second PASS 100a (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 12:18:55 (1713370735) fail_loc=0x119 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:18:56 (1713370736) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:19:10 (1713370750) targets are mounted 12:19:10 (1713370750) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200033850:0x4:0x0] 1 [0x24000b3ca:0x4:0x0] total: 20 open/close in 0.08 seconds: 251.49 ops/second PASS 100b (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 12:19:19 (1713370759) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2552 1285136 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2140 1285548 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds2 (opts:) on oleg234-server Failover mds2 to oleg234-server Starting mds2: -o localrecov -o abort_recov_mdt /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 total: 20 open/close in 0.09 seconds: 215.63 ops/second Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:19:40 (1713370780) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:19:53 (1713370793) targets are mounted 12:19:53 (1713370793) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 1 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 1 [0x24000bb98:0x3:0x0] 0 [0x200033851:0x3:0x0] total: 20 open/close in 0.06 seconds: 351.87 ops/second PASS 100c (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 12:20:01 (1713370801) striped dir -i0 -c2 -H all_char /mnt/lustre/d100d.replay-single total: 100 mkdir in 0.30 seconds: 329.28 ops/second lustre-MDT0001-osd [catalog]: [0x24000c368:0x3:0x0] [index]: 00001 [logid]: [0x24000cb38:0x2:0x0] lustre-MDT0000-osp-MDT0001 [catalog]: [0x200034021:0x3:0x0] [index]: 00001 [logid]: [0x200034022:0x2:0x0] [index]: 00002 [logid]: [0x200034022:0x3:0x0] Stopping /mnt/lustre-mds2 (opts:) on oleg234-server Failover mds2 to oleg234-server Starting mds2: -o localrecov -o abort_recovery /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 find: '/mnt/lustre/d100d.replay-single': No such file or directory find: '/mnt/lustre/d100d.replay-single': No such file or directory PASS 100d (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 12:20:25 (1713370825) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2972 1284716 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2516 1285172 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2972 1284716 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2516 1285172 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034020:0x36:0x0] 1 [0x24000bb99:0x36:0x0] Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:20:32 (1713370832) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:20:57 (1713370857) targets are mounted 12:20:57 (1713370857) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034020:0x36:0x0] 1 [0x24000bb99:0x36:0x0] PASS 100e (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 12:21:10 (1713370870) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3016 1284672 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2552 1285136 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 101 (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 12:21:57 (1713370917) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (12:21:58) ... fail_loc=0 done (12:22:14) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 12:22:18 (1713370938) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (12:22:19) ... fail_loc=0 done (12:22:35) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 12:22:39 (1713370959) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3184 1284504 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2624 1285064 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3580636 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3597108 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (12:22:43) ... fail_loc=0 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:22:46 (1713370966) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:23:02 (1713370982) targets are mounted 12:23:02 (1713370982) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (12:23:07) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 12:23:11 (1713370991) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (12:23:12) ... fail_loc=0 Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:23:14 (1713370994) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:23:27 (1713371007) targets are mounted 12:23:27 (1713371007) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec done (12:23:34) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 103: Check otr_next_id overflow ==== 12:23:37 (1713371017) fail_loc=0x80000162 total: 30 open/close in 0.09 seconds: 319.98 ops/second Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:23:39 (1713371019) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:23:52 (1713371032) targets are mounted 12:23:52 (1713371032) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 12:24:00 (1713371040) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3132 1284556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2628 1285060 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3580636 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3597108 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:24:04 (1713371044) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:24:18 (1713371058) targets are mounted 12:24:18 (1713371058) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110a.replay-single/striped_dir has type dir OK PASS 110a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 12:24:27 (1713371067) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3136 1284552 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2632 1285056 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3580636 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3597108 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:24:31 (1713371071) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:24:45 (1713371085) targets are mounted 12:24:45 (1713371085) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110b.replay-single/striped_dir has type dir OK PASS 110b (93s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 12:26:03 (1713371163) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3140 1284548 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2632 1285056 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:26:07 (1713371167) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:26:22 (1713371182) targets are mounted 12:26:22 (1713371182) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110c.replay-single/striped_dir has type dir OK PASS 110c (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 12:26:31 (1713371191) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2632 1285056 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:26:34 (1713371194) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:26:49 (1713371209) targets are mounted 12:26:49 (1713371209) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110d.replay-single/striped_dir has type dir OK PASS 110d (94s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 12:28:06 (1713371286) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3220 1284468 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2664 1285024 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:28:16 (1713371296) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:28:41 (1713371321) targets are mounted 12:28:41 (1713371321) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110e.replay-single/striped_dir has type dir OK PASS 110e (109s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_110f skipping excluded test 110f debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 12:29:57 (1713371397) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3256 1284432 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2700 1284988 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:30:09 (1713371409) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:30:31 (1713371431) targets are mounted 12:30:31 (1713371431) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110g.replay-single/striped_dir has type dir OK PASS 110g (110s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 12:31:49 (1713371509) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3144 1284544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2636 1285052 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:31:53 (1713371513) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:32:07 (1713371527) targets are mounted 12:32:07 (1713371527) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111a.replay-single/striped_dir: No such file or directory PASS 111a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 12:32:16 (1713371536) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3148 1284540 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:32:20 (1713371540) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:32:34 (1713371554) targets are mounted 12:32:34 (1713371554) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111b.replay-single/striped_dir: No such file or directory PASS 111b (93s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 12:33:51 (1713371631) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2640 1285048 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:34:05 (1713371645) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:34:25 (1713371665) targets are mounted 12:34:25 (1713371665) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111c.replay-single/striped_dir: No such file or directory PASS 111c (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 12:35:43 (1713371743) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3220 1284468 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2640 1285048 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:35:51 (1713371751) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:36:18 (1713371778) targets are mounted 12:36:18 (1713371778) facet_failover done pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg234-client: oleg234-client: ssh exited with exit code 95 Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111d.replay-single/striped_dir: No such file or directory PASS 111d (110s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 12:37:35 (1713371855) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3148 1284540 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2676 1285012 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3188 1284500 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2668 1285020 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:37:44 (1713371864) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:38:09 (1713371889) targets are mounted 12:38:09 (1713371889) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111e.replay-single/striped_dir: No such file or directory PASS 111e (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 12:38:18 (1713371898) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3156 1284532 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3156 1284532 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2636 1285052 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:38:26 (1713371906) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:38:52 (1713371932) targets are mounted 12:38:52 (1713371932) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111f.replay-single/striped_dir: No such file or directory PASS 111f (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 12:39:04 (1713371944) UUID Inodes IUsed IFree IUse% Mounted on lustre-MDT0000_UUID 1024000 550 1023450 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1024000 317 1023683 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 262144 560 261584 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 262144 483 261661 1% /mnt/lustre[OST:1] filesystem_summary: 524112 867 523245 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3164 1284524 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3164 1284524 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:39:12 (1713371952) shut down Failover mds1 to oleg234-server mount facets: mds1 Failover mds2 to oleg234-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:39:38 (1713371978) targets are mounted 12:39:38 (1713371978) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111g.replay-single/striped_dir: No such file or directory PASS 111g (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 12:39:49 (1713371989) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 12:39:52 (1713371992) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 12:39:56 (1713371996) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 12:39:59 (1713371999) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 12:40:03 (1713372003) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 12:40:06 (1713372006) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 12:40:09 (1713372009) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 12:40:12 (1713372012) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 12:40:16 (1713372016) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 12:40:19 (1713372019) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 12:40:22 (1713372022) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 12:40:26 (1713372026) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 12:40:29 (1713372029) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 12:40:32 (1713372032) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 115: failover for create/unlink striped directory ========================================================== 12:40:35 (1713372035) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3160 1284528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2632 1285056 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_0 striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_1 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_2 striped dir -i1 -c2 -H all_char /mnt/lustre/d115.replay-single/test_3 striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_4 Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:40:40 (1713372040) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:40:55 (1713372055) targets are mounted 12:40:55 (1713372055) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3212 1284476 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2700 1284988 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_0 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_1 striped dir -i0 -c2 -H crush /mnt/lustre/d115.replay-single/test_2 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_3 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_4 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:41:06 (1713372066) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:41:21 (1713372081) targets are mounted 12:41:21 (1713372081) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 115 (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116a: large update log master MDT recovery ========================================================== 12:41:31 (1713372091) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3252 1284436 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2764 1284924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre fail_loc=0x80001702 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:41:37 (1713372097) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:41:50 (1713372110) targets are mounted 12:41:50 (1713372110) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116a.replay-single/striped_dir has type dir OK PASS 116a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116b: large update log slave MDT recovery ========================================================== 12:41:59 (1713372119) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3304 1284384 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2936 1284752 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre fail_loc=0x80001702 Failing mds2 on oleg234-server Stopping /mnt/lustre-mds2 (opts:) on oleg234-server 12:42:02 (1713372122) shut down Failover mds2 to oleg234-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0001 12:42:16 (1713372136) targets are mounted 12:42:16 (1713372136) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116b.replay-single/striped_dir has type dir OK PASS 116b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 12:42:24 (1713372144) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 12:42:28 (1713372148) fail_loc=0x1705 lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error fail_loc=0x0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3400 1284288 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2816 1284872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:42:33 (1713372153) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:42:48 (1713372168) targets are mounted 12:42:48 (1713372168) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 118 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 12:42:58 (1713372178) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3468 1284220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2876 1284812 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server fail_loc=0x80000714 fail_val=65 Starting mds1: -o localrecov -o recovery_time_hard=60 /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg234-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 68 sec mdt.lustre-MDT0000.recovery_time_hard=180 PASS 119 (87s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 12:44:27 (1713372267) Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 find: '/mnt/lustre/d120.replay-single': No such file or directory PASS 120 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 121: lock replay timed out and race ========================================================== 12:44:57 (1713372297) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7528 Stopping /mnt/lustre-mds1 (opts:) on oleg234-server Failover mds1 to oleg234-server fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg234-client: oleg234-client: ssh exited with exit code 5 fail_loc=0x0 at_max=600 PASS 121 (200s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 12:48:19 (1713372499) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3860 1283828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:48:24 (1713372504) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:48:39 (1713372519) targets are mounted 12:48:39 (1713372519) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130b: DoM file create (inherited) replay ========================================================== 12:48:49 (1713372529) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3864 1283824 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:48:54 (1713372534) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:49:10 (1713372550) targets are mounted 12:49:10 (1713372550) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 131a: DoM file write lock replay === 12:49:19 (1713372559) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3864 1283824 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes (8 B) copied, 0.00272332 s, 2.9 kB/s Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:49:24 (1713372564) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:49:40 (1713372580) targets are mounted 12:49:40 (1713372580) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (28s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_131b skipping excluded test 131b debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 132a: PFL new component instantiate replay ========================================================== 12:49:50 (1713372590) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3876 1283812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18180 3588840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0334741 s, 31.3 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3c42:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x3c8a:0x0] } - 1: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3c43:0x0] } Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:49:55 (1713372595) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:50:10 (1713372610) targets are mounted 12:50:10 (1713372610) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 1 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3c42:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x3c8a:0x0] } - 1: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3c43:0x0] } PASS 132a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 12:50:20 (1713372620) seq.srv-lustre-MDT0001.space=clear Starting client: oleg234-client.virtnet: -o user_xattr,flock oleg234-server@tcp:/lustre /mnt/lustre fail_val=700 fail_loc=0x80000123 Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:50:25 (1713372625) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:50:39 (1713372639) targets are mounted 12:50:39 (1713372639) facet_failover done PASS 133 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 134: replay creation of a file created in a pool ========================================================== 12:50:47 (1713372647) Creating new pool oleg234-server: Pool lustre.pool_134 created Adding targets to pool oleg234-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3960 1283728 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3224 1284464 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19204 3587816 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre Failing mds1 on oleg234-server Stopping /mnt/lustre-mds1 (opts:) on oleg234-server 12:50:58 (1713372658) shut down Failover mds1 to oleg234-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-MDT0000 12:51:13 (1713372673) targets are mounted 12:51:13 (1713372673) facet_failover done oleg234-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg234-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg234-server: Pool lustre.pool_134 destroyed PASS 134 (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 135: Server failure in lock replay phase ========================================================== 12:51:28 (1713372688) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3940 1283748 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3224 1284464 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19204 3587816 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1764 3605256 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 Stopping /mnt/lustre-ost1 (opts:) on oleg234-server Failover ost1 to oleg234-server oleg234-server: oleg234-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 oleg234-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg234-server Failover ost1 to oleg234-server oleg234-server: oleg234-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all End of sync pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg234-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg234-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg234-server: oleg234-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg234-client: oleg234-server: ssh exited with exit code 1 Started lustre-OST0001 pdsh@oleg234-client: oleg234-client: ssh exited with exit code 5 ldlm.cancel_unused_locks_before_replay=1 PASS 135 (77s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 12:52:47 (1713372767) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 12:52:51 (1713372771) SKIP: replay-single test_200 Need remote client SKIP 200 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-single test complete, duration 8574 sec ======== 12:52:53 (1713372773) === replay-single: start cleanup 12:52:53 (1713372773) === === replay-single: finish cleanup 12:52:57 (1713372777) ===