-----============= acceptance-small: replay-single ============----- Tue Apr 16 10:49:50 EDT 2024 excepting tests: 110f 131b 59 36 === replay-single: start setup 10:49:53 (1713278993) === oleg127-client.virtnet: executing check_config_client /mnt/lustre oleg127-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg127-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b6019800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b6019800.idle_timeout=debug disable quota as required oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === replay-single: finish setup 10:50:00 (1713279000) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0a: empty replay =================== 10:50:01 (1713279001) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3048 7210992 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:50:04 (1713279004) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:50:18 (1713279018) targets are mounted 10:50:18 (1713279018) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 10:50:27 (1713279027) Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 10:50:28 (1713279028) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 10:50:41 (1713279041) targets are mounted 10:50:41 (1713279041) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.08 seconds: 263.30 ops/second - unlinked 0 (time 1713279045 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0c: check replay-barrier =========== 10:50:48 (1713279048) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:50:51 (1713279051) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:51:05 (1713279065) targets are mounted 10:51:05 (1713279065) facet_failover done Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (89s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0d: expired recovery with no clients ========================================================== 10:52:18 (1713279138) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:52:21 (1713279141) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:52:35 (1713279155) targets are mounted 10:52:35 (1713279155) facet_failover done Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre PASS 0d (89s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 1: simple create =================== 10:53:49 (1713279229) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:53:52 (1713279232) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:06 (1713279246) targets are mounted 10:54:06 (1713279246) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2a: touch ========================== 10:54:15 (1713279255) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:54:18 (1713279258) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:32 (1713279272) targets are mounted 10:54:32 (1713279272) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2b: touch ========================== 10:54:41 (1713279281) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:54:44 (1713279284) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:58 (1713279298) targets are mounted 10:54:58 (1713279298) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2c: setstripe replay =============== 10:55:06 (1713279306) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:55:10 (1713279310) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:55:24 (1713279324) targets are mounted 10:55:24 (1713279324) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2d: setdirstripe replay ============ 10:55:32 (1713279332) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:55:36 (1713279336) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:55:50 (1713279350) targets are mounted 10:55:50 (1713279350) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2e: O_CREAT|O_EXCL create replay === 10:55:58 (1713279358) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:56:03 (1713279363) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:56:17 (1713279377) targets are mounted 10:56:17 (1713279377) facet_failover done Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 10:56:25 (1713279385) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:56:28 (1713279388) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:56:42 (1713279402) targets are mounted 10:56:42 (1713279402) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3b: replay failed open -ENOMEM ===== 10:56:51 (1713279411) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:56:54 (1713279414) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:57:08 (1713279428) targets are mounted 10:57:08 (1713279428) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3c: replay failed open -ENOMEM ===== 10:57:17 (1713279437) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:57:21 (1713279441) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:57:34 (1713279454) targets are mounted 10:57:34 (1713279454) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 10:57:43 (1713279463) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:57:46 (1713279466) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:00 (1713279480) targets are mounted 10:58:00 (1713279480) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4b: |x| rm 10 files ================ 10:58:09 (1713279489) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605336 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210796 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:58:13 (1713279493) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:26 (1713279506) targets are mounted 10:58:26 (1713279506) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 5: |x| 220 open(O_CREAT) =========== 10:58:35 (1713279515) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:58:39 (1713279519) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:53 (1713279533) targets are mounted 10:58:53 (1713279533) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6a: mkdir + contained create ======= 10:59:07 (1713279547) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1884 1285804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:59:10 (1713279550) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:59:24 (1713279564) targets are mounted 10:59:24 (1713279564) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6b: |X| rmdir ====================== 10:59:34 (1713279574) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 10:59:38 (1713279578) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 10:59:52 (1713279592) targets are mounted 10:59:52 (1713279592) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 7: mkdir |X| contained create ====== 11:00:00 (1713279600) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:00:04 (1713279604) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:00:17 (1713279617) targets are mounted 11:00:17 (1713279617) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 8: creat open |X| close ============ 11:00:26 (1713279626) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:00:29 (1713279629) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:00:43 (1713279643) targets are mounted 11:00:43 (1713279643) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 9: |X| create (same inum/gen) ====== 11:00:51 (1713279651) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:00:55 (1713279655) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:08 (1713279668) targets are mounted 11:01:08 (1713279668) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 10: create |X| rename unlink ======= 11:01:17 (1713279677) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:01:20 (1713279680) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:34 (1713279694) targets are mounted 11:01:34 (1713279694) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 11:01:42 (1713279702) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre new old Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:01:46 (1713279706) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:01:59 (1713279719) targets are mounted 11:01:59 (1713279719) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 12: open, unlink |X| close ========= 11:02:08 (1713279728) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:02:11 (1713279731) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:24 (1713279744) targets are mounted 11:02:24 (1713279744) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 13: open chmod 0 |x| write close === 11:02:33 (1713279753) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:02:37 (1713279757) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:02:51 (1713279771) targets are mounted 11:02:51 (1713279771) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 11:03:00 (1713279780) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:03:03 (1713279783) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:03:17 (1713279797) targets are mounted 11:03:17 (1713279797) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 11:03:26 (1713279806) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:03:29 (1713279809) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:03:43 (1713279823) targets are mounted 11:03:43 (1713279823) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 11:03:52 (1713279832) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:03:56 (1713279836) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:04:10 (1713279850) targets are mounted 11:04:10 (1713279850) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 11:04:19 (1713279859) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:04:23 (1713279863) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:04:37 (1713279877) targets are mounted 11:04:37 (1713279877) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 11:04:45 (1713279885) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 pid: 24909 will close Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:04:49 (1713279889) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:05:03 (1713279903) targets are mounted 11:05:03 (1713279903) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 19: mcreate, open, write, rename === 11:05:12 (1713279912) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre old Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:05:16 (1713279916) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:05:30 (1713279930) targets are mounted 11:05:30 (1713279930) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 11:05:39 (1713279939) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210924 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:05:43 (1713279943) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:05:58 (1713279958) targets are mounted 11:05:58 (1713279958) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 11:06:07 (1713279967) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x280000401 10000+0 records in 10000+0 records out 40960000 bytes (41 MB) copied, 1.3397 s, 30.6 MB/s Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:06:12 (1713279972) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:06:25 (1713279985) targets are mounted 11:06:25 (1713279985) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg127-server: oleg127-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg127-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for MDT destroys to complete before 3116, after 3116 PASS 20b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20c: check that client eviction does not affect file content ========================================================== 11:06:38 (1713279998) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 -rw-r--r-- 1 root root 1 Apr 16 11:06 /mnt/lustre/f20c.replay-single pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 PASS 20c (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 11:06:45 (1713280005) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210884 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:06:48 (1713280008) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:03 (1713280023) targets are mounted 11:07:03 (1713280023) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 11:07:12 (1713280032) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:07:17 (1713280037) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:31 (1713280051) targets are mounted 11:07:31 (1713280051) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 11:07:41 (1713280061) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:07:45 (1713280065) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:59 (1713280079) targets are mounted 11:07:59 (1713280079) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 11:08:07 (1713280087) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:08:11 (1713280091) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:25 (1713280105) targets are mounted 11:08:25 (1713280105) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 11:08:33 (1713280113) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:08:37 (1713280117) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:51 (1713280131) targets are mounted 11:08:51 (1713280131) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 11:08:59 (1713280139) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:09:02 (1713280142) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:09:16 (1713280156) targets are mounted 11:09:16 (1713280156) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 11:09:25 (1713280165) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:09:28 (1713280168) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:09:42 (1713280182) targets are mounted 11:09:42 (1713280182) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 11:09:51 (1713280191) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:09:55 (1713280195) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:10:10 (1713280210) targets are mounted 11:10:10 (1713280210) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 11:10:20 (1713280220) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:10:24 (1713280224) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:10:39 (1713280239) targets are mounted 11:10:39 (1713280239) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 11:10:48 (1713280248) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:10:52 (1713280252) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:11:07 (1713280267) targets are mounted 11:11:07 (1713280267) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 11:11:15 (1713280275) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:11:19 (1713280279) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:11:34 (1713280294) targets are mounted 11:11:34 (1713280294) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 11:11:44 (1713280304) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 PASS 32 (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 11:11:51 (1713280311) total: 10 open/close in 0.04 seconds: 279.85 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.06 seconds: 165.74 ops/second PASS 33a (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33b: test fid seq allocation ======= 11:12:13 (1713280333) fail_loc=0x1311 total: 10 open/close in 0.03 seconds: 287.50 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.07 seconds: 140.74 ops/second PASS 33b (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 11:12:36 (1713280356) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1908 1285780 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1720 1285968 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 35: test recovery from llog for unlink op ========================================================== 11:12:55 (1713280375) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error oleg127-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 first stat failed: 5 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (17s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_36 skipping ALWAYS excluded test 36 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 11:13:14 (1713280394) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1980 1285708 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1784 1285904 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg127-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 first stat failed: 5 PASS 37 (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 11:13:34 (1713280414) total: 800 open/close in 2.02 seconds: 395.10 ops/second - unlinked 0 (time 1713280417 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2108 1285580 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:13:41 (1713280421) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:13:55 (1713280435) targets are mounted 11:13:55 (1713280435) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713280442 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 11:14:08 (1713280448) total: 800 open/close in 2.22 seconds: 360.85 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2112 1285576 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605412 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210880 1% /mnt/lustre - unlinked 0 (time 1713280454 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:14:16 (1713280456) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:14:31 (1713280471) targets are mounted 11:14:31 (1713280471) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713280479 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 11:14:46 (1713280486) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00261825 s, 1.6 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00660288 s, 620 kB/s PASS 41 (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 42: recovery after ost failure ===== 11:14:52 (1713280492) total: 800 open/close in 3.82 seconds: 209.49 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2136 1285552 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713280501 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second debug=-1 Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 11:15:04 (1713280504) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 11:15:20 (1713280520) targets are mounted 11:15:20 (1713280520) facet_failover done wait for MDS to timeout and recover debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck - unlinked 0 (time 1713280561 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (72s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 11:16:06 (1713280566) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2192 1285496 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:16:11 (1713280571) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:16:25 (1713280585) targets are mounted 11:16:25 (1713280585) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44a: race in target handle connect ========================================================== 11:16:44 (1713280604) at_max=40 1 of 10 (1713280605) service : cur 5 worst 5 (at 1713278959, 1648s ago) 4 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1713280611) service : cur 5 worst 5 (at 1713278959, 1653s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1713280616) service : cur 5 worst 5 (at 1713278959, 1659s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1713280622) service : cur 5 worst 5 (at 1713278959, 1664s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1713280627) service : cur 5 worst 5 (at 1713278959, 1670s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1713280633) service : cur 5 worst 5 (at 1713278959, 1675s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1713280638) service : cur 5 worst 5 (at 1713278959, 1681s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1713280644) service : cur 5 worst 5 (at 1713278959, 1687s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1713280649) service : cur 5 worst 5 (at 1713278959, 1692s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1713280655) service : cur 5 worst 5 (at 1713278959, 1698s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (58s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44b: race in target handle connect ========================================================== 11:17:44 (1713280664) 1 of 10 (1713280664) service : cur 5 worst 5 (at 1713278959, 1707s ago) 5 4 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1713280705) service : cur 40 worst 40 (at 1713280706, 0s ago) 40 4 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1713280725) service : cur 40 worst 40 (at 1713280706, 21s ago) 40 4 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1713280746) service : cur 40 worst 40 (at 1713280706, 42s ago) 40 4 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1713280766) service : cur 40 worst 40 (at 1713280706, 62s ago) 1 40 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1713280787) service : cur 40 worst 40 (at 1713280706, 83s ago) 40 40 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1713280808) service : cur 40 worst 40 (at 1713280706, 103s ago) 40 40 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1713280828) service : cur 40 worst 40 (at 1713280706, 124s ago) 40 40 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1713280849) service : cur 40 worst 40 (at 1713280706, 144s ago) 40 40 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1713280869) service : cur 40 worst 40 (at 1713280706, 165s ago) 40 40 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.127@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre PASS 44b (227s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44c: race in target handle connect ========================================================== 11:21:33 (1713280893) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2088 1285600 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 100 create in 0.22 seconds: 446.69 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg127-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 first stat failed: 5 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:21:54 (1713280914) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:22:07 (1713280927) targets are mounted 11:22:07 (1713280927) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 45: Handle failed close ============ 11:22:15 (1713280935) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 /mnt/lustre/f45.replay-single has type file OK PASS 45 (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 11:22:19 (1713280939) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:22:37 (1713280957) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:22:50 (1713280970) targets are mounted 11:22:50 (1713280970) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 11:22:59 (1713280979) total: 20 open/close in 0.07 seconds: 294.42 ops/second Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 11:23:00 (1713280980) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 11:23:14 (1713280994) targets are mounted 11:23:14 (1713280994) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.06 seconds: 341.03 ops/second - unlinked 0 (time 1713281059 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (81s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 11:24:21 (1713281061) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2120 1285568 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 20 open/close in 0.07 seconds: 303.83 ops/second Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:24:25 (1713281065) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:24:39 (1713281079) targets are mounted 11:24:39 (1713281079) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.07 seconds: 270.03 ops/second - unlinked 0 (time 1713281143 ; total 0 ; last 0) total: 40 unlinks in 0 seconds: inf unlinks/second PASS 48 (83s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 11:25:45 (1713281145) PASS 50 (7s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 52: time out lock replay (3764) ==== 11:25:54 (1713281154) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7517 fail_loc=0x80000157 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:25:56 (1713281156) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:26:09 (1713281169) targets are mounted 11:26:09 (1713281169) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (80s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 11:27:16 (1713281236) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:27:20 (1713281240) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:27:34 (1713281254) targets are mounted 11:27:34 (1713281254) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 11:27:43 (1713281263) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:27:47 (1713281267) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:28:01 (1713281281) targets are mounted 11:28:01 (1713281281) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 11:28:10 (1713281290) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:28:14 (1713281294) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:28:28 (1713281308) targets are mounted 11:28:28 (1713281308) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 11:28:37 (1713281317) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:28:39 (1713281319) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:28:53 (1713281333) targets are mounted 11:28:53 (1713281333) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 11:29:03 (1713281343) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:29:08 (1713281348) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:29:22 (1713281362) targets are mounted 11:29:22 (1713281362) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 11:29:30 (1713281370) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:29:35 (1713281375) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:29:49 (1713281389) targets are mounted 11:29:49 (1713281389) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 11:29:57 (1713281397) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:30:02 (1713281402) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:30:16 (1713281416) targets are mounted 11:30:16 (1713281416) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 11:30:25 (1713281425) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:30:30 (1713281430) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:30:45 (1713281445) targets are mounted 11:30:45 (1713281445) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 11:30:53 (1713281453) fail_loc=0x8000012b fail_loc=0x0 rm: touch: cannot touch '/mnt/lustre/f55.replay-single'cannot remove '/mnt/lustre/f55.replay-single': No such file or directory : No such file or directory PASS 55 (62s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 11:31:56 (1713281516) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2128 1285560 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:31:59 (1713281519) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:32:13 (1713281533) targets are mounted 11:32:13 (1713281533) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 57: test recovery from llog for setattr op ========================================================== 11:32:32 (1713281552) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2128 1285560 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:32:36 (1713281556) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:32:51 (1713281571) targets are mounted 11:32:51 (1713281571) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg127-server: oleg127-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg127-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg127-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 11:33:02 (1713281582) fail_loc=0x8000012c total: 2500 open/close in 7.87 seconds: 317.58 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2424 1285264 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:33:16 (1713281596) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:33:31 (1713281611) targets are mounted 11:33:31 (1713281611) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1713281627 ; total 0 ; last 0) total: 2500 unlinks in 4 seconds: 625.000000 unlinks/second PASS 58a (51s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58b: test replay of setxattr op ==== 11:33:55 (1713281635) Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2404 1285284 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:34:00 (1713281640) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:34:14 (1713281654) targets are mounted 11:34:14 (1713281654) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg127-client.virtnet /mnt/lustre2 (opts:) oleg127-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 11:34:26 (1713281666) Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg127-client.virtnet /mnt/lustre2 (opts:) PASS 58c (128s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_59 skipping ALWAYS excluded test 59 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 11:36:37 (1713281797) total: 200 open/close in 0.50 seconds: 397.02 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2264 1285424 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713281801 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:36:42 (1713281802) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:36:56 (1713281816) targets are mounted 11:36:56 (1713281816) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713281822 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second PASS 60 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 11:37:06 (1713281826) total: 800 open/close in 1.70 seconds: 470.70 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2348 1285340 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2024 1285664 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713281831 ; total 0 ; last 0) total: 800 unlinks in 1 seconds: 800.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 11:37:14 (1713281834) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 11:37:28 (1713281848) targets are mounted 11:37:28 (1713281848) facet_failover done Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 11:37:39 (1713281859) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 11:37:53 (1713281873) targets are mounted 11:37:53 (1713281873) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (81s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 11:38:29 (1713281909) fail_loc=0x8000013a Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:38:30 (1713281910) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:38:43 (1713281923) targets are mounted 11:38:43 (1713281923) facet_failover done Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:38:54 (1713281934) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:39:07 (1713281947) targets are mounted 11:39:07 (1713281947) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00348746 s, 1.2 MB/s PASS 61b (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 11:39:16 (1713281956) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 11:39:28 (1713281968) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 11:39:41 (1713281981) targets are mounted 11:39:41 (1713281981) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 11:39:49 (1713281989) Stopping /mnt/lustre-mds1 (opts:) on oleg127-server fail_loc=0x80000605 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: mount.lustre: mount /dev/mapper/mds1_flakey at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg127-client: oleg127-server: ssh exited with exit code 95 Start of /dev/mapper/mds1_flakey on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (12s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 62: don't mis-drop resent replay === 11:40:03 (1713282003) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2340 1285348 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3128 7210912 1% /mnt/lustre total: 25 open/close in 0.12 seconds: 211.87 ops/second fail_loc=0x80000707 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:40:08 (1713282008) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:40:24 (1713282024) targets are mounted 11:40:24 (1713282024) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1713282086 ; total 0 ; last 0) total: 25 unlinks in 1 seconds: 25.000000 unlinks/second PASS 62 (85s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65a: AT: verify early replies ====== 11:41:30 (1713282090) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:3.0:1713282120.788206:0:14583:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a8880000 x1796503162825664/t0(0) o101->lustre-MDT0000-mdc-ffff88012a48c000@192.168.201.127@tcp:12/10 lens 664/66320 e 1 to 0 dl 1713282155 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 36 worst 40 (at 1713280705, 1421s ago) 40 40 40 40 portal 29 : cur 5 worst 5 (at 1713279225, 2901s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713279225, 2901s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713279229, 2897s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713279309, 2817s ago) 5 5 5 5 portal 13 : cur 5 worst 5 (at 1713279679, 2447s ago) 5 5 0 0 portal 12 : cur 5 worst 40 (at 1713280705, 1430s ago) 40 40 40 40 portal 29 : cur 5 worst 5 (at 1713279225, 2910s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713279225, 2910s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713279229, 2906s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713279309, 2826s ago) 5 5 5 5 portal 13 : cur 5 worst 5 (at 1713279679, 2456s ago) 5 5 0 0 PASS 65a (46s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 11:42:18 (1713282138) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:2.0:1713282168.724643:0:1958:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a7ed1500 x1796503162835712/t0(0) o4->lustre-OST0000-osc-ffff88012a48c000@192.168.201.127@tcp:6/4 lens 4584/448 e 1 to 0 dl 1713282203 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck portal 28 : cur 5 worst 5 (at 1713279225, 2949s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1713279226, 2948s ago) 5 5 5 5 portal 6 : cur 36 worst 36 (at 1713282173, 1s ago) 36 0 0 0 portal 17 : cur 5 worst 5 (at 1713279492, 2682s ago) 5 5 0 5 PASS 65b (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 11:42:57 (1713282177) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1713280705, 1494s ago) 40 40 40 40 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1713280705, 1500s ago) 40 40 40 40 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1713280705, 1511s ago) 40 40 40 40 fail_loc=0 portal 12 : cur 5 worst 40 (at 1713280705, 1520s ago) 5 40 40 40 Current MDT timeout 5, worst 40 PASS 66a (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66b: AT: verify net latency adjusts ========================================================== 11:43:48 (1713282228) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 10, worst 10 PASS 66b (85s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 11:45:15 (1713282315) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (74s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 11:46:32 (1713282392) at_history=8 at_history=8 Creating to objid 5057 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 17 open/close in 0.08 seconds: 200.94 ops/second Connected clients: oleg127-client.virtnet oleg127-client.virtnet service : cur 5 worst 5 (at 1713278967, 3451s ago) 1 1 0 1 phase 2 0 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg127-client.virtnet oleg127-client.virtnet service : cur 5 worst 5 (at 1713278967, 3452s ago) 1 1 0 1 0 osc reconnect attempts on 2nd slow PASS 67b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 68: AT: verify slowing locks ======= 11:47:02 (1713282422) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1968: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1983: $ldlm_enqueue_min: ambiguous redirect PASS 68 (70s) debug_raw_pointers=0 debug_raw_pointers=0 Cleaning up AT ... debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70a: check multi client t-f ======== 11:48:14 (1713282494) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70b: dbench 2mdts recovery; 1 clients ========================================================== 11:48:18 (1713282498) Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) striped dir -i0 -c2 -H all_char /mnt/lustre/d70b.replay-single + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg127-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found Started rundbench load pid=21704 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 26156 3580864 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1640 3594096 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 27796 7174960 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:48:27 (1713282507) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:48:42 (1713282522) targets are mounted 11:48:42 (1713282522) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2424 1285264 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2128 1285560 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 26412 3568868 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1792 3594192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 28204 7163060 1% /mnt/lustre oleg127-client.virtnet: looking for dbench program oleg127-client.virtnet: /usr/bin/dbench oleg127-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg127-client.virtnet oleg127-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg127-client.virtnet' oleg127-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg127-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg127-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg127-client.virtnet at Tue Apr 16 11:48:19 EDT 2024 oleg127-client.virtnet: waiting for dbench pid 21728 oleg127-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg127-client.virtnet: oleg127-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg127-client.virtnet: failed to create barrier semaphore oleg127-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg127-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg127-client.virtnet: releasing clients oleg127-client.virtnet: 1 313 11.48 MB/sec warmup 1 sec latency 21.381 ms oleg127-client.virtnet: 1 673 10.59 MB/sec warmup 2 sec latency 18.102 ms oleg127-client.virtnet: 1 977 7.30 MB/sec warmup 3 sec latency 16.990 ms oleg127-client.virtnet: 1 1448 6.50 MB/sec warmup 4 sec latency 15.101 ms oleg127-client.virtnet: 1 1808 5.27 MB/sec warmup 5 sec latency 17.848 ms oleg127-client.virtnet: 1 2184 4.46 MB/sec warmup 6 sec latency 10.815 ms oleg127-client.virtnet: 1 2596 4.14 MB/sec warmup 7 sec latency 15.604 ms oleg127-client.virtnet: 1 3352 4.42 MB/sec warmup 8 sec latency 26.091 ms oleg127-client.virtnet: 1 3996 4.45 MB/sec warmup 9 sec latency 12.234 ms oleg127-client.virtnet: 1 4083 4.01 MB/sec warmup 10 sec latency 836.363 ms oleg127-client.virtnet: 1 4083 3.64 MB/sec warmup 11 sec latency 1836.624 ms oleg127-client.virtnet: 1 4083 3.34 MB/sec warmup 12 sec latency 2836.885 ms oleg127-client.virtnet: 1 4083 3.08 MB/sec warmup 13 sec latency 3837.136 ms oleg127-client.virtnet: 1 4083 2.86 MB/sec warmup 14 sec latency 4837.409 ms oleg127-client.virtnet: 1 4083 2.67 MB/sec warmup 15 sec latency 5837.654 ms oleg127-client.virtnet: 1 4083 2.51 MB/sec warmup 16 sec latency 6837.899 ms oleg127-client.virtnet: 1 4083 2.36 MB/sec warmup 17 sec latency 7838.136 ms oleg127-client.virtnet: 1 4083 2.23 MB/sec warmup 18 sec latency 8838.365 ms oleg127-client.virtnet: 1 4083 2.11 MB/sec warmup 19 sec latency 9838.594 ms oleg127-client.virtnet: 1 4083 2.00 MB/sec warmup 20 sec latency 10838.802 ms oleg127-client.virtnet: 1 4083 1.91 MB/sec warmup 21 sec latency 11839.089 ms oleg127-client.virtnet: 1 4083 1.82 MB/sec warmup 22 sec latency 12839.266 ms oleg127-client.virtnet: 1 4083 1.74 MB/sec warmup 23 sec latency 13839.488 ms oleg127-client.virtnet: 1 4083 1.67 MB/sec warmup 24 sec latency 14839.727 ms oleg127-client.virtnet: 1 4083 1.60 MB/sec warmup 25 sec latency 15839.968 ms oleg127-client.virtnet: 1 4083 1.54 MB/sec warmup 26 sec latency 16840.200 ms oleg127-client.virtnet: 1 4268 1.49 MB/sec warmup 27 sec latency 17500.659 ms oleg127-client.virtnet: 1 4606 1.50 MB/sec warmup 28 sec latency 16.276 ms oleg127-client.virtnet: 1 5021 1.56 MB/sec warmup 29 sec latency 15.571 ms oleg127-client.virtnet: 1 5334 1.51 MB/sec warmup 30 sec latency 17.735 ms oleg127-client.virtnet: 1 5726 1.48 MB/sec warmup 31 sec latency 12.126 ms oleg127-client.virtnet: 1 6135 1.50 MB/sec warmup 32 sec latency 19.143 ms oleg127-client.virtnet: 1 6436 1.55 MB/sec warmup 33 sec latency 650.103test_70b fail mds2 2 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 11:48:55 (1713282535) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 11:49:11 (1713282551) targets are mounted 11:49:11 (1713282551) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37424 3569476 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12688 3594148 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 50112 7163624 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:49:24 (1713282564) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 ms oleg127-client.virtnet: 1 7092 1.69 MB/sec warmup 34 sec latency 15.999 ms oleg127-client.virtnet: 1 7404 1.68 MB/sec warmup 35 sec latency 22.782 ms oleg127-client.virtnet: 1 7542 1.64 MB/sec warmup 36 sec latency 454.362 ms oleg127-client.virtnet: 1 7542 1.60 MB/sec warmup 37 sec latency 1454.607 ms oleg127-client.virtnet: 1 7542 1.56 MB/sec warmup 38 sec latency 2454.809 ms oleg127-client.virtnet: 1 7542 1.52 MB/sec warmup 39 sec latency 3455.123 ms oleg127-client.virtnet: 1 7542 1.48 MB/sec warmup 40 sec latency 4455.342 ms oleg127-client.virtnet: 1 7542 1.44 MB/sec warmup 41 sec latency 5455.549 ms oleg127-client.virtnet: 1 7542 1.41 MB/sec warmup 42 sec latency 6455.792 ms oleg127-client.virtnet: 1 7542 1.38 MB/sec warmup 43 sec latency 7456.024 ms oleg127-client.virtnet: 1 7542 1.34 MB/sec warmup 44 sec latency 8456.258 ms oleg127-client.virtnet: 1 7542 1.31 MB/sec warmup 45 sec latency 9456.518 ms oleg127-client.virtnet: 1 7542 1.29 MB/sec warmup 46 sec latency 10456.767 ms oleg127-client.virtnet: 1 7542 1.26 MB/sec warmup 47 sec latency 11456.996 ms oleg127-client.virtnet: 1 7542 1.23 MB/sec warmup 48 sec latency 12457.186 ms oleg127-client.virtnet: 1 7542 1.21 MB/sec warmup 49 sec latency 13457.375 ms oleg127-client.virtnet: 1 7542 1.18 MB/sec warmup 50 sec latency 14457.534 ms oleg127-client.virtnet: 1 7542 1.16 MB/sec warmup 51 sec latency 15457.688 ms oleg127-client.virtnet: 1 7542 1.14 MB/sec warmup 52 sec latency 16457.889 ms oleg127-client.virtnet: 1 7542 1.12 MB/sec warmup 53 sec latency 17458.059 ms oleg127-client.virtnet: 1 7542 1.10 MB/sec warmup 54 sec latency 18458.232 ms oleg127-client.virtnet: 1 7542 1.08 MB/sec warmup 55 sec latency 19458.552 ms oleg127-client.virtnet: 1 7701 1.06 MB/sec warmup 56 sec latency 19880.063 ms oleg127-client.virtnet: 1 8030 1.05 MB/sec warmup 57 sec latency 25.474 ms oleg127-client.virtnet: 1 8371 1.06 MB/sec warmup 58 sec latency 16.907 ms oleg127-client.virtnet: 1 8761 1.09 MB/sec warmup 59 sec latency 15.225 ms oleg127-client.virtnet: 1 9532 1.58 MB/sec execute 1 sec latency 16.516 ms oleg127-client.virtnet: 1 10100 3.50 MB/sec execute 2 sec latency 15.748 ms oleg127-client.virtnet: 1 10732 4.03 MB/sec execute 3 sec latency 14.767 ms oleg127-client.virtnet: 1 11021 3.38 MB/sec execute 4 sec latency 15.288 ms oleg127-client.virtnet: 1 11359 2.76 MB/sec execute 5 sec latency 21.539 ms oleg127-client.virtnet: 1 11653 2.42 MB/sec execute 6 sec latency 15.499 ms oleg127-client.virtnet: 1 12121 2.66 MB/sec execute 7 sec latency 15.568 ms oleg127-client.virtnet: 1 12444 2.36 MB/sec execute 8 sec latency 16.361 ms oleg127-client.virtnet: 1 12836 2.15 MB/sec execute 9 sec latency 10.921 ms oleg127-client.virtnet: 1 13255 2.16 MB/sec execute 10 sec latency 15.063 ms oleg127-client.virtnet: 1 14004 2.54 MB/sec execute 11 sec latency 14.957 ms oleg127-client.virtnet: 1 14436 2.70 MB/sec execute 12 sec latency 14.612 ms oleg127-client.virtnet: 1 14673 2.51 MB/sec execute 13 sec latency 15.222 ms oleg127-client.virtnet: 1 15051 2.38 MB/sec execute 14 sec latency 17.582 ms oleg127-client.virtnet: 1 15404 2.32 MB/sec execute 15 sec latency 15.187 ms oleg127-client.virtnet: 1 16033 2.38 MB/sec execute 16 sec latency 15.009 ms oleg127-client.virtnet: 1 16634 2.34 MB/sec execute 17 sec latency 14.826 ms oleg127-client.virtnet: 1 17205 2.51 MB/sec execute 18 sec latency 14.910 ms oleg127-client.virtnet: 1 17876 2.66 MB/sec executoleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:49:39 (1713282579) targets are mounted 11:49:39 (1713282579) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2080 1285608 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37548 3568884 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12064 3591904 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49612 7160788 1% /mnt/lustre test_70b fail mds2 4 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 11:49:52 (1713282592) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 11:50:07 (1713282607) targets are mounted 11:50:07 (1713282607) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37512 3568732 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12192 3591824 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49704 7160556 1% /mnt/lustre test_70b fail mds1 5 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:50:21 (1713282621) shut down e 19 sec latency 11.905 ms oleg127-client.virtnet: 1 18162 2.59 MB/sec execute 20 sec latency 15.832 ms oleg127-client.virtnet: 1 18486 2.48 MB/sec execute 21 sec latency 22.168 ms oleg127-client.virtnet: 1 18816 2.44 MB/sec execute 22 sec latency 16.445 ms oleg127-client.virtnet: 1 19233 2.47 MB/sec execute 23 sec latency 15.143 ms oleg127-client.virtnet: 1 19568 2.38 MB/sec execute 24 sec latency 16.698 ms oleg127-client.virtnet: 1 19974 2.31 MB/sec execute 25 sec latency 10.116 ms oleg127-client.virtnet: 1 20377 2.30 MB/sec execute 26 sec latency 15.245 ms oleg127-client.virtnet: 1 21132 2.45 MB/sec execute 27 sec latency 15.249 ms oleg127-client.virtnet: 1 21545 2.52 MB/sec execute 28 sec latency 16.175 ms oleg127-client.virtnet: 1 21775 2.45 MB/sec execute 29 sec latency 16.671 ms oleg127-client.virtnet: 1 22114 2.38 MB/sec execute 30 sec latency 17.153 ms oleg127-client.virtnet: 1 22432 2.35 MB/sec execute 31 sec latency 16.990 ms oleg127-client.virtnet: 1 22827 2.38 MB/sec execute 32 sec latency 16.672 ms oleg127-client.virtnet: 1 22854 2.31 MB/sec execute 33 sec latency 923.058 ms oleg127-client.virtnet: 1 22854 2.24 MB/sec execute 34 sec latency 1923.339 ms oleg127-client.virtnet: 1 22854 2.17 MB/sec execute 35 sec latency 2923.598 ms oleg127-client.virtnet: 1 22854 2.11 MB/sec execute 36 sec latency 3923.989 ms oleg127-client.virtnet: 1 22854 2.06 MB/sec execute 37 sec latency 4924.400 ms oleg127-client.virtnet: 1 22854 2.00 MB/sec execute 38 sec latency 5924.697 ms oleg127-client.virtnet: 1 22854 1.95 MB/sec execute 39 sec latency 6924.995 ms oleg127-client.virtnet: 1 22854 1.90 MB/sec execute 40 sec latency 7925.383 ms oleg127-client.virtnet: 1 22854 1.86 MB/sec execute 41 sec latency 8925.662 ms oleg127-client.virtnet: 1 22854 1.81 MB/sec execute 42 sec latency 9925.961 ms oleg127-client.virtnet: 1 22854 1.77 MB/sec execute 43 sec latency 10926.164 ms oleg127-client.virtnet: 1 22854 1.73 MB/sec execute 44 sec latency 11926.325 ms oleg127-client.virtnet: 1 22854 1.69 MB/sec execute 45 sec latency 12926.498 ms oleg127-client.virtnet: 1 22854 1.65 MB/sec execute 46 sec latency 13926.641 ms oleg127-client.virtnet: 1 22854 1.62 MB/sec execute 47 sec latency 14926.838 ms oleg127-client.virtnet: 1 22854 1.59 MB/sec execute 48 sec latency 15927.164 ms oleg127-client.virtnet: 1 22854 1.55 MB/sec execute 49 sec latency 16927.444 ms oleg127-client.virtnet: 1 22854 1.52 MB/sec execute 50 sec latency 17927.623 ms oleg127-client.virtnet: 1 22854 1.49 MB/sec execute 51 sec latency 18927.868 ms oleg127-client.virtnet: 1 22854 1.46 MB/sec execute 52 sec latency 19928.104 ms oleg127-client.virtnet: 1 23191 1.44 MB/sec execute 53 sec latency 19930.486 ms oleg127-client.virtnet: 1 23615 1.44 MB/sec execute 54 sec latency 14.234 ms oleg127-client.virtnet: 1 23958 1.44 MB/sec execute 55 sec latency 16.105 ms oleg127-client.virtnet: 1 24675 1.52 MB/sec execute 56 sec latency 15.110 ms oleg127-client.virtnet: 1 25099 1.57 MB/sec execute 57 sec latency 15.947 ms oleg127-client.virtnet: 1 25333 1.55 MB/sec execute 58 sec latency 15.505 ms oleg127-client.virtnet: 1 25665 1.53 MB/sec execute 59 sec latency 20.964 ms oleg127-client.virtnet: 1 26014 1.54 MB/sec execute 60 sec latency 15.374 ms oleg127-client.virtnet: 1 26441 1.56 MB/sec execute 61 sec latency 16.377 ms oleg127-client.virtnet: 1 26794 1.54 MB/sec execute 62 sec latency 15.978 ms oleg127-client.virtnet: 1 27224 1.54 MB/sec execute 63 sec latency 26.212 ms oleg127-cliFailover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:50:36 (1713282636) targets are mounted 11:50:36 (1713282636) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37560 3568872 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12064 3591904 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49624 7160776 1% /mnt/lustre test_70b fail mds2 6 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 11:50:49 (1713282649) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 11:51:05 (1713282665) targets are mounted 11:51:05 (1713282665) facet_failover done ent.virtnet: 1 27651 1.58 MB/sec execute 64 sec latency 15.625 ms oleg127-client.virtnet: 1 28363 1.65 MB/sec execute 65 sec latency 15.093 ms oleg127-client.virtnet: 1 28712 1.65 MB/sec execute 66 sec latency 15.544 ms oleg127-client.virtnet: 1 28974 1.63 MB/sec execute 67 sec latency 15.551 ms oleg127-client.virtnet: 1 29324 1.62 MB/sec execute 68 sec latency 15.859 ms oleg127-client.virtnet: 1 29667 1.61 MB/sec execute 69 sec latency 15.133 ms oleg127-client.virtnet: 1 30069 1.63 MB/sec execute 70 sec latency 15.545 ms oleg127-client.virtnet: 1 30419 1.62 MB/sec execute 71 sec latency 16.846 ms oleg127-client.virtnet: 1 30820 1.61 MB/sec execute 72 sec latency 15.129 ms oleg127-client.virtnet: 1 31346 1.67 MB/sec execute 73 sec latency 15.784 ms oleg127-client.virtnet: 1 31987 1.71 MB/sec execute 74 sec latency 14.760 ms oleg127-client.virtnet: 1 32284 1.71 MB/sec execute 75 sec latency 16.185 ms oleg127-client.virtnet: 1 32585 1.69 MB/sec execute 76 sec latency 18.133 ms oleg127-client.virtnet: 1 32896 1.68 MB/sec execute 77 sec latency 17.706 ms oleg127-client.virtnet: 1 33239 1.67 MB/sec execute 78 sec latency 16.163 ms oleg127-client.virtnet: 1 33637 1.69 MB/sec execute 79 sec latency 16.107 ms oleg127-client.virtnet: 1 33984 1.67 MB/sec execute 80 sec latency 18.419 ms oleg127-client.virtnet: 1 34383 1.67 MB/sec execute 81 sec latency 15.768 ms oleg127-client.virtnet: 1 34896 1.72 MB/sec execute 82 sec latency 16.011 ms oleg127-client.virtnet: 1 35527 1.76 MB/sec execute 83 sec latency 15.460 ms oleg127-client.virtnet: 1 35818 1.75 MB/sec execute 84 sec latency 15.919 ms oleg127-client.virtnet: 1 36099 1.73 MB/sec execute 85 sec latency 16.870 ms oleg127-client.virtnet: 1 36435 1.72 MB/sec execute 86 sec latency 15.535 ms oleg127-client.virtnet: 1 36584 1.72 MB/sec execute 87 sec latency 587.130 ms oleg127-client.virtnet: 1 36995 1.73 MB/sec execute 88 sec latency 16.125 ms oleg127-client.virtnet: 1 37321 1.72 MB/sec execute 89 sec latency 16.007 ms oleg127-client.virtnet: 1 37520 1.70 MB/sec execute 90 sec latency 511.023 ms oleg127-client.virtnet: 1 37520 1.68 MB/sec execute 91 sec latency 1511.271 ms oleg127-client.virtnet: 1 37520 1.66 MB/sec execute 92 sec latency 2511.656 ms oleg127-client.virtnet: 1 37520 1.64 MB/sec execute 93 sec latency 3512.034 ms oleg127-client.virtnet: 1 37520 1.63 MB/sec execute 94 sec latency 4512.304 ms oleg127-client.virtnet: 1 37520 1.61 MB/sec execute 95 sec latency 5512.619 ms oleg127-client.virtnet: 1 37520 1.59 MB/sec execute 96 sec latency 6512.929 ms oleg127-client.virtnet: 1 37520 1.58 MB/sec execute 97 sec latency 7513.162 ms oleg127-client.virtnet: 1 37520 1.56 MB/sec execute 98 sec latency 8513.404 ms oleg127-client.virtnet: 1 37520 1.55 MB/sec execute 99 sec latency 9513.755 ms oleg127-client.virtnet: 1 37520 1.53 MB/sec execute 100 sec latency 10514.040 ms oleg127-client.virtnet: 1 37520 1.51 MB/sec execute 101 sec latency 11514.277 ms oleg127-client.virtnet: 1 37520 1.50 MB/sec execute 102 sec latency 12514.579 ms oleg127-client.virtnet: 1 37520 1.49 MB/sec execute 103 sec latency 13514.810 ms oleg127-client.virtnet: 1 37520 1.47 MB/sec execute 104 sec latency 14515.024 ms oleg127-client.virtnet: 1 37520 1.46 MB/sec execute 105 sec latency 15515.304 ms oleg127-client.virtnet: 1 37520 1.44 MB/sec execute 106 sec latency 16515.618 ms oleg127-client.virtnet: 1 37520 1.43 MB/sec execute 107 sec latency 17515.863 ms oleg127-client.virtnet: 1 37520 1.42 MB/sec exeoleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37540 3568776 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12084 3592008 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49624 7160784 1% /mnt/lustre test_70b fail mds1 7 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:51:18 (1713282678) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:51:33 (1713282693) targets are mounted 11:51:33 (1713282693) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37560 3568696 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12064 3593324 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49624 7162020 1% /mnt/lustre test_70b fail mds2 8 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 11:51:46 (1713282706) shut down cute 108 sec latency 18516.149 ms oleg127-client.virtnet: 1 37520 1.40 MB/sec execute 109 sec latency 19516.512 ms oleg127-client.virtnet: 1 37851 1.40 MB/sec execute 110 sec latency 19673.049 ms oleg127-client.virtnet: 1 38273 1.43 MB/sec execute 111 sec latency 16.021 ms oleg127-client.virtnet: 1 38878 1.45 MB/sec execute 112 sec latency 15.903 ms oleg127-client.virtnet: 1 39334 1.47 MB/sec execute 113 sec latency 15.153 ms oleg127-client.virtnet: 1 39571 1.46 MB/sec execute 114 sec latency 17.196 ms oleg127-client.virtnet: 1 39914 1.45 MB/sec execute 115 sec latency 19.365 ms oleg127-client.virtnet: 1 40235 1.45 MB/sec execute 116 sec latency 15.580 ms oleg127-client.virtnet: 1 40624 1.47 MB/sec execute 117 sec latency 15.597 ms oleg127-client.virtnet: 1 40987 1.46 MB/sec execute 118 sec latency 18.718 ms oleg127-client.virtnet: 1 41639 1.47 MB/sec execute 119 sec latency 14.411 ms oleg127-client.virtnet: 1 42358 1.51 MB/sec execute 120 sec latency 14.841 ms oleg127-client.virtnet: 1 42821 1.53 MB/sec execute 121 sec latency 15.033 ms oleg127-client.virtnet: 1 43060 1.52 MB/sec execute 122 sec latency 15.370 ms oleg127-client.virtnet: 1 43500 1.52 MB/sec execute 123 sec latency 16.031 ms oleg127-client.virtnet: 1 43835 1.51 MB/sec execute 124 sec latency 15.263 ms oleg127-client.virtnet: 1 44246 1.53 MB/sec execute 125 sec latency 16.058 ms oleg127-client.virtnet: 1 44605 1.52 MB/sec execute 126 sec latency 17.050 ms oleg127-client.virtnet: 1 45011 1.52 MB/sec execute 127 sec latency 15.000 ms oleg127-client.virtnet: 1 45570 1.55 MB/sec execute 128 sec latency 15.492 ms oleg127-client.virtnet: 1 46207 1.58 MB/sec execute 129 sec latency 14.169 ms oleg127-client.virtnet: 1 46495 1.57 MB/sec execute 130 sec latency 15.567 ms oleg127-client.virtnet: 1 46816 1.56 MB/sec execute 131 sec latency 15.034 ms oleg127-client.virtnet: 1 47126 1.56 MB/sec execute 132 sec latency 14.963 ms oleg127-client.virtnet: 1 47584 1.58 MB/sec execute 133 sec latency 14.693 ms oleg127-client.virtnet: 1 47932 1.57 MB/sec execute 134 sec latency 16.063 ms oleg127-client.virtnet: 1 48312 1.56 MB/sec execute 135 sec latency 10.972 ms oleg127-client.virtnet: 1 48707 1.56 MB/sec execute 136 sec latency 16.019 ms oleg127-client.virtnet: 1 49352 1.59 MB/sec execute 137 sec latency 16.422 ms oleg127-client.virtnet: 1 49838 1.61 MB/sec execute 138 sec latency 12.986 ms oleg127-client.virtnet: 1 50103 1.61 MB/sec execute 139 sec latency 19.822 ms oleg127-client.virtnet: 1 50433 1.60 MB/sec execute 140 sec latency 21.799 ms oleg127-client.virtnet: 1 50747 1.60 MB/sec execute 141 sec latency 16.060 ms oleg127-client.virtnet: 1 51152 1.61 MB/sec execute 142 sec latency 15.890 ms oleg127-client.virtnet: 1 51479 1.60 MB/sec execute 143 sec latency 17.104 ms oleg127-client.virtnet: 1 51662 1.59 MB/sec execute 144 sec latency 571.063 ms oleg127-client.virtnet: 1 52073 1.59 MB/sec execute 145 sec latency 11.191 ms oleg127-client.virtnet: 1 52596 1.62 MB/sec execute 146 sec latency 16.069 ms oleg127-client.virtnet: 1 52933 1.62 MB/sec execute 147 sec latency 484.711 ms oleg127-client.virtnet: 1 52933 1.61 MB/sec execute 148 sec latency 1484.967 ms oleg127-client.virtnet: 1 52933 1.60 MB/sec execute 149 sec latency 2485.357 ms oleg127-client.virtnet: 1 52933 1.58 MB/sec execute 150 sec latency 3485.644 ms oleg127-client.virtnet: 1 52933 1.57 MB/sec execute 151 sec latency 4485.935 ms oleg127-client.virtnet: 1 52933 1.56 MB/sec execute 152 sec latency 5486.188 ms oleg127-client.virtnet: 1 52933 Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 11:52:02 (1713282722) targets are mounted 11:52:02 (1713282722) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37548 3568796 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12064 3591872 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49612 7160668 1% /mnt/lustre test_70b fail mds1 9 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:52:15 (1713282735) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:52:31 (1713282751) targets are mounted 11:52:31 (1713282751) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1.55 MB/sec execute 153 sec latency 6486.432 ms oleg127-client.virtnet: 1 52933 1.54 MB/sec execute 154 sec latency 7486.648 ms oleg127-client.virtnet: 1 52933 1.53 MB/sec execute 155 sec latency 8486.910 ms oleg127-client.virtnet: 1 52933 1.52 MB/sec execute 156 sec latency 9487.246 ms oleg127-client.virtnet: 1 52933 1.51 MB/sec execute 157 sec latency 10487.516 ms oleg127-client.virtnet: 1 52933 1.50 MB/sec execute 158 sec latency 11487.791 ms oleg127-client.virtnet: 1 52933 1.49 MB/sec execute 159 sec latency 12488.067 ms oleg127-client.virtnet: 1 52933 1.49 MB/sec execute 160 sec latency 13488.289 ms oleg127-client.virtnet: 1 52933 1.48 MB/sec execute 161 sec latency 14488.512 ms oleg127-client.virtnet: 1 52933 1.47 MB/sec execute 162 sec latency 15488.749 ms oleg127-client.virtnet: 1 52933 1.46 MB/sec execute 163 sec latency 16489.067 ms oleg127-client.virtnet: 1 52933 1.45 MB/sec execute 164 sec latency 17489.220 ms oleg127-client.virtnet: 1 52933 1.44 MB/sec execute 165 sec latency 18489.426 ms oleg127-client.virtnet: 1 52933 1.43 MB/sec execute 166 sec latency 19489.639 ms oleg127-client.virtnet: 1 53342 1.45 MB/sec execute 167 sec latency 19701.053 ms oleg127-client.virtnet: 1 53622 1.45 MB/sec execute 168 sec latency 15.648 ms oleg127-client.virtnet: 1 53889 1.44 MB/sec execute 169 sec latency 14.966 ms oleg127-client.virtnet: 1 54168 1.43 MB/sec execute 170 sec latency 20.259 ms oleg127-client.virtnet: 1 54511 1.43 MB/sec execute 171 sec latency 15.648 ms oleg127-client.virtnet: 1 54902 1.44 MB/sec execute 172 sec latency 15.009 ms oleg127-client.virtnet: 1 55302 1.44 MB/sec execute 173 sec latency 16.189 ms oleg127-client.virtnet: 1 55692 1.44 MB/sec execute 174 sec latency 14.948 ms oleg127-client.virtnet: 1 56254 1.46 MB/sec execute 175 sec latency 17.661 ms oleg127-client.virtnet: 1 57056 1.49 MB/sec execute 176 sec latency 23.550 ms oleg127-client.virtnet: 1 57673 1.48 MB/sec execute 177 sec latency 11.272 ms oleg127-client.virtnet: 1 58441 1.50 MB/sec execute 178 sec latency 8.588 ms oleg127-client.virtnet: 1 59209 1.50 MB/sec execute 179 sec latency 8.693 ms oleg127-client.virtnet: 1 60370 1.55 MB/sec execute 180 sec latency 8.043 ms oleg127-client.virtnet: 1 60796 1.55 MB/sec execute 181 sec latency 11.077 ms oleg127-client.virtnet: 1 61242 1.55 MB/sec execute 182 sec latency 15.650 ms oleg127-client.virtnet: 1 61579 1.55 MB/sec execute 183 sec latency 15.209 ms oleg127-client.virtnet: 1 61978 1.56 MB/sec execute 184 sec latency 15.254 ms oleg127-client.virtnet: 1 62340 1.55 MB/sec execute 185 sec latency 16.962 ms oleg127-client.virtnet: 1 62748 1.55 MB/sec execute 186 sec latency 16.835 ms oleg127-client.virtnet: 1 63310 1.57 MB/sec execute 187 sec latency 15.629 ms oleg127-client.virtnet: 1 63944 1.59 MB/sec execute 188 sec latency 15.330 ms oleg127-client.virtnet: 1 64234 1.59 MB/sec execute 189 sec latency 15.083 ms oleg127-client.virtnet: 1 64546 1.58 MB/sec execute 190 sec latency 15.531 ms oleg127-client.virtnet: 1 64842 1.58 MB/sec execute 191 sec latency 15.488 ms oleg127-client.virtnet: 1 65272 1.59 MB/sec execute 192 sec latency 16.170 ms oleg127-client.virtnet: 1 65584 1.58 MB/sec execute 193 sec latency 17.133 ms oleg127-client.virtnet: 1 65942 1.58 MB/sec execute 194 sec latency 12.480 ms oleg127-client.virtnet: 1 66325 1.58 MB/sec execute 195 sec latency 15.738 ms oleg127-client.virtnet: 1 66885 1.60 MB/sec execute 196 sec latency 15.549 ms oleg127-client.virtnet: 1 67528 1.61 MB/sec execute 197 sec latency 15.407 ms olUUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2084 1285604 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37608 3566480 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12028 3594000 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49636 7160480 1% /mnt/lustre test_70b fail mds2 10 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 11:52:50 (1713282770) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 11:53:05 (1713282785) targets are mounted 11:53:05 (1713282785) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2384 1285304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37596 3566668 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12028 3594828 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49624 7161496 1% /mnt/lustre test_70b fail mds1 11 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:53:18 (1713282798) shut down eg127-client.virtnet: 1 67805 1.61 MB/sec execute 198 sec latency 18.060 ms oleg127-client.virtnet: 1 68121 1.61 MB/sec execute 199 sec latency 15.536 ms oleg127-client.virtnet: 1 68418 1.60 MB/sec execute 200 sec latency 16.236 ms oleg127-client.virtnet: 1 68867 1.61 MB/sec execute 201 sec latency 14.888 ms oleg127-client.virtnet: 1 69196 1.61 MB/sec execute 202 sec latency 17.121 ms oleg127-client.virtnet: 1 69585 1.60 MB/sec execute 203 sec latency 11.745 ms oleg127-client.virtnet: 1 69850 1.60 MB/sec execute 204 sec latency 337.134 ms oleg127-client.virtnet: 1 69850 1.59 MB/sec execute 205 sec latency 1337.320 ms oleg127-client.virtnet: 1 69850 1.59 MB/sec execute 206 sec latency 2337.509 ms oleg127-client.virtnet: 1 69850 1.58 MB/sec execute 207 sec latency 3337.750 ms oleg127-client.virtnet: 1 69850 1.57 MB/sec execute 208 sec latency 4337.956 ms oleg127-client.virtnet: 1 69850 1.56 MB/sec execute 209 sec latency 5338.177 ms oleg127-client.virtnet: 1 69850 1.56 MB/sec execute 210 sec latency 6338.404 ms oleg127-client.virtnet: 1 69850 1.55 MB/sec execute 211 sec latency 7338.632 ms oleg127-client.virtnet: 1 69850 1.54 MB/sec execute 212 sec latency 8338.821 ms oleg127-client.virtnet: 1 69850 1.53 MB/sec execute 213 sec latency 9339.041 ms oleg127-client.virtnet: 1 69850 1.53 MB/sec execute 214 sec latency 10339.322 ms oleg127-client.virtnet: 1 69850 1.52 MB/sec execute 215 sec latency 11339.609 ms oleg127-client.virtnet: 1 69850 1.51 MB/sec execute 216 sec latency 12339.955 ms oleg127-client.virtnet: 1 69850 1.51 MB/sec execute 217 sec latency 13340.198 ms oleg127-client.virtnet: 1 69850 1.50 MB/sec execute 218 sec latency 14340.481 ms oleg127-client.virtnet: 1 69850 1.49 MB/sec execute 219 sec latency 15340.631 ms oleg127-client.virtnet: 1 69850 1.48 MB/sec execute 220 sec latency 16340.830 ms oleg127-client.virtnet: 1 69850 1.48 MB/sec execute 221 sec latency 17341.018 ms oleg127-client.virtnet: 1 69850 1.47 MB/sec execute 222 sec latency 18341.237 ms oleg127-client.virtnet: 1 69850 1.46 MB/sec execute 223 sec latency 19341.465 ms oleg127-client.virtnet: 1 69850 1.46 MB/sec execute 224 sec latency 20341.598 ms oleg127-client.virtnet: 1 69850 1.45 MB/sec execute 225 sec latency 21341.774 ms oleg127-client.virtnet: 1 69850 1.45 MB/sec execute 226 sec latency 22341.977 ms oleg127-client.virtnet: 1 69850 1.44 MB/sec execute 227 sec latency 23342.174 ms oleg127-client.virtnet: 1 69850 1.43 MB/sec execute 228 sec latency 24342.361 ms oleg127-client.virtnet: 1 69850 1.43 MB/sec execute 229 sec latency 25342.553 ms oleg127-client.virtnet: 1 70213 1.44 MB/sec execute 230 sec latency 25699.135 ms oleg127-client.virtnet: 1 70923 1.46 MB/sec execute 231 sec latency 20.422 ms oleg127-client.virtnet: 1 71234 1.46 MB/sec execute 232 sec latency 15.876 ms oleg127-client.virtnet: 1 71557 1.45 MB/sec execute 233 sec latency 15.201 ms oleg127-client.virtnet: 1 71875 1.45 MB/sec execute 234 sec latency 21.986 ms oleg127-client.virtnet: 1 72213 1.45 MB/sec execute 235 sec latency 14.899 ms oleg127-client.virtnet: 1 72622 1.46 MB/sec execute 236 sec latency 16.057 ms oleg127-client.virtnet: 1 72982 1.45 MB/sec execute 237 sec latency 16.691 ms oleg127-client.virtnet: 1 73387 1.45 MB/sec execute 238 sec latency 15.279 ms oleg127-client.virtnet: 1 73925 1.47 MB/sec execute 239 sec latency 14.956 ms oleg127-client.virtnet: 1 74571 1.48 MB/sec execute 240 sec latency 15.478 ms oleg127-client.virtnet: 1 74869 1.48 MB/sec execute 241 sec latency 15.064 ms oleg127-client.virtnet: Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:53:33 (1713282813) targets are mounted 11:53:33 (1713282813) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 75164 1.48 MB/sec execute 242 sec latency 15.415 ms oleg127-client.virtnet: 1 75475 1.48 MB/sec execute 243 sec latency 23.092 ms oleg127-client.virtnet: 1 75824 1.48 MB/sec execute 244 sec latency 16.027 ms oleg127-client.virtnet: 1 76234 1.48 MB/sec execute 245 sec latency 15.444 ms oleg127-client.virtnet: 1 76597 1.48 MB/sec execute 246 sec latency 17.258 ms oleg127-client.virtnet: 1 76990 1.48 MB/sec execute 247 sec latency 15.189 ms oleg127-client.virtnet: 1 77554 1.49 MB/sec execute 248 sec latency 15.151 ms oleg127-client.virtnet: 1 78183 1.51 MB/sec execute 249 sec latency 16.016 ms oleg127-client.virtnet: 1 78469 1.51 MB/sec execute 250 sec latency 15.857 ms oleg127-client.virtnet: 1 78777 1.50 MB/sec execute 251 sec latency 17.729 ms oleg127-client.virtnet: 1 79076 1.50 MB/sec execute 252 sec latency 16.052 ms oleg127-client.virtnet: 1 79567 1.51 MB/sec execute 253 sec latency 14.823 ms oleg127-client.virtnet: 1 80017 1.51 MB/sec execute 254 sec latency 11.388 ms oleg127-client.virtnet: 1 80506 1.51 MB/sec execute 255 sec latency 11.881 ms oleg127-client.virtnet: 1 81254 1.52 MB/sec execute 256 sec latency 12.921 ms oleg127-client.virtnet: 1 81834 1.54 MB/sec execute 257 sec latency 11.889 ms oleg127-client.virtnet: 1 82159 1.53 MB/sec execute 258 sec latency 13.443 ms oleg127-client.virtnet: 1 82775 1.54 MB/sec execute 259 sec latency 11.233 ms oleg127-client.virtnet: 1 83473 1.54 MB/sec execute 260 sec latency 8.920 ms oleg127-client.virtnet: 1 84507 1.56 MB/sec execute 261 sec latency 7.459 ms oleg127-client.virtnet: 1 85493 1.58 MB/sec execute 262 sec latency 7.415 ms oleg127-client.virtnet: 1 86141 1.58 MB/sec execute 263 sec latency 8.976 ms oleg127-client.virtnet: 1 86953 1.59 MB/sec execute 264 sec latency 8.280 ms oleg127-client.virtnet: 1 87784 1.60 MB/sec execute 265 sec latency 7.153 ms oleg127-client.virtnet: 1 88995 1.63 MB/sec execute 266 sec latency 7.493 ms oleg127-client.virtnet: 1 89638 1.63 MB/sec execute 267 sec latency 9.895 ms oleg127-client.virtnet: 1 90443 1.64 MB/sec execute 268 sec latency 7.595 ms oleg127-client.virtnet: 1 91292 1.64 MB/sec execute 269 sec latency 7.474 ms oleg127-client.virtnet: 1 92503 1.68 MB/sec execute 270 sec latency 7.477 ms oleg127-client.virtnet: 1 93101 1.67 MB/sec execute 271 sec latency 9.322 ms oleg127-client.virtnet: 1 93907 1.69 MB/sec execute 272 sec latency 7.652 ms oleg127-client.virtnet: 1 94713 1.69 MB/sec execute 273 sec latency 8.151 ms oleg127-client.virtnet: 1 95946 1.72 MB/sec execute 274 sec latency 7.583 ms oleg127-client.virtnet: 1 96566 1.72 MB/sec execute 275 sec latency 10.857 ms oleg127-client.virtnet: 1 97346 1.73 MB/sec execute 276 sec latency 7.811 ms oleg127-client.virtnet: 1 98185 1.73 MB/sec execute 277 sec latency 8.559 ms oleg127-client.virtnet: 1 99381 1.76 MB/sec execute 278 sec latency 7.461 ms oleg127-client.virtnet: 1 100026 1.76 MB/sec execute 279 sec latency 8.171 ms oleg127-client.virtnet: 1 100787 1.77 MB/sec execute 280 sec latency 11.414 ms oleg127-client.virtnet: 1 101548 1.77 MB/sec execute 281 sec latency 8.114 ms oleg127-client.virtnet: 1 102714 1.79 MB/sec execute 282 sec latency 7.647 ms oleg127-client.virtnet: 1 103400 1.80 MB/sec execute 283 sec latency 7.352 ms oleg127-client.virtnet: 1 104130 1.81 MB/sec execute 284 sec latency 9.619 ms oleg127-client.virtnet: 1 104933 1.81 MB/sec execute 285 sec latency 8.558 ms oleg127-client.virtnet: 1 105919 1.83 MB/sec execute 286 sec latency 7.547 ms oleg127-client.virtnet: 1 106854 1.85 MB/sec execute 287 sec latency 7.794 ms oleg127-client.virtnet: 1 107535 1.85 MB/sec execute 288 sec latency 9.169 ms oleg127-client.virtnet: 1 108311 1.85 MB/sec execute 289 sec latency 8.036 ms oleg127-client.virtnet: 1 109331 1.87 MB/sec execute 290 sec latency 7.448 ms oleg127-client.virtnet: 1 110309 1.89 MB/sec execute 291 sec latency 7.508 ms oleg127-client.virtnet: 1 110932 1.88 MB/sec execute 292 sec latency 7.863 ms oleg127-client.virtnet: 1 111721 1.89 MB/sec execute 293 sec latency 8.186 ms oleg127-client.virtnet: 1 112565 1.89 MB/sec execute 294 sec latency 7.555 ms oleg127-client.virtnet: 1 113781 1.92 MB/sec execute 295 sec latency 7.609 ms oleg127-client.virtnet: 1 114417 1.92 MB/sec execute 296 sec latency 9.862 ms oleg127-client.virtnet: 1 115202 1.93 MB/sec execute 297 sec latency 7.614 ms oleg127-client.virtnet: 1 116026 1.93 MB/sec execute 298 sec latency 7.878 ms oleg127-client.virtnet: 1 117246 1.96 MB/sec execute 299 sec latency 7.575 ms oleg127-client.virtnet: 1 cleanup 300 sec oleg127-client.virtnet: 0 cleanup 300 sec oleg127-client.virtnet: oleg127-client.virtnet: Operation Count AvgLat MaxLat oleg127-client.virtnet: ---------------------------------------- oleg127-client.virtnet: NTCreateX 18788 9.695 25699.119 oleg127-client.virtnet: Close 13776 1.384 5.733 oleg127-client.virtnet: Rename 796 9.122 20.405 oleg127-client.virtnet: Unlink 3820 2.349 7.073 oleg127-client.virtnet: Qpathinfo 17029 2.878 19930.472 oleg127-client.virtnet: Qfileinfo 2968 0.172 2.414 oleg127-client.virtnet: Qfsinfo 3142 0.289 26.185 oleg127-client.virtnet: Sfileinfo 1522 4.879 14.489 oleg127-client.virtnet: Find 6598 0.492 14.353 oleg127-client.virtnet: WriteX 9286 1.089 7.629 oleg127-client.virtnet: ReadX 29576 0.021 2.707 oleg127-client.virtnet: LockX 62 1.482 2.060 oleg127-client.virtnet: UnlockX 62 1.584 3.696 oleg127-client.virtnet: Flush 1315 7.388 587.108 oleg127-client.virtnet: oleg127-client.virtnet: Throughput 1.95958 MB/sec 1 clients 1 procs max_latency=25699.135 ms oleg127-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg127-client.virtnet at Tue Apr 16 11:54:19 EDT 2024 with return code 0 oleg127-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg127-client.virtnet oleg127-client.virtnet: /mnt/lustre/d70b.replay-single/oleg127-client.virtnet /mnt/lustre/d70b.replay-single/oleg127-client.virtnet oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/ACCESS' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/COREL' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/EXCEL' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/SEED' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORD' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/PM' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/PARADOX' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/PWRPNT' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORDPRO' oleg127-client.virtnet: removed directory: 'clients/client0/~dmtmp' oleg127-client.virtnet: removed directory: 'clients/client0' oleg127-client.virtnet: removed directory: 'clients' oleg127-client.virtnet: removed 'client.txt' oleg127-client.virtnet: /mnt/lustre/d70b.replay-single/oleg127-client.virtnet oleg127-client.virtnet: dbench successfully finished PASS 70b (363s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70c: tar 2mdts recovery ============ 11:54:23 (1713282863) Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar 2327 striped dir -i0 -c2 -H crush2 /mnt/lustre/d70c.replay-single tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6752 1280936 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4500 1283188 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 16912 3577108 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 6940 3587932 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 23852 7165040 1% /mnt/lustre tar: Removing leading `/' from member names test_70c fail mds1 1 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 11:56:45 (1713283005) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 11:57:00 (1713283020) targets are mounted 11:57:00 (1713283020) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6776 1280912 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4528 1283160 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 6204 3589872 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 18084 3574904 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 24288 7164776 1% /mnt/lustre tar: Removing leading `/' from hard link targets test_70c fail mds2 2 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 11:59:23 (1713283163) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 11:59:38 (1713283178) targets are mounted 11:59:38 (1713283178) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (370s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70d: mkdir/rmdir striped dir 2mdts recovery ========================================================== 12:00:36 (1713283236) Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 5687 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 7080 1280608 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4120 1283568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3604572 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3604300 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208872 1% /mnt/lustre test_70d fail mds1 1 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:02:57 (1713283377) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:03:12 (1713283392) targets are mounted 12:03:12 (1713283392) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2776 1284912 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4100 1283588 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3604572 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3604300 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208872 1% /mnt/lustre test_70d fail mds2 2 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:05:33 (1713283533) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:05:47 (1713283547) targets are mounted 12:05:47 (1713283547) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4697: 5687 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test || { echo "mkdir fails"; break; }; $LFS mkdir -i1 -c2 $DIR/$tdir/test1 || { echo "mkdir fails"; break; }; touch $DIR/$tdir/test/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test || { echo "rmdir fails"; ls -lR $DIR/$tdir; break; }; touch $DIR/$tdir/test1/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test1/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test1 || { echo "rmdir fails"; ls -lR $DIR/$tdir/test1; break; }; done ) (wd: ~) PASS 70d (318s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70e: rename cross-MDT with random fails ========================================================== 12:05:56 (1713283556) debug=+ha Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started PID=25361 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3220 1284468 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2772 1284916 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3604572 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3604300 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208872 1% /mnt/lustre test_70e fail mds1 1 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:08:10 (1713283690) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:08:24 (1713283704) targets are mounted 12:08:24 (1713283704) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2772 1284916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2368 1285320 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3604572 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3604300 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7208872 1% /mnt/lustre test_70e fail mds2 2 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:10:48 (1713283848) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:11:02 (1713283862) targets are mounted 12:11:02 (1713283862) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70e (313s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 12:11:11 (1713283871) mount clients oleg127-client.virtnet ... Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2752 1284936 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 5676 3588864 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3597152 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 7232 7186016 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:11:20 (1713283880) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Started lustre-OST0000 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... 12:11:35 (1713283895) targets are mounted 12:11:35 (1713283895) facet_failover done ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2752 1284936 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3592960 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3597152 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7190112 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:11:50 (1713283910) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... Started lustre-OST0000 12:12:06 (1713283926) targets are mounted 12:12:06 (1713283926) facet_failover done ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2752 1284936 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3568000 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 4628 3593032 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 6208 7161032 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:12:20 (1713283940) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Started lustre-OST0000 12:12:37 (1713283957) targets are mounted 12:12:37 (1713283957) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2752 1284936 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3592960 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3597152 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7190112 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... test_70f failing OST 4 times Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:12:51 (1713283971) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... Started lustre-OST0000 12:13:07 (1713283987) targets are mounted 12:13:07 (1713283987) facet_failover done ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg127-client.virtnet' ... ldlm.namespaces.MGC192.168.201.127@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff88012a48c000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff88012a48c000.lru_size=clear PASS 70f (124s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 12:13:17 (1713283997) Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 18734 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3056 1284632 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3352 1284336 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3272 1284416 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3588 1284100 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds1 mds2 1 times Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:15:35 (1713284135) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:16:01 (1713284161) targets are mounted 12:16:01 (1713284161) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2600 1285088 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3040 1284648 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2708 1284980 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3220 1284468 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds2 mds1 2 times Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:18:34 (1713284314) shut down Failover mds1 to oleg127-server Failover mds2 to oleg127-server mount facets: mds2 mount facets: mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:18:49 (1713284329) targets are mounted 12:18:49 (1713284329) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4697: 18734 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test; rmdir $DIR/$tdir/test; done ) (wd: ~) PASS 71a (349s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 12:19:09 (1713284349) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3924 1283764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3724 1283964 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:19:13 (1713284353) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:19:28 (1713284368) targets are mounted 12:19:28 (1713284368) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 12:19:49 (1713284389) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2432 1285256 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2024 1285664 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:19:53 (1713284393) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:20:08 (1713284408) targets are mounted 12:20:08 (1713284408) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 12:20:29 (1713284429) Stopping clients: oleg127-client.virtnet /mnt/lustre (opts:) Stopping client oleg127-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg127-server Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:20:32 (1713284432) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:20:46 (1713284446) targets are mounted 12:20:46 (1713284446) facet_failover done Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 12:21:00 (1713284460) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2436 1285252 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:21:05 (1713284465) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:21:19 (1713284479) targets are mounted 12:21:19 (1713284479) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.05 seconds: 406.31 ops/second PASS 80a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 12:21:29 (1713284489) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:21:42 (1713284502) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:21:56 (1713284516) targets are mounted 12:21:56 (1713284516) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 200.91 ops/second PASS 80b (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 12:22:06 (1713284526) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2512 1285176 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2512 1285176 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:22:13 (1713284533) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:22:28 (1713284548) targets are mounted 12:22:28 (1713284548) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:22:36 (1713284556) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:22:51 (1713284571) targets are mounted 12:22:51 (1713284571) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 205.52 ops/second PASS 80c (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 12:23:01 (1713284581) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2548 1285140 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2548 1285140 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:23:19 (1713284599) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:23:39 (1713284619) targets are mounted 12:23:39 (1713284619) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.09 seconds: 216.49 ops/second PASS 80d (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 12:23:50 (1713284630) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:23:58 (1713284638) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:24:13 (1713284653) targets are mounted 12:24:13 (1713284653) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 204.74 ops/second PASS 80e (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 12:24:23 (1713284663) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:24:28 (1713284668) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:24:44 (1713284684) targets are mounted 12:24:44 (1713284684) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.09 seconds: 218.45 ops/second PASS 80f (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 12:24:54 (1713284694) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:25:04 (1713284704) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:25:20 (1713284720) targets are mounted 12:25:20 (1713284720) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:25:27 (1713284727) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:25:42 (1713284742) targets are mounted 12:25:42 (1713284742) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 204.50 ops/second PASS 80g (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 12:25:53 (1713284753) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:26:06 (1713284766) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:26:32 (1713284792) targets are mounted 12:26:32 (1713284792) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.05 seconds: 432.00 ops/second PASS 80h (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 12:26:43 (1713284803) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:26:54 (1713284814) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:27:09 (1713284829) targets are mounted 12:27:09 (1713284829) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81a (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 12:27:19 (1713284839) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2508 1285180 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:27:24 (1713284844) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:27:40 (1713284860) targets are mounted 12:27:40 (1713284860) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 12:27:50 (1713284870) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:27:57 (1713284877) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:28:13 (1713284893) targets are mounted 12:28:13 (1713284893) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:28:20 (1713284900) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:28:35 (1713284915) targets are mounted 12:28:35 (1713284915) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81c (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 12:28:45 (1713284925) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2508 1285180 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2508 1285180 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:29:01 (1713284941) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:29:20 (1713284960) targets are mounted 12:29:20 (1713284960) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81d (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 12:29:29 (1713284969) fail_loc=0x119 fail_loc=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:29:32 (1713284972) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:29:48 (1713284988) targets are mounted 12:29:48 (1713284988) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81e (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 12:30:28 (1713285028) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:30:34 (1713285034) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:30:49 (1713285049) targets are mounted 12:30:49 (1713285049) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81f (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 12:30:59 (1713285059) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:31:07 (1713285067) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:31:22 (1713285082) targets are mounted 12:31:22 (1713285082) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:31:30 (1713285090) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:31:45 (1713285105) targets are mounted 12:31:45 (1713285105) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81g (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 12:31:55 (1713285115) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:32:05 (1713285125) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:32:30 (1713285150) targets are mounted 12:32:30 (1713285150) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81h (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 84a: stale open during export disconnect ========================================================== 12:32:42 (1713285162) fail_loc=0x80000144 total: 1 open/close in 0.01 seconds: 136.97 ops/second pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 PASS 84a (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 12:32:50 (1713285170) before recovery: unused locks count = 201 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:32:53 (1713285173) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:33:07 (1713285187) targets are mounted 12:33:07 (1713285187) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 12:33:17 (1713285197) before recovery: unused locks count = 100 Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:33:24 (1713285204) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 12:33:38 (1713285218) targets are mounted 12:33:38 (1713285218) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 12:33:47 (1713285227) Stopping clients: oleg127-client.virtnet /mnt/lustre (opts:) Stopping client oleg127-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 mdt.lustre-MDT0001.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Started clients oleg127-client.virtnet: 192.168.201.127@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) PASS 86 (13s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87a: write replay ================== 12:34:01 (1713285241) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1780 3605240 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3536 7210504 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.158395 s, 53.0 MB/s Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:34:05 (1713285245) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 12:34:19 (1713285259) targets are mounted 12:34:19 (1713285259) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.0469434 s, 179 MB/s PASS 87a (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 12:34:25 (1713285265) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9972 3597048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11728 7202312 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.10921 s, 76.8 MB/s 8+0 records in 8+0 records out 8 bytes (8 B) copied, 0.00164975 s, 4.8 kB/s Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:34:30 (1713285270) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 12:34:44 (1713285284) targets are mounted 12:34:44 (1713285284) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes (72 B) copied, 0.00258951 s, 27.8 kB/s PASS 87b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 88: MDS should not assign same objid to different files ========================================================== 12:34:50 (1713285290) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9976 3597044 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9976 3597044 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre before test: last_id = 14977, next_id = 14948 Creating to objid 14977 on ost lustre-OST0000... total: 31 open/close in 0.09 seconds: 350.07 ops/second total: 8 open/close in 0.02 seconds: 321.21 ops/second before recovery: last_id = 15009, next_id = 14986 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg127-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 15017, next_id = 14986 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0259067 s, 20.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0283274 s, 18.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0250181 s, 21.0 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0246883 s, 21.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0242193 s, 21.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0284703 s, 18.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.031904 s, 16.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0271947 s, 19.3 MB/s -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14948 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14949 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14950 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14951 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14952 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14953 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14954 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14955 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14956 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14957 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14958 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14959 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14960 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14961 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14962 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14963 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14964 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14965 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14966 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14967 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14968 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14969 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14970 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14971 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14972 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14973 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14974 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14975 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14976 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14977 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14978 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14979 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14980 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14981 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14982 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14983 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14984 -rw-r--r-- 1 root root 0 Apr 16 12:34 /mnt/lustre/d88.replay-single/f-14985 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14989 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14990 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14991 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14992 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14993 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14994 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14995 -rw-r--r-- 1 root root 524288 Apr 16 12:35 /mnt/lustre/d88.replay-single/f-14996 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0390851 s, 13.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0268075 s, 19.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0269287 s, 19.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0259446 s, 20.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0327881 s, 16.0 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0266332 s, 19.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0251204 s, 20.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0230576 s, 22.7 MB/s PASS 88 (52s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 89: no disk space leak on late ost connection ========================================================== 12:35:44 (1713285344) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg127-server mds-ost sync done. Waiting for MDT destroys to complete 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.0943971 s, 111 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg127-server Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:35:51 (1713285351) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:36:05 (1713285365) targets are mounted 12:36:05 (1713285365) facet_failover done Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 68 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg127-server mds-ost sync done. Waiting for MDT destroys to complete free_before: 7646308 free_after: 7646308 PASS 89 (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 12:37:37 (1713285457) Create the files Fail ost2 lustre-OST0001_UUID, display the list of affected files Stopping /mnt/lustre-ost2 (opts:) on oleg127-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Querying files on shutdown ost2: lfs find --obd lustre-OST0001_UUID /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0001_UUID /mnt/lustre/d90.replay-single/f1 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 1 14947 0x3a63 0x2c0000401 * /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 1 14946 0x3a62 0x2c0000401 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f1 Failover ost2 to oleg127-server Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 90 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93a: replay + reconnect ============ 12:38:00 (1713285480) 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.0028992 s, 353 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 12:38:03 (1713285483) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 12:38:16 (1713285496) targets are mounted 12:38:16 (1713285496) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (60s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93b: replay + reconnect on mds ===== 12:39:03 (1713285543) total: 20 open/close in 0.14 seconds: 140.00 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:39:05 (1713285545) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:39:19 (1713285559) targets are mounted 12:39:19 (1713285559) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (105s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 12:40:49 (1713285649) fail_loc=0x1701 Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:40:52 (1713285652) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:41:06 (1713285666) targets are mounted 12:41:06 (1713285666) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200033850:0x1:0x0] 1 [0x24000b3ca:0x1:0x0] total: 20 open/close in 0.10 seconds: 199.19 ops/second PASS 100a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 12:41:16 (1713285676) fail_loc=0x119 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:41:18 (1713285678) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:41:32 (1713285692) targets are mounted 12:41:32 (1713285692) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200033850:0x4:0x0] 1 [0x24000b3ca:0x4:0x0] total: 20 open/close in 0.10 seconds: 203.37 ops/second PASS 100b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 12:41:42 (1713285702) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2564 1285124 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2152 1285536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds2 (opts:) on oleg127-server Failover mds2 to oleg127-server Starting mds2: -o localrecov -o abort_recov_mdt /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 total: 20 open/close in 0.11 seconds: 184.17 ops/second Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:42:05 (1713285725) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:42:19 (1713285739) targets are mounted 12:42:19 (1713285739) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 1 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 1 [0x24000bb98:0x3:0x0] 0 [0x200033851:0x3:0x0] total: 20 open/close in 0.09 seconds: 215.10 ops/second PASS 100c (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 12:42:29 (1713285749) striped dir -i0 -c2 -H all_char /mnt/lustre/d100d.replay-single total: 100 mkdir in 0.47 seconds: 210.56 ops/second lustre-MDT0001-osd [catalog]: [0x24000c368:0x3:0x0] [index]: 00001 [logid]: [0x24000cb38:0x2:0x0] lustre-MDT0000-osp-MDT0001 [catalog]: [0x200034021:0x3:0x0] [index]: 00001 [logid]: [0x200034022:0x2:0x0] [index]: 00002 [logid]: [0x200034022:0x3:0x0] Stopping /mnt/lustre-mds2 (opts:) on oleg127-server Failover mds2 to oleg127-server Starting mds2: -o localrecov -o abort_recovery /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 find: '/mnt/lustre/d100d.replay-single': No such file or directory find: '/mnt/lustre/d100d.replay-single': No such file or directory PASS 100d (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 12:42:54 (1713285774) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2984 1284704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2532 1285156 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2984 1284704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2532 1285156 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034020:0x36:0x0] 1 [0x24000bb99:0x36:0x0] Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:43:03 (1713285783) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:43:29 (1713285809) targets are mounted 12:43:29 (1713285809) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034020:0x36:0x0] 1 [0x24000bb99:0x36:0x0] PASS 100e (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 12:43:40 (1713285820) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3028 1284660 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2568 1285120 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 101 (49s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 12:44:31 (1713285871) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (12:44:33) ... fail_loc=0 done (12:44:49) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 12:44:53 (1713285893) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (12:44:54) ... fail_loc=0 done (12:45:10) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 12:45:14 (1713285914) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2640 1285048 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3580644 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3597100 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (12:45:18) ... fail_loc=0 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:45:20 (1713285920) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:45:35 (1713285935) targets are mounted 12:45:35 (1713285935) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (12:45:41) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 12:45:45 (1713285945) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (12:45:45) ... fail_loc=0 Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:45:48 (1713285948) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:46:02 (1713285962) targets are mounted 12:46:02 (1713285962) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec done (12:46:08) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 103: Check otr_next_id overflow ==== 12:46:12 (1713285972) fail_loc=0x80000162 total: 30 open/close in 0.11 seconds: 265.32 ops/second Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:46:14 (1713285974) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:46:28 (1713285988) targets are mounted 12:46:28 (1713285988) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 12:46:38 (1713285998) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3144 1284544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3580644 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3597100 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:46:43 (1713286003) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:46:57 (1713286017) targets are mounted 12:46:57 (1713286017) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110a.replay-single/striped_dir has type dir OK PASS 110a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 12:47:06 (1713286026) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3148 1284540 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3580644 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3597100 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:47:09 (1713286029) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:47:22 (1713286042) targets are mounted 12:47:22 (1713286042) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110b.replay-single/striped_dir has type dir OK PASS 110b (91s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 12:48:38 (1713286118) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3152 1284536 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:48:41 (1713286121) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:48:55 (1713286135) targets are mounted 12:48:55 (1713286135) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110c.replay-single/striped_dir has type dir OK PASS 110c (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 12:49:03 (1713286143) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:49:06 (1713286146) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:49:19 (1713286159) targets are mounted 12:49:19 (1713286159) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110d.replay-single/striped_dir has type dir OK PASS 110d (91s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 12:50:36 (1713286236) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3232 1284456 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2680 1285008 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:50:45 (1713286245) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:51:11 (1713286271) targets are mounted 12:51:11 (1713286271) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110e.replay-single/striped_dir has type dir OK PASS 110e (111s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_110f skipping excluded test 110f debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 12:52:29 (1713286349) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3272 1284416 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2716 1284972 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:52:44 (1713286364) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:53:04 (1713286384) targets are mounted 12:53:04 (1713286384) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110g.replay-single/striped_dir has type dir OK PASS 110g (112s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 12:54:23 (1713286463) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3304 1284384 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2652 1285036 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 12:54:26 (1713286466) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 12:54:39 (1713286479) targets are mounted 12:54:39 (1713286479) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111a.replay-single/striped_dir: No such file or directory PASS 111a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 12:54:48 (1713286488) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3160 1284528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2660 1285028 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:54:52 (1713286492) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 12:55:05 (1713286505) targets are mounted 12:55:05 (1713286505) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111b.replay-single/striped_dir: No such file or directory PASS 111b (92s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 12:56:22 (1713286582) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3204 1284484 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2656 1285032 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:56:37 (1713286597) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:56:57 (1713286617) targets are mounted 12:56:57 (1713286617) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111c.replay-single/striped_dir: No such file or directory PASS 111c (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 12:58:14 (1713286694) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3232 1284456 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2656 1285032 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 12:58:22 (1713286702) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 12:58:47 (1713286727) targets are mounted 12:58:47 (1713286727) facet_failover done pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg127-client: oleg127-client: ssh exited with exit code 95 Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111d.replay-single/striped_dir: No such file or directory PASS 111d (108s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 13:00:03 (1713286803) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3160 1284528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2692 1284996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3200 1284488 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2692 1284996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 13:00:09 (1713286809) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 13:00:35 (1713286835) targets are mounted 13:00:35 (1713286835) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111e.replay-single/striped_dir: No such file or directory PASS 111e (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 13:00:45 (1713286845) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3168 1284520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2664 1285024 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3168 1284520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2652 1285036 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 13:00:53 (1713286853) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 13:01:19 (1713286879) targets are mounted 13:01:19 (1713286879) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111f.replay-single/striped_dir: No such file or directory PASS 111f (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 13:01:37 (1713286897) UUID Inodes IUsed IFree IUse% Mounted on lustre-MDT0000_UUID 1024000 554 1023446 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1024000 320 1023680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 262144 555 261589 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 262144 489 261655 1% /mnt/lustre[OST:1] filesystem_summary: 524118 874 523244 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3176 1284512 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2660 1285028 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3176 1284512 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2660 1285028 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 13:01:45 (1713286905) shut down Failover mds1 to oleg127-server mount facets: mds1 Failover mds2 to oleg127-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 13:02:11 (1713286931) targets are mounted 13:02:11 (1713286931) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111g.replay-single/striped_dir: No such file or directory PASS 111g (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 13:02:22 (1713286942) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 13:02:26 (1713286946) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 13:02:29 (1713286949) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 13:02:32 (1713286952) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 13:02:36 (1713286956) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 13:02:39 (1713286959) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 13:02:42 (1713286962) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 13:02:45 (1713286965) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 13:02:49 (1713286969) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 13:02:52 (1713286972) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 13:02:55 (1713286975) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 13:02:59 (1713286979) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 13:03:02 (1713286982) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 13:03:05 (1713286985) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 115: failover for create/unlink striped directory ========================================================== 13:03:09 (1713286989) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3172 1284516 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre striped dir -i1 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_0 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_1 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_2 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_3 striped dir -i1 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_4 Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 13:03:14 (1713286994) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 13:03:29 (1713287009) targets are mounted 13:03:29 (1713287009) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3224 1284464 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2716 1284972 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre striped dir -i0 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_0 striped dir -i0 -c2 -H crush /mnt/lustre/d115.replay-single/test_1 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_2 striped dir -i0 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_3 striped dir -i0 -c2 -H crush /mnt/lustre/d115.replay-single/test_4 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:03:40 (1713287020) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:03:55 (1713287035) targets are mounted 13:03:55 (1713287035) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 115 (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116a: large update log master MDT recovery ========================================================== 13:04:05 (1713287045) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3264 1284424 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2780 1284908 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre fail_loc=0x80001702 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:04:10 (1713287050) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:04:25 (1713287065) targets are mounted 13:04:25 (1713287065) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116a.replay-single/striped_dir has type dir OK PASS 116a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116b: large update log slave MDT recovery ========================================================== 13:04:35 (1713287075) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3316 1284372 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2952 1284736 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre fail_loc=0x80001702 Failing mds2 on oleg127-server Stopping /mnt/lustre-mds2 (opts:) on oleg127-server 13:04:40 (1713287080) shut down Failover mds2 to oleg127-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0001 13:04:56 (1713287096) targets are mounted 13:04:56 (1713287096) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116b.replay-single/striped_dir has type dir OK PASS 116b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 13:05:05 (1713287105) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 13:05:09 (1713287109) fail_loc=0x1705 lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error fail_loc=0x0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3412 1284276 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2832 1284856 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:05:14 (1713287114) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:05:29 (1713287129) targets are mounted 13:05:29 (1713287129) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 118 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 13:05:39 (1713287139) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3480 1284208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2892 1284796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server fail_loc=0x80000714 fail_val=65 Starting mds1: -o localrecov -o recovery_time_hard=60 /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg127-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 72 sec mdt.lustre-MDT0000.recovery_time_hard=180 PASS 119 (90s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 13:07:12 (1713287232) Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 find: '/mnt/lustre/d120.replay-single': No such file or directory PASS 120 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 121: lock replay timed out and race ========================================================== 13:07:41 (1713287261) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Stopping /mnt/lustre-mds1 (opts:) on oleg127-server Failover mds1 to oleg127-server fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 fail_loc=0x0 at_max=600 PASS 121 (200s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 13:11:03 (1713287463) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3872 1283816 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:11:08 (1713287468) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:11:23 (1713287483) targets are mounted 13:11:23 (1713287483) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130b: DoM file create (inherited) replay ========================================================== 13:11:33 (1713287493) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3876 1283812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:11:38 (1713287498) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:11:53 (1713287513) targets are mounted 13:11:53 (1713287513) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 131a: DoM file write lock replay === 13:12:03 (1713287523) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3876 1283812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes (8 B) copied, 0.00253882 s, 3.2 kB/s Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:12:08 (1713287528) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:12:23 (1713287543) targets are mounted 13:12:23 (1713287543) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (28s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_131b skipping excluded test 131b debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 132a: PFL new component instantiate replay ========================================================== 13:12:33 (1713287553) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3888 1283800 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0282909 s, 37.1 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x41ca:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x4182:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x280000401:0x41cb:0x0] } Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:12:38 (1713287558) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:12:54 (1713287574) targets are mounted 13:12:54 (1713287574) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 1 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x41ca:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x4182:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x280000401:0x41cb:0x0] } PASS 132a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 13:13:04 (1713287584) seq.srv-lustre-MDT0001.space=clear Starting client: oleg127-client.virtnet: -o user_xattr,flock oleg127-server@tcp:/lustre /mnt/lustre fail_val=700 fail_loc=0x80000123 Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:13:09 (1713287589) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:13:23 (1713287603) targets are mounted 13:13:23 (1713287603) facet_failover done PASS 133 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 134: replay creation of a file created in a pool ========================================================== 13:13:30 (1713287610) Creating new pool oleg127-server: Pool lustre.pool_134 created Adding targets to pool oleg127-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3972 1283716 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3236 1284452 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2796 3604224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre Failing mds1 on oleg127-server Stopping /mnt/lustre-mds1 (opts:) on oleg127-server 13:13:41 (1713287621) shut down Failover mds1 to oleg127-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-MDT0000 13:13:57 (1713287637) targets are mounted 13:13:57 (1713287637) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg127-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg127-server: Pool lustre.pool_134 destroyed Waiting 90s for 'foo' PASS 134 (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 135: Server failure in lock replay phase ========================================================== 13:14:13 (1713287653) Failing ost1 on oleg127-server Stopping /mnt/lustre-ost1 (opts:) on oleg127-server 13:14:15 (1713287655) shut down Failover ost1 to oleg127-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 13:14:30 (1713287670) targets are mounted 13:14:30 (1713287670) facet_failover done oleg127-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3952 1283736 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3236 1284452 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2796 3604224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 Stopping /mnt/lustre-ost1 (opts:) on oleg127-server Failover ost1 to oleg127-server oleg127-server: oleg127-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 oleg127-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg127-server Failover ost1 to oleg127-server oleg127-server: oleg127-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 End of sync oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg127-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg127-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg127-server: oleg127-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg127-client: oleg127-server: ssh exited with exit code 1 Started lustre-OST0001 pdsh@oleg127-client: oleg127-client: ssh exited with exit code 5 ldlm.cancel_unused_locks_before_replay=1 PASS 135 (97s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 13:15:52 (1713287752) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 13:15:55 (1713287755) SKIP: replay-single test_200 Need remote client SKIP 200 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-single test complete, duration 8767 sec ======== 13:15:57 (1713287757) === replay-single: start cleanup 13:15:58 (1713287758) === === replay-single: finish cleanup 13:16:02 (1713287762) ===