-----============= acceptance-small: replay-single ============----- Thu Apr 18 04:40:38 EDT 2024 excepting tests: 110f 131b 59 36 === replay-single: start setup 04:40:42 (1713429642) === oleg316-client.virtnet: executing check_config_client /mnt/lustre oleg316-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg316-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800aad97000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800aad97000.idle_timeout=debug disable quota as required oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === replay-single: finish setup 04:40:51 (1713429651) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0a: empty replay =================== 04:40:52 (1713429652) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3048 7210992 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:40:56 (1713429656) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:41:12 (1713429672) targets are mounted 04:41:12 (1713429672) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 04:41:21 (1713429681) Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 04:41:23 (1713429683) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 04:41:38 (1713429698) targets are mounted 04:41:38 (1713429698) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.16 seconds: 128.98 ops/second - unlinked 0 (time 1713429702 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0c: check replay-barrier =========== 04:41:46 (1713429706) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:41:51 (1713429711) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:42:06 (1713429726) targets are mounted 04:42:06 (1713429726) facet_failover done Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (93s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0d: expired recovery with no clients ========================================================== 04:43:21 (1713429801) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:43:25 (1713429805) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:43:40 (1713429820) targets are mounted 04:43:40 (1713429820) facet_failover done Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre PASS 0d (92s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 1: simple create =================== 04:44:54 (1713429894) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:44:58 (1713429898) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:45:13 (1713429913) targets are mounted 04:45:13 (1713429913) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2a: touch ========================== 04:45:22 (1713429922) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:45:25 (1713429925) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:45:40 (1713429940) targets are mounted 04:45:40 (1713429940) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2b: touch ========================== 04:45:48 (1713429948) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:45:52 (1713429952) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:46:06 (1713429966) targets are mounted 04:46:06 (1713429966) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2c: setstripe replay =============== 04:46:14 (1713429974) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:46:17 (1713429977) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:46:31 (1713429991) targets are mounted 04:46:31 (1713429991) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2d: setdirstripe replay ============ 04:46:40 (1713430000) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1768 1285920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1612 1286076 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:46:43 (1713430003) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:46:57 (1713430017) targets are mounted 04:46:57 (1713430017) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2e: O_CREAT|O_EXCL create replay === 04:47:05 (1713430025) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:47:10 (1713430030) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:47:24 (1713430044) targets are mounted 04:47:24 (1713430044) facet_failover done Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 04:47:33 (1713430053) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:47:36 (1713430056) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:47:50 (1713430070) targets are mounted 04:47:50 (1713430070) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3b: replay failed open -ENOMEM ===== 04:47:59 (1713430079) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:48:03 (1713430083) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:48:17 (1713430097) targets are mounted 04:48:17 (1713430097) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3c: replay failed open -ENOMEM ===== 04:48:25 (1713430105) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:48:30 (1713430110) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:48:44 (1713430124) targets are mounted 04:48:44 (1713430124) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 04:48:53 (1713430133) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:48:57 (1713430137) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:49:12 (1713430152) targets are mounted 04:49:12 (1713430152) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4b: |x| rm 10 files ================ 04:49:21 (1713430161) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605320 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605336 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210656 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:49:24 (1713430164) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:49:39 (1713430179) targets are mounted 04:49:39 (1713430179) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 5: |x| 220 open(O_CREAT) =========== 04:49:48 (1713430188) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:49:53 (1713430193) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:50:07 (1713430207) targets are mounted 04:50:07 (1713430207) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6a: mkdir + contained create ======= 04:50:22 (1713430222) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1884 1285804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:50:26 (1713430226) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:50:40 (1713430240) targets are mounted 04:50:40 (1713430240) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6b: |X| rmdir ====================== 04:50:51 (1713430251) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:50:54 (1713430254) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:51:09 (1713430269) targets are mounted 04:51:09 (1713430269) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 7: mkdir |X| contained create ====== 04:51:17 (1713430277) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:51:21 (1713430281) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:51:35 (1713430295) targets are mounted 04:51:35 (1713430295) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 8: creat open |X| close ============ 04:51:44 (1713430304) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:51:48 (1713430308) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:52:01 (1713430321) targets are mounted 04:52:01 (1713430321) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 9: |X| create (same inum/gen) ====== 04:52:10 (1713430330) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:52:14 (1713430334) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:52:28 (1713430348) targets are mounted 04:52:28 (1713430348) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 10: create |X| rename unlink ======= 04:52:36 (1713430356) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:52:39 (1713430359) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:52:53 (1713430373) targets are mounted 04:52:53 (1713430373) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 04:53:02 (1713430382) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre new old Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:53:05 (1713430385) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:53:19 (1713430399) targets are mounted 04:53:19 (1713430399) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 12: open, unlink |X| close ========= 04:53:28 (1713430408) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:53:31 (1713430411) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:53:45 (1713430425) targets are mounted 04:53:45 (1713430425) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 13: open chmod 0 |x| write close === 04:53:53 (1713430433) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:53:57 (1713430437) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:54:11 (1713430451) targets are mounted 04:54:11 (1713430451) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 04:54:19 (1713430459) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:54:23 (1713430463) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:54:37 (1713430477) targets are mounted 04:54:37 (1713430477) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 04:54:45 (1713430485) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:54:49 (1713430489) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:55:03 (1713430503) targets are mounted 04:55:03 (1713430503) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 04:55:11 (1713430511) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:55:14 (1713430514) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:55:28 (1713430528) targets are mounted 04:55:28 (1713430528) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 04:55:37 (1713430537) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:55:40 (1713430540) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:55:54 (1713430554) targets are mounted 04:55:54 (1713430554) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 04:56:02 (1713430562) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 pid: 24953 will close Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:56:06 (1713430566) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:56:20 (1713430580) targets are mounted 04:56:20 (1713430580) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 19: mcreate, open, write, rename === 04:56:28 (1713430588) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre old Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:56:32 (1713430592) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:56:46 (1713430606) targets are mounted 04:56:46 (1713430606) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 04:56:55 (1713430615) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210924 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:56:58 (1713430618) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:57:12 (1713430632) targets are mounted 04:57:12 (1713430632) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 04:57:21 (1713430641) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x280000401 dd: error writing '/mnt/lustre/f20b.replay-single': Cannot send after transport endpoint shutdown pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 9093+0 records in 9092+0 records out 37240832 bytes (37 MB) copied, 1.52535 s, 24.4 MB/s Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:57:26 (1713430646) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:57:38 (1713430658) targets are mounted 04:57:38 (1713430658) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg316-server: oleg316-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg316-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for MDT destroys to complete before 3116, after 3116 PASS 20b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20c: check that client eviction does not affect file content ========================================================== 04:57:50 (1713430670) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 -rw-r--r-- 1 root root 1 Apr 18 04:57 /mnt/lustre/f20c.replay-single PASS 20c (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 04:57:54 (1713430674) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210888 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:57:58 (1713430678) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:58:12 (1713430692) targets are mounted 04:58:12 (1713430692) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 04:58:21 (1713430701) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605440 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210860 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:58:24 (1713430704) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:58:38 (1713430718) targets are mounted 04:58:38 (1713430718) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 04:58:47 (1713430727) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:58:50 (1713430730) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:59:04 (1713430744) targets are mounted 04:59:04 (1713430744) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 04:59:12 (1713430752) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:59:16 (1713430756) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:59:30 (1713430770) targets are mounted 04:59:30 (1713430770) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 04:59:38 (1713430778) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 04:59:42 (1713430782) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 04:59:56 (1713430796) targets are mounted 04:59:56 (1713430796) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 05:00:04 (1713430804) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:00:08 (1713430808) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:00:22 (1713430822) targets are mounted 05:00:22 (1713430822) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 05:00:30 (1713430830) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:00:34 (1713430834) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:00:47 (1713430847) targets are mounted 05:00:47 (1713430847) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 05:00:56 (1713430856) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:00:59 (1713430859) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:01:13 (1713430873) targets are mounted 05:01:13 (1713430873) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 05:01:22 (1713430882) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:01:25 (1713430885) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:01:39 (1713430899) targets are mounted 05:01:39 (1713430899) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 05:01:48 (1713430908) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:01:51 (1713430911) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:02:05 (1713430925) targets are mounted 05:02:05 (1713430925) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 05:02:13 (1713430933) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1876 1285812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:02:17 (1713430937) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:02:31 (1713430951) targets are mounted 05:02:31 (1713430951) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 05:02:39 (1713430959) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 PASS 32 (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 05:02:45 (1713430965) total: 10 open/close in 0.04 seconds: 283.71 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.03 seconds: 305.73 ops/second PASS 33a (16s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33b: test fid seq allocation ======= 05:03:03 (1713430983) fail_loc=0x1311 total: 10 open/close in 0.03 seconds: 308.74 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg316-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 first stat failed: 5 total: 10 open/close in 0.04 seconds: 242.52 ops/second PASS 33b (17s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 05:03:22 (1713431002) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1908 1285780 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1720 1285968 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg316-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 first stat failed: 5 PASS 34 (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 35: test recovery from llog for unlink op ========================================================== 05:03:42 (1713431022) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error oleg316-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 first stat failed: 5 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (18s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_36 skipping ALWAYS excluded test 36 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 05:04:02 (1713431042) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1972 1285716 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1784 1285904 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg316-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 first stat failed: 5 PASS 37 (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 05:04:22 (1713431062) total: 800 open/close in 2.11 seconds: 379.09 ops/second - unlinked 0 (time 1713431066 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2104 1285584 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:04:30 (1713431070) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:04:44 (1713431084) targets are mounted 05:04:44 (1713431084) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713431090 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 05:04:57 (1713431097) total: 800 open/close in 2.24 seconds: 357.29 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2116 1285572 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210884 1% /mnt/lustre - unlinked 0 (time 1713431103 ; total 0 ; last 0) total: 400 unlinks in 0 seconds: inf unlinks/second Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:05:05 (1713431105) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:05:18 (1713431118) targets are mounted 05:05:18 (1713431118) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713431125 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 05:05:31 (1713431131) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00148101 s, 2.8 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.0036153 s, 1.1 MB/s PASS 41 (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 42: recovery after ost failure ===== 05:05:35 (1713431135) total: 800 open/close in 2.15 seconds: 372.31 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2140 1285548 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713431141 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second debug=-1 Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 05:05:43 (1713431143) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 05:05:57 (1713431157) targets are mounted 05:05:57 (1713431157) facet_failover done wait for MDS to timeout and recover debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck - unlinked 0 (time 1713431198 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (65s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 05:06:42 (1713431202) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2196 1285492 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:06:46 (1713431206) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:07:00 (1713431220) targets are mounted 05:07:00 (1713431220) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44a: race in target handle connect ========================================================== 05:07:18 (1713431238) at_max=40 1 of 10 (1713431239) service : cur 5 worst 5 (at 1713429601, 1638s ago) 4 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1713431245) service : cur 5 worst 5 (at 1713429601, 1644s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1713431250) service : cur 5 worst 5 (at 1713429601, 1649s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1713431256) service : cur 5 worst 5 (at 1713429601, 1655s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1713431261) service : cur 5 worst 5 (at 1713429601, 1660s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1713431267) service : cur 5 worst 5 (at 1713429601, 1665s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1713431272) service : cur 5 worst 5 (at 1713429601, 1671s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1713431278) service : cur 5 worst 5 (at 1713429601, 1676s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1713431283) service : cur 5 worst 5 (at 1713429601, 1682s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1713431289) service : cur 5 worst 5 (at 1713429601, 1687s ago) 5 5 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (58s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44b: race in target handle connect ========================================================== 05:08:17 (1713431297) 1 of 10 (1713431298) service : cur 5 worst 5 (at 1713429601, 1696s ago) 5 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1713431338) service : cur 40 worst 40 (at 1713431338, 0s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1713431359) service : cur 40 worst 40 (at 1713431338, 20s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1713431379) service : cur 40 worst 40 (at 1713431338, 41s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1713431400) service : cur 40 worst 40 (at 1713431338, 61s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1713431420) service : cur 40 worst 40 (at 1713431338, 82s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1713431441) service : cur 40 worst 40 (at 1713431338, 102s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1713431461) service : cur 40 worst 40 (at 1713431338, 123s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1713431482) service : cur 40 worst 40 (at 1713431338, 143s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1713431502) service : cur 40 worst 40 (at 1713431338, 164s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.203.116@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre PASS 44b (227s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44c: race in target handle connect ========================================================== 05:12:05 (1713431525) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2092 1285596 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1816 1285872 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 100 create in 0.21 seconds: 479.35 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:12:22 (1713431542) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:12:35 (1713431555) targets are mounted 05:12:35 (1713431555) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 45: Handle failed close ============ 05:12:44 (1713431564) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 /mnt/lustre/f45.replay-single has type file OK PASS 45 (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 05:12:48 (1713431568) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:13:06 (1713431586) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:13:18 (1713431598) targets are mounted 05:13:18 (1713431598) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 05:13:27 (1713431607) total: 20 open/close in 0.07 seconds: 300.78 ops/second Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 05:13:29 (1713431609) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 05:13:42 (1713431622) targets are mounted 05:13:42 (1713431622) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.06 seconds: 348.54 ops/second - unlinked 0 (time 1713431687 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (81s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 05:14:50 (1713431690) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2124 1285564 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 20 open/close in 0.07 seconds: 297.31 ops/second Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:14:53 (1713431693) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:15:07 (1713431707) targets are mounted 05:15:07 (1713431707) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.07 seconds: 288.77 ops/second - unlinked 0 (time 1713431771 ; total 0 ; last 0) total: 40 unlinks in 0 seconds: inf unlinks/second PASS 48 (82s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 05:16:14 (1713431774) PASS 50 (7s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 52: time out lock replay (3764) ==== 05:16:22 (1713431782) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7537 fail_loc=0x80000157 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:16:24 (1713431784) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:16:37 (1713431797) targets are mounted 05:16:37 (1713431797) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (81s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 05:17:45 (1713431865) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:17:49 (1713431869) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:18:03 (1713431883) targets are mounted 05:18:03 (1713431883) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 05:18:12 (1713431892) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7537 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:18:17 (1713431897) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:18:31 (1713431911) targets are mounted 05:18:31 (1713431911) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 05:18:40 (1713431920) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:18:45 (1713431925) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:18:59 (1713431939) targets are mounted 05:18:59 (1713431939) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 05:19:08 (1713431948) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:19:10 (1713431950) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:19:23 (1713431963) targets are mounted 05:19:23 (1713431963) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 05:19:32 (1713431972) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:19:37 (1713431977) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:19:50 (1713431990) targets are mounted 05:19:50 (1713431990) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 05:19:59 (1713431999) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:20:03 (1713432003) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:20:17 (1713432017) targets are mounted 05:20:17 (1713432017) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 05:20:26 (1713432026) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:20:30 (1713432030) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:20:44 (1713432044) targets are mounted 05:20:44 (1713432044) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 05:20:52 (1713432052) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:20:58 (1713432058) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:21:11 (1713432071) targets are mounted 05:21:11 (1713432071) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 05:21:20 (1713432080) fail_loc=0x8000012b fail_loc=0x0 touch: rm: cannot touch '/mnt/lustre/f55.replay-single'cannot remove '/mnt/lustre/f55.replay-single': No such file or directory : No such file or directory PASS 55 (61s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 05:22:23 (1713432143) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2132 1285556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:22:26 (1713432146) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:22:40 (1713432160) targets are mounted 05:22:40 (1713432160) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 57: test recovery from llog for setattr op ========================================================== 05:22:58 (1713432178) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2132 1285556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:23:02 (1713432182) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:23:15 (1713432195) targets are mounted 05:23:15 (1713432195) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg316-server: oleg316-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg316-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg316-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 05:23:28 (1713432208) fail_loc=0x8000012c total: 2500 open/close in 6.20 seconds: 402.98 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2432 1285256 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:23:40 (1713432220) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:23:54 (1713432234) targets are mounted 05:23:54 (1713432234) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1713432248 ; total 0 ; last 0) total: 2500 unlinks in 3 seconds: 833.333313 unlinks/second PASS 58a (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58b: test replay of setxattr op ==== 05:24:14 (1713432254) Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1285280 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:24:19 (1713432259) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:24:34 (1713432274) targets are mounted 05:24:34 (1713432274) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg316-client.virtnet /mnt/lustre2 (opts:) oleg316-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 05:24:45 (1713432285) Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg316-client.virtnet /mnt/lustre2 (opts:) PASS 58c (129s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_59 skipping ALWAYS excluded test 59 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 05:26:55 (1713432415) total: 200 open/close in 0.35 seconds: 577.52 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2268 1285420 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1848 1285840 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713432419 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:27:00 (1713432420) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:27:14 (1713432434) targets are mounted 05:27:14 (1713432434) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713432440 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second PASS 60 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 05:27:22 (1713432442) total: 800 open/close in 1.48 seconds: 542.09 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2352 1285336 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1713432447 ; total 0 ; last 0) total: 800 unlinks in 1 seconds: 800.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 05:27:30 (1713432450) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 05:27:45 (1713432465) targets are mounted 05:27:45 (1713432465) facet_failover done Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 05:27:56 (1713432476) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 05:28:10 (1713432490) targets are mounted 05:28:10 (1713432490) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (83s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 05:28:46 (1713432526) fail_loc=0x8000013a Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:28:48 (1713432528) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:29:01 (1713432541) targets are mounted 05:29:01 (1713432541) facet_failover done Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:29:12 (1713432552) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:29:25 (1713432565) targets are mounted 05:29:25 (1713432565) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00353931 s, 1.2 MB/s PASS 61b (46s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 05:29:34 (1713432574) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 05:29:45 (1713432585) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 05:29:59 (1713432599) targets are mounted 05:29:59 (1713432599) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 05:30:05 (1713432605) Stopping /mnt/lustre-mds1 (opts:) on oleg316-server fail_loc=0x80000605 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: mount.lustre: mount /dev/mapper/mds1_flakey at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg316-client: oleg316-server: ssh exited with exit code 95 Start of /dev/mapper/mds1_flakey on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (8s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 62: don't mis-drop resent replay === 05:30:14 (1713432614) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2344 1285344 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3128 7210912 1% /mnt/lustre total: 25 open/close in 0.06 seconds: 397.19 ops/second fail_loc=0x80000707 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:30:19 (1713432619) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:30:33 (1713432633) targets are mounted 05:30:33 (1713432633) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1713432695 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second PASS 62 (82s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65a: AT: verify early replies ====== 05:31:37 (1713432697) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:1.0:1713432726.987582:0:14576:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a82c1f80 x1796661117157824/t0(0) o101->lustre-MDT0000-mdc-ffff880135398800@192.168.203.116@tcp:12/10 lens 664/66320 e 1 to 0 dl 1713432761 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 36 worst 40 (at 1713431338, 1394s ago) 40 40 40 40 portal 29 : cur 5 worst 5 (at 1713429890, 2842s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713429890, 2842s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713429895, 2837s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713429976, 2756s ago) 5 5 0 5 portal 13 : cur 5 worst 5 (at 1713430358, 2374s ago) 5 5 0 0 portal 12 : cur 11 worst 40 (at 1713431338, 1403s ago) 40 40 40 40 portal 29 : cur 5 worst 5 (at 1713429890, 2851s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713429890, 2851s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713429895, 2846s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1713429976, 2765s ago) 5 5 0 5 portal 13 : cur 5 worst 5 (at 1713430358, 2383s ago) 5 5 0 0 PASS 65a (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 05:32:23 (1713432743) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:1.0:1713432772.651518:0:1967:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8801353a8700 x1796661117167808/t0(0) o4->lustre-OST0000-osc-ffff880135398800@192.168.203.116@tcp:6/4 lens 4584/448 e 1 to 0 dl 1713432807 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck portal 28 : cur 5 worst 5 (at 1713429890, 2888s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1713429891, 2887s ago) 5 5 5 5 portal 6 : cur 36 worst 36 (at 1713432777, 1s ago) 36 0 0 0 portal 17 : cur 5 worst 5 (at 1713430164, 2614s ago) 5 5 5 5 PASS 65b (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 05:33:00 (1713432780) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1713431338, 1465s ago) 5 40 40 40 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1713431338, 1471s ago) 5 40 40 40 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1713431338, 1481s ago) 36 40 40 40 fail_loc=0 portal 12 : cur 5 worst 40 (at 1713431338, 1491s ago) 36 40 40 40 Current MDT timeout 5, worst 40 PASS 66a (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66b: AT: verify net latency adjusts ========================================================== 05:33:51 (1713432831) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 11, worst 11 PASS 66b (84s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 05:35:17 (1713432917) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (73s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 05:36:31 (1713432991) at_history=8 at_history=8 Creating to objid 5057 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 17 open/close in 0.04 seconds: 387.34 ops/second Connected clients: oleg316-client.virtnet oleg316-client.virtnet service : cur 5 worst 5 (at 1713429608, 3407s ago) 1 1 1 1 phase 2 0 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg316-client.virtnet oleg316-client.virtnet service : cur 5 worst 5 (at 1713429608, 3408s ago) 1 1 1 1 0 osc reconnect attempts on 2nd slow PASS 67b (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 68: AT: verify slowing locks ======= 05:37:00 (1713433020) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1968: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1983: $ldlm_enqueue_min: ambiguous redirect PASS 68 (69s) debug_raw_pointers=0 debug_raw_pointers=0 Cleaning up AT ... debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70a: check multi client t-f ======== 05:38:11 (1713433091) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70b: dbench 2mdts recovery; 1 clients ========================================================== 05:38:14 (1713433094) Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d70b.replay-single + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg316-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found Started rundbench load pid=21684 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2112 1285576 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 26152 3556148 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1956 3594028 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 28108 7150176 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:38:21 (1713433101) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:38:35 (1713433115) targets are mounted 05:38:35 (1713433115) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 27020 3568260 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 4236 3588888 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 31256 7157148 1% /mnt/lustre test_70b fail mds2 2 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server oleg316-client.virtnet: looking for dbench program oleg316-client.virtnet: /usr/bin/dbench oleg316-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg316-client.virtnet oleg316-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg316-client.virtnet' oleg316-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg316-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg316-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg316-client.virtnet at Thu Apr 18 05:38:15 EDT 2024 oleg316-client.virtnet: waiting for dbench pid 21708 oleg316-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg316-client.virtnet: oleg316-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg316-client.virtnet: failed to create barrier semaphore oleg316-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg316-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg316-client.virtnet: releasing clients oleg316-client.virtnet: 1 493 17.20 MB/sec warmup 1 sec latency 16.457 ms oleg316-client.virtnet: 1 1091 11.47 MB/sec warmup 2 sec latency 14.445 ms oleg316-client.virtnet: 1 1785 8.77 MB/sec warmup 3 sec latency 10.874 ms oleg316-client.virtnet: 1 2412 7.01 MB/sec warmup 4 sec latency 13.552 ms oleg316-client.virtnet: 1 3207 6.86 MB/sec warmup 5 sec latency 15.055 ms oleg316-client.virtnet: 1 3916 6.68 MB/sec warmup 6 sec latency 153.265 ms oleg316-client.virtnet: 1 3916 5.72 MB/sec warmup 7 sec latency 1153.459 ms oleg316-client.virtnet: 1 3916 5.01 MB/sec warmup 8 sec latency 2153.667 ms oleg316-client.virtnet: 1 3916 4.45 MB/sec warmup 9 sec latency 3153.976 ms oleg316-client.virtnet: 1 3916 4.01 MB/sec warmup 10 sec latency 4154.207 ms oleg316-client.virtnet: 1 3916 3.64 MB/sec warmup 11 sec latency 5154.398 ms oleg316-client.virtnet: 1 3916 3.34 MB/sec warmup 12 sec latency 6154.627 ms oleg316-client.virtnet: 1 3916 3.08 MB/sec warmup 13 sec latency 7154.848 ms oleg316-client.virtnet: 1 3916 2.86 MB/sec warmup 14 sec latency 8155.093 ms oleg316-client.virtnet: 1 3916 2.67 MB/sec warmup 15 sec latency 9155.279 ms oleg316-client.virtnet: 1 3916 2.50 MB/sec warmup 16 sec latency 10155.490 ms oleg316-client.virtnet: 1 3916 2.36 MB/sec warmup 17 sec latency 11155.768 ms oleg316-client.virtnet: 1 3916 2.23 MB/sec warmup 18 sec latency 12155.930 ms oleg316-client.virtnet: 1 3916 2.11 MB/sec warmup 19 sec latency 13156.142 ms oleg316-client.virtnet: 1 3916 2.00 MB/sec warmup 20 sec latency 14156.281 ms oleg316-client.virtnet: 1 3916 1.91 MB/sec warmup 21 sec latency 15156.430 ms oleg316-client.virtnet: 1 3916 1.82 MB/sec warmup 22 sec latency 16156.649 ms oleg316-client.virtnet: 1 3916 1.74 MB/sec warmup 23 sec latency 17156.964 ms oleg316-client.virtnet: 1 3916 1.67 MB/sec warmup 24 sec latency 18157.180 ms oleg316-client.virtnet: 1 3992 1.60 MB/sec warmup 25 sec latency 18891.142 ms oleg316-client.virtnet: 1 4518 1.57 MB/sec warmup 26 sec latency 11.616 ms oleg316-client.virtnet: 1 5089 1.67 MB/sec warmup 27 sec latency 16.467 ms oleg316-client.virtnet: 1 5405 1.62 MB/sec warmup 28 sec latency 17.120 ms oleg316-client.virtnet: 1 5961 1.62 MB/sec warmup 29 sec latency 11.983 ms oleg316-client.virtnet: 1 6754 1.78 MB/sec warmup 30 sec latency 14.636 ms oleg316-client.virtnet: 1 7358 1.90 MB/sec warmup 31 sec latency 15.332 ms oleg316-client.virtnet: 1 7833 1.86 MB/sec warmup 32 sec latency 12.706 ms oleg316-client.virtnet: 1 8241 1.86 MB/sec warmup 33 sec latency 105:38:48 (1713433128) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:39:03 (1713433143) targets are mounted 05:39:03 (1713433143) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37604 3566484 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12032 3593996 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49636 7160480 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:39:14 (1713433154) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:39:28 (1713433168) targets are mounted 05:39:28 (1713433168) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 4.190 ms oleg316-client.virtnet: 1 8784 1.90 MB/sec warmup 34 sec latency 12.501 ms oleg316-client.virtnet: 1 9276 1.86 MB/sec warmup 35 sec latency 16.416 ms oleg316-client.virtnet: 1 9688 1.87 MB/sec warmup 36 sec latency 16.424 ms oleg316-client.virtnet: 1 10478 1.99 MB/sec warmup 37 sec latency 14.651 ms oleg316-client.virtnet: 1 11034 2.06 MB/sec warmup 38 sec latency 15.700 ms oleg316-client.virtnet: 1 11422 2.02 MB/sec warmup 39 sec latency 15.566 ms oleg316-client.virtnet: 1 11836 2.01 MB/sec warmup 40 sec latency 12.790 ms oleg316-client.virtnet: 1 12438 2.04 MB/sec warmup 41 sec latency 14.485 ms oleg316-client.virtnet: 1 13013 2.03 MB/sec warmup 42 sec latency 10.687 ms oleg316-client.virtnet: 1 13634 2.11 MB/sec warmup 43 sec latency 15.059 ms oleg316-client.virtnet: 1 14415 2.21 MB/sec warmup 44 sec latency 15.855 ms oleg316-client.virtnet: 1 14726 2.17 MB/sec warmup 45 sec latency 16.558 ms oleg316-client.virtnet: 1 15211 2.14 MB/sec warmup 46 sec latency 18.017 ms oleg316-client.virtnet: 1 15789 2.18 MB/sec warmup 47 sec latency 12.517 ms oleg316-client.virtnet: 1 15900 2.14 MB/sec warmup 48 sec latency 712.880 ms oleg316-client.virtnet: 1 15900 2.10 MB/sec warmup 49 sec latency 1713.016 ms oleg316-client.virtnet: 1 15900 2.05 MB/sec warmup 50 sec latency 2713.179 ms oleg316-client.virtnet: 1 15900 2.01 MB/sec warmup 51 sec latency 3713.353 ms oleg316-client.virtnet: 1 16407 1.99 MB/sec warmup 52 sec latency 3945.152 ms oleg316-client.virtnet: 1 17373 2.09 MB/sec warmup 53 sec latency 10.845 ms oleg316-client.virtnet: 1 18111 2.16 MB/sec warmup 54 sec latency 12.071 ms oleg316-client.virtnet: 1 18579 2.13 MB/sec warmup 55 sec latency 15.504 ms oleg316-client.virtnet: 1 19213 2.17 MB/sec warmup 56 sec latency 11.053 ms oleg316-client.virtnet: 1 19622 2.14 MB/sec warmup 57 sec latency 158.843 ms oleg316-client.virtnet: 1 20379 2.15 MB/sec warmup 58 sec latency 10.117 ms oleg316-client.virtnet: 1 20790 2.19 MB/sec warmup 59 sec latency 661.274 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 1 sec latency 2661.544 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 2 sec latency 3661.700 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 3 sec latency 4661.983 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 4 sec latency 5662.185 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 5 sec latency 6662.276 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 6 sec latency 7662.434 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 7 sec latency 8662.657 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 8 sec latency 9662.978 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 9 sec latency 10663.163 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 10 sec latency 11663.354 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 11 sec latency 12663.517 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 12 sec latency 13663.623 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 13 sec latency 14663.742 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 14 sec latency 15663.929 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 15 sec latency 16664.093 ms oleg316-client.virtnet: 1 20790 0.00 MB/sec execute 16 sec latency 17664.223 ms oleg316-client.virtnet: 1 20795 0.00 MB/sec execute 17 sec latency 18639.064 ms oleg316-client.virtnet: 1 21676 0.36 MB/sec execute 18 sec latency 10.406 ms oleg316-client.virtnet: 1 22199 0.38 MB/sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2436 1285252 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37604 3565860 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12032 3594140 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49636 7160000 1% /mnt/lustre test_70b fail mds2 4 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 05:39:39 (1713433179) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:39:53 (1713433193) targets are mounted 05:39:53 (1713433193) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37728 3566388 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12084 3593972 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49812 7160360 1% /mnt/lustre test_70b fail mds1 5 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:40:05 (1713433205) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all execute 19 sec latency 15.077 ms oleg316-client.virtnet: 1 22915 0.59 MB/sec execute 20 sec latency 8.641 ms oleg316-client.virtnet: 1 23626 0.64 MB/sec execute 21 sec latency 8.011 ms oleg316-client.virtnet: 1 24685 0.96 MB/sec execute 22 sec latency 8.926 ms oleg316-client.virtnet: 1 25329 1.12 MB/sec execute 23 sec latency 7.871 ms oleg316-client.virtnet: 1 26006 1.15 MB/sec execute 24 sec latency 10.341 ms oleg316-client.virtnet: 1 26643 1.24 MB/sec execute 25 sec latency 14.909 ms oleg316-client.virtnet: 1 27349 1.28 MB/sec execute 26 sec latency 10.318 ms oleg316-client.virtnet: 1 28534 1.61 MB/sec execute 27 sec latency 9.196 ms oleg316-client.virtnet: 1 29114 1.61 MB/sec execute 28 sec latency 10.518 ms oleg316-client.virtnet: 1 29548 1.61 MB/sec execute 29 sec latency 16.038 ms oleg316-client.virtnet: 1 30013 1.67 MB/sec execute 30 sec latency 15.104 ms oleg316-client.virtnet: 1 30395 1.62 MB/sec execute 31 sec latency 16.339 ms oleg316-client.virtnet: 1 30815 1.62 MB/sec execute 32 sec latency 14.166 ms oleg316-client.virtnet: 1 31782 1.80 MB/sec execute 33 sec latency 15.388 ms oleg316-client.virtnet: 1 32242 1.87 MB/sec execute 34 sec latency 12.707 ms oleg316-client.virtnet: 1 32655 1.83 MB/sec execute 35 sec latency 13.738 ms oleg316-client.virtnet: 1 33374 1.92 MB/sec execute 36 sec latency 10.267 ms oleg316-client.virtnet: 1 34036 1.88 MB/sec execute 37 sec latency 10.179 ms oleg316-client.virtnet: 1 34320 1.86 MB/sec execute 38 sec latency 680.004 ms oleg316-client.virtnet: 1 34320 1.82 MB/sec execute 39 sec latency 1680.288 ms oleg316-client.virtnet: 1 34320 1.77 MB/sec execute 40 sec latency 2680.524 ms oleg316-client.virtnet: 1 34320 1.73 MB/sec execute 41 sec latency 3680.744 ms oleg316-client.virtnet: 1 34320 1.69 MB/sec execute 42 sec latency 4680.985 ms oleg316-client.virtnet: 1 35371 1.83 MB/sec execute 43 sec latency 4693.503 ms oleg316-client.virtnet: 1 36003 1.89 MB/sec execute 44 sec latency 10.539 ms oleg316-client.virtnet: 1 36567 1.89 MB/sec execute 45 sec latency 12.781 ms oleg316-client.virtnet: 1 37138 1.92 MB/sec execute 46 sec latency 11.820 ms oleg316-client.virtnet: 1 37710 1.89 MB/sec execute 47 sec latency 16.111 ms oleg316-client.virtnet: 1 38525 1.99 MB/sec execute 48 sec latency 15.152 ms oleg316-client.virtnet: 1 39327 2.08 MB/sec execute 49 sec latency 12.379 ms oleg316-client.virtnet: 1 39454 2.04 MB/sec execute 50 sec latency 701.576 ms oleg316-client.virtnet: 1 39454 2.00 MB/sec execute 51 sec latency 1701.765 ms oleg316-client.virtnet: 1 39454 1.96 MB/sec execute 52 sec latency 2702.036 ms oleg316-client.virtnet: 1 39454 1.93 MB/sec execute 53 sec latency 3702.232 ms oleg316-client.virtnet: 1 39454 1.89 MB/sec execute 54 sec latency 4702.501 ms oleg316-client.virtnet: 1 39454 1.86 MB/sec execute 55 sec latency 5702.694 ms oleg316-client.virtnet: 1 39454 1.82 MB/sec execute 56 sec latency 6702.895 ms oleg316-client.virtnet: 1 39454 1.79 MB/sec execute 57 sec latency 7703.150 ms oleg316-client.virtnet: 1 39454 1.76 MB/sec execute 58 sec latency 8703.397 ms oleg316-client.virtnet: 1 39454 1.73 MB/sec execute 59 sec latency 9703.621 ms oleg316-client.virtnet: 1 39454 1.70 MB/sec execute 60 sec latency 10703.830 ms oleg316-client.virtnet: 1 39454 1.67 MB/sec execute 61 sec latency 11703.943 ms oleg316-client.virtnet: 1 39454 1.65 MB/sec execute 62 sec latency 12704.172 ms oleg316-client.virtnet: 1 39454 1.62 MB/sec execute 63 sec latency 13704.381 ms oleg316-client.virtpdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:40:19 (1713433219) targets are mounted 05:40:19 (1713433219) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2440 1285248 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37420 3569232 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12060 3594076 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49480 7163308 1% /mnt/lustre test_70b fail mds2 6 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 05:40:32 (1713433232) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:40:46 (1713433246) targets are mounted 05:40:46 (1713433246) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37472 3568932 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12064 3593048 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49536 7161980 1% /mnt/lustre test_70b fail mds1 7 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:40:58 (1713433258) shut down net: 1 39454 1.60 MB/sec execute 64 sec latency 14704.604 ms oleg316-client.virtnet: 1 39454 1.57 MB/sec execute 65 sec latency 15704.795 ms oleg316-client.virtnet: 1 39454 1.55 MB/sec execute 66 sec latency 16705.045 ms oleg316-client.virtnet: 1 39454 1.52 MB/sec execute 67 sec latency 17705.224 ms oleg316-client.virtnet: 1 39454 1.50 MB/sec execute 68 sec latency 18705.357 ms oleg316-client.virtnet: 1 39846 1.49 MB/sec execute 69 sec latency 18907.247 ms oleg316-client.virtnet: 1 40190 1.49 MB/sec execute 70 sec latency 15.725 ms oleg316-client.virtnet: 1 40628 1.51 MB/sec execute 71 sec latency 15.042 ms oleg316-client.virtnet: 1 41028 1.49 MB/sec execute 72 sec latency 16.620 ms oleg316-client.virtnet: 1 41551 1.51 MB/sec execute 73 sec latency 13.601 ms oleg316-client.virtnet: 1 42404 1.58 MB/sec execute 74 sec latency 15.300 ms oleg316-client.virtnet: 1 42822 1.61 MB/sec execute 75 sec latency 15.556 ms oleg316-client.virtnet: 1 43156 1.60 MB/sec execute 76 sec latency 13.043 ms oleg316-client.virtnet: 1 43592 1.59 MB/sec execute 77 sec latency 15.881 ms oleg316-client.virtnet: 1 44023 1.62 MB/sec execute 78 sec latency 16.274 ms oleg316-client.virtnet: 1 44400 1.60 MB/sec execute 79 sec latency 17.231 ms oleg316-client.virtnet: 1 44926 1.60 MB/sec execute 80 sec latency 10.809 ms oleg316-client.virtnet: 1 45366 1.63 MB/sec execute 81 sec latency 16.060 ms oleg316-client.virtnet: 1 46103 1.69 MB/sec execute 82 sec latency 15.776 ms oleg316-client.virtnet: 1 46503 1.69 MB/sec execute 83 sec latency 15.801 ms oleg316-client.virtnet: 1 47106 1.68 MB/sec execute 84 sec latency 10.554 ms oleg316-client.virtnet: 1 47808 1.71 MB/sec execute 85 sec latency 15.469 ms oleg316-client.virtnet: 1 48171 1.70 MB/sec execute 86 sec latency 17.144 ms oleg316-client.virtnet: 1 49023 1.74 MB/sec execute 87 sec latency 8.947 ms oleg316-client.virtnet: 1 49767 1.80 MB/sec execute 88 sec latency 12.853 ms oleg316-client.virtnet: 1 50058 1.79 MB/sec execute 89 sec latency 16.939 ms oleg316-client.virtnet: 1 50459 1.78 MB/sec execute 90 sec latency 16.339 ms oleg316-client.virtnet: 1 50892 1.78 MB/sec execute 91 sec latency 12.366 ms oleg316-client.virtnet: 1 51158 1.79 MB/sec execute 92 sec latency 545.239 ms oleg316-client.virtnet: 1 51158 1.77 MB/sec execute 93 sec latency 1545.501 ms oleg316-client.virtnet: 1 51158 1.75 MB/sec execute 94 sec latency 2545.843 ms oleg316-client.virtnet: 1 51183 1.73 MB/sec execute 95 sec latency 3491.294 ms oleg316-client.virtnet: 1 51782 1.72 MB/sec execute 96 sec latency 10.240 ms oleg316-client.virtnet: 1 52294 1.73 MB/sec execute 97 sec latency 12.112 ms oleg316-client.virtnet: 1 53179 1.80 MB/sec execute 98 sec latency 16.207 ms oleg316-client.virtnet: 1 53549 1.80 MB/sec execute 99 sec latency 14.275 ms oleg316-client.virtnet: 1 53913 1.79 MB/sec execute 100 sec latency 63.763 ms oleg316-client.virtnet: 1 54229 1.78 MB/sec execute 101 sec latency 460.199 ms oleg316-client.virtnet: 1 54888 1.80 MB/sec execute 102 sec latency 12.090 ms oleg316-client.virtnet: 1 55244 1.79 MB/sec execute 103 sec latency 390.203 ms oleg316-client.virtnet: 1 55244 1.77 MB/sec execute 104 sec latency 1390.467 ms oleg316-client.virtnet: 1 55244 1.75 MB/sec execute 105 sec latency 2390.792 ms oleg316-client.virtnet: 1 55244 1.74 MB/sec execute 106 sec latency 3390.948 ms oleg316-client.virtnet: 1 55244 1.72 MB/sec execute 107 sec latency 4391.161 ms oleg316-client.virtnet: 1 55244 1.71 MB/sec execute 108 sec lateFailover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:41:12 (1713433272) targets are mounted 05:41:12 (1713433272) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37556 3568552 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12088 3593720 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49644 7162272 1% /mnt/lustre test_70b fail mds2 8 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 05:41:25 (1713433285) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:41:39 (1713433299) targets are mounted 05:41:39 (1713433299) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37428 3568760 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12060 3592968 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49488 7161728 1% /mnt/lustre ncy 5391.394 ms oleg316-client.virtnet: 1 55244 1.69 MB/sec execute 109 sec latency 6391.667 ms oleg316-client.virtnet: 1 55244 1.67 MB/sec execute 110 sec latency 7391.831 ms oleg316-client.virtnet: 1 55244 1.66 MB/sec execute 111 sec latency 8392.029 ms oleg316-client.virtnet: 1 55244 1.64 MB/sec execute 112 sec latency 9392.198 ms oleg316-client.virtnet: 1 55244 1.63 MB/sec execute 113 sec latency 10392.356 ms oleg316-client.virtnet: 1 55244 1.62 MB/sec execute 114 sec latency 11392.523 ms oleg316-client.virtnet: 1 55244 1.60 MB/sec execute 115 sec latency 12392.650 ms oleg316-client.virtnet: 1 55244 1.59 MB/sec execute 116 sec latency 13392.898 ms oleg316-client.virtnet: 1 55244 1.57 MB/sec execute 117 sec latency 14393.142 ms oleg316-client.virtnet: 1 55244 1.56 MB/sec execute 118 sec latency 15393.284 ms oleg316-client.virtnet: 1 55244 1.55 MB/sec execute 119 sec latency 16393.621 ms oleg316-client.virtnet: 1 55244 1.53 MB/sec execute 120 sec latency 17393.876 ms oleg316-client.virtnet: 1 55244 1.52 MB/sec execute 121 sec latency 18394.163 ms oleg316-client.virtnet: 1 55419 1.51 MB/sec execute 122 sec latency 18957.789 ms oleg316-client.virtnet: 1 55819 1.52 MB/sec execute 123 sec latency 15.295 ms oleg316-client.virtnet: 1 56772 1.58 MB/sec execute 124 sec latency 15.179 ms oleg316-client.virtnet: 1 57151 1.58 MB/sec execute 125 sec latency 15.269 ms oleg316-client.virtnet: 1 57642 1.57 MB/sec execute 126 sec latency 13.303 ms oleg316-client.virtnet: 1 58348 1.60 MB/sec execute 127 sec latency 8.752 ms oleg316-client.virtnet: 1 58816 1.59 MB/sec execute 128 sec latency 17.304 ms oleg316-client.virtnet: 1 59362 1.59 MB/sec execute 129 sec latency 18.874 ms oleg316-client.virtnet: 1 60353 1.66 MB/sec execute 130 sec latency 10.753 ms oleg316-client.virtnet: 1 60675 1.66 MB/sec execute 131 sec latency 15.440 ms oleg316-client.virtnet: 1 61148 1.65 MB/sec execute 132 sec latency 15.285 ms oleg316-client.virtnet: 1 61735 1.67 MB/sec execute 133 sec latency 15.515 ms oleg316-client.virtnet: 1 62106 1.66 MB/sec execute 134 sec latency 17.779 ms oleg316-client.virtnet: 1 62786 1.66 MB/sec execute 135 sec latency 8.455 ms oleg316-client.virtnet: 1 63864 1.72 MB/sec execute 136 sec latency 10.530 ms oleg316-client.virtnet: 1 64393 1.72 MB/sec execute 137 sec latency 8.676 ms oleg316-client.virtnet: 1 65048 1.72 MB/sec execute 138 sec latency 13.648 ms oleg316-client.virtnet: 1 65467 1.73 MB/sec execute 139 sec latency 15.971 ms oleg316-client.virtnet: 1 66078 1.73 MB/sec execute 140 sec latency 15.765 ms oleg316-client.virtnet: 1 67202 1.78 MB/sec execute 141 sec latency 8.778 ms oleg316-client.virtnet: 1 67803 1.80 MB/sec execute 142 sec latency 13.641 ms oleg316-client.virtnet: 1 68166 1.79 MB/sec execute 143 sec latency 15.049 ms oleg316-client.virtnet: 1 68562 1.79 MB/sec execute 144 sec latency 15.643 ms oleg316-client.virtnet: 1 68575 1.77 MB/sec execute 145 sec latency 965.834 ms oleg316-client.virtnet: 1 68575 1.76 MB/sec execute 146 sec latency 1965.987 ms oleg316-client.virtnet: 1 68575 1.75 MB/sec execute 147 sec latency 2966.130 ms oleg316-client.virtnet: 1 68880 1.76 MB/sec execute 148 sec latency 3477.151 ms oleg316-client.virtnet: 1 69511 1.75 MB/sec execute 149 sec latency 12.471 ms oleg316-client.virtnet: 1 70297 1.78 MB/sec execute 150 sec latency 14.035 ms oleg316-client.virtnet: 1 70938 1.80 MB/sec execute 151 sec latency 17.220 ms oleg316-client.virtnet: 1 71357 1.80 MB/sec execute 152 sec latency 14.820 ms oleg316-client.virtnet: 1 71test_70b fail mds1 9 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:41:51 (1713433311) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:42:05 (1713433325) targets are mounted 05:42:05 (1713433325) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37512 3566664 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12232 3593680 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49744 7160344 1% /mnt/lustre test_70b fail mds2 10 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 05:42:17 (1713433337) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:42:31 (1713433351) targets are mounted 05:42:31 (1713433351) facet_failover done 639 1.79 MB/sec execute 153 sec latency 318.139 ms oleg316-client.virtnet: 1 72035 1.79 MB/sec execute 154 sec latency 382.339 ms oleg316-client.virtnet: 1 72658 1.80 MB/sec execute 155 sec latency 16.125 ms oleg316-client.virtnet: 1 72843 1.79 MB/sec execute 156 sec latency 721.358 ms oleg316-client.virtnet: 1 72843 1.78 MB/sec execute 157 sec latency 1721.563 ms oleg316-client.virtnet: 1 72843 1.77 MB/sec execute 158 sec latency 2721.719 ms oleg316-client.virtnet: 1 72843 1.76 MB/sec execute 159 sec latency 3721.842 ms oleg316-client.virtnet: 1 72843 1.75 MB/sec execute 160 sec latency 4722.019 ms oleg316-client.virtnet: 1 72843 1.74 MB/sec execute 161 sec latency 5722.209 ms oleg316-client.virtnet: 1 72843 1.73 MB/sec execute 162 sec latency 6722.390 ms oleg316-client.virtnet: 1 72843 1.72 MB/sec execute 163 sec latency 7722.611 ms oleg316-client.virtnet: 1 72843 1.70 MB/sec execute 164 sec latency 8722.845 ms oleg316-client.virtnet: 1 72843 1.69 MB/sec execute 165 sec latency 9723.023 ms oleg316-client.virtnet: 1 72843 1.68 MB/sec execute 166 sec latency 10723.173 ms oleg316-client.virtnet: 1 72843 1.67 MB/sec execute 167 sec latency 11723.335 ms oleg316-client.virtnet: 1 72843 1.66 MB/sec execute 168 sec latency 12723.516 ms oleg316-client.virtnet: 1 72843 1.65 MB/sec execute 169 sec latency 13723.697 ms oleg316-client.virtnet: 1 72843 1.64 MB/sec execute 170 sec latency 14723.864 ms oleg316-client.virtnet: 1 72843 1.64 MB/sec execute 171 sec latency 15724.074 ms oleg316-client.virtnet: 1 72843 1.63 MB/sec execute 172 sec latency 16724.285 ms oleg316-client.virtnet: 1 72843 1.62 MB/sec execute 173 sec latency 17724.452 ms oleg316-client.virtnet: 1 72843 1.61 MB/sec execute 174 sec latency 18724.558 ms oleg316-client.virtnet: 1 73447 1.61 MB/sec execute 175 sec latency 18747.838 ms oleg316-client.virtnet: 1 74345 1.64 MB/sec execute 176 sec latency 14.215 ms oleg316-client.virtnet: 1 74905 1.66 MB/sec execute 177 sec latency 13.770 ms oleg316-client.virtnet: 1 75229 1.65 MB/sec execute 178 sec latency 14.864 ms oleg316-client.virtnet: 1 75533 1.64 MB/sec execute 179 sec latency 16.087 ms oleg316-client.virtnet: 1 76039 1.66 MB/sec execute 180 sec latency 15.435 ms oleg316-client.virtnet: 1 76643 1.65 MB/sec execute 181 sec latency 13.196 ms oleg316-client.virtnet: 1 77725 1.68 MB/sec execute 182 sec latency 19.668 ms oleg316-client.virtnet: 1 78523 1.71 MB/sec execute 183 sec latency 7.666 ms oleg316-client.virtnet: 1 79230 1.71 MB/sec execute 184 sec latency 8.685 ms oleg316-client.virtnet: 1 80031 1.72 MB/sec execute 185 sec latency 7.874 ms oleg316-client.virtnet: 1 81015 1.75 MB/sec execute 186 sec latency 7.374 ms oleg316-client.virtnet: 1 81992 1.77 MB/sec execute 187 sec latency 7.449 ms oleg316-client.virtnet: 1 82600 1.77 MB/sec execute 188 sec latency 10.940 ms oleg316-client.virtnet: 1 83370 1.78 MB/sec execute 189 sec latency 8.351 ms oleg316-client.virtnet: 1 84212 1.79 MB/sec execute 190 sec latency 8.141 ms oleg316-client.virtnet: 1 85396 1.83 MB/sec execute 191 sec latency 7.831 ms oleg316-client.virtnet: 1 85996 1.83 MB/sec execute 192 sec latency 10.294 ms oleg316-client.virtnet: 1 86762 1.84 MB/sec execute 193 sec latency 8.131 ms oleg316-client.virtnet: 1 87536 1.84 MB/sec execute 194 sec latency 8.560 ms oleg316-client.virtnet: 1 88565 1.87 MB/sec execute 195 sec latency 8.837 ms oleg316-client.virtnet: 1 89141 1.89 MB/sec execute 196 sec latency 10.580 ms oleg316-client.virtnet: 1 89621 1.88 MB/sec execute 197 sec latency 238.96oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37636 3568624 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 11952 3593992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49588 7162616 1% /mnt/lustre test_70b fail mds1 11 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:42:42 (1713433362) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:42:55 (1713433375) targets are mounted 05:42:55 (1713433375) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2436 1285252 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 37728 3568464 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12092 3594076 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 49820 7162540 1% /mnt/lustre test_70b fail mds2 12 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 05:43:07 (1713433387) shut down 1 ms oleg316-client.virtnet: 1 89621 1.87 MB/sec execute 198 sec latency 1239.202 ms oleg316-client.virtnet: 1 89621 1.86 MB/sec execute 199 sec latency 2239.391 ms oleg316-client.virtnet: 1 89726 1.85 MB/sec execute 200 sec latency 3049.670 ms oleg316-client.virtnet: 1 90515 1.87 MB/sec execute 201 sec latency 8.450 ms oleg316-client.virtnet: 1 91317 1.87 MB/sec execute 202 sec latency 7.668 ms oleg316-client.virtnet: 1 92496 1.91 MB/sec execute 203 sec latency 8.490 ms oleg316-client.virtnet: 1 93080 1.91 MB/sec execute 204 sec latency 8.601 ms oleg316-client.virtnet: 1 93501 1.91 MB/sec execute 205 sec latency 401.774 ms oleg316-client.virtnet: 1 94290 1.92 MB/sec execute 206 sec latency 8.106 ms oleg316-client.virtnet: 1 94725 1.91 MB/sec execute 207 sec latency 467.509 ms oleg316-client.virtnet: 1 94725 1.90 MB/sec execute 208 sec latency 1467.654 ms oleg316-client.virtnet: 1 94725 1.90 MB/sec execute 209 sec latency 2467.813 ms oleg316-client.virtnet: 1 94725 1.89 MB/sec execute 210 sec latency 3467.963 ms oleg316-client.virtnet: 1 94725 1.88 MB/sec execute 211 sec latency 4468.153 ms oleg316-client.virtnet: 1 94725 1.87 MB/sec execute 212 sec latency 5468.328 ms oleg316-client.virtnet: 1 94725 1.86 MB/sec execute 213 sec latency 6468.497 ms oleg316-client.virtnet: 1 94725 1.85 MB/sec execute 214 sec latency 7468.669 ms oleg316-client.virtnet: 1 94725 1.84 MB/sec execute 215 sec latency 8468.834 ms oleg316-client.virtnet: 1 94725 1.83 MB/sec execute 216 sec latency 9469.021 ms oleg316-client.virtnet: 1 94725 1.83 MB/sec execute 217 sec latency 10469.159 ms oleg316-client.virtnet: 1 94725 1.82 MB/sec execute 218 sec latency 11469.272 ms oleg316-client.virtnet: 1 94725 1.81 MB/sec execute 219 sec latency 12469.410 ms oleg316-client.virtnet: 1 94725 1.80 MB/sec execute 220 sec latency 13469.544 ms oleg316-client.virtnet: 1 94725 1.79 MB/sec execute 221 sec latency 14469.683 ms oleg316-client.virtnet: 1 94725 1.78 MB/sec execute 222 sec latency 15469.849 ms oleg316-client.virtnet: 1 94725 1.78 MB/sec execute 223 sec latency 16470.005 ms oleg316-client.virtnet: 1 94725 1.77 MB/sec execute 224 sec latency 17470.177 ms oleg316-client.virtnet: 1 95186 1.78 MB/sec execute 225 sec latency 18067.771 ms oleg316-client.virtnet: 1 96109 1.80 MB/sec execute 226 sec latency 8.255 ms oleg316-client.virtnet: 1 96714 1.80 MB/sec execute 227 sec latency 8.499 ms oleg316-client.virtnet: 1 97465 1.81 MB/sec execute 228 sec latency 8.037 ms oleg316-client.virtnet: 1 98144 1.81 MB/sec execute 229 sec latency 10.279 ms oleg316-client.virtnet: 1 99173 1.84 MB/sec execute 230 sec latency 12.367 ms oleg316-client.virtnet: 1 99597 1.85 MB/sec execute 231 sec latency 18.751 ms oleg316-client.virtnet: 1 99986 1.84 MB/sec execute 232 sec latency 16.466 ms oleg316-client.virtnet: 1 100454 1.84 MB/sec execute 233 sec latency 15.005 ms oleg316-client.virtnet: 1 100970 1.85 MB/sec execute 234 sec latency 17.797 ms oleg316-client.virtnet: 1 101707 1.85 MB/sec execute 235 sec latency 15.134 ms oleg316-client.virtnet: 1 102711 1.87 MB/sec execute 236 sec latency 10.440 ms oleg316-client.virtnet: 1 103200 1.88 MB/sec execute 237 sec latency 13.269 ms oleg316-client.virtnet: 1 103783 1.88 MB/sec execute 238 sec latency 11.090 ms oleg316-client.virtnet: 1 104364 1.89 MB/sec execute 239 sec latency 14.255 ms oleg316-client.virtnet: 1 104899 1.88 MB/sec execute 240 sec latency 12.130 ms oleg316-client.virtnet: 1 105402 1.88 MB/sec execute 241 sec latency 11.486 ms oleg316-client.virtnet: 1 106550 Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:43:21 (1713433401) targets are mounted 05:43:21 (1713433401) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec 1.92 MB/sec execute 242 sec latency 11.437 ms oleg316-client.virtnet: 1 107132 1.92 MB/sec execute 243 sec latency 10.535 ms oleg316-client.virtnet: 1 107882 1.93 MB/sec execute 244 sec latency 8.347 ms oleg316-client.virtnet: 1 108605 1.92 MB/sec execute 245 sec latency 8.593 ms oleg316-client.virtnet: 1 109463 1.94 MB/sec execute 246 sec latency 9.650 ms oleg316-client.virtnet: 1 110110 1.96 MB/sec execute 247 sec latency 366.093 ms oleg316-client.virtnet: 1 110110 1.95 MB/sec execute 248 sec latency 1366.316 ms oleg316-client.virtnet: 1 110110 1.94 MB/sec execute 249 sec latency 2366.520 ms oleg316-client.virtnet: 1 110146 1.93 MB/sec execute 250 sec latency 3281.493 ms oleg316-client.virtnet: 1 110526 1.93 MB/sec execute 251 sec latency 12.888 ms oleg316-client.virtnet: 1 110996 1.93 MB/sec execute 252 sec latency 15.977 ms oleg316-client.virtnet: 1 111539 1.94 MB/sec execute 253 sec latency 12.848 ms oleg316-client.virtnet: 1 112346 1.94 MB/sec execute 254 sec latency 9.093 ms oleg316-client.virtnet: 1 113013 1.95 MB/sec execute 255 sec latency 15.229 ms oleg316-client.virtnet: 1 113752 1.97 MB/sec execute 256 sec latency 13.543 ms oleg316-client.virtnet: 1 114243 1.96 MB/sec execute 257 sec latency 13.760 ms oleg316-client.virtnet: 1 114722 1.96 MB/sec execute 258 sec latency 14.931 ms oleg316-client.virtnet: 1 115295 1.97 MB/sec execute 259 sec latency 13.812 ms oleg316-client.virtnet: 1 115961 1.96 MB/sec execute 260 sec latency 8.703 ms oleg316-client.virtnet: 1 116558 1.98 MB/sec execute 261 sec latency 16.091 ms oleg316-client.virtnet: 1 117415 1.99 MB/sec execute 262 sec latency 11.740 ms oleg316-client.virtnet: 1 117813 1.99 MB/sec execute 263 sec latency 14.551 ms oleg316-client.virtnet: 1 118507 2.00 MB/sec execute 264 sec latency 9.206 ms oleg316-client.virtnet: 1 119234 1.99 MB/sec execute 265 sec latency 8.846 ms oleg316-client.virtnet: 1 119985 2.01 MB/sec execute 266 sec latency 14.859 ms oleg316-client.virtnet: 1 120874 2.03 MB/sec execute 267 sec latency 11.745 ms oleg316-client.virtnet: 1 121376 2.02 MB/sec execute 268 sec latency 13.191 ms oleg316-client.virtnet: 1 122108 2.03 MB/sec execute 269 sec latency 7.985 ms oleg316-client.virtnet: 1 122908 2.03 MB/sec execute 270 sec latency 7.910 ms oleg316-client.virtnet: 1 124138 2.06 MB/sec execute 271 sec latency 7.524 ms oleg316-client.virtnet: 1 124627 2.06 MB/sec execute 272 sec latency 11.223 ms oleg316-client.virtnet: 1 125201 2.06 MB/sec execute 273 sec latency 14.990 ms oleg316-client.virtnet: 1 125983 2.07 MB/sec execute 274 sec latency 8.095 ms oleg316-client.virtnet: 1 126788 2.07 MB/sec execute 275 sec latency 7.919 ms oleg316-client.virtnet: 1 127956 2.10 MB/sec execute 276 sec latency 8.089 ms oleg316-client.virtnet: 1 128545 2.10 MB/sec execute 277 sec latency 11.171 ms oleg316-client.virtnet: 1 129324 2.11 MB/sec execute 278 sec latency 8.865 ms oleg316-client.virtnet: 1 130135 2.10 MB/sec execute 279 sec latency 8.223 ms oleg316-client.virtnet: 1 131309 2.13 MB/sec execute 280 sec latency 7.895 ms oleg316-client.virtnet: 1 131937 2.13 MB/sec execute 281 sec latency 8.118 ms oleg316-client.virtnet: 1 132682 2.14 MB/sec execute 282 sec latency 9.998 ms oleg316-client.virtnet: 1 133442 2.14 MB/sec execute 283 sec latency 8.819 ms oleg316-client.virtnet: 1 134208 2.15 MB/sec execute 284 sec latency 13.197 ms oleg316-client.virtnet: 1 135028 2.17 MB/sec execute 285 sec latency 14.789 ms oleg316-client.virtnet: 1 135408 2.16 MB/sec execute 286 sec latency 12.368 ms oleg316-client.virtnet: 1 135764 2.16 MB/sec execute 287 sec latency 19.641 ms oleg316-client.virtnet: 1 136336 2.16 MB/sec execute 288 sec latency 11.958 ms oleg316-client.virtnet: 1 136944 2.16 MB/sec execute 289 sec latency 10.968 ms oleg316-client.virtnet: 1 137813 2.18 MB/sec execute 290 sec latency 10.511 ms oleg316-client.virtnet: 1 138748 2.19 MB/sec execute 291 sec latency 7.849 ms oleg316-client.virtnet: 1 139397 2.19 MB/sec execute 292 sec latency 8.854 ms oleg316-client.virtnet: 1 140088 2.19 MB/sec execute 293 sec latency 9.149 ms oleg316-client.virtnet: 1 140835 2.19 MB/sec execute 294 sec latency 8.303 ms oleg316-client.virtnet: 1 141940 2.22 MB/sec execute 295 sec latency 9.387 ms oleg316-client.virtnet: 1 142332 2.22 MB/sec execute 296 sec latency 13.211 ms oleg316-client.virtnet: 1 142876 2.21 MB/sec execute 297 sec latency 12.156 ms oleg316-client.virtnet: 1 143589 2.22 MB/sec execute 298 sec latency 10.398 ms oleg316-client.virtnet: 1 144366 2.22 MB/sec execute 299 sec latency 9.647 ms oleg316-client.virtnet: 1 cleanup 300 sec oleg316-client.virtnet: 0 cleanup 300 sec oleg316-client.virtnet: oleg316-client.virtnet: Operation Count AvgLat MaxLat oleg316-client.virtnet: ---------------------------------------- oleg316-client.virtnet: NTCreateX 21538 8.431 18957.772 oleg316-client.virtnet: Close 15821 1.031 5.180 oleg316-client.virtnet: Rename 914 10.858 3477.132 oleg316-client.virtnet: Unlink 4353 1.883 5.473 oleg316-client.virtnet: Qpathinfo 19530 1.913 4693.485 oleg316-client.virtnet: Qfileinfo 3407 0.140 2.530 oleg316-client.virtnet: Qfsinfo 3605 0.210 4.798 oleg316-client.virtnet: Sfileinfo 1750 3.593 11.970 oleg316-client.virtnet: Find 7584 0.390 11.330 oleg316-client.virtnet: WriteX 10651 2.608 18747.829 oleg316-client.virtnet: ReadX 33988 0.018 4.171 oleg316-client.virtnet: LockX 70 1.149 2.129 oleg316-client.virtnet: UnlockX 70 1.153 2.212 oleg316-client.virtnet: Flush 1510 5.627 460.182 oleg316-client.virtnet: oleg316-client.virtnet: Throughput 2.22011 MB/sec 1 clients 1 procs max_latency=18957.789 ms oleg316-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg316-client.virtnet at Thu Apr 18 05:44:15 EDT 2024 with return code 0 oleg316-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg316-client.virtnet oleg316-client.virtnet: /mnt/lustre/d70b.replay-single/oleg316-client.virtnet /mnt/lustre/d70b.replay-single/oleg316-client.virtnet oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/COREL' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/PARADOX' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/PM' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/PWRPNT' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORD' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/EXCEL' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/ACCESS' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORDPRO' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp/SEED' oleg316-client.virtnet: removed directory: 'clients/client0/~dmtmp' oleg316-client.virtnet: removed directory: 'clients/client0' oleg316-client.virtnet: removed directory: 'clients' oleg316-client.virtnet: removed 'client.txt' oleg316-client.virtnet: /mnt/lustre/d70b.replay-single/oleg316-client.virtnet oleg316-client.virtnet: dbench successfully finished PASS 70b (362s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70c: tar 2mdts recovery ============ 05:44:18 (1713433458) Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar 3372 striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d70c.replay-single tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 7052 1280636 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4548 1283140 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 13288 3571860 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 7320 3578652 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20608 7150512 1% /mnt/lustre test_70c fail mds1 1 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:46:32 (1713433592) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:46:47 (1713433607) targets are mounted 05:46:47 (1713433607) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 7008 1280680 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4596 1283092 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 5552 3582668 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 15364 3572760 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20916 7155428 1% /mnt/lustre test_70c fail mds2 2 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 05:49:11 (1713433751) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 05:49:25 (1713433765) targets are mounted 05:49:25 (1713433765) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (336s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70d: mkdir/rmdir striped dir 2mdts recovery ========================================================== 05:49:56 (1713433796) Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 6714 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4932 1282756 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3924 1283764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3598840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3599528 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7198368 1% /mnt/lustre test_70d fail mds1 1 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:52:16 (1713433936) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:52:31 (1713433951) targets are mounted 05:52:31 (1713433951) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 5184 1282504 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4300 1283388 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3598840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3599528 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7198368 1% /mnt/lustre test_70d fail mds1 2 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:54:57 (1713434097) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:55:11 (1713434111) targets are mounted 05:55:11 (1713434111) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4697: 6714 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test || { echo "mkdir fails"; break; }; $LFS mkdir -i1 -c2 $DIR/$tdir/test1 || { echo "mkdir fails"; break; }; touch $DIR/$tdir/test/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test || { echo "rmdir fails"; ls -lR $DIR/$tdir; break; }; touch $DIR/$tdir/test1/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test1/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test1 || { echo "rmdir fails"; ls -lR $DIR/$tdir/test1; break; }; done ) (wd: ~) PASS 70d (324s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70e: rename cross-MDT with random fails ========================================================== 05:55:22 (1713434122) debug=+ha Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started PID=344 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3312 1284376 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3972 1283716 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3598840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3599528 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7198368 1% /mnt/lustre test_70e fail mds1 1 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 05:57:37 (1713434257) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 05:57:52 (1713434272) targets are mounted 05:57:52 (1713434272) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2916 1284772 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3888 1283800 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3598840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3599528 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7198368 1% /mnt/lustre test_70e fail mds1 2 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:00:15 (1713434415) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:00:30 (1713434430) targets are mounted 06:00:30 (1713434430) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70e (320s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 06:00:44 (1713434444) mount clients oleg316-client.virtnet ... Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2728 1284960 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3892 1283796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3580484 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3588836 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7169320 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:00:53 (1713434453) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 06:01:09 (1713434469) targets are mounted 06:01:09 (1713434469) facet_failover done ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2728 1284960 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3892 1283796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 3624 3588820 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3580524 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 5184 7169344 1% /mnt/lustre ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:01:22 (1713434482) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 06:01:37 (1713434497) targets are mounted 06:01:37 (1713434497) facet_failover done ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2728 1284960 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3892 1283796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3582580 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3588836 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7171416 1% /mnt/lustre ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:01:51 (1713434511) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Started lustre-OST0000 06:02:06 (1713434526) targets are mounted 06:02:06 (1713434526) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2728 1284960 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3892 1283796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3563884 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3580524 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7144408 1% /mnt/lustre ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... test_70f failing OST 4 times Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:02:19 (1713434539) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... Started lustre-OST0000 06:02:34 (1713434554) targets are mounted 06:02:34 (1713434554) facet_failover done ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2728 1284960 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3892 1283796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 6696 3580604 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3576404 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 8256 7157008 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... test_70f failing OST 5 times Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:02:47 (1713434567) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear 06:03:02 (1713434582) targets are mounted 06:03:02 (1713434582) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... ldlm.namespaces.MGC192.168.203.116@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff880135398800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff880135398800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg316-client.virtnet' ... PASS 70f (145s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 06:03:10 (1713434590) Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 15214 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4004 1283684 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 6320 1281368 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4204 1283484 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4500 1283188 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds1 mds2 1 times Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:05:28 (1713434728) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:05:54 (1713434754) targets are mounted 06:05:54 (1713434754) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3876 1283812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4248 1283440 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4020 1283668 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2348 1285340 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds2 mds1 2 times Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:08:29 (1713434909) shut down Failover mds1 to oleg316-server Failover mds2 to oleg316-server mount facets: mds2 mount facets: mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:08:44 (1713434924) targets are mounted 06:08:44 (1713434924) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4697: 15214 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test; rmdir $DIR/$tdir/test; done ) (wd: ~) PASS 71a (347s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 06:08:59 (1713434939) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3984 1283704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3764 1283924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:09:05 (1713434945) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:09:20 (1713434960) targets are mounted 06:09:20 (1713434960) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 06:09:42 (1713434982) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7537 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2448 1285240 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:09:47 (1713434987) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:10:02 (1713435002) targets are mounted 06:10:02 (1713435002) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 06:10:27 (1713435027) Stopping clients: oleg316-client.virtnet /mnt/lustre (opts:) Stopping client oleg316-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg316-server Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:10:31 (1713435031) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:10:45 (1713435045) targets are mounted 06:10:45 (1713435045) facet_failover done Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 06:11:00 (1713435060) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2452 1285236 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:11:06 (1713435066) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:11:21 (1713435081) targets are mounted 06:11:21 (1713435081) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.08 seconds: 263.09 ops/second PASS 80a (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 06:11:32 (1713435092) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:11:45 (1713435105) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:12:01 (1713435121) targets are mounted 06:12:01 (1713435121) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 203.48 ops/second PASS 80b (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 06:12:11 (1713435131) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2528 1285160 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2528 1285160 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:12:19 (1713435139) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:12:34 (1713435154) targets are mounted 06:12:34 (1713435154) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:12:42 (1713435162) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:12:57 (1713435177) targets are mounted 06:12:57 (1713435177) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 208.31 ops/second PASS 80c (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 06:13:07 (1713435187) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2564 1285124 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2564 1285124 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:13:24 (1713435204) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:13:45 (1713435225) targets are mounted 06:13:45 (1713435225) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.09 seconds: 218.56 ops/second PASS 80d (59s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 06:14:08 (1713435248) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:14:16 (1713435256) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:14:31 (1713435271) targets are mounted 06:14:31 (1713435271) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.09 seconds: 213.99 ops/second PASS 80e (58s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 06:15:07 (1713435307) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:15:12 (1713435312) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:15:28 (1713435328) targets are mounted 06:15:28 (1713435328) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 193.03 ops/second PASS 80f (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 06:15:38 (1713435338) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:15:49 (1713435349) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:16:04 (1713435364) targets are mounted 06:16:04 (1713435364) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:16:12 (1713435372) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:16:27 (1713435387) targets are mounted 06:16:27 (1713435387) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 201.80 ops/second PASS 80g (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 06:16:37 (1713435397) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:16:50 (1713435410) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:17:16 (1713435436) targets are mounted 06:17:16 (1713435436) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.09 seconds: 234.95 ops/second PASS 80h (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 06:17:27 (1713435447) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:17:38 (1713435458) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:17:54 (1713435474) targets are mounted 06:17:54 (1713435474) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81a (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 06:18:04 (1713435484) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:18:09 (1713435489) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:18:24 (1713435504) targets are mounted 06:18:24 (1713435504) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 06:18:34 (1713435514) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:18:42 (1713435522) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:18:57 (1713435537) targets are mounted 06:18:57 (1713435537) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:19:05 (1713435545) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:19:20 (1713435560) targets are mounted 06:19:20 (1713435560) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81c (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 06:19:30 (1713435570) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:19:45 (1713435585) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:20:05 (1713435605) targets are mounted 06:20:05 (1713435605) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81d (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 06:20:14 (1713435614) fail_loc=0x119 fail_loc=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:20:18 (1713435618) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:20:32 (1713435632) targets are mounted 06:20:32 (1713435632) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81e (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 06:21:13 (1713435673) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:21:18 (1713435678) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:21:33 (1713435693) targets are mounted 06:21:33 (1713435693) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81f (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 06:21:43 (1713435703) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:21:51 (1713435711) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:22:05 (1713435725) targets are mounted 06:22:05 (1713435725) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:22:12 (1713435732) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:22:26 (1713435746) targets are mounted 06:22:26 (1713435746) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81g (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 06:22:34 (1713435754) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:22:42 (1713435762) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:23:08 (1713435788) targets are mounted 06:23:08 (1713435788) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81h (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 84a: stale open during export disconnect ========================================================== 06:23:23 (1713435803) fail_loc=0x80000144 total: 1 open/close in 0.00 seconds: 242.31 ops/second pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 PASS 84a (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 06:23:29 (1713435809) before recovery: unused locks count = 201 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:23:31 (1713435811) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:23:44 (1713435824) targets are mounted 06:23:44 (1713435824) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 06:23:53 (1713435833) before recovery: unused locks count = 100 Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:23:58 (1713435838) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 06:24:12 (1713435852) targets are mounted 06:24:12 (1713435852) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 06:24:19 (1713435859) Stopping clients: oleg316-client.virtnet /mnt/lustre (opts:) Stopping client oleg316-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 mdt.lustre-MDT0001.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Started clients oleg316-client.virtnet: 192.168.203.116@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) PASS 86 (11s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87a: write replay ================== 06:24:31 (1713435871) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1760 3605260 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3536 7210504 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.109356 s, 76.7 MB/s Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:24:34 (1713435874) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 06:24:48 (1713435888) targets are mounted 06:24:48 (1713435888) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.0638269 s, 131 MB/s PASS 87a (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 06:24:55 (1713435895) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9968 3597052 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1760 3605260 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11728 7202312 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.134484 s, 62.4 MB/s 8+0 records in 8+0 records out 8 bytes (8 B) copied, 0.00255641 s, 3.1 kB/s Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:25:01 (1713435901) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 06:25:15 (1713435915) targets are mounted 06:25:15 (1713435915) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes (72 B) copied, 0.00269601 s, 26.7 kB/s PASS 87b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 88: MDS should not assign same objid to different files ========================================================== 06:25:21 (1713435921) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9972 3597048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1760 3605260 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9972 3597048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1760 3605260 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre before test: last_id = 17153, next_id = 17124 Creating to objid 17153 on ost lustre-OST0000... total: 31 open/close in 0.07 seconds: 429.35 ops/second total: 8 open/close in 0.02 seconds: 408.43 ops/second before recovery: last_id = 17185, next_id = 17162 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg316-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 17193, next_id = 17162 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0257366 s, 20.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0220621 s, 23.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.024063 s, 21.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0221467 s, 23.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0253721 s, 20.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0258848 s, 20.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0236112 s, 22.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0231467 s, 22.7 MB/s -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17124 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17125 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17126 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17127 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17128 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17129 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17130 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17131 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17132 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17133 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17134 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17135 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17136 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17137 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17138 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17139 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17140 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17141 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17142 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17143 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17144 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17145 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17146 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17147 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17148 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17149 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17150 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17151 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17152 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17153 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17154 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17155 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17156 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17157 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17158 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17159 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17160 -rw-r--r-- 1 root root 0 Apr 18 06:25 /mnt/lustre/d88.replay-single/f-17161 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17165 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17166 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17167 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17168 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17169 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17170 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17171 -rw-r--r-- 1 root root 524288 Apr 18 06:26 /mnt/lustre/d88.replay-single/f-17172 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0265867 s, 19.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0212355 s, 24.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0213846 s, 24.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0212471 s, 24.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0212059 s, 24.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0207863 s, 25.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0208612 s, 25.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0217622 s, 24.1 MB/s PASS 88 (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 89: no disk space leak on late ost connection ========================================================== 06:26:13 (1713435973) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg316-server mds-ost sync done. Waiting for MDT destroys to complete 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.0610499 s, 172 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg316-server Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:26:18 (1713435978) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:26:31 (1713435991) targets are mounted 06:26:31 (1713435991) facet_failover done Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 68 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg316-server mds-ost sync done. Waiting for MDT destroys to complete free_before: 7646308 free_after: 7646308 PASS 89 (108s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 06:28:04 (1713436084) Create the files Fail ost2 lustre-OST0001_UUID, display the list of affected files Stopping /mnt/lustre-ost2 (opts:) on oleg316-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Querying files on shutdown ost2: lfs find --obd lustre-OST0001_UUID /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0001_UUID /mnt/lustre/d90.replay-single/f1 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 1 17155 0x4303 0x2c0000401 * /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 1 17154 0x4302 0x2c0000401 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f1 Failover ost2 to oleg316-server Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 90 (17s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93a: replay + reconnect ============ 06:28:24 (1713436104) 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.00275099 s, 372 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg316-server Stopping /mnt/lustre-ost1 (opts:) on oleg316-server 06:28:26 (1713436106) shut down Failover ost1 to oleg316-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 06:28:41 (1713436121) targets are mounted 06:28:41 (1713436121) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (60s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93b: replay + reconnect on mds ===== 06:29:27 (1713436167) total: 20 open/close in 0.15 seconds: 136.96 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:29:29 (1713436169) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:29:43 (1713436183) targets are mounted 06:29:43 (1713436183) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (103s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 06:31:12 (1713436272) fail_loc=0x1701 Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:31:15 (1713436275) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:31:29 (1713436289) targets are mounted 06:31:29 (1713436289) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x2000347f0:0x1:0x0] 1 [0x24000abfa:0x1:0x0] total: 20 open/close in 0.10 seconds: 192.39 ops/second PASS 100a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 06:31:39 (1713436299) fail_loc=0x119 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:31:41 (1713436301) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:31:55 (1713436315) targets are mounted 06:31:55 (1713436315) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x2000347f0:0x4:0x0] 1 [0x24000abfa:0x4:0x0] total: 20 open/close in 0.08 seconds: 242.72 ops/second PASS 100b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 06:32:06 (1713436326) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2580 1285108 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2148 1285540 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds2 (opts:) on oleg316-server Failover mds2 to oleg316-server Starting mds2: -o localrecov -o abort_recov_mdt /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 total: 20 open/close in 0.11 seconds: 183.82 ops/second Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:32:29 (1713436349) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:32:43 (1713436363) targets are mounted 06:32:43 (1713436363) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 1 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 1 [0x24000b3c8:0x3:0x0] 0 [0x2000347f1:0x3:0x0] total: 20 open/close in 0.09 seconds: 212.86 ops/second PASS 100c (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 06:32:53 (1713436373) striped dir -i0 -c2 -H crush2 /mnt/lustre/d100d.replay-single total: 100 mkdir in 0.48 seconds: 206.30 ops/second lustre-MDT0001-osd [catalog]: [0x24000bb98:0x3:0x0] [index]: 00001 [logid]: [0x24000c368:0x2:0x0] lustre-MDT0000-osp-MDT0001 [catalog]: [0x200034fc1:0x3:0x0] [index]: 00001 [logid]: [0x200034fc2:0x2:0x0] [index]: 00002 [logid]: [0x200034fc2:0x3:0x0] Stopping /mnt/lustre-mds2 (opts:) on oleg316-server Failover mds2 to oleg316-server Starting mds2: -o localrecov -o abort_recovery /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 find: '/mnt/lustre/d100d.replay-single': No such file or directory find: '/mnt/lustre/d100d.replay-single': No such file or directory PASS 100d (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 06:33:17 (1713436397) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2528 1285160 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2528 1285160 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034fc0:0x3a:0x0] 1 [0x24000b3c9:0x3a:0x0] Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:33:26 (1713436406) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:33:52 (1713436432) targets are mounted 06:33:52 (1713436432) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034fc0:0x3a:0x0] 1 [0x24000b3c9:0x3a:0x0] PASS 100e (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 06:34:03 (1713436443) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2836 1284852 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2336 1285352 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 101 (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 06:34:48 (1713436488) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (06:34:49) ... fail_loc=0 done (06:35:05) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 06:35:09 (1713436509) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (06:35:10) ... fail_loc=0 done (06:35:26) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 06:35:29 (1713436529) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2408 1285280 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3580648 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3597096 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (06:35:32) ... fail_loc=0 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:35:35 (1713436535) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:35:48 (1713436548) targets are mounted 06:35:48 (1713436548) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (06:35:54) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 06:35:57 (1713436557) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (06:35:58) ... fail_loc=0 Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:36:00 (1713436560) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:36:13 (1713436573) targets are mounted 06:36:13 (1713436573) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec done (06:36:19) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 103: Check otr_next_id overflow ==== 06:36:22 (1713436582) fail_loc=0x80000162 total: 30 open/close in 0.07 seconds: 438.45 ops/second Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:36:23 (1713436583) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:36:37 (1713436597) targets are mounted 06:36:37 (1713436597) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 06:36:45 (1713436605) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2948 1284740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2408 1285280 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3580648 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3597096 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:36:48 (1713436608) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:37:02 (1713436622) targets are mounted 06:37:02 (1713436622) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110a.replay-single/striped_dir has type dir OK PASS 110a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 06:37:11 (1713436631) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2416 1285272 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3580648 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3597096 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7177744 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:37:15 (1713436635) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:37:29 (1713436649) targets are mounted 06:37:29 (1713436649) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110b.replay-single/striped_dir has type dir OK PASS 110b (94s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 06:38:46 (1713436726) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2416 1285272 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:38:50 (1713436730) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:39:04 (1713436744) targets are mounted 06:39:04 (1713436744) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110c.replay-single/striped_dir has type dir OK PASS 110c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 06:39:13 (1713436753) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2416 1285272 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:39:17 (1713436757) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:39:31 (1713436771) targets are mounted 06:39:31 (1713436771) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110d.replay-single/striped_dir has type dir OK PASS 110d (92s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 06:40:47 (1713436847) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3036 1284652 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2448 1285240 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:40:53 (1713436853) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:41:19 (1713436879) targets are mounted 06:41:19 (1713436879) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110e.replay-single/striped_dir has type dir OK PASS 110e (106s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_110f skipping excluded test 110f debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 06:42:35 (1713436955) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3076 1284612 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:42:47 (1713436967) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:43:08 (1713436988) targets are mounted 06:43:08 (1713436988) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110g.replay-single/striped_dir has type dir OK PASS 110g (109s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 06:44:25 (1713437065) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2964 1284724 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2420 1285268 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:44:29 (1713437069) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:44:43 (1713437083) targets are mounted 06:44:43 (1713437083) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111a.replay-single/striped_dir: No such file or directory PASS 111a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 06:44:51 (1713437091) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2968 1284720 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2428 1285260 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:44:55 (1713437095) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:45:09 (1713437109) targets are mounted 06:45:09 (1713437109) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111b.replay-single/striped_dir: No such file or directory PASS 111b (92s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 06:46:24 (1713437184) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3012 1284676 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2424 1285264 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:46:36 (1713437196) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:46:56 (1713437216) targets are mounted 06:46:56 (1713437216) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111c.replay-single/striped_dir: No such file or directory PASS 111c (108s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 06:48:14 (1713437294) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2972 1284716 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2424 1285264 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:48:20 (1713437300) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:48:46 (1713437326) targets are mounted 06:48:46 (1713437326) facet_failover done pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg316-client: oleg316-client: ssh exited with exit code 95 Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111d.replay-single/striped_dir: No such file or directory PASS 111d (106s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 06:50:02 (1713437402) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2968 1284720 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3008 1284680 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2452 1285236 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:50:08 (1713437408) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:50:34 (1713437434) targets are mounted 06:50:34 (1713437434) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111e.replay-single/striped_dir: No such file or directory PASS 111e (40s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 06:50:44 (1713437444) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2976 1284712 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2432 1285256 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2976 1284712 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2420 1285268 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:50:50 (1713437450) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 06:51:16 (1713437476) targets are mounted 06:51:16 (1713437476) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111f.replay-single/striped_dir: No such file or directory PASS 111f (40s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 06:51:25 (1713437485) UUID Inodes IUsed IFree IUse% Mounted on lustre-MDT0000_UUID 1024000 555 1023445 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1024000 318 1023682 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 262144 554 261590 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 262144 489 261655 1% /mnt/lustre[OST:1] filesystem_summary: 524118 873 523245 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2984 1284704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2428 1285260 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2984 1284704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2428 1285260 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:51:31 (1713437491) shut down Failover mds1 to oleg316-server mount facets: mds1 Failover mds2 to oleg316-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 06:51:57 (1713437517) targets are mounted 06:51:57 (1713437517) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111g.replay-single/striped_dir: No such file or directory PASS 111g (40s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 06:52:07 (1713437527) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 06:52:09 (1713437529) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 06:52:11 (1713437531) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 06:52:14 (1713437534) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 06:52:16 (1713437536) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 06:52:19 (1713437539) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 06:52:21 (1713437541) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 06:52:23 (1713437543) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 06:52:26 (1713437546) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 06:52:28 (1713437548) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 06:52:30 (1713437550) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 06:52:33 (1713437553) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 06:52:35 (1713437555) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 06:52:38 (1713437558) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 115: failover for create/unlink striped directory ========================================================== 06:52:40 (1713437560) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2980 1284708 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2416 1285272 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre striped dir -i1 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_0 striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_1 striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_2 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_3 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_4 Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:52:44 (1713437564) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:52:57 (1713437577) targets are mounted 06:52:57 (1713437577) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3032 1284656 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_0 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_1 striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_2 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_3 striped dir -i0 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_4 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:53:07 (1713437587) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:53:20 (1713437600) targets are mounted 06:53:20 (1713437600) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 115 (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116a: large update log master MDT recovery ========================================================== 06:53:29 (1713437609) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3072 1284616 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2548 1285140 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre fail_loc=0x80001702 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:53:33 (1713437613) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:53:47 (1713437627) targets are mounted 06:53:47 (1713437627) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116a.replay-single/striped_dir has type dir OK PASS 116a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116b: large update log slave MDT recovery ========================================================== 06:53:55 (1713437635) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3124 1284564 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2720 1284968 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre fail_loc=0x80001702 Failing mds2 on oleg316-server Stopping /mnt/lustre-mds2 (opts:) on oleg316-server 06:53:59 (1713437639) shut down Failover mds2 to oleg316-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0001 06:54:13 (1713437653) targets are mounted 06:54:13 (1713437653) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116b.replay-single/striped_dir has type dir OK PASS 116b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 06:54:21 (1713437661) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 06:54:24 (1713437664) fail_loc=0x1705 lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error fail_loc=0x0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3220 1284468 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2600 1285088 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:54:28 (1713437668) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 06:54:41 (1713437681) targets are mounted 06:54:41 (1713437681) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 118 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 06:54:50 (1713437690) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3288 1284400 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2660 1285028 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server fail_loc=0x80000714 fail_val=65 Starting mds1: -o localrecov -o recovery_time_hard=60 /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg316-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 69 sec mdt.lustre-MDT0000.recovery_time_hard=180 PASS 119 (82s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 06:56:13 (1713437773) Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 find: '/mnt/lustre/d120.replay-single': No such file or directory find: '/mnt/lustre/d120.replay-single': No such file or directory PASS 120 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 121: lock replay timed out and race ========================================================== 06:56:39 (1713437799) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7537 Stopping /mnt/lustre-mds1 (opts:) on oleg316-server Failover mds1 to oleg316-server fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg316-client: oleg316-client: ssh exited with exit code 5 fail_loc=0x0 at_max=600 PASS 121 (193s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 06:59:54 (1713437994) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3680 1284008 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 06:59:57 (1713437997) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 07:00:12 (1713438012) targets are mounted 07:00:12 (1713438012) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130b: DoM file create (inherited) replay ========================================================== 07:00:20 (1713438020) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3692 1283996 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 07:00:24 (1713438024) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 07:00:38 (1713438038) targets are mounted 07:00:38 (1713438038) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 131a: DoM file write lock replay === 07:00:49 (1713438049) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3692 1283996 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes (8 B) copied, 0.00170086 s, 4.7 kB/s Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 07:00:53 (1713438053) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 07:01:07 (1713438067) targets are mounted 07:01:07 (1713438067) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (25s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_131b skipping excluded test 131b debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 132a: PFL new component instantiate replay ========================================================== 07:01:16 (1713438076) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3696 1283992 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19944 7194096 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0169379 s, 61.9 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x4a4a:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x4a22:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x280000401:0x4a4b:0x0] } Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 07:01:19 (1713438079) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 07:01:33 (1713438093) targets are mounted 07:01:33 (1713438093) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 1 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x4a4a:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x4a22:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x280000401:0x4a4b:0x0] } PASS 132a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 07:01:42 (1713438102) seq.srv-lustre-MDT0001.space=clear Starting client: oleg316-client.virtnet: -o user_xattr,flock oleg316-server@tcp:/lustre /mnt/lustre fail_val=700 fail_loc=0x80000123 Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 07:01:47 (1713438107) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 07:02:00 (1713438120) targets are mounted 07:02:00 (1713438120) facet_failover done PASS 133 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 134: replay creation of a file created in a pool ========================================================== 07:02:06 (1713438126) Creating new pool oleg316-server: Pool lustre.pool_134 created Adding targets to pool oleg316-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3780 1283908 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2800 3604220 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre Failing mds1 on oleg316-server Stopping /mnt/lustre-mds1 (opts:) on oleg316-server 07:02:14 (1713438134) shut down Failover mds1 to oleg316-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-MDT0000 07:02:28 (1713438148) targets are mounted 07:02:28 (1713438148) facet_failover done oleg316-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg316-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg316-server: Pool lustre.pool_134 destroyed PASS 134 (35s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 135: Server failure in lock replay phase ========================================================== 07:02:43 (1713438163) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3760 1283928 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18168 3588852 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2800 3604220 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 Stopping /mnt/lustre-ost1 (opts:) on oleg316-server Failover ost1 to oleg316-server oleg316-server: oleg316-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 oleg316-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg316-server Failover ost1 to oleg316-server oleg316-server: oleg316-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 End of sync oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg316-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg316-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg316-server: oleg316-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg316-client: oleg316-server: ssh exited with exit code 1 Started lustre-OST0001 ldlm.cancel_unused_locks_before_replay=1 PASS 135 (50s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 07:03:35 (1713438215) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 07:03:38 (1713438218) SKIP: replay-single test_200 Need remote client SKIP 200 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-single test complete, duration 8582 sec ======== 07:03:40 (1713438220) === replay-single: start cleanup 07:03:40 (1713438220) === === replay-single: finish cleanup 07:03:43 (1713438223) ===