-----============= acceptance-small: replay-single ============----- Fri Apr 19 08:49:58 EDT 2024 excepting tests: 110f 131b 59 36 === replay-single: start setup 08:50:00 (1713531000) === oleg122-client.virtnet: executing check_config_client /mnt/lustre oleg122-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg122-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b6ccd800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b6ccd800.idle_timeout=debug disable quota as required oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === replay-single: finish setup 08:50:07 (1713531007) === osp.lustre-OST0000-osc-MDT0000.prealloc_force_new_seq=1 osp.lustre-OST0001-osc-MDT0001.prealloc_force_new_seq=1 osp.lustre-OST0000-osc-MDT0001.prealloc_force_new_seq=1 osp.lustre-OST0001-osc-MDT0000.prealloc_force_new_seq=1 Creating to objid 33 on ost lustre-OST0000... Creating to objid 33 on ost lustre-OST0001... Creating to objid 33 on ost lustre-OST0000... Creating to objid 33 on ost lustre-OST0001... total: 33 open/close in 0.12 seconds: 283.22 ops/second total: 33 open/close in 0.13 seconds: 261.52 ops/second total: 33 open/close in 0.13 seconds: 255.98 ops/second total: 33 open/close in 0.13 seconds: 254.16 ops/second osp.lustre-OST0001-osc-MDT0000.prealloc_force_new_seq=0 osp.lustre-OST0000-osc-MDT0000.prealloc_force_new_seq=0 osp.lustre-OST0000-osc-MDT0001.prealloc_force_new_seq=0 osp.lustre-OST0001-osc-MDT0001.prealloc_force_new_seq=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0a: empty replay =================== 08:50:20 (1713531020) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1844 1285844 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3592 7210448 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:50:23 (1713531023) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:50:38 (1713531038) targets are mounted 08:50:38 (1713531038) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 08:50:46 (1713531046) Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 08:50:48 (1713531048) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 08:51:01 (1713531061) targets are mounted 08:51:01 (1713531061) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.06 seconds: 317.19 ops/second - unlinked 0 (time 1713531064 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0c: check replay-barrier =========== 08:51:07 (1713531067) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:51:10 (1713531070) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:51:24 (1713531084) targets are mounted 08:51:24 (1713531084) facet_failover done Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (89s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0d: expired recovery with no clients ========================================================== 08:52:38 (1713531158) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:52:41 (1713531161) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:52:55 (1713531175) targets are mounted 08:52:55 (1713531175) facet_failover done Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre PASS 0d (90s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 1: simple create =================== 08:54:09 (1713531249) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:54:13 (1713531253) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:54:27 (1713531267) targets are mounted 08:54:27 (1713531267) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2a: touch ========================== 08:54:35 (1713531275) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:54:38 (1713531278) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:54:52 (1713531292) targets are mounted 08:54:52 (1713531292) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2b: touch ========================== 08:55:01 (1713531301) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:55:04 (1713531304) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:55:18 (1713531318) targets are mounted 08:55:18 (1713531318) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2c: setstripe replay =============== 08:55:27 (1713531327) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:55:30 (1713531330) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:55:44 (1713531344) targets are mounted 08:55:44 (1713531344) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2d: setdirstripe replay ============ 08:55:52 (1713531352) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1840 1285848 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1688 1286000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:55:56 (1713531356) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:56:09 (1713531369) targets are mounted 08:56:09 (1713531369) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2e: O_CREAT|O_EXCL create replay === 08:56:18 (1713531378) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:56:22 (1713531382) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:56:36 (1713531396) targets are mounted 08:56:36 (1713531396) facet_failover done Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 08:56:45 (1713531405) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:56:48 (1713531408) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:57:01 (1713531421) targets are mounted 08:57:02 (1713531422) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3b: replay failed open -ENOMEM ===== 08:57:10 (1713531430) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:57:14 (1713531434) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:57:28 (1713531448) targets are mounted 08:57:28 (1713531448) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3c: replay failed open -ENOMEM ===== 08:57:36 (1713531456) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:57:40 (1713531460) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:57:54 (1713531474) targets are mounted 08:57:54 (1713531474) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 08:58:03 (1713531483) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1812 3605208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1796 3605224 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3608 7210432 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:58:06 (1713531486) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:58:20 (1713531500) targets are mounted 08:58:20 (1713531500) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4b: |x| rm 10 files ================ 08:58:29 (1713531509) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605064 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210112 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:58:32 (1713531512) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:58:46 (1713531526) targets are mounted 08:58:46 (1713531526) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 5: |x| 220 open(O_CREAT) =========== 08:58:55 (1713531535) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:58:59 (1713531539) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:59:13 (1713531553) targets are mounted 08:59:13 (1713531553) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6a: mkdir + contained create ======= 08:59:28 (1713531568) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1956 1285732 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:59:31 (1713531571) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 08:59:45 (1713531585) targets are mounted 08:59:45 (1713531585) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6b: |X| rmdir ====================== 08:59:56 (1713531596) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1952 1285736 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 08:59:59 (1713531599) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:00:13 (1713531613) targets are mounted 09:00:13 (1713531613) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 7: mkdir |X| contained create ====== 09:00:22 (1713531622) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:00:25 (1713531625) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:00:39 (1713531639) targets are mounted 09:00:39 (1713531639) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 8: creat open |X| close ============ 09:00:47 (1713531647) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:00:50 (1713531650) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:01:04 (1713531664) targets are mounted 09:01:04 (1713531664) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 9: |X| create (same inum/gen) ====== 09:01:13 (1713531673) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:01:16 (1713531676) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:01:30 (1713531690) targets are mounted 09:01:30 (1713531690) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 10: create |X| rename unlink ======= 09:01:38 (1713531698) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:01:42 (1713531702) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:01:56 (1713531716) targets are mounted 09:01:56 (1713531716) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 09:02:04 (1713531724) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1832 3605188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1816 3605204 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3648 7210392 1% /mnt/lustre new old Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:02:07 (1713531727) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:02:21 (1713531741) targets are mounted 09:02:21 (1713531741) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 12: open, unlink |X| close ========= 09:02:30 (1713531750) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:02:33 (1713531753) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:02:47 (1713531767) targets are mounted 09:02:47 (1713531767) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 13: open chmod 0 |x| write close === 09:02:55 (1713531775) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:02:59 (1713531779) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:03:13 (1713531793) targets are mounted 09:03:13 (1713531793) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 09:03:21 (1713531801) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:03:24 (1713531804) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:03:38 (1713531818) targets are mounted 09:03:38 (1713531818) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 09:03:47 (1713531827) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:03:51 (1713531831) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:04:06 (1713531846) targets are mounted 09:04:06 (1713531846) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 09:04:15 (1713531855) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:04:18 (1713531858) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:04:33 (1713531873) targets are mounted 09:04:33 (1713531873) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 09:04:43 (1713531883) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:04:47 (1713531887) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:05:01 (1713531901) targets are mounted 09:05:01 (1713531901) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 09:05:10 (1713531910) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 pid: 24989 will close Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:05:13 (1713531913) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:05:28 (1713531928) targets are mounted 09:05:28 (1713531928) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 19: mcreate, open, write, rename === 09:05:37 (1713531937) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1836 3605184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3656 7210384 1% /mnt/lustre old Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:05:41 (1713531941) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:05:55 (1713531955) targets are mounted 09:05:55 (1713531955) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 09:06:06 (1713531966) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605180 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3660 7210380 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:06:10 (1713531970) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:06:25 (1713531985) targets are mounted 09:06:25 (1713531985) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 09:06:34 (1713531994) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1122 0x462 0x280000402 10000+0 records in 10000+0 records out 40960000 bytes (41 MB) copied, 1.48388 s, 27.6 MB/s Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:06:38 (1713531998) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:06:51 (1713532011) targets are mounted 09:06:52 (1713532012) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg122-server: oleg122-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg122-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for MDT destroys to complete before 3660, after 3660 PASS 20b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20c: check that client eviction does not affect file content ========================================================== 09:07:04 (1713532024) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 -rw-r--r-- 1 root root 1 Apr 19 09:07 /mnt/lustre/f20c.replay-single pdsh@oleg122-client: oleg122-client: ssh exited with exit code 5 PASS 20c (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 09:07:11 (1713532031) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1820 3605200 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3660 7210328 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:07:15 (1713532035) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:07:29 (1713532049) targets are mounted 09:07:29 (1713532049) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 09:07:38 (1713532058) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605168 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210296 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:07:41 (1713532061) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:07:57 (1713532077) targets are mounted 09:07:57 (1713532077) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 09:08:07 (1713532087) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:08:11 (1713532091) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:08:27 (1713532107) targets are mounted 09:08:27 (1713532107) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 09:08:36 (1713532116) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:08:41 (1713532121) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:08:56 (1713532136) targets are mounted 09:08:56 (1713532136) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 09:09:06 (1713532146) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:09:11 (1713532151) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:09:26 (1713532166) targets are mounted 09:09:26 (1713532166) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 09:09:36 (1713532176) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:09:41 (1713532181) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:09:57 (1713532197) targets are mounted 09:09:57 (1713532197) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 09:10:07 (1713532207) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:10:12 (1713532212) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:10:27 (1713532227) targets are mounted 09:10:27 (1713532227) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 09:10:37 (1713532237) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:10:42 (1713532242) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:10:57 (1713532257) targets are mounted 09:10:57 (1713532257) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 09:11:07 (1713532267) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:11:12 (1713532272) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:11:27 (1713532287) targets are mounted 09:11:27 (1713532287) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 09:11:37 (1713532297) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:11:41 (1713532301) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:11:56 (1713532316) targets are mounted 09:11:56 (1713532316) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 09:12:06 (1713532326) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1948 1285740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1764 1285924 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:12:11 (1713532331) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:12:26 (1713532346) targets are mounted 09:12:26 (1713532346) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 09:12:36 (1713532356) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 PASS 32 (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 09:12:42 (1713532362) total: 10 open/close in 0.07 seconds: 141.55 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.04 seconds: 285.09 ops/second PASS 33a (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33b: test fid seq allocation ======= 09:13:03 (1713532383) fail_loc=0x1311 total: 10 open/close in 0.05 seconds: 216.99 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.05 seconds: 217.24 ops/second PASS 33b (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 09:13:23 (1713532403) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1980 1285708 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1796 1285892 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 35: test recovery from llog for unlink op ========================================================== 09:13:44 (1713532424) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (18s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_36 skipping ALWAYS excluded test 36 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 09:14:04 (1713532444) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1860 1285828 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 37 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 09:14:27 (1713532467) total: 800 open/close in 4.74 seconds: 168.66 ops/second - unlinked 0 (time 1713532474 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2180 1285508 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:14:40 (1713532480) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:14:56 (1713532496) targets are mounted 09:14:56 (1713532496) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713532503 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 09:15:11 (1713532511) total: 800 open/close in 4.87 seconds: 164.26 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2184 1285504 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1840 3605128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3664 7210324 1% /mnt/lustre - unlinked 0 (time 1713532522 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:15:25 (1713532525) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:15:41 (1713532541) targets are mounted 09:15:41 (1713532541) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713532549 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 09:15:57 (1713532557) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00272286 s, 1.5 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00385902 s, 1.1 MB/s PASS 41 (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 42: recovery after ost failure ===== 09:16:03 (1713532563) total: 800 open/close in 4.61 seconds: 173.51 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2208 1285480 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre - unlinked 0 (time 1713532573 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second debug=-1 Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 09:16:17 (1713532577) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 09:16:32 (1713532592) targets are mounted 09:16:32 (1713532592) facet_failover done wait for MDS to timeout and recover debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck - unlinked 0 (time 1713532633 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (73s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 09:17:18 (1713532638) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2264 1285424 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:17:23 (1713532643) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:17:38 (1713532658) targets are mounted 09:17:38 (1713532658) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44a: race in target handle connect ========================================================== 09:17:57 (1713532677) at_max=40 1 of 10 (1713532679) service : cur 5 worst 5 (at 1713530967, 1711s ago) 4 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 2 of 10 (1713532684) service : cur 5 worst 5 (at 1713530967, 1717s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 3 of 10 (1713532690) service : cur 5 worst 5 (at 1713530967, 1723s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 4 of 10 (1713532695) service : cur 5 worst 5 (at 1713530967, 1728s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 5 of 10 (1713532701) service : cur 5 worst 5 (at 1713530967, 1734s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 6 of 10 (1713532706) service : cur 5 worst 5 (at 1713530967, 1739s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 7 of 10 (1713532712) service : cur 5 worst 5 (at 1713530967, 1745s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 8 of 10 (1713532717) service : cur 5 worst 5 (at 1713530967, 1750s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 9 of 10 (1713532723) service : cur 5 worst 5 (at 1713530967, 1756s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre 10 of 10 (1713532729) service : cur 5 worst 5 (at 1713530967, 1762s ago) 5 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (60s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44b: race in target handle connect ========================================================== 09:18:59 (1713532739) 1 of 10 (1713532739) service : cur 5 worst 5 (at 1713530967, 1772s ago) 5 4 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 2 of 10 (1713532780) service : cur 40 worst 40 (at 1713532780, 0s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 3 of 10 (1713532801) service : cur 40 worst 40 (at 1713532780, 21s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 4 of 10 (1713532821) service : cur 40 worst 40 (at 1713532780, 41s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 5 of 10 (1713532842) service : cur 40 worst 40 (at 1713532780, 62s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 6 of 10 (1713532862) service : cur 40 worst 40 (at 1713532780, 82s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 7 of 10 (1713532883) service : cur 40 worst 40 (at 1713532780, 103s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 8 of 10 (1713532904) service : cur 40 worst 40 (at 1713532780, 124s ago) 40 5 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 9 of 10 (1713532924) service : cur 40 worst 40 (at 1713532780, 144s ago) 1 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre 10 of 10 (1713532945) service : cur 40 worst 40 (at 1713532780, 165s ago) 40 40 5 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.122@tcp:/lustre 7666232 3668 7210372 1% /mnt/lustre PASS 44b (228s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44c: race in target handle connect ========================================================== 09:22:50 (1713532970) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2160 1285528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1892 1285796 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre total: 100 create in 0.42 seconds: 239.34 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:23:10 (1713532990) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:23:24 (1713533004) targets are mounted 09:23:24 (1713533004) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 45: Handle failed close ============ 09:23:34 (1713533014) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 /mnt/lustre/f45.replay-single has type file OK PASS 45 (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 09:23:39 (1713533019) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:23:57 (1713533037) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:24:10 (1713533050) targets are mounted 09:24:10 (1713533050) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 09:24:20 (1713533060) total: 20 open/close in 0.11 seconds: 176.61 ops/second Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 09:24:21 (1713533061) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 09:24:36 (1713533076) targets are mounted 09:24:36 (1713533076) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.09 seconds: 225.36 ops/second - unlinked 0 (time 1713533141 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (83s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 09:25:45 (1713533145) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2192 1285496 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre total: 20 open/close in 0.13 seconds: 155.14 ops/second Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:25:49 (1713533149) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:26:03 (1713533163) targets are mounted 09:26:03 (1713533163) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.12 seconds: 163.33 ops/second - unlinked 0 (time 1713533227 ; total 0 ; last 0) total: 40 unlinks in 1 seconds: 40.000000 unlinks/second PASS 48 (84s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 09:27:11 (1713533231) PASS 50 (7s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 52: time out lock replay (3764) ==== 09:27:20 (1713533240) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7346 fail_loc=0x80000157 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:27:22 (1713533242) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:27:36 (1713533256) targets are mounted 09:27:36 (1713533256) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (81s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 09:28:43 (1713533323) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:28:49 (1713533329) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:29:05 (1713533345) targets are mounted 09:29:05 (1713533345) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 09:29:14 (1713533354) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7346 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:29:19 (1713533359) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:29:33 (1713533373) targets are mounted 09:29:33 (1713533373) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 09:29:43 (1713533383) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:29:49 (1713533389) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:30:03 (1713533403) targets are mounted 09:30:03 (1713533403) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 09:30:12 (1713533412) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:30:16 (1713533416) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:30:29 (1713533429) targets are mounted 09:30:29 (1713533429) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 09:30:38 (1713533438) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:30:44 (1713533444) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:30:59 (1713533459) targets are mounted 09:30:59 (1713533459) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 09:31:08 (1713533468) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:31:13 (1713533473) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:31:28 (1713533488) targets are mounted 09:31:28 (1713533488) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 09:31:37 (1713533497) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:31:43 (1713533503) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:31:58 (1713533518) targets are mounted 09:31:58 (1713533518) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 09:32:07 (1713533527) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:32:13 (1713533533) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:32:28 (1713533548) targets are mounted 09:32:28 (1713533548) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 09:32:37 (1713533557) fail_loc=0x8000012b fail_loc=0x0 touch: cannot touch '/mnt/lustre/f55.replay-single': No such file or directory rm: cannot remove '/mnt/lustre/f55.replay-single': No such file or directory PASS 55 (62s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 09:33:41 (1713533621) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2200 1285488 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:33:46 (1713533626) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:34:01 (1713533641) targets are mounted 09:34:01 (1713533641) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 57: test recovery from llog for setattr op ========================================================== 09:34:20 (1713533660) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2200 1285488 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:34:24 (1713533664) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:34:40 (1713533680) targets are mounted 09:34:40 (1713533680) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg122-server: oleg122-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg122-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg122-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 09:34:52 (1713533692) fail_loc=0x8000012c total: 2500 open/close in 8.42 seconds: 296.76 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:35:07 (1713533707) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:35:22 (1713533722) targets are mounted 09:35:22 (1713533722) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1713533741 ; total 0 ; last 0) total: 2500 unlinks in 4 seconds: 625.000000 unlinks/second PASS 58a (55s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58b: test replay of setxattr op ==== 09:35:49 (1713533749) Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:35:54 (1713533754) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:36:09 (1713533769) targets are mounted 09:36:09 (1713533769) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg122-client.virtnet /mnt/lustre2 (opts:) oleg122-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 09:36:19 (1713533779) Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg122-client.virtnet /mnt/lustre2 (opts:) PASS 58c (128s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_59 skipping ALWAYS excluded test 59 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 09:38:30 (1713533910) total: 200 open/close in 0.71 seconds: 283.36 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2336 1285352 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1924 1285764 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre - unlinked 0 (time 1713533915 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:38:36 (1713533916) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:38:51 (1713533931) targets are mounted 09:38:51 (1713533931) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1713533937 ; total 0 ; last 0) total: 100 unlinks in 1 seconds: 100.000000 unlinks/second PASS 60 (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 09:39:01 (1713533941) total: 800 open/close in 2.48 seconds: 322.59 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2348 1285340 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1844 3605176 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3668 7210372 1% /mnt/lustre - unlinked 0 (time 1713533948 ; total 0 ; last 0) total: 800 unlinks in 1 seconds: 800.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 09:39:11 (1713533951) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 09:39:27 (1713533967) targets are mounted 09:39:27 (1713533967) facet_failover done Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 09:39:39 (1713533979) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 09:39:53 (1713533993) targets are mounted 09:39:53 (1713533993) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (88s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 09:40:32 (1713534032) fail_loc=0x8000013a Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:40:34 (1713534034) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:40:48 (1713534048) targets are mounted 09:40:48 (1713534048) facet_failover done Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:41:00 (1713534060) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:41:14 (1713534074) targets are mounted 09:41:14 (1713534074) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00454511 s, 901 kB/s PASS 61b (49s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 09:41:24 (1713534084) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 09:41:36 (1713534096) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 09:41:51 (1713534111) targets are mounted 09:41:51 (1713534111) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 09:41:59 (1713534119) Stopping /mnt/lustre-mds1 (opts:) on oleg122-server fail_loc=0x80000605 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: mount.lustre: mount /dev/mapper/mds1_flakey at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg122-client: oleg122-server: ssh exited with exit code 95 Start of /dev/mapper/mds1_flakey on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (11s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 62: don't mis-drop resent replay === 09:42:13 (1713534133) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2340 1285348 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1848 3605172 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1824 3605196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3672 7210368 1% /mnt/lustre total: 25 open/close in 0.13 seconds: 191.59 ops/second fail_loc=0x80000707 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:42:18 (1713534138) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:42:33 (1713534153) targets are mounted 09:42:33 (1713534153) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1713534219 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second PASS 62 (88s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65a: AT: verify early replies ====== 09:43:43 (1713534223) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:3.0:1713534253.945530:0:14681:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a884b480 x1796767414521728/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa134800@192.168.201.122@tcp:12/10 lens 664/66320 e 1 to 0 dl 1713534288 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 36 worst 40 (at 1713532780, 1479s ago) 36 40 40 40 portal 29 : cur 5 worst 5 (at 1713531245, 3014s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713531245, 3014s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713531250, 3009s ago) 5 0 0 5 portal 17 : cur 5 worst 5 (at 1713531329, 2930s ago) 5 5 0 5 portal 13 : cur 5 worst 5 (at 1713531701, 2558s ago) 5 5 0 0 portal 12 : cur 5 worst 40 (at 1713532780, 1488s ago) 36 40 40 40 portal 29 : cur 5 worst 5 (at 1713531245, 3023s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1713531245, 3023s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1713531250, 3018s ago) 5 0 0 5 portal 17 : cur 5 worst 5 (at 1713531329, 2939s ago) 5 5 0 5 portal 13 : cur 5 worst 5 (at 1713531701, 2567s ago) 5 5 0 0 PASS 65a (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 09:44:32 (1713534272) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:0.0:1713534302.489605:0:1878:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff880136bd9180 x1796767414531712/t0(0) o4->lustre-OST0000-osc-ffff8800aa134800@192.168.201.122@tcp:6/4 lens 4584/448 e 1 to 0 dl 1713534337 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck portal 28 : cur 5 worst 5 (at 1713531245, 3063s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1713531247, 3061s ago) 5 5 5 5 portal 17 : cur 5 worst 5 (at 1713531302, 3006s ago) 5 0 5 5 portal 6 : cur 36 worst 36 (at 1713534307, 1s ago) 36 5 0 0 PASS 65b (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 09:45:12 (1713534312) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1713532780, 1555s ago) 36 40 40 40 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1713532780, 1561s ago) 36 40 40 40 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1713532780, 1572s ago) 36 40 40 40 fail_loc=0 portal 12 : cur 5 worst 40 (at 1713532780, 1581s ago) 36 40 40 40 Current MDT timeout 5, worst 40 PASS 66a (51s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66b: AT: verify net latency adjusts ========================================================== 09:46:05 (1713534365) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 10, worst 10 PASS 66b (95s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 09:47:42 (1713534462) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (75s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 09:48:59 (1713534539) at_history=8 at_history=8 Creating to objid 5089 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 18 open/close in 0.09 seconds: 198.48 ops/second Connected clients: oleg122-client.virtnet oleg122-client.virtnet service : cur 5 worst 5 (at 1713530976, 3588s ago) 1 0 1 1 phase 2 0 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg122-client.virtnet oleg122-client.virtnet service : cur 5 worst 5 (at 1713530976, 3590s ago) 1 0 1 1 0 osc reconnect attempts on 2nd slow PASS 67b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 68: AT: verify slowing locks ======= 09:49:30 (1713534570) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1970: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1985: $ldlm_enqueue_min: ambiguous redirect PASS 68 (70s) debug_raw_pointers=0 debug_raw_pointers=0 Cleaning up AT ... debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70a: check multi client t-f ======== 09:50:42 (1713534642) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70b: dbench 2mdts recovery; 1 clients ========================================================== 09:50:46 (1713534646) Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) striped dir -i0 -c2 -H all_char /mnt/lustre/d70b.replay-single + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg122-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found Started rundbench load pid=21812 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2512 1285176 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2112 1285576 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1936 3594044 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 26404 3580616 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 28340 7174660 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:50:55 (1713534655) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:51:11 (1713534671) targets are mounted 09:51:11 (1713534671) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 2104 3590936 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 26796 3568380 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 28900 7159316 1% /mnt/lustre oleg122-client.virtnet: looking for dbench program oleg122-client.virtnet: /usr/bin/dbench oleg122-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg122-client.virtnet oleg122-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg122-client.virtnet' oleg122-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg122-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg122-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg122-client.virtnet at Fri Apr 19 09:50:47 EDT 2024 oleg122-client.virtnet: waiting for dbench pid 21837 oleg122-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg122-client.virtnet: oleg122-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg122-client.virtnet: failed to create barrier semaphore oleg122-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg122-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg122-client.virtnet: releasing clients oleg122-client.virtnet: 1 289 10.70 MB/sec warmup 1 sec latency 24.057 ms oleg122-client.virtnet: 1 669 10.58 MB/sec warmup 2 sec latency 20.250 ms oleg122-client.virtnet: 1 1095 7.64 MB/sec warmup 3 sec latency 16.793 ms oleg122-client.virtnet: 1 1481 6.51 MB/sec warmup 4 sec latency 255.327 ms oleg122-client.virtnet: 1 1777 5.26 MB/sec warmup 5 sec latency 686.817 ms oleg122-client.virtnet: 1 2428 4.68 MB/sec warmup 6 sec latency 16.070 ms oleg122-client.virtnet: 1 3125 4.85 MB/sec warmup 7 sec latency 15.902 ms oleg122-client.virtnet: 1 3204 4.28 MB/sec warmup 8 sec latency 896.111 ms oleg122-client.virtnet: 1 3204 3.81 MB/sec warmup 9 sec latency 1896.278 ms oleg122-client.virtnet: 1 3204 3.43 MB/sec warmup 10 sec latency 2896.488 ms oleg122-client.virtnet: 1 3204 3.12 MB/sec warmup 11 sec latency 3896.697 ms oleg122-client.virtnet: 1 3204 2.86 MB/sec warmup 12 sec latency 4896.910 ms oleg122-client.virtnet: 1 3204 2.64 MB/sec warmup 13 sec latency 5897.102 ms oleg122-client.virtnet: 1 3204 2.45 MB/sec warmup 14 sec latency 6897.272 ms oleg122-client.virtnet: 1 3204 2.29 MB/sec warmup 15 sec latency 7897.459 ms oleg122-client.virtnet: 1 3204 2.14 MB/sec warmup 16 sec latency 8897.745 ms oleg122-client.virtnet: 1 3204 2.02 MB/sec warmup 17 sec latency 9898.018 ms oleg122-client.virtnet: 1 3204 1.90 MB/sec warmup 18 sec latency 10898.209 ms oleg122-client.virtnet: 1 3204 1.80 MB/sec warmup 19 sec latency 11898.388 ms oleg122-client.virtnet: 1 3204 1.71 MB/sec warmup 20 sec latency 12898.572 ms oleg122-client.virtnet: 1 3204 1.63 MB/sec warmup 21 sec latency 13898.811 ms oleg122-client.virtnet: 1 3204 1.56 MB/sec warmup 22 sec latency 14898.985 ms oleg122-client.virtnet: 1 3204 1.49 MB/sec warmup 23 sec latency 15899.177 ms oleg122-client.virtnet: 1 3204 1.43 MB/sec warmup 24 sec latency 16899.368 ms oleg122-client.virtnet: 1 3204 1.37 MB/sec warmup 25 sec latency 17899.596 ms oleg122-client.virtnet: 1 3204 1.32 MB/sec warmup 26 sec latency 18899.814 ms oleg122-client.virtnet: 1 3204 1.27 MB/sec warmup 27 sec latency 19899.974 ms oleg122-client.virtnet: 1 3708 1.38 MB/sec warmup 28 sec latency 20060.611 ms oleg122-client.virtnet: 1 3987 1.38 MB/sec warmup 29 sec latency 16.145 ms oleg122-client.virtnet: 1 4338 1.35 MB/sec warmup 30 sec latency 26.615 ms oleg122-client.virtnet: 1 4798 1.36 MB/sec warmup 31 sec latency 15.675 ms oleg122-client.virtnet: 1 5476 1.42 MB/sec warmup 32 sec latency 17.505 ms oleg122-client.virtnet: 1 6117 1.46 MB/sec warmup 33 sec latetest_70b fail mds2 2 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 09:51:24 (1713534684) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 09:51:39 (1713534699) targets are mounted 09:51:39 (1713534699) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12360 3593580 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37808 3568624 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50168 7162204 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:51:52 (1713534712) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 ncy 16.385 ms oleg122-client.virtnet: 1 6908 1.60 MB/sec warmup 34 sec latency 16.177 ms oleg122-client.virtnet: 1 7388 1.68 MB/sec warmup 35 sec latency 17.174 ms oleg122-client.virtnet: 1 7687 1.64 MB/sec warmup 36 sec latency 15.713 ms oleg122-client.virtnet: 1 8187 1.65 MB/sec warmup 37 sec latency 16.607 ms oleg122-client.virtnet: 1 8794 1.70 MB/sec warmup 38 sec latency 15.355 ms oleg122-client.virtnet: 1 9551 1.70 MB/sec warmup 39 sec latency 17.865 ms oleg122-client.virtnet: 1 10290 1.81 MB/sec warmup 40 sec latency 15.746 ms oleg122-client.virtnet: 1 10912 1.90 MB/sec warmup 41 sec latency 15.689 ms oleg122-client.virtnet: 1 11192 1.86 MB/sec warmup 42 sec latency 15.720 ms oleg122-client.virtnet: 1 11651 1.84 MB/sec warmup 43 sec latency 19.334 ms oleg122-client.virtnet: 1 12271 1.90 MB/sec warmup 44 sec latency 15.082 ms oleg122-client.virtnet: 1 13056 1.90 MB/sec warmup 45 sec latency 16.651 ms oleg122-client.virtnet: 1 13650 1.98 MB/sec warmup 46 sec latency 16.067 ms oleg122-client.virtnet: 1 14415 2.07 MB/sec warmup 47 sec latency 16.117 ms oleg122-client.virtnet: 1 14661 2.03 MB/sec warmup 48 sec latency 15.477 ms oleg122-client.virtnet: 1 15125 2.00 MB/sec warmup 49 sec latency 16.243 ms oleg122-client.virtnet: 1 15714 2.05 MB/sec warmup 50 sec latency 16.310 ms oleg122-client.virtnet: 1 16584 2.05 MB/sec warmup 51 sec latency 15.616 ms oleg122-client.virtnet: 1 17073 2.09 MB/sec warmup 52 sec latency 227.651 ms oleg122-client.virtnet: 1 17073 2.05 MB/sec warmup 53 sec latency 1227.880 ms oleg122-client.virtnet: 1 17073 2.01 MB/sec warmup 54 sec latency 2228.105 ms oleg122-client.virtnet: 1 17073 1.98 MB/sec warmup 55 sec latency 3228.306 ms oleg122-client.virtnet: 1 17401 1.98 MB/sec warmup 56 sec latency 3750.880 ms oleg122-client.virtnet: 1 18010 2.04 MB/sec warmup 57 sec latency 15.431 ms oleg122-client.virtnet: 1 18278 2.01 MB/sec warmup 58 sec latency 16.128 ms oleg122-client.virtnet: 1 18736 1.99 MB/sec warmup 59 sec latency 17.760 ms oleg122-client.virtnet: 1 20182 1.87 MB/sec execute 1 sec latency 18.940 ms oleg122-client.virtnet: 1 20377 1.45 MB/sec execute 2 sec latency 518.038 ms oleg122-client.virtnet: 1 21169 3.13 MB/sec execute 3 sec latency 15.615 ms oleg122-client.virtnet: 1 21692 3.49 MB/sec execute 4 sec latency 15.132 ms oleg122-client.virtnet: 1 21779 2.79 MB/sec execute 5 sec latency 609.787 ms oleg122-client.virtnet: 1 21779 2.33 MB/sec execute 6 sec latency 1609.972 ms oleg122-client.virtnet: 1 21779 1.99 MB/sec execute 7 sec latency 2610.142 ms oleg122-client.virtnet: 1 21779 1.75 MB/sec execute 8 sec latency 3610.360 ms oleg122-client.virtnet: 1 21779 1.55 MB/sec execute 9 sec latency 4610.600 ms oleg122-client.virtnet: 1 21779 1.40 MB/sec execute 10 sec latency 5610.853 ms oleg122-client.virtnet: 1 21779 1.27 MB/sec execute 11 sec latency 6611.084 ms oleg122-client.virtnet: 1 21779 1.16 MB/sec execute 12 sec latency 7611.376 ms oleg122-client.virtnet: 1 21779 1.07 MB/sec execute 13 sec latency 8611.669 ms oleg122-client.virtnet: 1 21779 1.00 MB/sec execute 14 sec latency 9611.891 ms oleg122-client.virtnet: 1 21779 0.93 MB/sec execute 15 sec latency 10612.140 ms oleg122-client.virtnet: 1 21779 0.87 MB/sec execute 16 sec latency 11612.343 ms oleg122-client.virtnet: 1 21779 0.82 MB/sec execute 17 sec latency 12612.496 ms oleg122-client.virtnet: 1 21779 0.78 MB/sec execute 18 sec latency 13612.696 ms oleg122-client.virtnet: 1 21779 0.73 MB/sec executeoleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:52:08 (1713534728) targets are mounted 09:52:08 (1713534728) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2432 1285256 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12192 3594056 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37836 3568676 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50028 7162732 1% /mnt/lustre test_70b fail mds2 4 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 09:52:21 (1713534741) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 09:52:36 (1713534756) targets are mounted 09:52:36 (1713534756) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12312 3593708 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37832 3568480 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50144 7162188 1% /mnt/lustre test_70b fail mds1 5 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:52:49 (1713534769) shut down 19 sec latency 14612.893 ms oleg122-client.virtnet: 1 21779 0.70 MB/sec execute 20 sec latency 15613.079 ms oleg122-client.virtnet: 1 21779 0.66 MB/sec execute 21 sec latency 16613.308 ms oleg122-client.virtnet: 1 21779 0.63 MB/sec execute 22 sec latency 17613.495 ms oleg122-client.virtnet: 1 21779 0.61 MB/sec execute 23 sec latency 18613.669 ms oleg122-client.virtnet: 1 21779 0.58 MB/sec execute 24 sec latency 19613.776 ms oleg122-client.virtnet: 1 21927 0.56 MB/sec execute 25 sec latency 20107.541 ms oleg122-client.virtnet: 1 22400 0.61 MB/sec execute 26 sec latency 17.057 ms oleg122-client.virtnet: 1 22973 0.71 MB/sec execute 27 sec latency 18.596 ms oleg122-client.virtnet: 1 23742 0.75 MB/sec execute 28 sec latency 18.421 ms oleg122-client.virtnet: 1 24338 0.92 MB/sec execute 29 sec latency 16.453 ms oleg122-client.virtnet: 1 25096 1.09 MB/sec execute 30 sec latency 15.921 ms oleg122-client.virtnet: 1 25333 1.07 MB/sec execute 31 sec latency 16.733 ms oleg122-client.virtnet: 1 25788 1.06 MB/sec execute 32 sec latency 20.881 ms oleg122-client.virtnet: 1 26368 1.16 MB/sec execute 33 sec latency 15.763 ms oleg122-client.virtnet: 1 27229 1.18 MB/sec execute 34 sec latency 17.540 ms oleg122-client.virtnet: 1 27793 1.30 MB/sec execute 35 sec latency 16.117 ms oleg122-client.virtnet: 1 28556 1.41 MB/sec execute 36 sec latency 15.502 ms oleg122-client.virtnet: 1 28844 1.41 MB/sec execute 37 sec latency 15.586 ms oleg122-client.virtnet: 1 29252 1.39 MB/sec execute 38 sec latency 19.251 ms oleg122-client.virtnet: 1 29820 1.47 MB/sec execute 39 sec latency 16.775 ms oleg122-client.virtnet: 1 30620 1.45 MB/sec execute 40 sec latency 17.519 ms oleg122-client.virtnet: 1 31185 1.54 MB/sec execute 41 sec latency 16.171 ms oleg122-client.virtnet: 1 31993 1.66 MB/sec execute 42 sec latency 16.352 ms oleg122-client.virtnet: 1 32377 1.66 MB/sec execute 43 sec latency 15.988 ms oleg122-client.virtnet: 1 32775 1.63 MB/sec execute 44 sec latency 18.014 ms oleg122-client.virtnet: 1 33399 1.70 MB/sec execute 45 sec latency 15.716 ms oleg122-client.virtnet: 1 34334 1.70 MB/sec execute 46 sec latency 11.875 ms oleg122-client.virtnet: 1 34944 1.78 MB/sec execute 47 sec latency 15.575 ms oleg122-client.virtnet: 1 35731 1.88 MB/sec execute 48 sec latency 15.590 ms oleg122-client.virtnet: 1 35973 1.85 MB/sec execute 49 sec latency 42.523 ms oleg122-client.virtnet: 1 35973 1.81 MB/sec execute 50 sec latency 1042.745 ms oleg122-client.virtnet: 1 35973 1.77 MB/sec execute 51 sec latency 2042.998 ms oleg122-client.virtnet: 1 35973 1.74 MB/sec execute 52 sec latency 3043.229 ms oleg122-client.virtnet: 1 36374 1.72 MB/sec execute 53 sec latency 3305.794 ms oleg122-client.virtnet: 1 36961 1.77 MB/sec execute 54 sec latency 15.918 ms oleg122-client.virtnet: 1 37807 1.77 MB/sec execute 55 sec latency 17.724 ms oleg122-client.virtnet: 1 38426 1.84 MB/sec execute 56 sec latency 15.693 ms oleg122-client.virtnet: 1 39191 1.90 MB/sec execute 57 sec latency 15.891 ms oleg122-client.virtnet: 1 39479 1.89 MB/sec execute 58 sec latency 16.213 ms oleg122-client.virtnet: 1 39911 1.87 MB/sec execute 59 sec latency 18.594 ms oleg122-client.virtnet: 1 40469 1.91 MB/sec execute 60 sec latency 15.320 ms oleg122-client.virtnet: 1 41258 1.89 MB/sec execute 61 sec latency 59.094 ms oleg122-client.virtnet: 1 41258 1.86 MB/sec execute 62 sec latency 1059.316 ms oleg122-client.virtnet: 1 41258 1.83 MB/sec execute 63 sec latency 2059.534 ms oleg122-client.virtnet: 1 Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:53:04 (1713534784) targets are mounted 09:53:04 (1713534784) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2440 1285248 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12312 3593932 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37924 3568104 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50236 7162036 1% /mnt/lustre test_70b fail mds2 6 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 09:53:17 (1713534797) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 09:53:33 (1713534813) targets are mounted 09:53:33 (1713534813) facet_failover done 41258 1.80 MB/sec execute 64 sec latency 3059.725 ms oleg122-client.virtnet: 1 41258 1.77 MB/sec execute 65 sec latency 4059.939 ms oleg122-client.virtnet: 1 41258 1.75 MB/sec execute 66 sec latency 5060.159 ms oleg122-client.virtnet: 1 41258 1.72 MB/sec execute 67 sec latency 6060.370 ms oleg122-client.virtnet: 1 41258 1.70 MB/sec execute 68 sec latency 7060.651 ms oleg122-client.virtnet: 1 41258 1.67 MB/sec execute 69 sec latency 8060.851 ms oleg122-client.virtnet: 1 41258 1.65 MB/sec execute 70 sec latency 9061.037 ms oleg122-client.virtnet: 1 41258 1.62 MB/sec execute 71 sec latency 10061.221 ms oleg122-client.virtnet: 1 41258 1.60 MB/sec execute 72 sec latency 11061.408 ms oleg122-client.virtnet: 1 41258 1.58 MB/sec execute 73 sec latency 12061.607 ms oleg122-client.virtnet: 1 41258 1.56 MB/sec execute 74 sec latency 13061.792 ms oleg122-client.virtnet: 1 41258 1.54 MB/sec execute 75 sec latency 14061.979 ms oleg122-client.virtnet: 1 41258 1.52 MB/sec execute 76 sec latency 15062.175 ms oleg122-client.virtnet: 1 41258 1.50 MB/sec execute 77 sec latency 16062.484 ms oleg122-client.virtnet: 1 41258 1.48 MB/sec execute 78 sec latency 17062.676 ms oleg122-client.virtnet: 1 41258 1.46 MB/sec execute 79 sec latency 18062.864 ms oleg122-client.virtnet: 1 41258 1.44 MB/sec execute 80 sec latency 19063.094 ms oleg122-client.virtnet: 1 41322 1.42 MB/sec execute 81 sec latency 19992.004 ms oleg122-client.virtnet: 1 41936 1.47 MB/sec execute 82 sec latency 15.118 ms oleg122-client.virtnet: 1 42674 1.53 MB/sec execute 83 sec latency 15.968 ms oleg122-client.virtnet: 1 42994 1.53 MB/sec execute 84 sec latency 16.120 ms oleg122-client.virtnet: 1 43344 1.52 MB/sec execute 85 sec latency 16.451 ms oleg122-client.virtnet: 1 43829 1.52 MB/sec execute 86 sec latency 15.428 ms oleg122-client.virtnet: 1 44559 1.54 MB/sec execute 87 sec latency 15.902 ms oleg122-client.virtnet: 1 45193 1.55 MB/sec execute 88 sec latency 16.038 ms oleg122-client.virtnet: 1 46125 1.64 MB/sec execute 89 sec latency 16.484 ms oleg122-client.virtnet: 1 46520 1.64 MB/sec execute 90 sec latency 16.857 ms oleg122-client.virtnet: 1 46882 1.63 MB/sec execute 91 sec latency 21.881 ms oleg122-client.virtnet: 1 47365 1.63 MB/sec execute 92 sec latency 17.198 ms oleg122-client.virtnet: 1 48062 1.65 MB/sec execute 93 sec latency 17.361 ms oleg122-client.virtnet: 1 48720 1.66 MB/sec execute 94 sec latency 15.954 ms oleg122-client.virtnet: 1 49508 1.71 MB/sec execute 95 sec latency 15.974 ms oleg122-client.virtnet: 1 50007 1.74 MB/sec execute 96 sec latency 15.576 ms oleg122-client.virtnet: 1 50356 1.72 MB/sec execute 97 sec latency 15.652 ms oleg122-client.virtnet: 1 50868 1.72 MB/sec execute 98 sec latency 17.469 ms oleg122-client.virtnet: 1 51570 1.74 MB/sec execute 99 sec latency 16.852 ms oleg122-client.virtnet: 1 52278 1.75 MB/sec execute 100 sec latency 15.473 ms oleg122-client.virtnet: 1 53234 1.83 MB/sec execute 101 sec latency 14.558 ms oleg122-client.virtnet: 1 53631 1.82 MB/sec execute 102 sec latency 16.033 ms oleg122-client.virtnet: 1 54037 1.81 MB/sec execute 103 sec latency 16.822 ms oleg122-client.virtnet: 1 54639 1.84 MB/sec execute 104 sec latency 15.926 ms oleg122-client.virtnet: 1 55131 1.82 MB/sec execute 105 sec latency 272.321 ms oleg122-client.virtnet: 1 55131 1.81 MB/sec execute 106 sec latency 1272.579 ms oleg122-client.virtnet: 1 55131 1.79 MB/sec execute 107 sec latency 2272.826 ms oleg122-client.virtnet: 1 55131 1.77 MB/sec execute 108 secoleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12188 3593840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37844 3568196 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50032 7162036 1% /mnt/lustre test_70b fail mds1 7 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:53:46 (1713534826) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:54:01 (1713534841) targets are mounted 09:54:01 (1713534841) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2440 1285248 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12192 3593688 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 39128 3567184 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 51320 7160872 1% /mnt/lustre test_70b fail mds2 8 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 09:54:15 (1713534855) shut down latency 3273.033 ms oleg122-client.virtnet: 1 55299 1.76 MB/sec execute 109 sec latency 4147.807 ms oleg122-client.virtnet: 1 55845 1.76 MB/sec execute 110 sec latency 16.024 ms oleg122-client.virtnet: 1 56748 1.83 MB/sec execute 111 sec latency 16.318 ms oleg122-client.virtnet: 1 57125 1.83 MB/sec execute 112 sec latency 16.272 ms oleg122-client.virtnet: 1 57467 1.82 MB/sec execute 113 sec latency 18.239 ms oleg122-client.virtnet: 1 57943 1.82 MB/sec execute 114 sec latency 17.851 ms oleg122-client.virtnet: 1 58418 1.83 MB/sec execute 115 sec latency 227.984 ms oleg122-client.virtnet: 1 59169 1.83 MB/sec execute 116 sec latency 453.168 ms oleg122-client.virtnet: 1 59751 1.86 MB/sec execute 117 sec latency 16.359 ms oleg122-client.virtnet: 1 60455 1.89 MB/sec execute 118 sec latency 104.248 ms oleg122-client.virtnet: 1 60455 1.87 MB/sec execute 119 sec latency 1104.462 ms oleg122-client.virtnet: 1 60455 1.86 MB/sec execute 120 sec latency 2104.742 ms oleg122-client.virtnet: 1 60455 1.84 MB/sec execute 121 sec latency 3105.011 ms oleg122-client.virtnet: 1 60455 1.83 MB/sec execute 122 sec latency 4105.270 ms oleg122-client.virtnet: 1 60455 1.81 MB/sec execute 123 sec latency 5105.514 ms oleg122-client.virtnet: 1 60455 1.80 MB/sec execute 124 sec latency 6105.751 ms oleg122-client.virtnet: 1 60455 1.78 MB/sec execute 125 sec latency 7106.017 ms oleg122-client.virtnet: 1 60455 1.77 MB/sec execute 126 sec latency 8106.334 ms oleg122-client.virtnet: 1 60455 1.76 MB/sec execute 127 sec latency 9106.673 ms oleg122-client.virtnet: 1 60455 1.74 MB/sec execute 128 sec latency 10107.018 ms oleg122-client.virtnet: 1 60455 1.73 MB/sec execute 129 sec latency 11107.267 ms oleg122-client.virtnet: 1 60455 1.71 MB/sec execute 130 sec latency 12107.479 ms oleg122-client.virtnet: 1 60455 1.70 MB/sec execute 131 sec latency 13107.713 ms oleg122-client.virtnet: 1 60455 1.69 MB/sec execute 132 sec latency 14108.022 ms oleg122-client.virtnet: 1 60455 1.68 MB/sec execute 133 sec latency 15108.241 ms oleg122-client.virtnet: 1 60455 1.66 MB/sec execute 134 sec latency 16108.451 ms oleg122-client.virtnet: 1 60455 1.65 MB/sec execute 135 sec latency 17108.706 ms oleg122-client.virtnet: 1 60455 1.64 MB/sec execute 136 sec latency 18109.031 ms oleg122-client.virtnet: 1 60455 1.63 MB/sec execute 137 sec latency 19109.314 ms oleg122-client.virtnet: 1 60524 1.62 MB/sec execute 138 sec latency 19968.804 ms oleg122-client.virtnet: 1 60775 1.61 MB/sec execute 139 sec latency 21.431 ms oleg122-client.virtnet: 1 61120 1.60 MB/sec execute 140 sec latency 18.764 ms oleg122-client.virtnet: 1 61580 1.61 MB/sec execute 141 sec latency 16.431 ms oleg122-client.virtnet: 1 62259 1.62 MB/sec execute 142 sec latency 17.626 ms oleg122-client.virtnet: 1 62911 1.62 MB/sec execute 143 sec latency 15.787 ms oleg122-client.virtnet: 1 63690 1.66 MB/sec execute 144 sec latency 17.537 ms oleg122-client.virtnet: 1 64174 1.68 MB/sec execute 145 sec latency 16.161 ms oleg122-client.virtnet: 1 64489 1.67 MB/sec execute 146 sec latency 15.776 ms oleg122-client.virtnet: 1 64996 1.67 MB/sec execute 147 sec latency 19.993 ms oleg122-client.virtnet: 1 65646 1.68 MB/sec execute 148 sec latency 18.038 ms oleg122-client.virtnet: 1 66326 1.68 MB/sec execute 149 sec latency 15.916 ms oleg122-client.virtnet: 1 67058 1.71 MB/sec execute 150 sec latency 16.216 ms oleg122-client.virtnet: 1 67674 1.74 MB/sec execute 151 sec latency 16.818 ms oleg122-client.virtnet: 1 67966 1.73 MB/sec execute 152 sec latency 17.306 ms oleg122-clientFailover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 09:54:30 (1713534870) targets are mounted 09:54:30 (1713534870) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12224 3593804 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 38160 3568092 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50384 7161896 1% /mnt/lustre test_70b fail mds1 9 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:54:43 (1713534883) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:54:58 (1713534898) targets are mounted 09:54:58 (1713534898) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec .virtnet: 1 68416 1.72 MB/sec execute 153 sec latency 22.235 ms oleg122-client.virtnet: 1 69088 1.74 MB/sec execute 154 sec latency 15.902 ms oleg122-client.virtnet: 1 69898 1.74 MB/sec execute 155 sec latency 17.879 ms oleg122-client.virtnet: 1 70768 1.78 MB/sec execute 156 sec latency 16.229 ms oleg122-client.virtnet: 1 71265 1.79 MB/sec execute 157 sec latency 17.083 ms oleg122-client.virtnet: 1 71611 1.78 MB/sec execute 158 sec latency 17.254 ms oleg122-client.virtnet: 1 72096 1.78 MB/sec execute 159 sec latency 19.670 ms oleg122-client.virtnet: 1 72790 1.79 MB/sec execute 160 sec latency 17.911 ms oleg122-client.virtnet: 1 73475 1.80 MB/sec execute 161 sec latency 16.611 ms oleg122-client.virtnet: 1 74270 1.83 MB/sec execute 162 sec latency 16.140 ms oleg122-client.virtnet: 1 74802 1.84 MB/sec execute 163 sec latency 102.396 ms oleg122-client.virtnet: 1 74802 1.83 MB/sec execute 164 sec latency 1102.616 ms oleg122-client.virtnet: 1 74802 1.82 MB/sec execute 165 sec latency 2102.835 ms oleg122-client.virtnet: 1 74802 1.81 MB/sec execute 166 sec latency 3103.049 ms oleg122-client.virtnet: 1 75013 1.80 MB/sec execute 167 sec latency 3226.667 ms oleg122-client.virtnet: 1 75459 1.79 MB/sec execute 168 sec latency 20.986 ms oleg122-client.virtnet: 1 76036 1.81 MB/sec execute 169 sec latency 15.748 ms oleg122-client.virtnet: 1 76883 1.81 MB/sec execute 170 sec latency 18.580 ms oleg122-client.virtnet: 1 77444 1.83 MB/sec execute 171 sec latency 16.954 ms oleg122-client.virtnet: 1 78195 1.85 MB/sec execute 172 sec latency 15.792 ms oleg122-client.virtnet: 1 78489 1.85 MB/sec execute 173 sec latency 16.689 ms oleg122-client.virtnet: 1 78867 1.84 MB/sec execute 174 sec latency 18.241 ms oleg122-client.virtnet: 1 79337 1.84 MB/sec execute 175 sec latency 17.288 ms oleg122-client.virtnet: 1 79471 1.85 MB/sec execute 176 sec latency 902.801 ms oleg122-client.virtnet: 1 79471 1.84 MB/sec execute 177 sec latency 1902.967 ms oleg122-client.virtnet: 1 79471 1.83 MB/sec execute 178 sec latency 2903.181 ms oleg122-client.virtnet: 1 79471 1.82 MB/sec execute 179 sec latency 3903.338 ms oleg122-client.virtnet: 1 79471 1.81 MB/sec execute 180 sec latency 4903.498 ms oleg122-client.virtnet: 1 79471 1.80 MB/sec execute 181 sec latency 5903.672 ms oleg122-client.virtnet: 1 79471 1.79 MB/sec execute 182 sec latency 6903.896 ms oleg122-client.virtnet: 1 79471 1.78 MB/sec execute 183 sec latency 7904.141 ms oleg122-client.virtnet: 1 79471 1.77 MB/sec execute 184 sec latency 8904.348 ms oleg122-client.virtnet: 1 79471 1.76 MB/sec execute 185 sec latency 9904.565 ms oleg122-client.virtnet: 1 79471 1.75 MB/sec execute 186 sec latency 10904.743 ms oleg122-client.virtnet: 1 79471 1.74 MB/sec execute 187 sec latency 11904.906 ms oleg122-client.virtnet: 1 79471 1.73 MB/sec execute 188 sec latency 12905.075 ms oleg122-client.virtnet: 1 79471 1.72 MB/sec execute 189 sec latency 13905.257 ms oleg122-client.virtnet: 1 79471 1.71 MB/sec execute 190 sec latency 14905.438 ms oleg122-client.virtnet: 1 79471 1.70 MB/sec execute 191 sec latency 15905.614 ms oleg122-client.virtnet: 1 79471 1.69 MB/sec execute 192 sec latency 16905.867 ms oleg122-client.virtnet: 1 79471 1.68 MB/sec execute 193 sec latency 17906.043 ms oleg122-client.virtnet: 1 79471 1.68 MB/sec execute 194 sec latency 18906.242 ms oleg122-client.virtnet: 1 79471 1.67 MB/sec execute 195 sec latency 19906.371 ms oleg122-client.virtnet: 1 80065 1.66 MB/sec execute 196 sec latency 20059.746 ms oleg122-client.virtnet: 1 80668 1.UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2436 1285252 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12320 3591688 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37920 3568308 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50240 7159996 1% /mnt/lustre test_70b fail mds2 10 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 09:55:12 (1713534912) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 09:55:27 (1713534927) targets are mounted 09:55:27 (1713534927) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1285684 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12356 3593448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37688 3568556 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 50044 7162004 1% /mnt/lustre test_70b fail mds1 11 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:55:40 (1713534940) shut down 66 MB/sec execute 197 sec latency 15.181 ms oleg122-client.virtnet: 1 81455 1.69 MB/sec execute 198 sec latency 15.588 ms oleg122-client.virtnet: 1 81942 1.70 MB/sec execute 199 sec latency 15.162 ms oleg122-client.virtnet: 1 82255 1.70 MB/sec execute 200 sec latency 19.737 ms oleg122-client.virtnet: 1 82769 1.70 MB/sec execute 201 sec latency 16.057 ms oleg122-client.virtnet: 1 83399 1.70 MB/sec execute 202 sec latency 17.598 ms oleg122-client.virtnet: 1 84082 1.71 MB/sec execute 203 sec latency 15.192 ms oleg122-client.virtnet: 1 84823 1.73 MB/sec execute 204 sec latency 16.169 ms oleg122-client.virtnet: 1 85451 1.75 MB/sec execute 205 sec latency 15.856 ms oleg122-client.virtnet: 1 85734 1.74 MB/sec execute 206 sec latency 15.643 ms oleg122-client.virtnet: 1 86226 1.74 MB/sec execute 207 sec latency 16.852 ms oleg122-client.virtnet: 1 86825 1.75 MB/sec execute 208 sec latency 15.736 ms oleg122-client.virtnet: 1 87594 1.75 MB/sec execute 209 sec latency 18.115 ms oleg122-client.virtnet: 1 88187 1.77 MB/sec execute 210 sec latency 16.440 ms oleg122-client.virtnet: 1 88951 1.79 MB/sec execute 211 sec latency 16.079 ms oleg122-client.virtnet: 1 89188 1.78 MB/sec execute 212 sec latency 15.694 ms oleg122-client.virtnet: 1 89649 1.77 MB/sec execute 213 sec latency 17.785 ms oleg122-client.virtnet: 1 90226 1.79 MB/sec execute 214 sec latency 15.268 ms oleg122-client.virtnet: 1 91080 1.79 MB/sec execute 215 sec latency 18.163 ms oleg122-client.virtnet: 1 91645 1.80 MB/sec execute 216 sec latency 15.990 ms oleg122-client.virtnet: 1 92408 1.82 MB/sec execute 217 sec latency 15.136 ms oleg122-client.virtnet: 1 92702 1.82 MB/sec execute 218 sec latency 16.742 ms oleg122-client.virtnet: 1 93080 1.81 MB/sec execute 219 sec latency 105.029 ms oleg122-client.virtnet: 1 93080 1.80 MB/sec execute 220 sec latency 1105.235 ms oleg122-client.virtnet: 1 93080 1.80 MB/sec execute 221 sec latency 2105.389 ms oleg122-client.virtnet: 1 93080 1.79 MB/sec execute 222 sec latency 3105.605 ms oleg122-client.virtnet: 1 93080 1.78 MB/sec execute 223 sec latency 4105.887 ms oleg122-client.virtnet: 1 93395 1.78 MB/sec execute 224 sec latency 4562.534 ms oleg122-client.virtnet: 1 94048 1.79 MB/sec execute 225 sec latency 17.982 ms oleg122-client.virtnet: 1 94750 1.79 MB/sec execute 226 sec latency 15.399 ms oleg122-client.virtnet: 1 95586 1.81 MB/sec execute 227 sec latency 16.081 ms oleg122-client.virtnet: 1 96110 1.82 MB/sec execute 228 sec latency 15.395 ms oleg122-client.virtnet: 1 96422 1.81 MB/sec execute 229 sec latency 15.615 ms oleg122-client.virtnet: 1 96688 1.81 MB/sec execute 230 sec latency 506.992 ms oleg122-client.virtnet: 1 97241 1.82 MB/sec execute 231 sec latency 15.774 ms oleg122-client.virtnet: 1 98113 1.82 MB/sec execute 232 sec latency 17.084 ms oleg122-client.virtnet: 1 98214 1.81 MB/sec execute 233 sec latency 749.958 ms oleg122-client.virtnet: 1 98214 1.81 MB/sec execute 234 sec latency 1750.154 ms oleg122-client.virtnet: 1 98214 1.80 MB/sec execute 235 sec latency 2750.356 ms oleg122-client.virtnet: 1 98214 1.79 MB/sec execute 236 sec latency 3750.514 ms oleg122-client.virtnet: 1 98214 1.78 MB/sec execute 237 sec latency 4750.687 ms oleg122-client.virtnet: 1 98214 1.78 MB/sec execute 238 sec latency 5750.866 ms oleg122-client.virtnet: 1 98214 1.77 MB/sec execute 239 sec latency 6751.077 ms oleg122-client.virtnet: 1 98214 1.76 MB/sec execute 240 sec latency 7751.310 ms oleg122-client.virtnet: 1 98214 1.75 MB/sec execute 241 sec latency 8751.553 ms oleg122-client.viFailover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:55:56 (1713534956) targets are mounted 09:55:56 (1713534956) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec rtnet: 1 98214 1.75 MB/sec execute 242 sec latency 9751.769 ms oleg122-client.virtnet: 1 98214 1.74 MB/sec execute 243 sec latency 10751.974 ms oleg122-client.virtnet: 1 98214 1.73 MB/sec execute 244 sec latency 11752.239 ms oleg122-client.virtnet: 1 98214 1.72 MB/sec execute 245 sec latency 12752.438 ms oleg122-client.virtnet: 1 98214 1.72 MB/sec execute 246 sec latency 13752.599 ms oleg122-client.virtnet: 1 98214 1.71 MB/sec execute 247 sec latency 14752.810 ms oleg122-client.virtnet: 1 98214 1.70 MB/sec execute 248 sec latency 15753.007 ms oleg122-client.virtnet: 1 98214 1.70 MB/sec execute 249 sec latency 16753.223 ms oleg122-client.virtnet: 1 98214 1.69 MB/sec execute 250 sec latency 17753.418 ms oleg122-client.virtnet: 1 98214 1.68 MB/sec execute 251 sec latency 18753.620 ms oleg122-client.virtnet: 1 98214 1.68 MB/sec execute 252 sec latency 19753.725 ms oleg122-client.virtnet: 1 98413 1.67 MB/sec execute 253 sec latency 20246.788 ms oleg122-client.virtnet: 1 99193 1.69 MB/sec execute 254 sec latency 16.035 ms oleg122-client.virtnet: 1 99673 1.70 MB/sec execute 255 sec latency 17.208 ms oleg122-client.virtnet: 1 99981 1.70 MB/sec execute 256 sec latency 16.404 ms oleg122-client.virtnet: 1 100450 1.70 MB/sec execute 257 sec latency 18.838 ms oleg122-client.virtnet: 1 101013 1.70 MB/sec execute 258 sec latency 16.211 ms oleg122-client.virtnet: 1 101779 1.71 MB/sec execute 259 sec latency 17.439 ms oleg122-client.virtnet: 1 102370 1.72 MB/sec execute 260 sec latency 16.348 ms oleg122-client.virtnet: 1 103130 1.74 MB/sec execute 261 sec latency 16.195 ms oleg122-client.virtnet: 1 103370 1.73 MB/sec execute 262 sec latency 15.935 ms oleg122-client.virtnet: 1 103838 1.73 MB/sec execute 263 sec latency 18.419 ms oleg122-client.virtnet: 1 104419 1.74 MB/sec execute 264 sec latency 16.265 ms oleg122-client.virtnet: 1 105270 1.74 MB/sec execute 265 sec latency 17.753 ms oleg122-client.virtnet: 1 105826 1.75 MB/sec execute 266 sec latency 16.429 ms oleg122-client.virtnet: 1 106585 1.77 MB/sec execute 267 sec latency 15.617 ms oleg122-client.virtnet: 1 106879 1.76 MB/sec execute 268 sec latency 16.123 ms oleg122-client.virtnet: 1 107277 1.76 MB/sec execute 269 sec latency 17.127 ms oleg122-client.virtnet: 1 107856 1.77 MB/sec execute 270 sec latency 15.993 ms oleg122-client.virtnet: 1 108632 1.77 MB/sec execute 271 sec latency 18.214 ms oleg122-client.virtnet: 1 109199 1.78 MB/sec execute 272 sec latency 16.730 ms oleg122-client.virtnet: 1 109981 1.80 MB/sec execute 273 sec latency 16.073 ms oleg122-client.virtnet: 1 110368 1.80 MB/sec execute 274 sec latency 15.624 ms oleg122-client.virtnet: 1 110724 1.79 MB/sec execute 275 sec latency 18.185 ms oleg122-client.virtnet: 1 111199 1.79 MB/sec execute 276 sec latency 16.324 ms oleg122-client.virtnet: 1 111890 1.80 MB/sec execute 277 sec latency 17.157 ms oleg122-client.virtnet: 1 112561 1.80 MB/sec execute 278 sec latency 15.482 ms oleg122-client.virtnet: 1 113346 1.82 MB/sec execute 279 sec latency 16.434 ms oleg122-client.virtnet: 1 113836 1.82 MB/sec execute 280 sec latency 15.759 ms oleg122-client.virtnet: 1 114135 1.82 MB/sec execute 281 sec latency 15.713 ms oleg122-client.virtnet: 1 114616 1.82 MB/sec execute 282 sec latency 22.163 ms oleg122-client.virtnet: 1 115203 1.82 MB/sec execute 283 sec latency 16.690 ms oleg122-client.virtnet: 1 115979 1.82 MB/sec execute 284 sec latency 17.522 ms oleg122-client.virtnet: 1 116709 1.84 MB/sec execute 285 sec latency 15.966 ms oleg122-client.virtnet: 1 117336 1.85 MB/sec execute 286 sec latency 15.536 ms oleg122-client.virtnet: 1 117590 1.85 MB/sec execute 287 sec latency 16.288 ms oleg122-client.virtnet: 1 118070 1.84 MB/sec execute 288 sec latency 18.896 ms oleg122-client.virtnet: 1 118680 1.85 MB/sec execute 289 sec latency 20.111 ms oleg122-client.virtnet: 1 119480 1.85 MB/sec execute 290 sec latency 17.244 ms oleg122-client.virtnet: 1 120071 1.87 MB/sec execute 291 sec latency 15.825 ms oleg122-client.virtnet: 1 120831 1.88 MB/sec execute 292 sec latency 16.307 ms oleg122-client.virtnet: 1 121076 1.87 MB/sec execute 293 sec latency 16.152 ms oleg122-client.virtnet: 1 121508 1.87 MB/sec execute 294 sec latency 22.203 ms oleg122-client.virtnet: 1 122061 1.88 MB/sec execute 295 sec latency 15.676 ms oleg122-client.virtnet: 1 122893 1.88 MB/sec execute 296 sec latency 17.670 ms oleg122-client.virtnet: 1 123546 1.89 MB/sec execute 297 sec latency 15.831 ms oleg122-client.virtnet: 1 124297 1.90 MB/sec execute 298 sec latency 16.568 ms oleg122-client.virtnet: 1 124596 1.90 MB/sec execute 299 sec latency 16.055 ms oleg122-client.virtnet: 1 cleanup 300 sec oleg122-client.virtnet: 0 cleanup 300 sec oleg122-client.virtnet: oleg122-client.virtnet: Operation Count AvgLat MaxLat oleg122-client.virtnet: ---------------------------------------- oleg122-client.virtnet: NTCreateX 18275 7.625 20246.778 oleg122-client.virtnet: Close 13415 0.558 4.989 oleg122-client.virtnet: Rename 772 11.638 18.563 oleg122-client.virtnet: Unlink 3703 3.613 6.901 oleg122-client.virtnet: Qpathinfo 16576 5.035 20059.707 oleg122-client.virtnet: Qfileinfo 2890 0.197 2.616 oleg122-client.virtnet: Qfsinfo 3040 0.305 4.789 oleg122-client.virtnet: Sfileinfo 1473 10.585 4562.473 oleg122-client.virtnet: Find 6404 0.607 11.821 oleg122-client.virtnet: WriteX 9023 1.365 8.177 oleg122-client.virtnet: ReadX 28676 0.023 1.484 oleg122-client.virtnet: LockX 60 1.929 2.201 oleg122-client.virtnet: UnlockX 60 1.982 2.378 oleg122-client.virtnet: Flush 1281 9.616 518.024 oleg122-client.virtnet: oleg122-client.virtnet: Throughput 1.90117 MB/sec 1 clients 1 procs max_latency=20246.788 ms oleg122-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg122-client.virtnet at Fri Apr 19 09:56:48 EDT 2024 with return code 0 oleg122-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg122-client.virtnet oleg122-client.virtnet: /mnt/lustre/d70b.replay-single/oleg122-client.virtnet /mnt/lustre/d70b.replay-single/oleg122-client.virtnet oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORDPRO' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/SEED' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/PARADOX' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/COREL' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/EXCEL' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORD' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/PM' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/PWRPNT' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp/ACCESS' oleg122-client.virtnet: removed directory: 'clients/client0/~dmtmp' oleg122-client.virtnet: removed directory: 'clients/client0' oleg122-client.virtnet: removed directory: 'clients' oleg122-client.virtnet: removed 'client.txt' oleg122-client.virtnet: /mnt/lustre/d70b.replay-single/oleg122-client.virtnet oleg122-client.virtnet: dbench successfully finished PASS 70b (364s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70c: tar 2mdts recovery ============ 09:56:52 (1713535012) Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar 2433 striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d70c.replay-single tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6632 1281056 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4292 1283396 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 14636 3578960 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 9496 3584884 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 24132 7163844 1% /mnt/lustre tar: Removing leading `/' from hard link targets test_70c fail mds1 1 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 09:59:07 (1713535147) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 09:59:21 (1713535161) targets are mounted 09:59:21 (1713535161) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 7072 1280616 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4488 1283200 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 7148 3576396 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 10596 3561972 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 17744 7138368 1% /mnt/lustre test_70c fail mds1 2 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:01:48 (1713535308) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:02:01 (1713535321) targets are mounted 10:02:01 (1713535321) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets PASS 70c (369s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70d: mkdir/rmdir striped dir 2mdts recovery ========================================================== 10:03:03 (1713535383) Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 5801 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4188 1283500 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3616 1284072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3595456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3598124 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7193580 1% /mnt/lustre test_70d fail mds1 1 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:05:23 (1713535523) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:05:39 (1713535539) targets are mounted 10:05:39 (1713535539) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4244 1283444 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3592 1284096 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3595456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3598124 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7193580 1% /mnt/lustre test_70d fail mds1 2 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:08:06 (1713535686) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:08:21 (1713535701) targets are mounted 10:08:21 (1713535701) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4661: 5801 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test || { echo "mkdir fails"; break; }; $LFS mkdir -i1 -c2 $DIR/$tdir/test1 || { echo "mkdir fails"; break; }; touch $DIR/$tdir/test/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test || { echo "rmdir fails"; ls -lR $DIR/$tdir; break; }; touch $DIR/$tdir/test1/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test1/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test1 || { echo "rmdir fails"; ls -lR $DIR/$tdir/test1; break; }; done ) (wd: ~) PASS 70d (326s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70e: rename cross-MDT with random fails ========================================================== 10:08:31 (1713535711) debug=+ha Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started PID=27314 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4784 1282904 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3692 1283996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3595456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3598124 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7193580 1% /mnt/lustre test_70e fail mds2 1 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:10:47 (1713535847) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:11:02 (1713535862) targets are mounted 10:11:02 (1713535862) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3132 1284556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3595456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3598124 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7193580 1% /mnt/lustre test_70e fail mds1 2 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:13:23 (1713536003) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:13:38 (1713536018) targets are mounted 10:13:38 (1713536018) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70e (318s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 10:13:52 (1713536032) mount clients oleg122-client.virtnet ... Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3585448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3592760 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7178208 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:14:01 (1713536041) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... Started lustre-OST0000 10:14:17 (1713536057) targets are mounted 10:14:17 (1713536057) facet_failover done ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3580208 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3588568 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7168776 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:14:31 (1713536071) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... Started lustre-OST0000 10:14:47 (1713536087) targets are mounted 10:14:47 (1713536087) facet_failover done ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3574016 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3584448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7158464 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:15:02 (1713536102) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... Started lustre-OST0000 10:15:18 (1713536118) targets are mounted 10:15:18 (1713536118) facet_failover done ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1285636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3592688 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3596880 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7189568 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... test_70f failing OST 4 times Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:15:32 (1713536132) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... Started lustre-OST0000 10:15:49 (1713536149) targets are mounted 10:15:49 (1713536149) facet_failover done ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg122-client.virtnet' ... ldlm.namespaces.MGC192.168.201.122@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa134800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa134800.lru_size=clear PASS 70f (124s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 10:15:59 (1713536159) Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 7424 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2992 1284696 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3396 1284292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3140 1284548 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3536 1284152 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre fail mds2 mds1 1 times Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:18:24 (1713536304) shut down Failover mds1 to oleg122-server Failover mds2 to oleg122-server mount facets: mds2 mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:18:39 (1713536319) targets are mounted 10:18:39 (1713536319) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4256 1283432 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2864 1284824 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4452 1283236 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3072 1284616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre fail mds1 mds2 2 times Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:21:15 (1713536475) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 10:21:41 (1713536501) targets are mounted 10:21:41 (1713536501) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4661: 7424 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test; rmdir $DIR/$tdir/test; done ) (wd: ~) PASS 71a (364s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 10:22:05 (1713536525) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3884 1283804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3252 1284436 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:22:09 (1713536529) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:22:25 (1713536545) targets are mounted 10:22:25 (1713536545) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 10:22:50 (1713536570) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7346 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:22:55 (1713536575) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:23:10 (1713536590) targets are mounted 10:23:10 (1713536590) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 10:23:35 (1713536615) Stopping clients: oleg122-client.virtnet /mnt/lustre (opts:) Stopping client oleg122-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg122-server Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:23:38 (1713536618) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:23:52 (1713536632) targets are mounted 10:23:52 (1713536632) facet_failover done Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 10:24:07 (1713536647) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2448 1285240 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:24:12 (1713536652) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:24:27 (1713536667) targets are mounted 10:24:27 (1713536667) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 198.95 ops/second PASS 80a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 10:24:37 (1713536677) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:24:50 (1713536690) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:25:05 (1713536705) targets are mounted 10:25:05 (1713536705) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 194.58 ops/second PASS 80b (36s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 10:25:15 (1713536715) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:25:22 (1713536722) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:25:38 (1713536738) targets are mounted 10:25:38 (1713536738) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:25:45 (1713536745) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:26:00 (1713536760) targets are mounted 10:26:00 (1713536760) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.05 seconds: 373.52 ops/second PASS 80c (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 10:26:10 (1713536770) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2560 1285128 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2560 1285128 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:26:27 (1713536787) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:26:48 (1713536808) targets are mounted 10:26:48 (1713536808) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 192.72 ops/second PASS 80d (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 10:27:00 (1713536820) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:27:08 (1713536828) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:27:23 (1713536843) targets are mounted 10:27:23 (1713536843) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 198.94 ops/second PASS 80e (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 10:27:34 (1713536854) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:27:39 (1713536859) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:27:54 (1713536874) targets are mounted 10:27:54 (1713536874) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 195.42 ops/second PASS 80f (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 10:28:04 (1713536884) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:28:15 (1713536895) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:28:30 (1713536910) targets are mounted 10:28:30 (1713536910) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:28:38 (1713536918) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:28:53 (1713536933) targets are mounted 10:28:53 (1713536933) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 202.94 ops/second PASS 80g (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 10:29:03 (1713536943) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2488 1285200 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:29:15 (1713536955) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 10:29:41 (1713536981) targets are mounted 10:29:41 (1713536981) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 195.23 ops/second PASS 80h (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 10:29:53 (1713536993) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:30:03 (1713537003) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:30:18 (1713537018) targets are mounted 10:30:18 (1713537018) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81a (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 10:30:28 (1713537028) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2520 1285168 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:30:33 (1713537033) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:30:49 (1713537049) targets are mounted 10:30:49 (1713537049) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 10:30:58 (1713537058) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:31:06 (1713537066) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:31:21 (1713537081) targets are mounted 10:31:21 (1713537081) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:31:29 (1713537089) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:31:44 (1713537104) targets are mounted 10:31:44 (1713537104) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81c (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 10:31:54 (1713537114) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2520 1285168 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2520 1285168 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:32:09 (1713537129) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:32:29 (1713537149) targets are mounted 10:32:29 (1713537149) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81d (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 10:32:40 (1713537160) fail_loc=0x119 fail_loc=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:32:45 (1713537165) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:33:01 (1713537181) targets are mounted 10:33:01 (1713537181) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81e (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 10:33:10 (1713537190) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:33:16 (1713537196) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:33:31 (1713537211) targets are mounted 10:33:31 (1713537211) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81f (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 10:33:41 (1713537221) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:33:48 (1713537228) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:04 (1713537244) targets are mounted 10:34:04 (1713537244) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:34:11 (1713537251) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:34:26 (1713537266) targets are mounted 10:34:26 (1713537266) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81g (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 10:34:36 (1713537276) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1852 3605168 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1828 3605192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3680 7210360 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:34:45 (1713537285) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:35:11 (1713537311) targets are mounted 10:35:11 (1713537311) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81h (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 84a: stale open during export disconnect ========================================================== 10:35:23 (1713537323) fail_loc=0x80000144 total: 1 open/close in 0.01 seconds: 107.81 ops/second pdsh@oleg122-client: oleg122-client: ssh exited with exit code 5 PASS 84a (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 10:35:31 (1713537331) before recovery: unused locks count = 201 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:35:34 (1713537334) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:48 (1713537348) targets are mounted 10:35:48 (1713537348) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 10:35:58 (1713537358) before recovery: unused locks count = 100 Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:36:05 (1713537365) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 10:36:20 (1713537380) targets are mounted 10:36:20 (1713537380) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 10:36:28 (1713537388) Stopping clients: oleg122-client.virtnet /mnt/lustre (opts:) Stopping client oleg122-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 mdt.lustre-MDT0001.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Started clients oleg122-client.virtnet: 192.168.201.122@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) PASS 86 (13s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87a: write replay ================== 10:36:44 (1713537404) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 2052 3604968 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 4080 7209960 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.16187 s, 51.8 MB/s Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:36:49 (1713537409) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 10:37:05 (1713537425) targets are mounted 10:37:05 (1713537425) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.0704851 s, 119 MB/s PASS 87a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 10:37:13 (1713537433) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 10244 3596776 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 12272 7201768 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.160709 s, 52.2 MB/s 8+0 records in 8+0 records out 8 bytes (8 B) copied, 0.00264116 s, 3.0 kB/s Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:37:19 (1713537439) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 10:37:35 (1713537455) targets are mounted 10:37:35 (1713537455) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes (72 B) copied, 0.00540259 s, 13.3 kB/s PASS 87b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 88: MDS should not assign same objid to different files ========================================================== 10:37:43 (1713537463) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 10248 3596772 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 12276 7201764 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 10248 3596772 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 12276 7201764 1% /mnt/lustre before test: last_id = 16577, next_id = 16548 Creating to objid 16577 on ost lustre-OST0000... total: 31 open/close in 0.16 seconds: 198.19 ops/second total: 8 open/close in 0.04 seconds: 207.12 ops/second before recovery: last_id = 16609, next_id = 16586 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg122-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 16617, next_id = 16586 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0423488 s, 12.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0391091 s, 13.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0341811 s, 15.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0299126 s, 17.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0274386 s, 19.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.041647 s, 12.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0429511 s, 12.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0413515 s, 12.7 MB/s -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16548 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16549 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16550 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16551 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16552 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16553 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16554 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16555 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16556 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16557 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16558 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16559 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16560 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16561 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16562 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16563 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16564 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16565 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16566 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16567 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16568 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16569 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16570 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16571 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16572 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16573 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16574 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16575 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16576 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16577 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16578 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16579 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16580 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16581 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16582 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16583 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16584 -rw-r--r-- 1 root root 0 Apr 19 10:37 /mnt/lustre/d88.replay-single/f-16585 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16589 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16590 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16591 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16592 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16593 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16594 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16595 -rw-r--r-- 1 root root 524288 Apr 19 10:38 /mnt/lustre/d88.replay-single/f-16596 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0514714 s, 10.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0466646 s, 11.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.039044 s, 13.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0512129 s, 10.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0378828 s, 13.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0384305 s, 13.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0383317 s, 13.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.042218 s, 12.4 MB/s PASS 88 (58s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 89: no disk space leak on late ost connection ========================================================== 10:38:43 (1713537523) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg122-server mds-ost sync done. Waiting for MDT destroys to complete 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.0811845 s, 129 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg122-server Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:38:51 (1713537531) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:39:05 (1713537545) targets are mounted 10:39:05 (1713537545) facet_failover done Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 68 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg122-server mds-ost sync done. Waiting for MDT destroys to complete free_before: 7645764 free_after: 7645764 PASS 89 (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 10:40:36 (1713537636) Create the files Fail ost1 lustre-OST0000_UUID, display the list of affected files Stopping /mnt/lustre-ost1 (opts:) on oleg122-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/f0 /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all Querying files on shutdown ost1: lfs find --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/f0 /mnt/lustre/d90.replay-single/all Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/f0 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 16619 0x40eb 0x280000402 * /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 0 16618 0x40ea 0x280000402 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Failover ost1 to oleg122-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 90 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93a: replay + reconnect ============ 10:40:59 (1713537659) 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.0026266 s, 390 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg122-server Stopping /mnt/lustre-ost1 (opts:) on oleg122-server 10:41:02 (1713537662) shut down Failover ost1 to oleg122-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 10:41:16 (1713537676) targets are mounted 10:41:16 (1713537676) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (61s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93b: replay + reconnect on mds ===== 10:42:02 (1713537722) total: 20 open/close in 0.15 seconds: 135.65 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:42:05 (1713537725) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:42:19 (1713537739) targets are mounted 10:42:19 (1713537739) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (105s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 10:43:49 (1713537829) fail_loc=0x1701 Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:43:51 (1713537831) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:44:05 (1713537845) targets are mounted 10:44:05 (1713537845) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x2000347f0:0x1:0x0] 1 [0x24000a42a:0x1:0x0] total: 20 open/close in 0.10 seconds: 195.05 ops/second PASS 100a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 10:44:15 (1713537855) fail_loc=0x119 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:44:18 (1713537858) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:44:32 (1713537872) targets are mounted 10:44:32 (1713537872) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x2000347f0:0x4:0x0] 1 [0x24000a42a:0x4:0x0] total: 20 open/close in 0.10 seconds: 191.24 ops/second PASS 100b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 10:44:42 (1713537882) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2576 1285112 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2148 1285540 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Stopping /mnt/lustre-mds2 (opts:) on oleg122-server Failover mds2 to oleg122-server Starting mds2: -o localrecov -o abort_recov_mdt /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 total: 20 open/close in 0.10 seconds: 193.46 ops/second Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:45:05 (1713537905) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:45:19 (1713537919) targets are mounted 10:45:19 (1713537919) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 1 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 1 [0x24000abf8:0x3:0x0] 0 [0x2000347f1:0x3:0x0] total: 20 open/close in 0.10 seconds: 209.51 ops/second PASS 100c (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 10:45:29 (1713537929) striped dir -i0 -c2 -H crush2 /mnt/lustre/d100d.replay-single total: 100 mkdir in 0.48 seconds: 206.53 ops/second lustre-MDT0001-osd [catalog]: [0x24000b3c8:0x3:0x0] [index]: 00001 [logid]: [0x24000bb98:0x2:0x0] lustre-MDT0000-osp-MDT0001 [catalog]: [0x200034fc1:0x3:0x0] [index]: 00001 [logid]: [0x200034fc2:0x2:0x0] [index]: 00002 [logid]: [0x200034fc2:0x3:0x0] Stopping /mnt/lustre-mds2 (opts:) on oleg122-server Failover mds2 to oleg122-server Starting mds2: -o localrecov -o abort_recovery /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 find: '/mnt/lustre/d100d.replay-single': No such file or directory find: '/mnt/lustre/d100d.replay-single': No such file or directory PASS 100d (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 10:45:54 (1713537954) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2992 1284696 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2528 1285160 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2996 1284692 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2528 1285160 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034fc0:0x3a:0x0] 1 [0x24000abf9:0x3a:0x0] Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:46:03 (1713537963) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:46:28 (1713537988) targets are mounted 10:46:28 (1713537988) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034fc0:0x3a:0x0] 1 [0x24000abf9:0x3a:0x0] PASS 100e (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 10:46:40 (1713538000) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3040 1284648 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2564 1285124 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 101 (51s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 10:47:34 (1713538054) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (10:47:35) ... fail_loc=0 done (10:47:51) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 10:47:55 (1713538075) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (10:47:56) ... fail_loc=0 done (10:48:12) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 10:48:16 (1713538096) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3208 1284480 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2636 1285052 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3580372 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3596788 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7177160 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (10:48:20) ... fail_loc=0 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:48:22 (1713538102) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:48:38 (1713538118) targets are mounted 10:48:38 (1713538118) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (10:48:44) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 10:48:48 (1713538128) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (10:48:49) ... fail_loc=0 Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:48:52 (1713538132) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:49:06 (1713538146) targets are mounted 10:49:06 (1713538146) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec done (10:49:12) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 103: Check otr_next_id overflow ==== 10:49:16 (1713538156) fail_loc=0x80000162 total: 30 open/close in 0.15 seconds: 197.91 ops/second Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:49:19 (1713538159) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:49:33 (1713538173) targets are mounted 10:49:33 (1713538173) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 10:49:43 (1713538183) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3156 1284532 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2640 1285048 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3580372 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3596788 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7177160 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:49:48 (1713538188) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:50:03 (1713538203) targets are mounted 10:50:03 (1713538203) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110a.replay-single/striped_dir has type dir OK PASS 110a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 10:50:13 (1713538213) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3160 1284528 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3580372 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3596788 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7177160 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:50:18 (1713538218) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:50:34 (1713538234) targets are mounted 10:50:34 (1713538234) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110b.replay-single/striped_dir has type dir OK PASS 110b (98s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 10:51:53 (1713538313) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3164 1284524 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:51:58 (1713538318) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:52:13 (1713538333) targets are mounted 10:52:13 (1713538333) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110c.replay-single/striped_dir has type dir OK PASS 110c (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 10:52:23 (1713538343) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3204 1284484 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2688 1285000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:52:28 (1713538348) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:52:44 (1713538364) targets are mounted 10:52:44 (1713538364) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110d.replay-single/striped_dir has type dir OK PASS 110d (96s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 10:54:01 (1713538441) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3152 1284536 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2636 1285052 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:54:11 (1713538451) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:54:37 (1713538477) targets are mounted 10:54:37 (1713538477) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110e.replay-single/striped_dir has type dir OK PASS 110e (111s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_110f skipping excluded test 110f debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 10:55:55 (1713538555) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2672 1285016 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:56:04 (1713538564) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:56:30 (1713538590) targets are mounted 10:56:30 (1713538590) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110g.replay-single/striped_dir has type dir OK PASS 110g (112s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 10:57:49 (1713538669) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3168 1284520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2636 1285052 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 10:57:54 (1713538674) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:09 (1713538689) targets are mounted 10:58:09 (1713538689) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111a.replay-single/striped_dir: No such file or directory PASS 111a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 10:58:19 (1713538699) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3172 1284516 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2656 1285032 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 10:58:24 (1713538704) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 10:58:39 (1713538719) targets are mounted 10:58:40 (1713538720) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111b.replay-single/striped_dir: No such file or directory PASS 111b (96s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 10:59:57 (1713538797) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3168 1284520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2640 1285048 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:00:12 (1713538812) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 11:00:32 (1713538832) targets are mounted 11:00:32 (1713538832) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111c.replay-single/striped_dir: No such file or directory PASS 111c (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 11:01:51 (1713538911) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3176 1284512 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2604 1285084 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:02:00 (1713538920) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 11:02:26 (1713538946) targets are mounted 11:02:26 (1713538946) facet_failover done pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg122-client: oleg122-client: ssh exited with exit code 95 Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111d.replay-single/striped_dir: No such file or directory PASS 111d (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 11:03:43 (1713539023) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3172 1284516 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2688 1285000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3212 1284476 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2688 1285000 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:03:52 (1713539032) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 11:04:18 (1713539058) targets are mounted 11:04:18 (1713539058) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111e.replay-single/striped_dir: No such file or directory PASS 111e (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 11:04:28 (1713539068) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2660 1285028 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3180 1284508 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:04:37 (1713539077) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 11:05:04 (1713539104) targets are mounted 11:05:04 (1713539104) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111f.replay-single/striped_dir: No such file or directory PASS 111f (52s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 11:05:23 (1713539123) UUID Inodes IUsed IFree IUse% Mounted on lustre-MDT0000_UUID 1024000 556 1023444 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1024000 320 1023680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 262144 590 261554 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 262144 526 261618 1% /mnt/lustre[OST:1] filesystem_summary: 524048 876 523172 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3188 1284500 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2656 1285032 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3188 1284500 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2656 1285032 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:05:32 (1713539132) shut down Failover mds1 to oleg122-server mount facets: mds1 Failover mds2 to oleg122-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 11:05:58 (1713539158) targets are mounted 11:05:58 (1713539158) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111g.replay-single/striped_dir: No such file or directory PASS 111g (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 11:06:09 (1713539169) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 11:06:13 (1713539173) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 11:06:16 (1713539176) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 11:06:19 (1713539179) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 11:06:23 (1713539183) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 11:06:26 (1713539186) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 11:06:30 (1713539190) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 11:06:33 (1713539193) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 11:06:36 (1713539196) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 11:06:40 (1713539200) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 11:06:43 (1713539203) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 11:06:46 (1713539206) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 11:06:50 (1713539210) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 11:06:53 (1713539213) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 115: failover for create/unlink striped directory ========================================================== 11:06:57 (1713539217) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3184 1284504 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre striped dir -i1 -c2 -H all_char /mnt/lustre/d115.replay-single/test_0 striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_1 striped dir -i1 -c2 -H crush /mnt/lustre/d115.replay-single/test_2 striped dir -i1 -c2 -H all_char /mnt/lustre/d115.replay-single/test_3 striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_4 Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:07:02 (1713539222) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 11:07:17 (1713539237) targets are mounted 11:07:17 (1713539237) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3236 1284452 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2712 1284976 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_0 striped dir -i0 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_1 striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_2 striped dir -i0 -c2 -H crush /mnt/lustre/d115.replay-single/test_3 striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_4 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:07:28 (1713539248) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:07:43 (1713539263) targets are mounted 11:07:43 (1713539263) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 115 (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116a: large update log master MDT recovery ========================================================== 11:07:54 (1713539274) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3276 1284412 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2776 1284912 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre fail_loc=0x80001702 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:07:59 (1713539279) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:08:15 (1713539295) targets are mounted 11:08:15 (1713539295) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116a.replay-single/striped_dir has type dir OK PASS 116a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116b: large update log slave MDT recovery ========================================================== 11:08:25 (1713539305) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3328 1284360 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2948 1284740 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre fail_loc=0x80001702 Failing mds2 on oleg122-server Stopping /mnt/lustre-mds2 (opts:) on oleg122-server 11:08:30 (1713539310) shut down Failover mds2 to oleg122-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0001 11:08:45 (1713539325) targets are mounted 11:08:45 (1713539325) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116b.replay-single/striped_dir has type dir OK PASS 116b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 11:08:55 (1713539335) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 11:08:59 (1713539339) fail_loc=0x1705 lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error fail_loc=0x0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3424 1284264 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2828 1284860 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:09:04 (1713539344) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:09:20 (1713539360) targets are mounted 11:09:20 (1713539360) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 118 (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 11:09:30 (1713539370) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3492 1284196 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2888 1284800 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server fail_loc=0x80000714 fail_val=65 Starting mds1: -o localrecov -o recovery_time_hard=60 /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg122-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 67 sec mdt.lustre-MDT0000.recovery_time_hard=180 PASS 119 (86s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 11:10:58 (1713539458) Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 find: '/mnt/lustre/d120.replay-single': No such file or directory PASS 120 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 121: lock replay timed out and race ========================================================== 11:11:28 (1713539488) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7346 Stopping /mnt/lustre-mds1 (opts:) on oleg122-server Failover mds1 to oleg122-server fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg122-client: oleg122-client: ssh exited with exit code 5 fail_loc=0x0 at_max=600 PASS 121 (200s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 11:14:51 (1713539691) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3884 1283804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:14:55 (1713539695) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:15:11 (1713539711) targets are mounted 11:15:11 (1713539711) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130b: DoM file create (inherited) replay ========================================================== 11:15:21 (1713539721) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3896 1283792 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:15:26 (1713539726) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:15:42 (1713539742) targets are mounted 11:15:42 (1713539742) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 131a: DoM file write lock replay === 11:15:52 (1713539752) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3896 1283792 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes (8 B) copied, 0.00233938 s, 3.4 kB/s Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:15:57 (1713539757) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:16:12 (1713539772) targets are mounted 11:16:12 (1713539772) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (28s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_131b skipping excluded test 131b debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 132a: PFL new component instantiate replay ========================================================== 11:16:23 (1713539783) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3900 1283788 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3192 1284496 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 2028 3604992 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20472 7193568 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0284701 s, 36.8 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000402:0x480a:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000402:0x4802:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x280000402:0x480b:0x0] } Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:16:28 (1713539788) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:16:44 (1713539804) targets are mounted 11:16:44 (1713539804) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 1 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000402:0x480a:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000402:0x4802:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x280000402:0x480b:0x0] } PASS 132a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 11:16:54 (1713539814) seq.srv-lustre-MDT0001.space=clear Starting client: oleg122-client.virtnet: -o user_xattr,flock oleg122-server@tcp:/lustre /mnt/lustre fail_val=700 fail_loc=0x80000123 Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:16:59 (1713539819) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:17:13 (1713539833) targets are mounted 11:17:13 (1713539833) facet_failover done PASS 133 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 134: replay creation of a file created in a pool ========================================================== 11:17:19 (1713539839) Creating new pool oleg122-server: Pool lustre.pool_134 created Adding targets to pool oleg122-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3984 1283704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3236 1284452 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 3052 3603968 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 21496 7192544 1% /mnt/lustre Failing mds1 on oleg122-server Stopping /mnt/lustre-mds1 (opts:) on oleg122-server 11:17:29 (1713539849) shut down Failover mds1 to oleg122-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-MDT0000 11:17:45 (1713539865) targets are mounted 11:17:45 (1713539865) facet_failover done oleg122-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg122-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg122-server: Pool lustre.pool_134 destroyed PASS 134 (40s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 135: Server failure in lock replay phase ========================================================== 11:18:01 (1713539881) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3964 1283724 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3236 1284452 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18444 3588576 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 3052 3603968 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 21496 7192544 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 Stopping /mnt/lustre-ost1 (opts:) on oleg122-server Failover ost1 to oleg122-server oleg122-server: oleg122-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 oleg122-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg122-server Failover ost1 to oleg122-server oleg122-server: oleg122-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all End of sync pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg122-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg122-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg122-server: oleg122-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg122-client: oleg122-server: ssh exited with exit code 1 Started lustre-OST0001 pdsh@oleg122-client: oleg122-client: ssh exited with exit code 5 ldlm.cancel_unused_locks_before_replay=1 PASS 135 (71s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 11:19:14 (1713539954) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 11:19:18 (1713539958) SKIP: replay-single test_200 Need remote client SKIP 200 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-single test complete, duration 8962 sec ======== 11:19:20 (1713539960) === replay-single: start cleanup 11:19:20 (1713539960) === === replay-single: finish cleanup 11:19:24 (1713539964) ===