== replay-dual test 26: dbench and tar with mds failover ========================================================== 20:54:27 (1713488067) Starting client oleg440-client.virtnet: -o user_xattr,flock oleg440-server@tcp:/lustre /mnt/lustre Started clients oleg440-client.virtnet: 192.168.204.140@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar loop with pid 6297 Started dbench loop with 6298 striped dir -i0 -c2 -H all_char /mnt/lustre2/d26.replay-dual/run_dbench striped dir -i0 -c2 -H crush2 /mnt/lustre/d26.replay-dual/run_tar looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Thu Apr 18 20:54:29 EDT 2024 waiting for dbench pid 6339 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs failed to create barrier semaphore 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2272 1285416 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2080 1285608 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 26120 3555840 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 27644 7161336 1% /mnt/lustre 1 231 8.88 MB/sec warmup 1 sec latency 20.416 ms 1 481 8.38 MB/sec warmup 2 sec latency 21.181 ms 1 711 7.07 MB/sec warmup 3 sec latency 288.297 ms 1 985 5.48 MB/sec warmup 4 sec latency 14.681 ms test_26 fail mds1 1 times 1 1389 5.20 MB/sec warmup 5 sec latency 15.434 ms Failing mds1 on oleg440-server Stopping /mnt/lustre-mds1 (opts:) on oleg440-server 1 1555 4.35 MB/sec warmup 6 sec latency 416.545 ms 20:54:35 (1713488075) shut down 1 1555 3.73 MB/sec warmup 7 sec latency 1416.771 ms 1 1555 3.26 MB/sec warmup 8 sec latency 2416.999 ms 1 1555 2.90 MB/sec warmup 9 sec latency 3417.207 ms 1 1555 2.61 MB/sec warmup 10 sec latency 4417.415 ms 1 1555 2.37 MB/sec warmup 11 sec latency 5417.587 ms 1 1555 2.17 MB/sec warmup 12 sec latency 6417.734 ms 1 1555 2.01 MB/sec warmup 13 sec latency 7417.896 ms 1 1555 1.86 MB/sec warmup 14 sec latency 8418.078 ms 1 1555 1.74 MB/sec warmup 15 sec latency 9418.220 ms 1 1555 1.63 MB/sec warmup 16 sec latency 10418.417 ms Failover mds1 to oleg440-server mount facets: mds1 1 1555 1.54 MB/sec warmup 17 sec latency 11418.625 ms 1 1555 1.45 MB/sec warmup 18 sec latency 12418.794 ms Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 1 1555 1.37 MB/sec warmup 19 sec latency 13418.928 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 Started lustre-MDT0000 20:54:49 (1713488089) targets are mounted 20:54:49 (1713488089) facet_failover done 1 1555 0.00 MB/sec execute 1 sec latency 15419.244 ms 1 1555 0.00 MB/sec execute 2 sec latency 16419.430 ms 1 1555 0.00 MB/sec execute 3 sec latency 17419.630 ms 1 1555 0.00 MB/sec execute 4 sec latency 18419.771 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 1791 0.04 MB/sec execute 5 sec latency 18722.178 ms 1 2115 0.10 MB/sec execute 6 sec latency 12.792 ms 1 2389 0.27 MB/sec execute 7 sec latency 47.613 ms 1 2797 0.73 MB/sec execute 8 sec latency 21.566 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2592 1285096 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 31508 3565300 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 6220 3586732 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 37728 7152032 1% /mnt/lustre 1 3354 1.03 MB/sec execute 9 sec latency 14.691 ms 1 3763 1.37 MB/sec execute 10 sec latency 13.228 ms 1 3979 1.27 MB/sec execute 11 sec latency 15.381 ms 1 4271 1.19 MB/sec execute 12 sec latency 13.112 ms test_26 fail mds2 2 times 1 4552 1.15 MB/sec execute 13 sec latency 14.193 ms Failing mds2 on oleg440-server Stopping /mnt/lustre-mds2 (opts:) on oleg440-server 1 4740 1.15 MB/sec execute 14 sec latency 423.672 ms 20:55:03 (1713488103) shut down 1 4740 1.07 MB/sec execute 15 sec latency 1423.786 ms 1 4740 1.00 MB/sec execute 16 sec latency 2423.883 ms 1 4740 0.94 MB/sec execute 17 sec latency 3424.003 ms 1 4740 0.89 MB/sec execute 18 sec latency 4424.116 ms 1 4740 0.84 MB/sec execute 19 sec latency 5424.214 ms 1 4740 0.80 MB/sec execute 20 sec latency 6424.325 ms 1 4740 0.76 MB/sec execute 21 sec latency 7424.433 ms 1 4740 0.73 MB/sec execute 22 sec latency 8424.550 ms 1 4740 0.70 MB/sec execute 23 sec latency 9424.680 ms 1 4740 0.67 MB/sec execute 24 sec latency 10424.767 ms Failover mds2 to oleg440-server mount facets: mds2 1 4740 0.64 MB/sec execute 25 sec latency 11424.880 ms 1 4740 0.62 MB/sec execute 26 sec latency 12425.001 ms Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 1 4740 0.59 MB/sec execute 27 sec latency 13425.119 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 1 4740 0.57 MB/sec execute 28 sec latency 14425.242 ms Started lustre-MDT0001 20:55:17 (1713488117) targets are mounted 20:55:17 (1713488117) facet_failover done 1 4740 0.55 MB/sec execute 29 sec latency 15425.378 ms 1 4740 0.53 MB/sec execute 30 sec latency 16425.616 ms 1 4740 0.52 MB/sec execute 31 sec latency 17425.886 ms 1 4740 0.50 MB/sec execute 32 sec latency 18426.100 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec 1 5089 0.58 MB/sec execute 33 sec latency 18453.583 ms 1 5375 0.57 MB/sec execute 34 sec latency 17.013 ms 1 5743 0.56 MB/sec execute 35 sec latency 10.975 ms 1 6109 0.61 MB/sec execute 36 sec latency 42.998 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4532 1283156 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2428 1285260 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 39680 3557704 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 10544 3582580 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 50224 7140284 1% /mnt/lustre 1 6578 0.72 MB/sec execute 37 sec latency 20.814 ms 1 7164 0.83 MB/sec execute 38 sec latency 15.455 ms 1 7427 0.84 MB/sec execute 39 sec latency 18.401 ms 1 7691 0.83 MB/sec execute 40 sec latency 14.724 ms test_26 fail mds1 3 times Failing mds1 on oleg440-server Stopping /mnt/lustre-mds1 (opts:) on oleg440-server 1 8002 0.82 MB/sec execute 41 sec latency 16.760 ms 1 8106 0.81 MB/sec execute 42 sec latency 561.668 ms 1 8106 0.79 MB/sec execute 43 sec latency 1561.831 ms 1 8106 0.77 MB/sec execute 44 sec latency 2561.994 ms 1 8106 0.76 MB/sec execute 45 sec latency 3562.162 ms 1 8106 0.74 MB/sec execute 46 sec latency 4562.322 ms 1 8106 0.72 MB/sec execute 47 sec latency 5562.485 ms 20:55:36 (1713488136) shut down 1 8106 0.71 MB/sec execute 48 sec latency 6562.664 ms 1 8106 0.69 MB/sec execute 49 sec latency 7562.820 ms 1 8106 0.68 MB/sec execute 50 sec latency 8562.942 ms 1 8106 0.67 MB/sec execute 51 sec latency 9563.090 ms 1 8106 0.65 MB/sec execute 52 sec latency 10563.189 ms 1 8106 0.64 MB/sec execute 53 sec latency 11563.333 ms 1 8106 0.63 MB/sec execute 54 sec latency 12563.498 ms 1 8106 0.62 MB/sec execute 55 sec latency 13563.689 ms 1 8106 0.61 MB/sec execute 56 sec latency 14563.887 ms 1 8106 0.60 MB/sec execute 57 sec latency 15564.128 ms Failover mds1 to oleg440-server mount facets: mds1 1 8106 0.59 MB/sec execute 58 sec latency 16564.407 ms 1 8106 0.58 MB/sec execute 59 sec latency 17564.508 ms Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 1 8106 0.57 MB/sec execute 60 sec latency 18564.610 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 Started lustre-MDT0000 20:55:50 (1713488150) targets are mounted 20:55:50 (1713488150) facet_failover done 1 8106 0.56 MB/sec execute 61 sec latency 19564.773 ms 1 8106 0.55 MB/sec execute 62 sec latency 20564.900 ms 1 8106 0.54 MB/sec execute 63 sec latency 21565.017 ms 1 8106 0.53 MB/sec execute 64 sec latency 22565.138 ms 1 8106 0.52 MB/sec execute 65 sec latency 23565.260 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid 1 8243 0.53 MB/sec execute 66 sec latency 24160.617 ms mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 8601 0.57 MB/sec execute 67 sec latency 15.593 ms 1 8895 0.57 MB/sec execute 68 sec latency 16.292 ms 1 9254 0.56 MB/sec execute 69 sec latency 12.473 ms 1 9565 0.58 MB/sec execute 70 sec latency 15.572 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 5108 1282580 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3096 1284592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 52048 3548496 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 20292 3574380 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 72340 7122876 2% /mnt/lustre 1 10068 0.64 MB/sec execute 71 sec latency 17.827 ms 1 10611 0.70 MB/sec execute 72 sec latency 16.070 ms 1 10911 0.71 MB/sec execute 73 sec latency 14.296 ms 1 11136 0.71 MB/sec execute 74 sec latency 15.704 ms test_26 fail mds2 4 times Failing mds2 on oleg440-server Stopping /mnt/lustre-mds2 (opts:) on oleg440-server 1 11407 0.70 MB/sec execute 75 sec latency 64.403 ms 20:56:05 (1713488165) shut down 1 11407 0.69 MB/sec execute 76 sec latency 1064.620 ms 1 11407 0.68 MB/sec execute 77 sec latency 2064.814 ms 1 11407 0.67 MB/sec execute 78 sec latency 3065.013 ms 1 11407 0.67 MB/sec execute 79 sec latency 4065.212 ms 1 11407 0.66 MB/sec execute 80 sec latency 5065.379 ms 1 11407 0.65 MB/sec execute 81 sec latency 6065.570 ms 1 11407 0.64 MB/sec execute 82 sec latency 7065.757 ms 1 11407 0.63 MB/sec execute 83 sec latency 8065.936 ms 1 11407 0.63 MB/sec execute 84 sec latency 9066.127 ms 1 11407 0.62 MB/sec execute 85 sec latency 10066.315 ms Failover mds2 to oleg440-server mount facets: mds2 1 11407 0.61 MB/sec execute 86 sec latency 11066.478 ms 1 11407 0.60 MB/sec execute 87 sec latency 12066.617 ms Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 1 11407 0.60 MB/sec execute 88 sec latency 13066.766 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 1 11407 0.59 MB/sec execute 89 sec latency 14066.879 ms Started lustre-MDT0001 20:56:18 (1713488178) targets are mounted 20:56:18 (1713488178) facet_failover done 1 11407 0.58 MB/sec execute 90 sec latency 15067.055 ms 1 11407 0.58 MB/sec execute 91 sec latency 16067.241 ms 1 11407 0.57 MB/sec execute 92 sec latency 17067.429 ms 1 11407 0.57 MB/sec execute 93 sec latency 18067.616 ms 1 11439 0.56 MB/sec execute 94 sec latency 18958.758 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec 1 11720 0.57 MB/sec execute 95 sec latency 16.983 ms 1 12115 0.60 MB/sec execute 96 sec latency 14.325 ms 1 12386 0.59 MB/sec execute 97 sec latency 24.502 ms 1 12737 0.59 MB/sec execute 98 sec latency 13.212 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 5272 1282416 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3284 1284404 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 53912 3549212 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 24600 3578684 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 78512 7127896 2% /mnt/lustre 1 13083 0.60 MB/sec execute 99 sec latency 51.511 ms 1 cleanup 100 sec 0 cleanup 101 sec Operation Count AvgLat MaxLat ---------------------------------------- NTCreateX 2055 29.429 24160.604 Close 1530 13.674 18453.565 Rename 86 11.684 17.803 Unlink 392 4.602 18.504 Qpathinfo 1911 2.271 16.891 Qfileinfo 336 0.454 2.409 Qfsinfo 325 0.900 4.966 Sfileinfo 166 6.322 16.469 Find 717 28.451 18722.160 WriteX 1016 2.131 16.889 ReadX 3191 0.116 43.924 LockX 6 1.472 1.647 UnlockX 6 1.511 1.650 Flush 139 9.669 281.869 Throughput 0.599552 MB/sec 1 clients 1 procs max_latency=24160.617 ms stopping dbench on /mnt/lustre at Thu Apr 18 20:56:30 EDT 2024 with return code 0 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished striped dir -i0 -c2 -H crush /mnt/lustre2/d26.replay-dual/run_dbench looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Thu Apr 18 20:56:31 EDT 2024 waiting for dbench pid 10906 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs 0 of 1 processes prepared for launch 0 sec test_26 fail mds1 5 times 1 of 1 processes prepared for launch 0 sec releasing clients Failing mds1 on oleg440-server Stopping /mnt/lustre-mds1 (opts:) on oleg440-server 1 232 8.88 MB/sec warmup 1 sec latency 19.664 ms 1 339 6.13 MB/sec warmup 2 sec latency 581.890 ms 1 339 4.09 MB/sec warmup 3 sec latency 1582.074 ms 1 339 3.07 MB/sec warmup 4 sec latency 2582.270 ms 1 339 2.45 MB/sec warmup 5 sec latency 3582.444 ms 1 339 2.04 MB/sec warmup 6 sec latency 4582.615 ms 20:56:38 (1713488198) shut down 1 339 1.75 MB/sec warmup 7 sec latency 5582.767 ms 1 339 1.53 MB/sec warmup 8 sec latency 6582.904 ms 1 339 1.36 MB/sec warmup 9 sec latency 7583.031 ms 1 339 1.23 MB/sec warmup 10 sec latency 8583.195 ms 1 339 1.11 MB/sec warmup 11 sec latency 9583.383 ms 1 339 1.02 MB/sec warmup 12 sec latency 10583.574 ms 1 339 0.94 MB/sec warmup 13 sec latency 11583.749 ms 1 339 0.88 MB/sec warmup 14 sec latency 12583.907 ms 1 339 0.82 MB/sec warmup 15 sec latency 13584.078 ms 1 339 0.77 MB/sec warmup 16 sec latency 14584.290 ms Failover mds1 to oleg440-server mount facets: mds1 1 339 0.72 MB/sec warmup 17 sec latency 15584.465 ms 1 339 0.68 MB/sec warmup 18 sec latency 16584.585 ms Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 1 339 0.65 MB/sec warmup 19 sec latency 17584.742 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 Started lustre-MDT0000 20:56:52 (1713488212) targets are mounted 20:56:52 (1713488212) facet_failover done 1 339 0.00 MB/sec execute 1 sec latency 19585.123 ms 1 339 0.00 MB/sec execute 2 sec latency 20585.260 ms 1 339 0.00 MB/sec execute 3 sec latency 21585.449 ms 1 339 0.00 MB/sec execute 4 sec latency 22585.600 ms 1 339 0.00 MB/sec execute 5 sec latency 23585.751 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 618 1.45 MB/sec execute 6 sec latency 23638.501 ms 1 902 1.35 MB/sec execute 7 sec latency 20.161 ms 1 1188 1.35 MB/sec execute 8 sec latency 19.851 ms 1 1572 1.54 MB/sec execute 9 sec latency 31.019 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4296 1283392 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2508 1285180 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 31728 3564072 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 12160 3584912 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 43888 7148984 1% /mnt/lustre 1 1898 1.41 MB/sec execute 10 sec latency 23.042 ms 1 2269 1.33 MB/sec execute 11 sec latency 13.565 ms 1 2607 1.40 MB/sec execute 12 sec latency 15.743 ms 1 3305 1.78 MB/sec execute 13 sec latency 13.686 ms test_26 fail mds2 6 times Failing mds2 on oleg440-server Stopping /mnt/lustre-mds2 (opts:) on oleg440-server 1 3760 1.96 MB/sec execute 14 sec latency 10.544 ms 20:57:06 (1713488226) shut down 1 3816 1.83 MB/sec execute 15 sec latency 769.339 ms 1 3816 1.72 MB/sec execute 16 sec latency 1769.498 ms 1 3816 1.62 MB/sec execute 17 sec latency 2769.650 ms 1 3816 1.53 MB/sec execute 18 sec latency 3769.772 ms 1 3816 1.45 MB/sec execute 19 sec latency 4769.952 ms 1 3816 1.37 MB/sec execute 20 sec latency 5770.266 ms 1 3816 1.31 MB/sec execute 21 sec latency 6770.399 ms 1 3816 1.25 MB/sec execute 22 sec latency 7770.510 ms 1 3816 1.20 MB/sec execute 23 sec latency 8770.611 ms 1 3816 1.15 MB/sec execute 24 sec latency 9770.781 ms Failover mds2 to oleg440-server mount facets: mds2 1 3816 1.10 MB/sec execute 25 sec latency 10770.904 ms 1 3816 1.06 MB/sec execute 26 sec latency 11771.067 ms Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 1 3816 1.02 MB/sec execute 27 sec latency 12771.184 ms 1 3816 0.98 MB/sec execute 28 sec latency 13771.404 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 Started lustre-MDT0001 20:57:20 (1713488240) targets are mounted 20:57:20 (1713488240) facet_failover done 1 3816 0.95 MB/sec execute 29 sec latency 14771.596 ms 1 3816 0.92 MB/sec execute 30 sec latency 15771.788 ms 1 3816 0.89 MB/sec execute 31 sec latency 16771.943 ms 1 3816 0.86 MB/sec execute 32 sec latency 17772.108 ms 1 3844 0.83 MB/sec execute 33 sec latency 18643.796 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec 1 4064 0.82 MB/sec execute 34 sec latency 14.589 ms striped dir -i0 -c2 -H all_char /mnt/lustre/d26.replay-dual/run_tar 1 4311 0.80 MB/sec execute 35 sec latency 29.807 ms 1 4557 0.80 MB/sec execute 36 sec latency 19.965 ms 1 4942 0.89 MB/sec execute 37 sec latency 16.414 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2644 1285044 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2244 1285444 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 33196 3569056 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 8776 3593932 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 41972 7162988 1% /mnt/lustre 1 5190 0.87 MB/sec execute 38 sec latency 15.203 ms 1 5327 0.85 MB/sec execute 39 sec latency 416.680 ms 1 5664 0.84 MB/sec execute 40 sec latency 16.687 ms 1 6012 0.85 MB/sec execute 41 sec latency 13.821 ms test_26 fail mds1 7 times Failing mds1 on oleg440-server Stopping /mnt/lustre-mds1 (opts:) on oleg440-server 1 6542 0.96 MB/sec execute 42 sec latency 14.781 ms 20:57:34 (1713488254) shut down 1 7113 1.05 MB/sec execute 43 sec latency 16.419 ms 1 7402 1.06 MB/sec execute 44 sec latency 15.324 ms 1 7627 1.04 MB/sec execute 45 sec latency 16.439 ms 1 7627 1.02 MB/sec execute 46 sec latency 1016.570 ms 1 7627 1.00 MB/sec execute 47 sec latency 2016.733 ms 1 7627 0.98 MB/sec execute 48 sec latency 3016.882 ms 1 7627 0.96 MB/sec execute 49 sec latency 4017.007 ms 1 7627 0.94 MB/sec execute 50 sec latency 5017.178 ms 1 7627 0.92 MB/sec execute 51 sec latency 6017.343 ms 1 7627 0.90 MB/sec execute 52 sec latency 7017.512 ms Failover mds1 to oleg440-server mount facets: mds1 1 7627 0.89 MB/sec execute 53 sec latency 8017.664 ms 1 7627 0.87 MB/sec execute 54 sec latency 9017.789 ms Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 1 7627 0.85 MB/sec execute 55 sec latency 10017.914 ms 1 7627 0.84 MB/sec execute 56 sec latency 11018.044 ms oleg440-server: oleg440-server.virtnet: executing set_default_debug -1 all pdsh@oleg440-client: oleg440-server: ssh exited with exit code 1 Started lustre-MDT0000 20:57:48 (1713488268) targets are mounted 20:57:48 (1713488268) facet_failover done 1 7627 0.82 MB/sec execute 57 sec latency 12018.180 ms 1 7627 0.81 MB/sec execute 58 sec latency 13018.312 ms 1 7627 0.80 MB/sec execute 59 sec latency 14018.463 ms 1 7627 0.78 MB/sec execute 60 sec latency 15018.607 ms 1 7627 0.77 MB/sec execute 61 sec latency 16018.734 ms oleg440-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 7921 0.77 MB/sec execute 62 sec latency 16050.761 ms 1 8241 0.78 MB/sec execute 63 sec latency 20.945 ms tar: Unexpected EOF in archive tar: Unexpected EOF in archive 1 8621 0.81 MB/sec execute 64 sec latency 14.083 ms 1 8916 0.80 MB/sec execute 65 sec latency 21.713 ms tar: Error is not recoverable: exiting now dbench killed by signal 15 stopping dbench on /mnt/lustre at Thu Apr 18 20:57:57 EDT 2024 with return code 0 10906 pts/0 S+ 0:00 dbench -c client.txt 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100 killed dbench main pid 10906 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished