-----============= acceptance-small: insanity ============----- Thu Apr 18 04:40:32 EDT 2024 excepting tests: === insanity: start setup 04:40:37 (1713429637) === oleg312-client.virtnet: executing check_config_client /mnt/lustre oleg312-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg312-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b58ff800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b58ff800.idle_timeout=debug disable quota as required oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === insanity: finish setup 04:40:46 (1713429646) === debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 0: Fail all nodes, independently ======== 04:40:47 (1713429647) Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server 04:40:49 (1713429649) shut down Failover mds1 to oleg312-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 04:41:02 (1713429662) targets are mounted 04:41:02 (1713429662) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 04:41:10 (1713429670) shut down Failover mds2 to oleg312-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 04:41:24 (1713429684) targets are mounted 04:41:24 (1713429684) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing ost1 on oleg312-server Stopping /mnt/lustre-ost1 (opts:) on oleg312-server 04:41:31 (1713429691) shut down Failover ost1 to oleg312-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 04:41:46 (1713429706) targets are mounted 04:41:46 (1713429706) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing ost2 on oleg312-server Stopping /mnt/lustre-ost2 (opts:) on oleg312-server 04:41:51 (1713429711) shut down Failover ost2 to oleg312-server mount facets: ost2 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0001 04:42:05 (1713429725) targets are mounted 04:42:05 (1713429725) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 0 (84s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 1: MDS/MDS failure ====================== 04:42:13 (1713429733) Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failover mds1 to oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server Reintegrating MDS2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 Verify reintegration PASS 1 (175s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 2: Second Failure Mode: MDS/OST Thu Apr 18 04:45:09 EDT 2024 ========================================================== 04:45:10 (1713429910) Verify Lustre filesystem is up and running Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failover mds1 to oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server Failover mds2 to oleg312-server Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Reintegrating OST Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 Verify reintegration PASS 2 (165s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 3: Third Failure Mode: MDS/CLIENT Thu Apr 18 04:47:56 EDT 2024 ========================================================== 04:47:57 (1713430077) Verify Lustre filesystem is up and running Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server 04:47:58 (1713430078) shut down Failover mds1 to oleg312-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 04:48:11 (1713430091) targets are mounted 04:48:11 (1713430091) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 04:48:18 (1713430098) shut down Failover mds2 to oleg312-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 04:48:31 (1713430111) targets are mounted 04:48:31 (1713430111) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Test Lustre stability after MDS failover Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS PASS 3 (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 4: Fourth Failure Mode: OST/MDS Thu Apr 18 04:48:46 EDT 2024 ========================================================== 04:48:47 (1713430127) Fourth Failure Mode: OST/MDS Thu Apr 18 04:48:47 EDT 2024 Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Test Lustre stability after OST failure Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failover mds1 to oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server Failover mds2 to oleg312-server Reintegrating OST Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 Test Lustre stability after MDS failover PASS 4 (170s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 5: Fifth Failure Mode: OST/OST Thu Apr 18 04:51:37 EDT 2024 ========================================================== 04:51:39 (1713430299) Fifth Failure Mode: OST/OST Thu Apr 18 04:51:39 EDT 2024 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Test Lustre stability after OST failure Stopping /mnt/lustre-ost2 (opts:) on oleg312-server Test Lustre stability after OST failure Reintegrating OSTs Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 5 (63s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 6: Sixth Failure Mode: OST/CLIENT Thu Apr 18 04:52:43 EDT 2024 ========================================================== 04:52:44 (1713430364) Sixth Failure Mode: OST/CLIENT Thu Apr 18 04:52:44 EDT 2024 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Test Lustre stability after OST failure DFPIDA=23602 Failing CLIENTs Request fail clients: , to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure DFPIDB=23951 Reintegrating OST/CLIENTs Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Verifying mount PASS 6 (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 7: Seventh Failure Mode: CLIENT/MDS Thu Apr 18 04:53:19 EDT 2024 ========================================================== 04:53:19 (1713430399) Seventh Failure Mode: CLIENT/MDS Thu Apr 18 04:53:20 EDT 2024 Verify Lustre filesystem is up and running Part 1: Failing CLIENT Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg312-client: total 0 oleg312-client: -rw-r--r-- 1 root root 0 Apr 18 04:53 oleg312-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running oleg312-client: rm: cannot remove '/mnt/lustre/d0.insanity/oleg312-client.virtnet_testfile': No such file or directory pdsh@oleg312-client: oleg312-client: ssh exited with exit code 1 Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server 04:54:26 (1713430466) shut down Failover mds1 to oleg312-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 04:54:39 (1713430479) targets are mounted 04:54:39 (1713430479) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 04:54:46 (1713430486) shut down Failover mds2 to oleg312-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 04:54:59 (1713430499) targets are mounted 04:54:59 (1713430499) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec oleg312-client: total 0 Reintegrating CLIENTs wait 1 minutes PASS 7 (169s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 8: Eighth Failure Mode: CLIENT/OST Thu Apr 18 04:56:08 EDT 2024 ========================================================== 04:56:09 (1713430569) Eighth Failure Mode: CLIENT/OST Thu Apr 18 04:56:10 EDT 2024 Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg312-client: total 0 oleg312-client: -rw-r--r-- 1 root root 0 Apr 18 04:56 oleg312-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Test Lustre stability after OST failure Reintegrating CLIENTs/OST Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Wait 1 minutes PASS 8 (148s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 9: Ninth Failure Mode: CLIENT/CLIENT Thu Apr 18 04:58:38 EDT 2024 ========================================================== 04:58:39 (1713430719) Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg312-client: total 0 oleg312-client: -rw-r--r-- 1 root root 0 Apr 18 04:58 oleg312-client.virtnet_testfile oleg312-client: -rw-r--r-- 1 root root 0 Apr 18 04:57 oleg312-client.virtnet_testfile2 Wait 1 minutes Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg312-client: total 0 oleg312-client: -rw-r--r-- 1 root root 0 Apr 18 04:59 oleg312-client.virtnet_testfile oleg312-client: -rw-r--r-- 1 root root 0 Apr 18 04:57 oleg312-client.virtnet_testfile2 Reintegrating CLIENTs/CLIENTs Wait 1 minutes PASS 9 (130s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 10: Tenth Failure Mode: MDT0/OST/MDT1 Thu Apr 18 05:00:49 EDT 2024 ========================================================== 05:00:50 (1713430850) Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failover mds1 to oleg312-server Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Reintegrating OST Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-mds2 (opts:) on oleg312-server Failover mds2 to oleg312-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 Verify reintegration PASS 10 (159s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 11: Eleventh Failure Mode: MDS0/CLIENT/MDS1 Thu Apr 18 05:03:29 EDT 2024 ========================================================== 05:03:30 (1713431010) Verify Lustre filesystem is up and running Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server 05:03:32 (1713431012) shut down Failover mds1 to oleg312-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 05:03:45 (1713431025) targets are mounted 05:03:45 (1713431025) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Test Lustre stability after MDS failover Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 05:03:55 (1713431035) shut down Failover mds2 to oleg312-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 05:04:08 (1713431048) targets are mounted 05:04:08 (1713431048) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 11 (48s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 12: Twelve Failure Mode: MDS0,MDS1/OST0, OST1/CLIENTS Thu Apr 18 05:04:18 EDT 2024 ========================================================== 05:04:19 (1713431059) Verify Lustre filesystem is up and running Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 05:04:22 (1713431062) shut down Failover mds1 to oleg312-server mount facets: mds1 Failover mds2 to oleg312-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 05:04:48 (1713431088) targets are mounted 05:04:48 (1713431088) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing ost1 on oleg312-server Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Failing ost2 on oleg312-server Stopping /mnt/lustre-ost2 (opts:) on oleg312-server 05:04:58 (1713431098) shut down Failover ost1 to oleg312-server mount facets: ost1 Failover ost2 to oleg312-server mount facets: ost2 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 seq.cli-lustre-OST0000-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Started lustre-OST0001 05:05:12 (1713431112) targets are mounted 05:05:12 (1713431112) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS PASS 12 (68s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 13: Thirteen Failure Mode: MDS0,MDS1/CLIENTS/OST0,OST1 Thu Apr 18 05:05:28 EDT 2024 ========================================================== 05:05:29 (1713431129) Verify Lustre filesystem is up and running Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 05:05:31 (1713431131) shut down Failover mds1 to oleg312-server mount facets: mds1 Failover mds2 to oleg312-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 05:05:57 (1713431157) targets are mounted 05:05:57 (1713431157) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing ost1 on oleg312-server Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Failing ost2 on oleg312-server Stopping /mnt/lustre-ost2 (opts:) on oleg312-server 05:06:12 (1713431172) shut down Failover ost1 to oleg312-server mount facets: ost1 Failover ost2 to oleg312-server mount facets: ost2 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0000-super.width=65536 seq.cli-lustre-OST0001-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0001 Started lustre-OST0000 05:06:25 (1713431185) targets are mounted 05:06:25 (1713431185) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 13 (67s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 14: Fourteen Failure Mode: OST0,OST1/CLIENTS/MDS0,MDS1 Thu Apr 18 05:06:37 EDT 2024 ========================================================== 05:06:38 (1713431198) Verify Lustre filesystem is up and running Failing ost1 on oleg312-server Stopping /mnt/lustre-ost1 (opts:) on oleg312-server Failing ost2 on oleg312-server Stopping /mnt/lustre-ost2 (opts:) on oleg312-server 05:06:40 (1713431200) shut down Failover ost1 to oleg312-server mount facets: ost1 Failover ost2 to oleg312-server mount facets: ost2 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0000-super.width=65536 seq.cli-lustre-OST0001-super.width=65536 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-OST0000 Started lustre-OST0001 05:06:53 (1713431213) targets are mounted 05:06:53 (1713431213) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing mds1 on oleg312-server Stopping /mnt/lustre-mds1 (opts:) on oleg312-server Failing mds2 on oleg312-server Stopping /mnt/lustre-mds2 (opts:) on oleg312-server 05:07:05 (1713431225) shut down Failover mds1 to oleg312-server mount facets: mds1 Failover mds2 to oleg312-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg312-server: oleg312-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 pdsh@oleg312-client: oleg312-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 05:07:31 (1713431251) targets are mounted 05:07:31 (1713431251) facet_failover done oleg312-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (62s) debug_raw_pointers=0 debug_raw_pointers=0 == insanity test complete, duration 1627 sec ============= 05:07:40 (1713431260) === insanity: start cleanup 05:07:41 (1713431261) === === insanity: finish cleanup 05:07:41 (1713431261) ===