-----============= acceptance-small: insanity ============----- Tue Apr 16 15:40:33 EDT 2024 excepting tests: === insanity: start setup 15:40:36 (1713296436) === oleg349-client.virtnet: executing check_config_client /mnt/lustre oleg349-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg349-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b6e76000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b6e76000.idle_timeout=debug disable quota as required oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all === insanity: finish setup 15:40:42 (1713296442) === debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 0: Fail all nodes, independently ======== 15:40:43 (1713296443) Failing mds1 on oleg349-server Stopping /mnt/lustre-mds1 (opts:) on oleg349-server 15:40:45 (1713296445) shut down Failover mds1 to oleg349-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-MDT0000 15:40:57 (1713296457) targets are mounted 15:40:57 (1713296457) facet_failover done oleg349-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing ost1 on oleg349-server Stopping /mnt/lustre-ost1 (opts:) on oleg349-server 15:41:01 (1713296461) shut down Failover ost1 to oleg349-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0000 15:41:14 (1713296474) targets are mounted 15:41:14 (1713296474) facet_failover done oleg349-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing ost2 on oleg349-server Stopping /mnt/lustre-ost2 (opts:) on oleg349-server 15:41:19 (1713296479) shut down Failover ost2 to oleg349-server mount facets: ost2 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0001 15:41:31 (1713296491) targets are mounted 15:41:31 (1713296491) facet_failover done oleg349-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 0 (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 1: MDS/MDS failure ====================== 15:41:37 (1713296497) SKIP: insanity test_1 needs >= 2 MDTs SKIP 1 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 2: Second Failure Mode: MDS/OST Tue Apr 16 15:41:38 EDT 2024 ========================================================== 15:41:39 (1713296499) Verify Lustre filesystem is up and running Stopping /mnt/lustre-mds1 (opts:) on oleg349-server Failover mds1 to oleg349-server Stopping /mnt/lustre-ost1 (opts:) on oleg349-server Reintegrating OST Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0000 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-MDT0000 Verify reintegration PASS 2 (144s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 3: Third Failure Mode: MDS/CLIENT Tue Apr 16 15:44:04 EDT 2024 ========================================================== 15:44:05 (1713296645) Verify Lustre filesystem is up and running Failing mds1 on oleg349-server Stopping /mnt/lustre-mds1 (opts:) on oleg349-server 15:44:06 (1713296646) shut down Failover mds1 to oleg349-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-MDT0000 15:44:19 (1713296659) targets are mounted 15:44:19 (1713296659) facet_failover done oleg349-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Test Lustre stability after MDS failover Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS PASS 3 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 4: Fourth Failure Mode: OST/MDS Tue Apr 16 15:44:30 EDT 2024 ========================================================== 15:44:31 (1713296671) Fourth Failure Mode: OST/MDS Tue Apr 16 15:44:32 EDT 2024 Stopping /mnt/lustre-ost1 (opts:) on oleg349-server Test Lustre stability after OST failure Stopping /mnt/lustre-mds1 (opts:) on oleg349-server Failover mds1 to oleg349-server Reintegrating OST Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0000 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-MDT0000 Test Lustre stability after MDS failover PASS 4 (149s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 5: Fifth Failure Mode: OST/OST Tue Apr 16 15:47:00 EDT 2024 ========================================================== 15:47:01 (1713296821) Fifth Failure Mode: OST/OST Tue Apr 16 15:47:01 EDT 2024 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg349-server Test Lustre stability after OST failure Stopping /mnt/lustre-ost2 (opts:) on oleg349-server Test Lustre stability after OST failure Reintegrating OSTs Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 5 (62s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 6: Sixth Failure Mode: OST/CLIENT Tue Apr 16 15:48:03 EDT 2024 ========================================================== 15:48:04 (1713296884) Sixth Failure Mode: OST/CLIENT Tue Apr 16 15:48:05 EDT 2024 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg349-server Test Lustre stability after OST failure DFPIDA=19371 Failing CLIENTs Request fail clients: , to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure DFPIDB=19712 Reintegrating OST/CLIENTs Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0000 Verifying mount PASS 6 (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 7: Seventh Failure Mode: CLIENT/MDS Tue Apr 16 15:48:39 EDT 2024 ========================================================== 15:48:40 (1713296920) Seventh Failure Mode: CLIENT/MDS Tue Apr 16 15:48:40 EDT 2024 Verify Lustre filesystem is up and running Part 1: Failing CLIENT Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg349-client: total 1 oleg349-client: -rw-r--r-- 1 root root 0 Apr 16 15:48 oleg349-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running oleg349-client: rm: cannot remove '/mnt/lustre/d0.insanity/oleg349-client.virtnet_testfile': No such file or directory pdsh@oleg349-client: oleg349-client: ssh exited with exit code 1 Failing mds1 on oleg349-server Stopping /mnt/lustre-mds1 (opts:) on oleg349-server 15:49:47 (1713296987) shut down Failover mds1 to oleg349-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-MDT0000 15:50:00 (1713297000) targets are mounted 15:50:00 (1713297000) facet_failover done oleg349-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec oleg349-client: total 0 Reintegrating CLIENTs wait 1 minutes PASS 7 (146s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 8: Eighth Failure Mode: CLIENT/OST Tue Apr 16 15:51:06 EDT 2024 ========================================================== 15:51:07 (1713297067) Eighth Failure Mode: CLIENT/OST Tue Apr 16 15:51:08 EDT 2024 Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg349-client: total 1 oleg349-client: -rw-r--r-- 1 root root 0 Apr 16 15:51 oleg349-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg349-server Test Lustre stability after OST failure Reintegrating CLIENTs/OST Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg349-server: oleg349-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg349-client: oleg349-server: ssh exited with exit code 1 Started lustre-OST0000 Wait 1 minutes PASS 8 (148s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 9: Ninth Failure Mode: CLIENT/CLIENT Tue Apr 16 15:53:36 EDT 2024 ========================================================== 15:53:37 (1713297217) Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg349-client: total 1 oleg349-client: -rw-r--r-- 1 root root 0 Apr 16 15:53 oleg349-client.virtnet_testfile oleg349-client: -rw-r--r-- 1 root root 0 Apr 16 15:52 oleg349-client.virtnet_testfile2 Wait 1 minutes Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg349-client: total 1 oleg349-client: -rw-r--r-- 1 root root 0 Apr 16 15:54 oleg349-client.virtnet_testfile oleg349-client: -rw-r--r-- 1 root root 0 Apr 16 15:52 oleg349-client.virtnet_testfile2 Reintegrating CLIENTs/CLIENTs Wait 1 minutes PASS 9 (130s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 10: Tenth Failure Mode: MDT0/OST/MDT1 Tue Apr 16 15:55:48 EDT 2024 ========================================================== 15:55:49 (1713297349) SKIP: insanity test_10 needs >= 2 MDTs SKIP 10 (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 11: Eleventh Failure Mode: MDS0/CLIENT/MDS1 Tue Apr 16 15:55:50 EDT 2024 ========================================================== 15:55:51 (1713297351) SKIP: insanity test_11 needs >= 2 MDTs SKIP 11 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 12: Twelve Failure Mode: MDS0,MDS1/OST0, OST1/CLIENTS Tue Apr 16 15:55:52 EDT 2024 ========================================================== 15:55:53 (1713297353) SKIP: insanity test_12 needs >= 2 MDTs SKIP 12 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 13: Thirteen Failure Mode: MDS0,MDS1/CLIENTS/OST0,OST1 Tue Apr 16 15:55:54 EDT 2024 ========================================================== 15:55:55 (1713297355) SKIP: insanity test_13 needs >= 2 MDTs SKIP 13 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == insanity test 14: Fourteen Failure Mode: OST0,OST1/CLIENTS/MDS0,MDS1 Tue Apr 16 15:55:57 EDT 2024 ========================================================== 15:55:58 (1713297358) SKIP: insanity test_14 needs >= 2 MDTs SKIP 14 (0s) debug_raw_pointers=0 debug_raw_pointers=0 == insanity test complete, duration 925 sec ============== 15:55:59 (1713297359) === insanity: start cleanup 15:55:59 (1713297359) === === insanity: finish cleanup 15:55:59 (1713297359) ===