-----============= acceptance-small: sanity-scrub ============----- Thu Apr 18 04:40:21 EDT 2024 excepting tests: Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions libkmod: kmod_module_get_holders: could not open '/sys/module/pcc_cpufreq/holders': No such file or directory libkmod: kmod_module_get_holders: could not open '/sys/module/acpi_cpufreq/holders': No such file or directory loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions client=34553366 MDS=34553366 OSS=34553366 Stopping clients: oleg239-client.virtnet /mnt/lustre (opts:) Stopping client oleg239-client.virtnet /mnt/lustre opts: Stopping clients: oleg239-client.virtnet /mnt/lustre2 (opts:) Stopping /mnt/lustre-mds1 (opts:-f) on oleg239-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg239-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg239-server unloading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing unload_modules_local modules unloaded. === sanity-scrub: start setup 04:40:41 (1713429641) === Stopping clients: oleg239-client.virtnet /mnt/lustre (opts:-f) Stopping clients: oleg239-client.virtnet /mnt/lustre2 (opts:-f) pdsh@oleg239-client: oleg239-server: ssh exited with exit code 2 oleg239-server: oleg239-server.virtnet: executing set_hostid Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions ../libcfs/libcfs/libcfs options: 'cpu_npartitions=2' libkmod: kmod_module_get_holders: could not open '/sys/module/intel_rapl/holders': No such file or directory ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' quota/lquota options: 'hash_lqs_cur_bits=3' loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions oleg239-server: ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' oleg239-server: quota/lquota options: 'hash_lqs_cur_bits=3' Formatting mgs, mds, osts Format mds1: lustre-mdt1/mdt1 Format ost1: lustre-ost1/ost1 Format ost2: lustre-ost2/ost2 Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Commit the device label on lustre-mdt1/mdt1 Started lustre-MDT0000 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Commit the device label on lustre-ost1/ost1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Commit the device label on lustre-ost2/ost2 Started lustre-OST0001 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800aa96d800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800aa96d800.idle_timeout=debug setting jobstats to procname_uid Setting lustre.sys.jobid_var from disable to procname_uid Waiting 90s for 'procname_uid' Updated after 6s: want 'procname_uid' got 'procname_uid' disable quota as required === sanity-scrub: finish setup 04:41:38 (1713429698) === debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 0: Do not auto trigger OI scrub for non-backup/restore case ========================================================== 04:41:39 (1713429699) preparing... Thu Apr 18 04:41:39 EDT 2024 creating 0 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0117992 s, 88.9 MB/s prepared Thu Apr 18 04:41:41 EDT 2024. stop mds1 starting MDTs without disabling OI scrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre PASS 0 (13s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 1a: Auto trigger initial OI scrub when server mounts ========================================================== 04:41:53 (1713429713) preparing... Thu Apr 18 04:41:54 EDT 2024 creating 0 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0113926 s, 92.0 MB/s prepared Thu Apr 18 04:41:55 EDT 2024. stop mds1 start mds1 without disabling OI scrub: -o localrecov -o user_xattr oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre fail_loc=0x193 fail_loc=0 192.168.202.139@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg239-client.virtnet /mnt/lustre (opts:) stop mds1 start mds1 with disabling OI scrub: -o localrecov -o user_xattr,noscrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 PASS 1a (17s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 1b: Trigger OI scrub when MDT mounts for OI files remove/recreate case ========================================================== 04:42:12 (1713429732) Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: mount.lustre: according to /etc/mtab lustre-mdt1/mdt1 is already mounted on /mnt/lustre-mds1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 Start of lustre-mdt1/mdt1 on mds1 failed 17 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost1/ost1 is already mounted on /mnt/lustre-ost1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0000-super.width=65536 Start of lustre-ost1/ost1 on ost1 failed 17 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost2/ost2 is already mounted on /mnt/lustre-ost2 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0001-super.width=65536 Start of lustre-ost2/ost2 on ost2 failed 17 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b3ac9000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b3ac9000.idle_timeout=debug disable quota as required preparing... Thu Apr 18 04:42:29 EDT 2024 creating 0 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0103105 s, 102 MB/s prepared Thu Apr 18 04:42:30 EDT 2024. fail_loc=0x198 fail_loc=0 stop mds1 start MDTs without disabling OI scrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre PASS 1b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 1c: Auto detect kinds of OI file(s) removed/recreated cases ========================================================== 04:42:42 (1713429762) SKIP: sanity-scrub test_1c ldiskfs special test SKIP 1c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 2: Trigger OI scrub when MDT mounts for backup/restore case ========================================================== 04:42:45 (1713429765) SKIP: sanity-scrub test_2 ldiskfs special test SKIP 2 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 4a: Auto trigger OI scrub if bad OI mapping was found (1) ========================================================== 04:42:47 (1713429767) preparing... Thu Apr 18 04:42:48 EDT 2024 creating 0 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0101651 s, 103 MB/s prepared Thu Apr 18 04:42:49 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting MDTs with OI scrub disabled oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre PASS 4a (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 4b: Auto trigger OI scrub if bad OI mapping was found (2) ========================================================== 04:43:09 (1713429789) SKIP: sanity-scrub test_4b ldiskfs special test SKIP 4b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 4c: Auto trigger OI scrub if bad OI mapping was found (3) ========================================================== 04:43:12 (1713429792) SKIP: sanity-scrub test_4c ldiskfs special test SKIP 4c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 4d: FID in LMA mismatch with object FID won't block create ========================================================== 04:43:15 (1713429795) SKIP: sanity-scrub test_4d ldiskfs only test SKIP 4d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 4e: FID reuse can be fixed ========== 04:43:18 (1713429798) SKIP: sanity-scrub test_4e ldiskfs only test SKIP 4e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 5: OI scrub state machine =========== 04:43:21 (1713429801) oleg239-server: oleg239-server.virtnet: executing set_hostid oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Using TIMEOUT=20 preparing... Thu Apr 18 04:44:47 EDT 2024 creating 100 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0128303 s, 81.7 MB/s prepared Thu Apr 18 04:44:48 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting MDTs with OI scrub disabled (1) oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre fail_val=3 fail_loc=0x190 192.168.202.139@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg239-client.virtnet /mnt/lustre (opts:) fail_loc=0x191 stopping mds1 fail_loc=0 fail_val=0 starting MDTs with OI scrub disabled (2) oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 stopping mds1 fail_val=3 fail_loc=0x190 starting MDTs without disabling OI scrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 fail_loc=0x192 Waiting 6s for 'failed' Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre fail_loc=0 fail_val=0 File: '/mnt/lustre/d5.sanity-scrub/mds1/sanity-scrub.sh' Size: 40777 Blocks: 131 IO Block: 4194304 regular file Device: 2c54f966h/743766374d Inode: 144115205272502341 Links: 1 Access: (0755/-rwxr-xr-x) Uid: ( 0/ root) Gid: ( 0/ root) Access: 2024-04-18 04:44:48.000000000 -0400 Modify: 2024-04-18 04:44:48.000000000 -0400 Change: 2024-04-18 04:44:49.000000000 -0400 Birth: - PASS 5 (116s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 6: OI scrub resumes from last checkpoint ========================================================== 04:45:19 (1713429919) preparing... Thu Apr 18 04:45:20 EDT 2024 creating 100 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0114093 s, 91.9 MB/s prepared Thu Apr 18 04:45:22 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting MDTs with OI scrub disabled oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre fail_val=2 fail_loc=0x190 fail_loc=0x192 Waiting 6s for 'failed' fail_val=3 fail_loc=0x190 File: '/mnt/lustre/d6.sanity-scrub/mds1/sanity-scrub.sh' Size: 40777 Blocks: 131 IO Block: 4194304 regular file Device: 2c54f966h/743766374d Inode: 144115305935798341 Links: 1 Access: (0755/-rwxr-xr-x) Uid: ( 0/ root) Gid: ( 0/ root) Access: 2024-04-18 04:45:21.000000000 -0400 Modify: 2024-04-18 04:45:21.000000000 -0400 Change: 2024-04-18 04:45:22.000000000 -0400 Birth: - 192.168.202.139@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg239-client.virtnet /mnt/lustre (opts:) fail_loc=0x191 stopping mds1 fail_val=3 fail_loc=0x190 starting MDTs without disabling OI scrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 fail_loc=0 fail_val=0 Waiting 6s for 'completed' PASS 6 (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 7: System is available during OI scrub scanning ========================================================== 04:45:54 (1713429954) Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: mount.lustre: according to /etc/mtab lustre-mdt1/mdt1 is already mounted on /mnt/lustre-mds1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 Start of lustre-mdt1/mdt1 on mds1 failed 17 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost1/ost1 is already mounted on /mnt/lustre-ost1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0000-super.width=65536 Start of lustre-ost1/ost1 on ost1 failed 17 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost2/ost2 is already mounted on /mnt/lustre-ost2 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0001-super.width=65536 Start of lustre-ost2/ost2 on ost2 failed 17 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800ad823000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800ad823000.idle_timeout=debug disable quota as required preparing... Thu Apr 18 04:46:12 EDT 2024 creating 500 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0156456 s, 67.0 MB/s prepared Thu Apr 18 04:46:15 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting MDTs with OI scrub disabled oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre fail_val=3 fail_loc=0x190 File: '/mnt/lustre/d7.sanity-scrub/mds1/f7.sanity-scrub300' Size: 0 Blocks: 0 IO Block: 4194304 regular empty file Device: 2c54f966h/743766374d Inode: 144115373044662655 Links: 1 Access: (0444/-r--r--r--) Uid: ( 0/ root) Gid: ( 0/ root) Access: 2024-04-18 04:46:15.000000000 -0400 Modify: 2024-04-18 04:46:15.000000000 -0400 Change: 2024-04-18 04:46:15.000000000 -0400 Birth: - fail_loc=0 fail_val=0 Waiting 6s for 'completed' Updated after 2s: want 'completed' got 'completed' PASS 7 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 8: Control OI scrub manually ======== 04:46:33 (1713429993) preparing... Thu Apr 18 04:46:36 EDT 2024 creating 128 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0116202 s, 90.2 MB/s prepared Thu Apr 18 04:46:38 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting MDTs with OI scrub disabled oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 fail_val=1 fail_loc=0x190 Started LFSCK on the device lustre-MDT0000: scrub Stopped LFSCK on the device lustre-MDT0000. Started LFSCK on the device lustre-MDT0000: scrub fail_loc=0 fail_val=0 Waiting 6s for 'completed' Updated after 2s: want 'completed' got 'completed' PASS 8 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 9: OI scrub speed control =========== 04:46:56 (1713430016) SKIP: sanity-scrub test_9 test scrub speed only on ldiskfs SKIP 9 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 10a: non-stopped OI scrub should auto restarts after MDS remount (1) ========================================================== 04:46:59 (1713430019) Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: mount.lustre: according to /etc/mtab lustre-mdt1/mdt1 is already mounted on /mnt/lustre-mds1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 Start of lustre-mdt1/mdt1 on mds1 failed 17 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost1/ost1 is already mounted on /mnt/lustre-ost1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0000-super.width=65536 Start of lustre-ost1/ost1 on ost1 failed 17 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost2/ost2 is already mounted on /mnt/lustre-ost2 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0001-super.width=65536 Start of lustre-ost2/ost2 on ost2 failed 17 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800a8b2f000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800a8b2f000.idle_timeout=debug disable quota as required preparing... Thu Apr 18 04:47:17 EDT 2024 creating 0 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0125327 s, 83.7 MB/s prepared Thu Apr 18 04:47:18 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting mds with OI scrub disabled (1) oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre fail_val=1 fail_loc=0x190 192.168.202.139@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg239-client.virtnet /mnt/lustre (opts:) stopping mds1 starting MDTs with OI scrub disabled (2) oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 stopping mds1 starting MDTs without disabling OI scrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 fail_loc=0 fail_val=0 Waiting 6s for 'completed' PASS 10a (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 11: OI scrub skips the new created objects only once ========================================================== 04:47:43 (1713430063) SKIP: sanity-scrub test_11 ldiskfs special test SKIP 11 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 12: OI scrub can rebuild invalid /O entries ========================================================== 04:47:46 (1713430066) Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: mount.lustre: according to /etc/mtab lustre-mdt1/mdt1 is already mounted on /mnt/lustre-mds1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 Start of lustre-mdt1/mdt1 on mds1 failed 17 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost1/ost1 is already mounted on /mnt/lustre-ost1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0000-super.width=65536 Start of lustre-ost1/ost1 on ost1 failed 17 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost2/ost2 is already mounted on /mnt/lustre-ost2 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0001-super.width=65536 Start of lustre-ost2/ost2 on ost2 failed 17 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800aabb2800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800aabb2800.idle_timeout=debug disable quota as required fail_loc=0x195 total: 64 open/close in 0.28 seconds: 231.95 ops/second 192.168.202.139@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg239-client.virtnet /mnt/lustre (opts:) Stopping /mnt/lustre-ost1 (opts:) on oleg239-server fail_loc=0x233 Starting ost1: -o localrecov -o user_xattr,noscrub lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started LFSCK on the device lustre-OST0000: scrub fail_loc=0 PASS 12 (213s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 13: OI scrub can rebuild missed /O entries ========================================================== 04:51:21 (1713430281) fail_loc=0x196 total: 64 open/close in 0.30 seconds: 211.40 ops/second fail_loc=0 192.168.202.139@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg239-client.virtnet /mnt/lustre (opts:) Stopping /mnt/lustre-ost1 (opts:) on oleg239-server Starting ost1: -o localrecov -o user_xattr,noscrub lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started LFSCK on the device lustre-OST0000: scrub PASS 13 (196s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 14: OI scrub can repair OST objects under lost+found ========================================================== 04:54:39 (1713430479) SKIP: sanity-scrub test_14 ldiskfs special test SKIP 14 (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 15: Dryrun mode OI scrub ============ 04:54:41 (1713430481) oleg239-server: oleg239-server.virtnet: executing set_hostid libkmod: kmod_module_get_holders: could not open '/sys/module/intel_rapl/holders': No such file or directory oleg239-server: oleg239-server.virtnet: executing load_modules_local libkmod: kmod_module_get_holders: could not open '/sys/module/pcc_cpufreq/holders': No such file or directory oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Using TIMEOUT=20 preparing... Thu Apr 18 04:55:37 EDT 2024 creating 20 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.00819751 s, 128 MB/s prepared Thu Apr 18 04:55:37 EDT 2024. fail_loc=0x193 fail_loc=0 stop mds1 starting MDTs with OI scrub disabled oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Started LFSCK on the device lustre-MDT0000: scrub Started LFSCK on the device lustre-MDT0000: scrub Started LFSCK on the device lustre-MDT0000: scrub Started LFSCK on the device lustre-MDT0000: scrub PASS 15 (71s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 16: Initial OI scrub can rebuild crashed index objects ========================================================== 04:55:53 (1713430553) Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions oleg239-server: libkmod: kmod_module_get_holders: could not open '/sys/module/acpi_cpufreq/holders': No such file or directory Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: mount.lustre: according to /etc/mtab lustre-mdt1/mdt1 is already mounted on /mnt/lustre-mds1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 Start of lustre-mdt1/mdt1 on mds1 failed 17 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost1/ost1 is already mounted on /mnt/lustre-ost1 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0000-super.width=65536 Start of lustre-ost1/ost1 on ost1 failed 17 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 oleg239-server: mount.lustre: according to /etc/mtab lustre-ost2/ost2 is already mounted on /mnt/lustre-ost2 pdsh@oleg239-client: oleg239-server: ssh exited with exit code 17 seq.cli-lustre-OST0001-super.width=65536 Start of lustre-ost2/ost2 on ost2 failed 17 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff88012c52d000.idle_timeout=debug osc.lustre-OST0001-osc-ffff88012c52d000.idle_timeout=debug disable quota as required fail_loc=0x199 preparing... Thu Apr 18 04:56:09 EDT 2024 creating 0 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.00845239 s, 124 MB/s prepared Thu Apr 18 04:56:10 EDT 2024. stop mds1 fail_loc=0 starting MDTs without disabling OI scrub oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre PASS 16 (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 17a: ENOSPC on OI insert shouldn't leak inodes ========================================================== 04:56:21 (1713430581) SKIP: sanity-scrub test_17a ldiskfs only test SKIP 17a (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 17b: ENOSPC on .. insertion shouldn't leak inodes ========================================================== 04:56:23 (1713430583) SKIP: sanity-scrub test_17b ldiskfs only test SKIP 17b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 18: test mount -o resetoi to recreate OI files ========================================================== 04:56:25 (1713430585) preparing... Thu Apr 18 04:56:26 EDT 2024 creating 10 files on mds1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.00893181 s, 117 MB/s prepared Thu Apr 18 04:56:27 EDT 2024. stop mds1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 [ 1076.731486] Lustre: lustre-MDT0000: reset Object Index mappings Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre PASS 18 (15s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 19: LFSCK can fix multiple linked files on OST ========================================================== 04:56:41 (1713430601) total: 64 open/close in 0.14 seconds: 447.32 ops/second stopall link /mnt/lustre-ost1/O/240000400/d2/227 to /mnt/lustre-ost1/O/240000400/d2/226 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Started LFSCK on the device lustre-OST0000: scrub Stopping /mnt/lustre-ost1 (opts:) on oleg239-server File: '/mnt/lustre-ost1/O/240000400/d2/226' Size: 0 Blocks: 1 IO Block: 4096 regular empty file Device: 29h/41d Inode: 1115 Links: 1 Access: (7666/-rwSrwSrwT) Uid: ( 0/ root) Gid: ( 0/ root) Access: 1969-12-31 19:00:00.000000000 -0500 Modify: 1969-12-31 19:00:00.000000000 -0500 Change: 2024-04-18 04:57:08.567000000 -0400 Birth: - oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 PASS 19 (165s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 20: Don't trigger OI scrub for irreparable oi repeatedly ========================================================== 04:59:28 (1713430768) SKIP: sanity-scrub test_20 ldiskfs only test SKIP 20 (0s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-scrub test 21: don't hang MDS recovery when failed to get update log ========================================================== 04:59:30 (1713430770) SKIP: sanity-scrub test_21 needs >= 2 MDTs SKIP 21 (1s) debug_raw_pointers=0 debug_raw_pointers=0 === sanity-scrub: start setup 04:59:31 (1713430771) === Stopping clients: oleg239-client.virtnet /mnt/lustre (opts:-f) Stopping clients: oleg239-client.virtnet /mnt/lustre2 (opts:-f) Stopping /mnt/lustre-ost1 (opts:-f) on oleg239-server oleg239-server: oleg239-server.virtnet: executing set_hostid Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Formatting mgs, mds, osts Format mds1: lustre-mdt1/mdt1 Format ost1: lustre-ost1/ost1 Format ost2: lustre-ost2/ost2 Checking servers environments Checking clients oleg239-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg239-server' oleg239-server: oleg239-server.virtnet: executing load_modules_local oleg239-server: Loading modules from /home/green/git/lustre-release/lustre oleg239-server: detected 4 online CPUs by sysfs oleg239-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Commit the device label on lustre-mdt1/mdt1 Started lustre-MDT0000 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Commit the device label on lustre-ost1/ost1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg239-server: oleg239-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg239-client: oleg239-server: ssh exited with exit code 1 Commit the device label on lustre-ost2/ost2 Started lustre-OST0001 Starting client: oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Starting client oleg239-client.virtnet: -o user_xattr,flock oleg239-server@tcp:/lustre /mnt/lustre Started clients oleg239-client.virtnet: 192.168.202.139@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800a789d800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800a789d800.idle_timeout=debug disable quota as required === sanity-scrub: finish setup 05:00:08 (1713430808) === == sanity-scrub test complete, duration 1187 sec ========= 05:00:08 (1713430808) === sanity-scrub: start cleanup 05:00:09 (1713430809) === === sanity-scrub: finish cleanup 05:00:09 (1713430809) ===