-----============= acceptance-small: sanity-lfsck ============----- Fri Apr 19 08:35:02 EDT 2024 excepting tests: 23b Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg445-server' oleg445-server: oleg445-server.virtnet: executing load_modules_local oleg445-server: Loading modules from /home/green/git/lustre-release/lustre oleg445-server: detected 4 online CPUs by sysfs oleg445-server: Force libcfs to create 2 CPU partitions client=34553315 MDS=34553315 OSS=34553315 Stopping clients: oleg445-client.virtnet /mnt/lustre (opts:) Stopping client oleg445-client.virtnet /mnt/lustre opts: Stopping clients: oleg445-client.virtnet /mnt/lustre2 (opts:) Stopping /mnt/lustre-mds1 (opts:-f) on oleg445-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg445-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg445-server unloading modules on: 'oleg445-server' oleg445-server: oleg445-server.virtnet: executing unload_modules_local modules unloaded. === sanity-lfsck: start setup 08:35:15 (1713530115) === Stopping clients: oleg445-client.virtnet /mnt/lustre (opts:-f) Stopping clients: oleg445-client.virtnet /mnt/lustre2 (opts:-f) pdsh@oleg445-client: oleg445-server: ssh exited with exit code 2 oleg445-server: oleg445-server.virtnet: executing set_hostid Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions ../libcfs/libcfs/libcfs options: 'cpu_npartitions=2' ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' quota/lquota options: 'hash_lqs_cur_bits=3' loading modules on: 'oleg445-server' oleg445-server: oleg445-server.virtnet: executing load_modules_local oleg445-server: Loading modules from /home/green/git/lustre-release/lustre oleg445-server: detected 4 online CPUs by sysfs oleg445-server: Force libcfs to create 2 CPU partitions oleg445-server: ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' oleg445-server: quota/lquota options: 'hash_lqs_cur_bits=3' Formatting mgs, mds, osts Format mds1: lustre-mdt1/mdt1 Format ost1: lustre-ost1/ost1 Format ost2: lustre-ost2/ost2 Checking servers environments Checking clients oleg445-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg445-server' oleg445-server: oleg445-server.virtnet: executing load_modules_local oleg445-server: Loading modules from /home/green/git/lustre-release/lustre oleg445-server: detected 4 online CPUs by sysfs oleg445-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg445-server: oleg445-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg445-client: oleg445-server: ssh exited with exit code 1 Commit the device label on lustre-mdt1/mdt1 Started lustre-MDT0000 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg445-server: oleg445-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg445-client: oleg445-server: ssh exited with exit code 1 Commit the device label on lustre-ost1/ost1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg445-server: oleg445-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg445-client: oleg445-server: ssh exited with exit code 1 Commit the device label on lustre-ost2/ost2 Started lustre-OST0001 Starting client: oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre Starting client oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre Started clients oleg445-client.virtnet: 192.168.204.145@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800ab1e3800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800ab1e3800.idle_timeout=debug setting jobstats to procname_uid Setting lustre.sys.jobid_var from disable to procname_uid Waiting 90s for 'procname_uid' Updated after 6s: want 'procname_uid' got 'procname_uid' disable quota as required === sanity-lfsck: finish setup 08:36:05 (1713530165) === debug_raw_pointers=Y debug_raw_pointers=Y == sanity-lfsck test 0: Control LFSCK manually =========== 08:36:06 (1713530166) preparing... 3 * 3 files will be created Fri Apr 19 08:36:06 EDT 2024. total: 3 mkdir in 0.01 seconds: 527.90 ops/second total: 3 create in 0.00 seconds: 638.69 ops/second total: 3 mkdir in 0.01 seconds: 502.51 ops/second prepared Fri Apr 19 08:36:07 EDT 2024. fail_val=3 fail_loc=0x1600 Started LFSCK on the device lustre-MDT0000: scrub namespace name: lfsck_namespace magic: 0xa06249ff version: 2 status: scanning-phase1 flags: param: last_completed_time: N/A time_since_last_completed: N/A latest_start_time: 1713530168 time_since_latest_start: 1 seconds last_checkpoint_time: N/A time_since_last_checkpoint: N/A latest_start_position: 314, N/A, N/A last_checkpoint_position: N/A, N/A, N/A first_failure_position: N/A, N/A, N/A checked_phase1: 0 checked_phase2: 0 updated_phase1: 0 updated_phase2: 0 failed_phase1: 0 failed_phase2: 0 directories: 0 dirent_repaired: 0 linkea_repaired: 0 nlinks_repaired: 0 multiple_linked_checked: 0 multiple_linked_repaired: 0 unknown_inconsistency: 0 unmatched_pairs_repaired: 0 dangling_repaired: 0 multiple_referenced_repaired: 0 bad_file_type_repaired: 0 lost_dirent_repaired: 0 local_lost_found_scanned: 0 local_lost_found_moved: 0 local_lost_found_skipped: 0 local_lost_found_failed: 0 striped_dirs_scanned: 0 striped_dirs_repaired: 0 striped_dirs_failed: 0 striped_dirs_disabled: 0 striped_dirs_skipped: 0 striped_shards_scanned: 0 striped_shards_repaired: 0 striped_shards_failed: 0 striped_shards_skipped: 0 name_hash_repaired: 0 linkea_overflow_cleared: 0 agent_entries_repaired: 0 success_count: 0 run_time_phase1: 1 seconds run_time_phase2: 0 seconds average_speed_phase1: 0 items/sec average_speed_phase2: N/A average_speed_total: 0 items/sec real_time_speed_phase1: 0 items/sec real_time_speed_phase2: N/A current_position: 313, N/A, N/A Stopped LFSCK on the device lustre-MDT0000. Started LFSCK on the device lustre-MDT0000: scrub namespace fail_loc=0 fail_val=0 Started LFSCK on the device lustre-MDT0000: scrub namespace stopall, should NOT crash LU-3649 Stopping clients: oleg445-client.virtnet /mnt/lustre (opts:) Stopping client oleg445-client.virtnet /mnt/lustre opts: Stopping clients: oleg445-client.virtnet /mnt/lustre2 (opts:) Stopping /mnt/lustre-mds1 (opts:-f) on oleg445-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg445-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg445-server PASS 0 (18s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-lfsck test 1a: LFSCK can find out and repair crashed FID-in-dirent ========================================================== 08:36:26 (1713530186) Checking servers environments Checking clients oleg445-client.virtnet environments Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs Force libcfs to create 2 CPU partitions loading modules on: 'oleg445-server' oleg445-server: oleg445-server.virtnet: executing load_modules_local oleg445-server: Loading modules from /home/green/git/lustre-release/lustre oleg445-server: detected 4 online CPUs by sysfs oleg445-server: Force libcfs to create 2 CPU partitions Setup mgs, mdt, osts Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg445-server: oleg445-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg445-client: oleg445-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg445-server: oleg445-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg445-client: oleg445-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg445-server: oleg445-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg445-client: oleg445-server: ssh exited with exit code 1 Started lustre-OST0001 Starting client: oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre Starting client oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre Started clients oleg445-client.virtnet: 192.168.204.145@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b5c82000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b5c82000.idle_timeout=debug disable quota as required preparing... 1 * 1 files will be created Fri Apr 19 08:36:51 EDT 2024. total: 1 mkdir in 0.00 seconds: 500.51 ops/second total: 1 create in 0.00 seconds: 628.93 ops/second total: 1 mkdir in 0.00 seconds: 628.93 ops/second prepared Fri Apr 19 08:36:51 EDT 2024. fail_loc=0x1501 fail_loc=0 192.168.204.145@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg445-client.virtnet /mnt/lustre (opts:) Started LFSCK on the device lustre-MDT0000: scrub namespace Starting client: oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre fail_loc=0x1505 fail_loc=0 PASS 1a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-lfsck test 1b: LFSCK can find out and repair the missing FID-in-LMA ========================================================== 08:36:56 (1713530216) SKIP: sanity-lfsck test_1b OI Scrub not implemented for ZFS SKIP 1b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-lfsck test 1c: LFSCK can find out and repair lost FID-in-dirent ========================================================== 08:36:58 (1713530218) preparing... 1 * 1 files will be created Fri Apr 19 08:36:59 EDT 2024. total: 1 mkdir in 0.00 seconds: 512.56 ops/second total: 1 create in 0.00 seconds: 684.00 ops/second total: 1 mkdir in 0.00 seconds: 522.20 ops/second prepared Fri Apr 19 08:37:00 EDT 2024. fail_loc=0x1504 fail_loc=0 192.168.204.145@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg445-client.virtnet /mnt/lustre (opts:) Started LFSCK on the device lustre-MDT0000: scrub namespace Starting client: oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre fail_loc=0x1505 fail_loc=0 PASS 1c (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-lfsck test 2a: LFSCK can find out and repair crashed linkEA entry ========================================================== 08:37:04 (1713530224) preparing... 1 * 1 files will be created Fri Apr 19 08:37:05 EDT 2024. total: 1 mkdir in 0.00 seconds: 510.44 ops/second total: 1 create in 0.00 seconds: 752.48 ops/second total: 1 mkdir in 0.00 seconds: 620.73 ops/second prepared Fri Apr 19 08:37:06 EDT 2024. fail_loc=0x1603 fail_loc=0 192.168.204.145@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg445-client.virtnet /mnt/lustre (opts:) Started LFSCK on the device lustre-MDT0000: scrub namespace Starting client: oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre PASS 2a (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == sanity-lfsck test 2b: LFSCK can find out and remove invalid linkEA entry ========================================================== 08:37:10 (1713530230) preparing... 1 * 1 files will be created Fri Apr 19 08:37:10 EDT 2024. total: 1 mkdir in 0.00 seconds: 459.75 ops/second total: 1 create in 0.00 seconds: 517.05 ops/second total: 1 mkdir in 0.00 seconds: 469.74 ops/second prepared Fri Apr 19 08:37:11 EDT 2024. fail_loc=0x1604 fail_loc=0 192.168.204.145@tcp:/lustre /mnt/lustre lustre rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project 0 0 Stopping client oleg445-client.virtnet /mnt/lustre (opts:) Started LFSCK on the device lustre-MDT0000: scrub namespace Starting client: oleg445-client.virtnet: -o user_xattr,flock oleg445-server@tcp:/lustre /mnt/lustre PASS 2b (4s) debug_raw_pointers=0