== sanity-dom test sanityn: Run sanityn with Data-on-MDT files ========================================================== 06:56:07 (1713437767) excepting tests: 28 102 skipping tests SLOW=no: 33a === sanityn: start setup 06:56:10 (1713437770) === Starting client oleg405-client.virtnet: -o user_xattr,flock oleg405-server@tcp:/lustre /mnt/lustre2 Started clients oleg405-client.virtnet: 192.168.204.105@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) oleg405-client.virtnet: executing check_config_client /mnt/lustre oleg405-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg405-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800a9ea0000.idle_timeout=debug osc.lustre-OST0000-osc-ffff8800aa0e3800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800a9ea0000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800aa0e3800.idle_timeout=debug disable quota as required oleg405-server: oleg405-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all === sanityn: finish setup 06:56:16 (1713437776) === 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0115159 s, 91.1 MB/s running as uid/gid/euid/egid 500/500/500/500, groups: [true] running as uid/gid/euid/egid 500/500/500/500, groups: [touch] [/mnt/lustre/d0_runas_test/f9206] debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 1: Check attribute updates on 2 mount points ========================================================== 06:56:17 (1713437777) /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0777 OK /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0666 OK /mnt/lustre/f1.sanityn: absent OK PASS 1 (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2a: check cached attribute updates on 2 mtpt's ================================================================== 06:56:19 (1713437779) -rw-r--r-- 1 root root 0 Apr 18 06:56 /mnt/lustre2/f2a /mnt/lustre/f2a has type file OK /mnt/lustre/f2a has perms 0777 OK PASS 2a (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2b: check cached attribute updates on 2 mtpt's ================================================================== 06:56:22 (1713437782) -rw-r--r-- 1 root root 0 Apr 18 06:56 /mnt/lustre2/f2b /mnt/lustre2/f2b has type file OK /mnt/lustre2/f2b has perms 0777 OK PASS 2b (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2c: check cached attribute updates on 2 mtpt's root ============================================================= 06:56:25 (1713437785) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0777 OK PASS 2c (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2d: check cached attribute updates on 2 mtpt's root ============================================================= 06:56:28 (1713437788) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0755 OK PASS 2d (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2e: check chmod on root is propagated to others ========================================================== 06:56:30 (1713437790) total 265 -rwxrwxrwx 1 root root 0 Apr 18 06:56 f2a -rwxrwxrwx 1 root root 0 Apr 18 06:56 f2b -rw-r--r-- 1 root root 805750 Apr 18 06:48 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 18 06:48 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 73236 Apr 18 06:48 ffsx.sanity-dom.fsxlog total 265 -rwxrwxrwx 1 root root 0 Apr 18 06:56 f2a -rwxrwxrwx 1 root root 0 Apr 18 06:56 f2b -rw-r--r-- 1 root root 805750 Apr 18 06:48 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 18 06:48 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 73236 Apr 18 06:48 ffsx.sanity-dom.fsxlog running as uid/gid/euid/egid 500/500/500/500, groups: [dd] [if=/dev/zero] [of=/mnt/lustre2/f2e.sanityn] [count=1] 1+0 records in 1+0 records out 512 bytes (512 B) copied, 0.00163898 s, 312 kB/s PASS 2e (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2f: check attr/owner updates on DNE with 2 mtpt's ========================================================== 06:56:33 (1713437793) SKIP: sanityn test_2f needs >= 2 MDTs SKIP 2f (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 2g: check blocks update on sync write ==== 06:56:35 (1713437795) 2+0 records in 2+0 records out 2097152 bytes (2.1 MB) copied, 0.0751432 s, 27.9 MB/s /mnt/lustre/f2g.sanityn has 4109 blocks /mnt/lustre2/f2g.sanityn has 4109 blocks PASS 2g (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 4: fstat validation on multiple mount points ==================================================================== 06:56:38 (1713437798) PASS 4 (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 5: create a file on one mount, truncate it on the other ========================================================== 06:56:42 (1713437802) /mnt/lustre/f5 has type file OK /mnt/lustre/f5 has size 100 OK PASS 5 (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 6: remove of open file on other node ============================================================================ 06:56:45 (1713437805) opening writing unlinking /mnt/lustre2/f6.sanityn accessing (1) seeking (1) accessing (2) fstat... reading comparing data truncating seeking (2) writing again seeking (3) reading again comparing data again closing SUCCESS - goto beer PASS 6 (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 7: remove of open directory on other node ======================================================================= 06:56:47 (1713437807) creating directory /mnt/lustre/d7 opening directory unlinking /mnt/lustre/d7 Ok, everything goes well. PASS 7 (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 8: remove of open special file on other node ==================================================================== 06:56:50 (1713437810) creating special file /mnt/lustre/f8.sanityn opening file unlinking /mnt/lustre/f8.sanityn Ok, everything goes well. PASS 8 (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 9a: append of file with sub-page size on multiple mounts ========================================================== 06:56:53 (1713437813) PASS 9a (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 9b: append to striped sparse file ======== 06:56:56 (1713437816) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.00918963 s, 114 MB/s 3+0 records in 3+0 records out 3 bytes (3 B) copied, 0.0188647 s, 0.2 kB/s Data read (expecting 'foo'): foo PASS 9b (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 10a: write of file with sub-page size on multiple mounts ========================================================== 06:56:59 (1713437819) 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00127544 s, 0.8 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00281067 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00199076 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00202645 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00241359 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00280097 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00200918 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00197122 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.0020451 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00252126 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00227534 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00200376 s, 0.5 kB/s PASS 10a (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 10b: write of file with sub-page size on multiple mounts ========================================================== 06:57:02 (1713437822) 1+0 records in 1+0 records out 3072 bytes (3.1 kB) copied, 0.00211813 s, 1.5 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00653731 s, 627 kB/s 1+0 records in 1+0 records out 3072 bytes (3.1 kB) copied, 0.00017333 s, 17.7 MB/s PASS 10b (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 11: execution of file opened for write should return error ============================================================== 06:57:05 (1713437825) multiop /mnt/lustre/d11/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.9206 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 301: /mnt/lustre2/d11/f: Text file busy PASS 11 (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 12: test lock ordering (link, stat, unlink) ========================================================== 06:57:08 (1713437828) warning: '-runtime' deprecated, use '-t runtime' instead - link 3280 (time 1713437838.46 total 10.00 last 327.97) - link 8246 (time 1713437848.47 total 20.00 last 496.50) - link 10000 (time 1713437851.78 total 23.32 last 528.96) - link 15203 (time 1713437861.78 total 33.32 last 520.22) - link 20000 (time 1713437871.02 total 42.56 last 519.45) - link 25120 (time 1713437881.02 total 52.56 last 511.93) - link 28591 (time 1713437891.02 total 62.56 last 347.05) - link 30000 (time 1713437895.17 total 66.70 last 339.92) - link 33806 (time 1713437905.17 total 76.70 last 380.58) - link 37617 (time 1713437915.17 total 86.71 last 380.97) - link 40000 (time 1713437922.06 total 93.60 last 345.81) total: 42300 link in 100.00 seconds: 423.00 ops/second using seed 3149013047 running for 100 seconds - stat 3624 (time 1713437840 ; total 11 ; last 11) - stat 10000 (time 1713437846 ; total 17 ; last 6) - stat 19286 (time 1713437857 ; total 28 ; last 11) - stat 20000 (time 1713437857 ; total 28 ; last 0) - stat 29095 (time 1713437868 ; total 39 ; last 11) - stat 30000 (time 1713437869 ; total 40 ; last 1) - stat 39917 (time 1713437880 ; total 51 ; last 11) - stat 40000 (time 1713437880 ; total 51 ; last 0) - stat 46419 (time 1713437891 ; total 62 ; last 11) - stat 50000 (time 1713437897 ; total 68 ; last 6) - stat 56906 (time 1713437908 ; total 79 ; last 11) - stat 60000 (time 1713437913 ; total 84 ; last 5) - stat 66757 (time 1713437924 ; total 95 ; last 11) - stat 70000 (time 1713437928 ; total 99 ; last 4) total: 70454 stats in 100 seconds: 704.539978 stats/second - unlinked 0 (time 1713437879 ; total 0 ; last 0) - unlinked 10000 (time 1713437929 ; total 50 ; last 50) - unlinked 20000 (time 1713437953 ; total 74 ; last 24) - unlinked 30000 (time 1713437977 ; total 98 ; last 24) - unlinked 40000 (time 1713438000 ; total 121 ; last 23) unlink(/mnt/lustre2/lockdir/lockfile42300) error: No such file or directory total: 42299 unlinks in 127 seconds: 333.062988 unlinks/second /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 77: kill: (24139) - No such process /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 78: kill: (24141) - No such process PASS 12 (179s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 14aa: execution of file open for write returns -ETXTBSY ========================================================== 07:00:08 (1713438008) multiop /mnt/lustre/d14aa.sanityn/f14aa.sanityn vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.9206 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 336: /mnt/lustre2/d14aa.sanityn/f14aa.sanityn: Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4697: 25027 Terminated $MULTIOP_PROG $FILE v$ARGS > $TMPPIPE (wd: ~) PASS 14aa (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 14ab: open(RDWR) of executing file returns -ETXTBSY ========================================================== 07:00:11 (1713438011) open(O_RDWR|O_CREAT): Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4697: 25764 Terminated $DIR1/$tdir/sleep 60 (wd: ~) PASS 14ab (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 14b: truncate of executing file returns -ETXTBSY ================================================================ 07:00:14 (1713438014) truncate: cannot truncate '/mnt/lustre2/d14b.sanityn/sleep' to length 60: Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 351: 26501 Terminated $DIR1/$tdir/sleep 60 PASS 14b (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 14c: open(O_TRUNC) of executing file return -ETXTBSY ============================================================ 07:00:17 (1713438017) cp: cannot create regular file '/mnt/lustre2/d14c.sanityn/sleep': Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 363: 27240 Terminated $DIR1/$tdir/sleep 60 PASS 14c (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 14d: chmod of executing file is still possible ================================================================== 07:00:20 (1713438020) chmod /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 374: 27978 Terminated $DIR1/$tdir/sleep 60 PASS 14d (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 17: resource creation/LVB creation race ========================================================================= 07:00:23 (1713438023) fail_loc=0x8000030a PASS 17 (4s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 19: test concurrent uncached read races ========================================================================= 07:00:29 (1713438029) oleg405-server: error: get_param: param_path 'osd-*/lustre-MDT*/read_cache_enable': No such file or directory pdsh@oleg405-client: oleg405-server: ssh exited with exit code 2 SKIP: sanityn test_19 not cache-capable obdfilter SKIP 19 (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 20: test extra readahead page left in cache ============================================================== 07:00:31 (1713438031) PASS 20 (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 23: others should see updated atime while another read============================================================== 07:00:34 (1713438034) now is 1713438035 starting reads multiop /mnt/lustre/f23.sanityn vor20_c TMPPIPE=/tmp/multiop_open_wait_pipe.9206 new atime is 1713438096 PASS 23 (63s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 27: align non-overlapping extent locks from request ============================================================= 07:01:38 (1713438098) 4+0 records in 4+0 records out 16793600 bytes (17 MB) copied, 0.143572 s, 117 MB/s dd 1 started dd 2 started 1+0 records in 1+0 records out 15728640 bytes (16 MB) copied, 0.142562 s, 110 MB/s 1+0 records in 1+0 records out 8192 bytes (8.2 kB) copied, 0.00421601 s, 1.9 MB/s dd 3 finished PASS 27 (5s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 39a: file mtime does not change after rename ========================================================== 07:01:45 (1713438105) repeat after cancel_lru_locks PASS 39a (1s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 39b: file mtime the same on clients with/out lock ========================================================== 07:01:48 (1713438108) repeat after cancel_lru_locks PASS 39b (3s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 39c: check truncate mtime update ================================================================================ 07:01:53 (1713438113) repeat after cancel_lru_locks PASS 39c (3s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 39d: sync write should update mtime ====== 07:01:57 (1713438117) fail_loc=0x411 fail_loc=0 PASS 39d (2s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 51a: layout lock: refresh layout should work ========================================================== 07:02:01 (1713438121) 0+1 records in 0+1 records out 159 bytes (159 B) copied, 0.00725966 s, 21.9 kB/s /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 3445: kill: (3620) - No such process PASS 51a (4s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 51c: layout lock: IT_LAYOUT blocked and correct layout can be returned ========================================================== 07:02:06 (1713438126) fail_loc=0x172 Setting layout to have 2 stripes ... 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.00469535 s, 218 kB/s PASS 51c (6s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 51d: layout lock: losing layout lock should clean up memory map region ========================================================== 07:02:14 (1713438134) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.00871524 s, 120 MB/s Before revoking layout lock: 1024 KB mapped PASS 51d (3s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 107a: Basic grouplock conflict =========== 07:02:20 (1713438140) 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.147383 s, 71.1 MB/s /mnt/lustre/f107a.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x280000400:0x3db:0x0] } multiop /mnt/lustre/f107a.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.9206 multiop /mnt/lustre2/f107a.sanityn vO_G16022000r10g16022000c TMPPIPE=/tmp/multiop_open_wait_pipe.9206 First grouplock blocks second one PASS 107a (4s) debug_raw_pointers=1 debug_raw_pointers=1 debug_raw_pointers=Y debug_raw_pointers=Y == sanityn test 107b: Grouplock is added to the head of waiting list ========================================================== 07:02:26 (1713438146) 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.107524 s, 97.5 MB/s /mnt/lustre/f107b.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x240000400:0x3dd:0x0] } multiop /mnt/lustre/f107b.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.9206 Grouplock blocks IO First grouplock blocks second one Second grouplock blocks IO PASS 107b (8s) debug_raw_pointers=1 debug_raw_pointers=1 cleanup: ====================================================== == sanityn test complete, duration 388 sec =============== 07:02:35 (1713438155) === sanityn: start cleanup 07:02:36 (1713438156) === Stopping clients: oleg405-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg405-client.virtnet /mnt/lustre2 opts: === sanityn: finish cleanup 07:02:36 (1713438156) ===