== sanity-dom test sanityn: Run sanityn with Data-on-MDT files ========================================================== 04:50:46 (1713343846) excepting tests: 28 skipping tests SLOW=no: 33a Starting client oleg241-client.virtnet: -o user_xattr,flock oleg241-server@tcp:/lustre /mnt/lustre2 Started clients oleg241-client.virtnet: 192.168.202.141@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt) oleg241-client.virtnet: executing check_config_client /mnt/lustre oleg241-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg241-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800a9627800.idle_timeout=debug osc.lustre-OST0000-osc-ffff8800b5cb2000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800a9627800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b5cb2000.idle_timeout=debug disable quota as required oleg241-server: oleg241-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 osd-ldiskfs.track_declares_assert=1 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0115343 s, 90.9 MB/s running as uid/gid/euid/egid 500/500/500/500, groups: [true] running as uid/gid/euid/egid 500/500/500/500, groups: [touch] [/mnt/lustre/d0_runas_test/f5368] == sanityn test 1: Check attribute updates on 2 mount points ========================================================== 04:50:56 (1713343856) /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0777 OK /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0666 OK /mnt/lustre/f1.sanityn: absent OK PASS 1 (1s) == sanityn test 2a: check cached attribute updates on 2 mtpt's ================================================================== 04:50:57 (1713343857) -rw-r--r-- 1 root root 0 Apr 17 04:50 /mnt/lustre2/f2a /mnt/lustre/f2a has type file OK /mnt/lustre/f2a has perms 0777 OK PASS 2a (2s) == sanityn test 2b: check cached attribute updates on 2 mtpt's ================================================================== 04:50:59 (1713343859) -rw-r--r-- 1 root root 0 Apr 17 04:50 /mnt/lustre2/f2b /mnt/lustre2/f2b has type file OK /mnt/lustre2/f2b has perms 0777 OK PASS 2b (1s) == sanityn test 2c: check cached attribute updates on 2 mtpt's root ============================================================= 04:51:00 (1713343860) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0777 OK PASS 2c (2s) == sanityn test 2d: check cached attribute updates on 2 mtpt's root ============================================================= 04:51:02 (1713343862) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0755 OK PASS 2d (1s) == sanityn test 2e: check chmod on root is propagated to others ========================================================== 04:51:03 (1713343863) total 296 -rwxrwxrwx 1 root root 0 Apr 17 04:50 f2a -rwxrwxrwx 1 root root 0 Apr 17 04:50 f2b -rw-r--r-- 1 root root 1934406 Apr 17 04:44 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 17 04:44 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 107186 Apr 17 04:44 ffsx.sanity-dom.fsxlog total 296 -rwxrwxrwx 1 root root 0 Apr 17 04:50 f2a -rwxrwxrwx 1 root root 0 Apr 17 04:50 f2b -rw-r--r-- 1 root root 1934406 Apr 17 04:44 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 17 04:44 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 107186 Apr 17 04:44 ffsx.sanity-dom.fsxlog running as uid/gid/euid/egid 500/500/500/500, groups: [dd] [if=/dev/zero] [of=/mnt/lustre2/f2e.sanityn] [count=1] 1+0 records in 1+0 records out 512 bytes (512 B) copied, 0.00129771 s, 395 kB/s PASS 2e (2s) == sanityn test 2f: check attr/owner updates on DNE with 2 mtpt's ========================================================== 04:51:05 (1713343865) /mnt/lustre2/d2f.sanityn/remote_dir/f2f.sanityn has type file OK /mnt/lustre2/d2f.sanityn/remote_dir/f2f.sanityn has perms 0777 OK /mnt/lustre2/d2f.sanityn/remote_dir/f2f.sanityn is owned by user #500 OK /mnt/lustre2/d2f.sanityn/remote_dir/f2f.sanityn is owned by group #500 OK Can't lstat /mnt/lustre2/d2f.sanityn/remote_dir/f2f.sanityn: No such file or directory PASS 2f (1s) == sanityn test 2g: check blocks update on sync write ==== 04:51:06 (1713343866) 2+0 records in 2+0 records out 2097152 bytes (2.1 MB) copied, 0.0524044 s, 40.0 MB/s /mnt/lustre/f2g.sanityn has 4104 blocks /mnt/lustre2/f2g.sanityn has 4104 blocks PASS 2g (2s) == sanityn test 4: fstat validation on multiple mount points ==================================================================== 04:51:08 (1713343868) PASS 4 (2s) == sanityn test 5: create a file on one mount, truncate it on the other ========================================================== 04:51:10 (1713343870) /mnt/lustre/f5 has type file OK /mnt/lustre/f5 has size 100 OK PASS 5 (2s) == sanityn test 6: remove of open file on other node ============================================================================ 04:51:12 (1713343872) opening writing unlinking /mnt/lustre2/f6.sanityn accessing (1) seeking (1) accessing (2) fstat... reading comparing data truncating seeking (2) writing again seeking (3) reading again comparing data again closing SUCCESS - goto beer PASS 6 (1s) == sanityn test 7: remove of open directory on other node ======================================================================= 04:51:13 (1713343873) creating directory /mnt/lustre/d7 opening directory unlinking /mnt/lustre/d7 Ok, everything goes well. PASS 7 (2s) == sanityn test 8: remove of open special file on other node ==================================================================== 04:51:15 (1713343875) creating special file /mnt/lustre/f8.sanityn opening file unlinking /mnt/lustre/f8.sanityn Ok, everything goes well. PASS 8 (1s) == sanityn test 9a: append of file with sub-page size on multiple mounts ========================================================== 04:51:16 (1713343876) PASS 9a (2s) == sanityn test 9b: append to striped sparse file ======== 04:51:18 (1713343878) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.012449 s, 84.2 MB/s 3+0 records in 3+0 records out 3 bytes (3 B) copied, 0.0108789 s, 0.3 kB/s Data read (expecting 'foo'): foo PASS 9b (1s) == sanityn test 10a: write of file with sub-page size on multiple mounts ========================================================== 04:51:19 (1713343879) 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.0013471 s, 0.7 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00134923 s, 0.7 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00274899 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00135329 s, 0.7 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00222952 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00137765 s, 0.7 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00217369 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00136169 s, 0.7 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00209263 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00115815 s, 0.9 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00238538 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00130358 s, 0.8 kB/s PASS 10a (2s) == sanityn test 10b: write of file with sub-page size on multiple mounts ========================================================== 04:51:21 (1713343881) 1+0 records in 1+0 records out 3072 bytes (3.1 kB) copied, 0.00138494 s, 2.2 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.0065184 s, 628 kB/s 1+0 records in 1+0 records out 3072 bytes (3.1 kB) copied, 0.00022836 s, 13.5 MB/s PASS 10b (1s) == sanityn test 11: execution of file opened for write should return error ============================================================== 04:51:22 (1713343882) striped dir -i1 -c2 -H all_char /mnt/lustre/d11 multiop /mnt/lustre/d11/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.5368 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 273: /mnt/lustre2/d11/f: Text file busy PASS 11 (2s) == sanityn test 12: test lock ordering (link, stat, unlink) ========================================================== 04:51:24 (1713343884) warning: '-runtime' deprecated, use '-t runtime' instead - link 3170 (time 1713343894.82 total 10.00 last 316.97) - link 8026 (time 1713343904.82 total 20.00 last 485.54) - link 10000 (time 1713343908.37 total 23.55 last 555.67) - link 15406 (time 1713343918.38 total 33.56 last 540.53) - link 20000 (time 1713343926.68 total 41.86 last 552.98) - link 24879 (time 1713343936.69 total 51.87 last 487.75) - link 28061 (time 1713343946.69 total 61.87 last 318.06) - link 30000 (time 1713343953.04 total 68.22 last 305.55) - link 32936 (time 1713343963.04 total 78.22 last 293.50) - link 35997 (time 1713343973.04 total 88.22 last 306.06) - link 39339 (time 1713343983.04 total 98.23 last 334.08) total: 39889 link in 100.00 seconds: 398.88 ops/second using seed 4228238509 running for 100 seconds - stat 3207 (time 1713343896 ; total 11 ; last 11) - stat 10000 (time 1713343902 ; total 17 ; last 6) - stat 20000 (time 1713343911 ; total 26 ; last 9) - stat 30000 (time 1713343920 ; total 35 ; last 9) - stat 40000 (time 1713343930 ; total 45 ; last 10) - stat 50000 (time 1713343939 ; total 54 ; last 9) - stat 60000 (time 1713343947 ; total 62 ; last 8) - stat 70000 (time 1713343956 ; total 71 ; last 9) - stat 80000 (time 1713343965 ; total 80 ; last 9) - stat 90000 (time 1713343974 ; total 89 ; last 9) - stat 100000 (time 1713343981 ; total 96 ; last 7) total: 103699 stats in 100 seconds: 1036.989990 stats/second - unlinked 0 (time 1713343935 ; total 0 ; last 0) - unlinked 10000 (time 1713343987 ; total 52 ; last 52) - unlinked 20000 (time 1713344010 ; total 75 ; last 23) - unlinked 30000 (time 1713344032 ; total 97 ; last 22) unlink(/mnt/lustre2/lockdir/lockfile39889) error: No such file or directory total: 39888 unlinks in 119 seconds: 335.193268 unlinks/second /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 77: kill: (18948) - No such process /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 78: kill: (18951) - No such process PASS 12 (171s) == sanityn test 14aa: execution of file open for write returns -ETXTBSY ========================================================== 04:54:16 (1713344056) striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d14aa.sanityn multiop /mnt/lustre/d14aa.sanityn/f14aa.sanityn vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.5368 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 308: /mnt/lustre2/d14aa.sanityn/f14aa.sanityn: Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4261: 19732 Terminated $MULTIOP_PROG $FILE v$ARGS > $TMPPIPE (wd: ~) PASS 14aa (2s) == sanityn test 14ab: open(RDWR) of executing file returns -ETXTBSY ========================================================== 04:54:17 (1713344057) striped dir -i0 -c2 -H all_char /mnt/lustre/d14ab.sanityn open(O_RDWR|O_CREAT): Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4261: 20374 Terminated $DIR1/$tdir/sleep 60 (wd: ~) PASS 14ab (1s) == sanityn test 14b: truncate of executing file returns -ETXTBSY ================================================================ 04:54:18 (1713344058) striped dir -i0 -c2 -H crush /mnt/lustre/d14b.sanityn truncate: cannot truncate '/mnt/lustre2/d14b.sanityn/sleep' to length 60: Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 323: 21012 Terminated $DIR1/$tdir/sleep 60 PASS 14b (2s) == sanityn test 14c: open(O_TRUNC) of executing file return -ETXTBSY ============================================================ 04:54:20 (1713344060) striped dir -i0 -c2 -H crush /mnt/lustre/d14c.sanityn cp: cannot create regular file '/mnt/lustre2/d14c.sanityn/sleep': Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 335: 21653 Terminated $DIR1/$tdir/sleep 60 PASS 14c (1s) == sanityn test 14d: chmod of executing file is still possible ================================================================== 04:54:21 (1713344061) striped dir -i0 -c2 -H crush /mnt/lustre/d14d.sanityn chmod /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 346: 22294 Terminated $DIR1/$tdir/sleep 60 PASS 14d (2s) == sanityn test 17: resource creation/LVB creation race ========================================================================= 04:54:23 (1713344063) fail_loc=0x8000030a PASS 17 (4s) == sanityn test 19: test concurrent uncached read races ========================================================================= 04:54:27 (1713344067) 32+0 records in 32+0 records out 16777216 bytes (17 MB) copied, 0.147874 s, 113 MB/s loop 5 loop 10 loop 15 loop 20 PASS 19 (6s) == sanityn test 20: test extra readahead page left in cache ============================================================== 04:54:33 (1713344073) striped dir -i0 -c2 -H crush /mnt/lustre/d20.sanityn PASS 20 (2s) == sanityn test 23: others should see updated atime while another read============================================================== 04:54:35 (1713344075) now is 1713344075 starting reads multiop /mnt/lustre/f23.sanityn vor20_c TMPPIPE=/tmp/multiop_open_wait_pipe.5368 new atime is 1713344136 PASS 23 (63s) == sanityn test 27: align non-overlapping extent locks from request ============================================================= 04:55:38 (1713344138) 4+0 records in 4+0 records out 16793600 bytes (17 MB) copied, 0.159005 s, 106 MB/s dd 1 started dd 2 started 1+0 records in 1+0 records out 15728640 bytes (16 MB) copied, 0.151288 s, 104 MB/s 1+0 records in 1+0 records out 8192 bytes (8.2 kB) copied, 0.00459529 s, 1.8 MB/s dd 3 finished PASS 27 (4s) == sanityn test 39a: test from 11063 ============================================================================================ 04:55:42 (1713344142) repeat after cancel_lru_locks PASS 39a (1s) == sanityn test 39b: 11063 problem 1 ============================================================================================ 04:55:43 (1713344143) repeat after cancel_lru_locks PASS 39b (3s) == sanityn test 39c: check truncate mtime update ================================================================================ 04:55:46 (1713344146) repeat after cancel_lru_locks PASS 39c (2s) == sanityn test 39d: sync write should update mtime ====== 04:55:48 (1713344148) fail_loc=0x411 fail_loc=0 PASS 39d (2s) == sanityn test 51a: layout lock: refresh layout should work ========================================================== 04:55:50 (1713344150) 0+1 records in 0+1 records out 159 bytes (159 B) copied, 0.00404054 s, 39.4 kB/s /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 3175: kill: (30515) - No such process PASS 51a (3s) == sanityn test 51c: layout lock: IT_LAYOUT blocked and correct layout can be returned ========================================================== 04:55:53 (1713344153) fail_loc=0x172 Setting layout to have 2 stripes ... 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.0074099 s, 138 kB/s PASS 51c (6s) == sanityn test 51d: layout lock: losing layout lock should clean up memory map region ========================================================== 04:55:59 (1713344159) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0102486 s, 102 MB/s Before revoking layout lock: 1024 KB mapped PASS 51d (3s) == sanityn test 107a: Basic grouplock conflict =========== 04:56:03 (1713344163) 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.322886 s, 32.5 MB/s /mnt/lustre/f107a.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x100000000:0x1fc:0x0] } multiop /mnt/lustre/f107a.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.5368 multiop /mnt/lustre2/f107a.sanityn vO_G16022000r10g16022000c TMPPIPE=/tmp/multiop_open_wait_pipe.5368 First grouplock blocks second one PASS 107a (6s) == sanityn test 107b: Grouplock is added to the head of waiting list ========================================================== 04:56:10 (1713344170) 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.266152 s, 39.4 MB/s /mnt/lustre/f107b.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x100010000:0x1fa:0x0] } multiop /mnt/lustre/f107b.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.5368 Grouplock blocks IO First grouplock blocks second one Second grouplock blocks IO PASS 107b (11s) cleanup: ====================================================== == sanityn test complete, duration 336 sec =============== 04:56:22 (1713344182) Stopping clients: oleg241-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg241-client.virtnet /mnt/lustre2 opts: