== sanity-dom test sanityn: Run sanityn with Data-on-MDT files ========================================================== 17:07:07 (1713388027) excepting tests: 28 102 skipping tests SLOW=no: 33a Starting client oleg248-client.virtnet: -o user_xattr,flock oleg248-server@tcp:/lustre /mnt/lustre2 Started clients oleg248-client.virtnet: 192.168.202.148@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt) oleg248-client.virtnet: executing check_config_client /mnt/lustre oleg248-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg248-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800b3ca7000.idle_timeout=debug osc.lustre-OST0000-osc-ffff88012bfb1000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b3ca7000.idle_timeout=debug osc.lustre-OST0001-osc-ffff88012bfb1000.idle_timeout=debug disable quota as required oleg248-server: oleg248-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.019648 s, 53.4 MB/s running as uid/gid/euid/egid 500/500/500/500, groups: [true] running as uid/gid/euid/egid 500/500/500/500, groups: [touch] [/mnt/lustre/d0_runas_test/f31252] == sanityn test 1: Check attribute updates on 2 mount points ========================================================== 17:07:21 (1713388041) /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0777 OK /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0666 OK /mnt/lustre/f1.sanityn: absent OK PASS 1 (2s) == sanityn test 2a: check cached attribute updates on 2 mtpt's ================================================================== 17:07:23 (1713388043) -rw-r--r-- 1 root root 0 Apr 17 17:07 /mnt/lustre2/f2a /mnt/lustre/f2a has type file OK /mnt/lustre/f2a has perms 0777 OK PASS 2a (2s) == sanityn test 2b: check cached attribute updates on 2 mtpt's ================================================================== 17:07:25 (1713388045) -rw-r--r-- 1 root root 0 Apr 17 17:07 /mnt/lustre2/f2b /mnt/lustre2/f2b has type file OK /mnt/lustre2/f2b has perms 0777 OK PASS 2b (2s) == sanityn test 2c: check cached attribute updates on 2 mtpt's root ============================================================= 17:07:27 (1713388047) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0777 OK PASS 2c (2s) == sanityn test 2d: check cached attribute updates on 2 mtpt's root ============================================================= 17:07:29 (1713388049) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0755 OK PASS 2d (2s) == sanityn test 2e: check chmod on root is propagated to others ========================================================== 17:07:32 (1713388052) total 331 -rwxrwxrwx 1 root root 0 Apr 17 17:07 f2a -rwxrwxrwx 1 root root 0 Apr 17 17:07 f2b -rw-r--r-- 1 root root 1388728 Apr 17 17:02 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 17 17:01 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 80774 Apr 17 17:02 ffsx.sanity-dom.fsxlog total 331 -rwxrwxrwx 1 root root 0 Apr 17 17:07 f2a -rwxrwxrwx 1 root root 0 Apr 17 17:07 f2b -rw-r--r-- 1 root root 1388728 Apr 17 17:02 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 17 17:01 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 80774 Apr 17 17:02 ffsx.sanity-dom.fsxlog running as uid/gid/euid/egid 500/500/500/500, groups: [dd] [if=/dev/zero] [of=/mnt/lustre2/f2e.sanityn] [count=1] 1+0 records in 1+0 records out 512 bytes (512 B) copied, 0.00285649 s, 179 kB/s PASS 2e (3s) == sanityn test 2f: check attr/owner updates on DNE with 2 mtpt's ========================================================== 17:07:34 (1713388054) SKIP: sanityn test_2f needs >= 2 MDTs SKIP 2f (1s) == sanityn test 2g: check blocks update on sync write ==== 17:07:35 (1713388055) 2+0 records in 2+0 records out 2097152 bytes (2.1 MB) copied, 0.157147 s, 13.3 MB/s /mnt/lustre/f2g.sanityn has 4109 blocks /mnt/lustre2/f2g.sanityn has 4109 blocks PASS 2g (2s) == sanityn test 4: fstat validation on multiple mount points ==================================================================== 17:07:37 (1713388057) PASS 4 (3s) == sanityn test 5: create a file on one mount, truncate it on the other ========================================================== 17:07:40 (1713388060) /mnt/lustre/f5 has type file OK /mnt/lustre/f5 has size 100 OK PASS 5 (1s) == sanityn test 6: remove of open file on other node ============================================================================ 17:07:41 (1713388061) opening writing unlinking /mnt/lustre2/f6.sanityn accessing (1) seeking (1) accessing (2) fstat... reading comparing data truncating seeking (2) writing again seeking (3) reading again comparing data again closing SUCCESS - goto beer PASS 6 (2s) == sanityn test 7: remove of open directory on other node ======================================================================= 17:07:43 (1713388063) creating directory /mnt/lustre/d7 opening directory unlinking /mnt/lustre/d7 Ok, everything goes well. PASS 7 (2s) == sanityn test 8: remove of open special file on other node ==================================================================== 17:07:45 (1713388065) creating special file /mnt/lustre/f8.sanityn opening file unlinking /mnt/lustre/f8.sanityn Ok, everything goes well. PASS 8 (1s) == sanityn test 9a: append of file with sub-page size on multiple mounts ========================================================== 17:07:46 (1713388066) PASS 9a (3s) == sanityn test 9b: append to striped sparse file ======== 17:07:49 (1713388069) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0125974 s, 83.2 MB/s 3+0 records in 3+0 records out 3 bytes (3 B) copied, 0.0183526 s, 0.2 kB/s Data read (expecting 'foo'): foo PASS 9b (2s) == sanityn test 10a: write of file with sub-page size on multiple mounts ========================================================== 17:07:51 (1713388071) 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00106454 s, 0.9 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.0020048 s, 0.5 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00249687 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00130127 s, 0.8 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00235077 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00177397 s, 0.6 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00253248 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00235458 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00382134 s, 0.3 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00277089 s, 0.4 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00353045 s, 0.3 kB/s 1+0 records in 1+0 records out 1 byte (1 B) copied, 0.00258213 s, 0.4 kB/s PASS 10a (2s) == sanityn test 10b: write of file with sub-page size on multiple mounts ========================================================== 17:07:53 (1713388073) 1+0 records in 1+0 records out 3072 bytes (3.1 kB) copied, 0.0021846 s, 1.4 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.011872 s, 345 kB/s 1+0 records in 1+0 records out 3072 bytes (3.1 kB) copied, 0.000283606 s, 10.8 MB/s PASS 10b (2s) == sanityn test 11: execution of file opened for write should return error ============================================================== 17:07:55 (1713388075) multiop /mnt/lustre/d11/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.31252 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 273: /mnt/lustre2/d11/f: Text file busy PASS 11 (2s) == sanityn test 12: test lock ordering (link, stat, unlink) ========================================================== 17:07:57 (1713388077) warning: '-runtime' deprecated, use '-t runtime' instead - link 3173 (time 1713388088.36 total 10.00 last 317.30) - link 9019 (time 1713388098.36 total 20.00 last 584.51) - link 15059 (time 1713388108.36 total 30.00 last 603.97) - link 20000 (time 1713388116.53 total 38.18 last 604.43) - link 26055 (time 1713388126.53 total 48.18 last 605.50) - link 30000 (time 1713388135.96 total 57.60 last 418.47) - link 33506 (time 1713388145.96 total 67.61 last 350.55) - link 36181 (time 1713388155.96 total 77.61 last 267.50) - link 39593 (time 1713388165.96 total 87.61 last 341.19) - link 43081 (time 1713388175.96 total 97.61 last 348.78) total: 43937 link in 100.00 seconds: 439.36 ops/second using seed 3343540005 running for 100 seconds - stat 4011 (time 1713388090 ; total 11 ; last 11) - stat 10000 (time 1713388095 ; total 16 ; last 5) - stat 20000 (time 1713388103 ; total 24 ; last 8) - stat 30000 (time 1713388112 ; total 33 ; last 9) - stat 40000 (time 1713388120 ; total 41 ; last 8) - stat 50000 (time 1713388128 ; total 49 ; last 8) - stat 60000 (time 1713388137 ; total 58 ; last 9) - stat 70000 (time 1713388146 ; total 67 ; last 9) - stat 80000 (time 1713388156 ; total 77 ; last 10) - stat 90000 (time 1713388164 ; total 85 ; last 8) - stat 100000 (time 1713388173 ; total 94 ; last 9) total: 106532 stats in 100 seconds: 1065.319946 stats/second - unlinked 0 (time 1713388129 ; total 0 ; last 0) - unlinked 10000 (time 1713388179 ; total 50 ; last 50) - unlinked 20000 (time 1713388222 ; total 93 ; last 43) - unlinked 30000 (time 1713388266 ; total 137 ; last 44) - unlinked 40000 (time 1713388307 ; total 178 ; last 41) unlink(/mnt/lustre2/lockdir/lockfile43937) error: No such file or directory total: 43936 unlinks in 194 seconds: 226.474228 unlinks/second /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 77: kill: (11762) - No such process /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 78: kill: (11765) - No such process PASS 12 (248s) == sanityn test 14aa: execution of file open for write returns -ETXTBSY ========================================================== 17:12:05 (1713388325) multiop /mnt/lustre/d14aa.sanityn/f14aa.sanityn vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.31252 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 308: /mnt/lustre2/d14aa.sanityn/f14aa.sanityn: Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4261: 12591 Terminated $MULTIOP_PROG $FILE v$ARGS > $TMPPIPE (wd: ~) PASS 14aa (2s) == sanityn test 14ab: open(RDWR) of executing file returns -ETXTBSY ========================================================== 17:12:07 (1713388327) open(O_RDWR|O_CREAT): Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4261: 13221 Terminated $DIR1/$tdir/sleep 60 (wd: ~) PASS 14ab (2s) == sanityn test 14b: truncate of executing file returns -ETXTBSY ================================================================ 17:12:09 (1713388329) truncate: cannot truncate '/mnt/lustre2/d14b.sanityn/sleep' to length 60: Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 323: 13862 Terminated $DIR1/$tdir/sleep 60 PASS 14b (2s) == sanityn test 14c: open(O_TRUNC) of executing file return -ETXTBSY ============================================================ 17:12:11 (1713388331) cp: cannot create regular file '/mnt/lustre2/d14c.sanityn/sleep': Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 335: 14492 Terminated $DIR1/$tdir/sleep 60 PASS 14c (2s) == sanityn test 14d: chmod of executing file is still possible ================================================================== 17:12:13 (1713388333) chmod /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 346: 15121 Terminated $DIR1/$tdir/sleep 60 PASS 14d (3s) == sanityn test 17: resource creation/LVB creation race ========================================================================= 17:12:16 (1713388336) fail_loc=0x8000030a PASS 17 (4s) == sanityn test 19: test concurrent uncached read races ========================================================================= 17:12:20 (1713388340) oleg248-server: error: get_param: param_path 'osd-*/lustre-MDT*/read_cache_enable': No such file or directory pdsh@oleg248-client: oleg248-server: ssh exited with exit code 2 SKIP: sanityn test_19 not cache-capable obdfilter SKIP 19 (2s) == sanityn test 20: test extra readahead page left in cache ============================================================== 17:12:22 (1713388342) PASS 20 (2s) == sanityn test 23: others should see updated atime while another read============================================================== 17:12:24 (1713388344) now is 1713388345 starting reads multiop /mnt/lustre/f23.sanityn vor20_c TMPPIPE=/tmp/multiop_open_wait_pipe.31252 new atime is 1713388406 PASS 23 (64s) == sanityn test 27: align non-overlapping extent locks from request ============================================================= 17:13:28 (1713388408) 4+0 records in 4+0 records out 16793600 bytes (17 MB) copied, 0.189555 s, 88.6 MB/s dd 1 started dd 2 started 1+0 records in 1+0 records out 15728640 bytes (16 MB) copied, 0.187775 s, 83.8 MB/s 1+0 records in 1+0 records out 8192 bytes (8.2 kB) copied, 0.00692529 s, 1.2 MB/s dd 3 finished PASS 27 (5s) == sanityn test 39a: test from 11063 ============================================================================================ 17:13:33 (1713388413) repeat after cancel_lru_locks PASS 39a (2s) == sanityn test 39b: 11063 problem 1 ============================================================================================ 17:13:35 (1713388415) repeat after cancel_lru_locks PASS 39b (4s) == sanityn test 39c: check truncate mtime update ================================================================================ 17:13:39 (1713388419) repeat after cancel_lru_locks PASS 39c (3s) == sanityn test 39d: sync write should update mtime ====== 17:13:42 (1713388422) fail_loc=0x411 fail_loc=0 PASS 39d (2s) == sanityn test 51a: layout lock: refresh layout should work ========================================================== 17:13:44 (1713388424) 0+1 records in 0+1 records out 159 bytes (159 B) copied, 0.00477062 s, 33.3 kB/s /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 3175: kill: (22053) - No such process PASS 51a (4s) == sanityn test 51c: layout lock: IT_LAYOUT blocked and correct layout can be returned ========================================================== 17:13:48 (1713388428) fail_loc=0x172 Setting layout to have 2 stripes ... 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.00617793 s, 166 kB/s PASS 51c (7s) == sanityn test 51d: layout lock: losing layout lock should clean up memory map region ========================================================== 17:13:55 (1713388435) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0125016 s, 83.9 MB/s Before revoking layout lock: 1024 KB mapped PASS 51d (3s) == sanityn test 107a: Basic grouplock conflict =========== 17:13:58 (1713388438) 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.112218 s, 93.4 MB/s /mnt/lustre/f107a.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x100000000:0x3dc:0x0] } multiop /mnt/lustre/f107a.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.31252 multiop /mnt/lustre2/f107a.sanityn vO_G16022000r10g16022000c TMPPIPE=/tmp/multiop_open_wait_pipe.31252 First grouplock blocks second one PASS 107a (4s) == sanityn test 107b: Grouplock is added to the head of waiting list ========================================================== 17:14:02 (1713388442) 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.152375 s, 68.8 MB/s /mnt/lustre/f107b.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x100010000:0x3db:0x0] } multiop /mnt/lustre/f107b.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.31252 Grouplock blocks IO First grouplock blocks second one Second grouplock blocks IO PASS 107b (9s) cleanup: ====================================================== == sanityn test complete, duration 423 sec =============== 17:14:11 (1713388451) Stopping clients: oleg248-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg248-client.virtnet /mnt/lustre2 opts: