[ 0.000000] Initializing cgroup subsys cpuset [ 0.000000] Initializing cgroup subsys cpu [ 0.000000] Initializing cgroup subsys cpuacct [ 0.000000] Linux version 3.10.0-7.9-debug (green@centos7-base) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-44) (GCC) ) #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 0.000000] Command line: rd.shell root=nbd:192.168.200.253:centos7:ext4:ro:-p,-b4096 ro crashkernel=128M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] e820: BIOS-provided physical RAM map: [ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable [ 0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000000f0000-0x00000000000fffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x00000000bffcdfff] usable [ 0.000000] BIOS-e820: [mem 0x00000000bffce000-0x00000000bfffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000feffc000-0x00000000feffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000fffc0000-0x00000000ffffffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000100000000-0x000000013edfffff] usable [ 0.000000] NX (Execute Disable) protection: active [ 0.000000] SMBIOS 3.0.0 present. [ 0.000000] DMI: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.3-1.fc39 04/01/2014 [ 0.000000] Hypervisor detected: KVM [ 0.000000] e820: last_pfn = 0x13ee00 max_arch_pfn = 0x400000000 [ 0.000000] PAT configuration [0-7]: WB WC UC- UC WB WP UC- UC [ 0.000000] e820: last_pfn = 0xbffce max_arch_pfn = 0x400000000 [ 0.000000] found SMP MP-table at [mem 0x000f53f0-0x000f53ff] mapped at [ffffffffff2003f0] [ 0.000000] Using GB pages for direct mapping [ 0.000000] RAMDISK: [mem 0xbc2e2000-0xbffbffff] [ 0.000000] Early table checksum verification disabled [ 0.000000] ACPI: RSDP 00000000000f5200 00014 (v00 BOCHS ) [ 0.000000] ACPI: RSDT 00000000bffe1d87 00034 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACP 00000000bffe1c23 00074 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: DSDT 00000000bffe0040 01BE3 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACS 00000000bffe0000 00040 [ 0.000000] ACPI: APIC 00000000bffe1c97 00090 (v03 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: HPET 00000000bffe1d27 00038 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: WAET 00000000bffe1d5f 00028 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at [mem 0x0000000000000000-0x000000013edfffff] [ 0.000000] NODE_DATA(0) allocated [mem 0x13e5e3000-0x13e609fff] [ 0.000000] Reserving 128MB of memory at 768MB for crashkernel (System RAM: 4077MB) [ 0.000000] kvm-clock: cpu 0, msr 1:3e592001, primary cpu clock [ 0.000000] kvm-clock: Using msrs 4b564d01 and 4b564d00 [ 0.000000] kvm-clock: using sched offset of 321303219 cycles [ 0.000000] Zone ranges: [ 0.000000] DMA [mem 0x00001000-0x00ffffff] [ 0.000000] DMA32 [mem 0x01000000-0xffffffff] [ 0.000000] Normal [mem 0x100000000-0x13edfffff] [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges [ 0.000000] node 0: [mem 0x00001000-0x0009efff] [ 0.000000] node 0: [mem 0x00100000-0xbffcdfff] [ 0.000000] node 0: [mem 0x100000000-0x13edfffff] [ 0.000000] Initmem setup node 0 [mem 0x00001000-0x13edfffff] [ 0.000000] ACPI: PM-Timer IO Port: 0x608 [ 0.000000] ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) [ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) [ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled) [ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x03] enabled) [ 0.000000] ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl lint[0x1]) [ 0.000000] ACPI: IOAPIC (id[0x00] address[0xfec00000] gsi_base[0]) [ 0.000000] IOAPIC[0]: apic_id 0, version 17, address 0xfec00000, GSI 0-23 [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 high level) [ 0.000000] Using ACPI (MADT) for SMP configuration information [ 0.000000] ACPI: HPET id: 0x8086a201 base: 0xfed00000 [ 0.000000] smpboot: Allowing 4 CPUs, 0 hotplug CPUs [ 0.000000] PM: Registered nosave memory: [mem 0x0009f000-0x0009ffff] [ 0.000000] PM: Registered nosave memory: [mem 0x000a0000-0x000effff] [ 0.000000] PM: Registered nosave memory: [mem 0x000f0000-0x000fffff] [ 0.000000] PM: Registered nosave memory: [mem 0xbffce000-0xbfffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xc0000000-0xfeffbfff] [ 0.000000] PM: Registered nosave memory: [mem 0xfeffc000-0xfeffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xff000000-0xfffbffff] [ 0.000000] PM: Registered nosave memory: [mem 0xfffc0000-0xffffffff] [ 0.000000] e820: [mem 0xc0000000-0xfeffbfff] available for PCI devices [ 0.000000] Booting paravirtualized kernel on KVM [ 0.000000] setup_percpu: NR_CPUS:5120 nr_cpumask_bits:4 nr_cpu_ids:4 nr_node_ids:1 [ 0.000000] percpu: Embedded 38 pages/cpu s115176 r8192 d32280 u524288 [ 0.000000] KVM setup async PF for cpu 0 [ 0.000000] kvm-stealtime: cpu 0, msr 13e2135c0 [ 0.000000] PV qspinlock hash table entries: 256 (order: 0, 4096 bytes) [ 0.000000] Built 1 zonelists in Node order, mobility grouping on. Total pages: 1027487 [ 0.000000] Policy zone: Normal [ 0.000000] Kernel command line: rd.shell root=nbd:192.168.200.253:centos7:ext4:ro:-p,-b4096 ro crashkernel=128M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] audit: disabled (until reboot) [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) [ 0.000000] x86/fpu: xstate_offset[2]: 0240, xstate_sizes[2]: 0100 [ 0.000000] xsave: enabled xstate_bv 0x7, cntxt size 0x340 using standard form [ 0.000000] Memory: 3820268k/5224448k available (8172k kernel code, 1049168k absent, 355012k reserved, 5773k data, 2532k init) [ 0.000000] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1 [ 0.000000] Hierarchical RCU implementation. [ 0.000000] RCU restricting CPUs from NR_CPUS=5120 to nr_cpu_ids=4. [ 0.000000] Offload RCU callbacks from all CPUs [ 0.000000] Offload RCU callbacks from CPUs: 0-3. [ 0.000000] NR_IRQS:327936 nr_irqs:456 0 [ 0.000000] Console: colour *CGA 80x25 [ 0.000000] console [ttyS1] enabled [ 0.000000] allocated 25165824 bytes of page_cgroup [ 0.000000] please try 'cgroup_disable=memory' option if you don't want memory cgroups [ 0.000000] kmemleak: Kernel memory leak detector disabled [ 0.000000] tsc: Detected 2399.998 MHz processor [ 0.442428] Calibrating delay loop (skipped) preset value.. 4799.99 BogoMIPS (lpj=2399998) [ 0.445173] pid_max: default: 32768 minimum: 301 [ 0.447342] Security Framework initialized [ 0.448923] SELinux: Initializing. [ 0.451918] Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes) [ 0.456456] Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes) [ 0.461253] Mount-cache hash table entries: 8192 (order: 4, 65536 bytes) [ 0.463977] Mountpoint-cache hash table entries: 8192 (order: 4, 65536 bytes) [ 0.466063] Initializing cgroup subsys memory [ 0.467331] Initializing cgroup subsys devices [ 0.469012] Initializing cgroup subsys freezer [ 0.470681] Initializing cgroup subsys net_cls [ 0.472303] Initializing cgroup subsys blkio [ 0.474138] Initializing cgroup subsys perf_event [ 0.476290] Initializing cgroup subsys hugetlb [ 0.477996] Initializing cgroup subsys pids [ 0.479314] Initializing cgroup subsys net_prio [ 0.481210] x86/cpu: User Mode Instruction Prevention (UMIP) activated [ 0.484460] Last level iTLB entries: 4KB 0, 2MB 0, 4MB 0 [ 0.485996] Last level dTLB entries: 4KB 0, 2MB 0, 4MB 0 [ 0.487429] tlb_flushall_shift: 6 [ 0.488944] FEATURE SPEC_CTRL Present [ 0.490345] FEATURE IBPB_SUPPORT Present [ 0.491478] Spectre V2 : Enabling Indirect Branch Prediction Barrier [ 0.493636] Spectre V2 : Vulnerable [ 0.495617] Speculative Store Bypass: Vulnerable [ 0.498231] debug: unmapping init [mem 0xffffffff82019000-0xffffffff8201ffff] [ 0.506264] ACPI: Core revision 20130517 [ 0.508805] ACPI: All ACPI Tables successfully acquired [ 0.510364] ftrace: allocating 30294 entries in 119 pages [ 0.560442] Enabling x2apic [ 0.561564] Enabled x2apic [ 0.562909] Switched APIC routing to physical x2apic. [ 0.566477] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1 [ 0.568179] smpboot: CPU0: Intel(R) Xeon(R) CPU E5-2695 v2 @ 2.40GHz (fam: 06, model: 3e, stepping: 04) [ 0.573071] Performance Events: IvyBridge events, full-width counters, Intel PMU driver. [ 0.576141] ... version: 2 [ 0.577219] ... bit width: 48 [ 0.578427] ... generic registers: 4 [ 0.579639] ... value mask: 0000ffffffffffff [ 0.581567] ... max period: 00007fffffffffff [ 0.583264] ... fixed-purpose events: 3 [ 0.584227] ... event mask: 000000070000000f [ 0.586168] KVM setup paravirtual spinlock [ 0.589776] smpboot: Booting Node 0, Processors #1[ 0.591286] kvm-clock: cpu 1, msr 1:3e592041, secondary cpu clock [ 0.601875] KVM setup async PF for cpu 1 [ 0.603150] kvm-stealtime: cpu 1, msr 13e2935c0 #2[ 0.606413] kvm-clock: cpu 2, msr 1:3e592081, secondary cpu clock [ 0.611569] KVM setup async PF for cpu 2 [ 0.612118] kvm-clock: cpu 3, msr 1:3e5920c1, secondary cpu clock #3 OK [ 0.617467] kvm-stealtime: cpu 2, msr 13e3135c0 [ 0.620043] Brought up 4 CPUs [ 0.620063] KVM setup async PF for cpu 3 [ 0.620072] kvm-stealtime: cpu 3, msr 13e3935c0 [ 0.624602] smpboot: Max logical packages: 1 [ 0.626225] smpboot: Total of 4 processors activated (19199.98 BogoMIPS) [ 0.630580] devtmpfs: initialized [ 0.632112] x86/mm: Memory block size: 128MB [ 0.637704] EVM: security.selinux [ 0.638592] EVM: security.ima [ 0.639366] EVM: security.capability [ 0.644247] atomic64 test passed for x86-64 platform with CX8 and with SSE [ 0.646691] NET: Registered protocol family 16 [ 0.648540] cpuidle: using governor haltpoll [ 0.650718] ACPI: bus type PCI registered [ 0.652127] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5 [ 0.654753] PCI: Using configuration type 1 for base access [ 0.656852] core: PMU erratum BJ122, BV98, HSD29 worked around, HT is on [ 0.666993] ACPI: Added _OSI(Module Device) [ 0.668991] ACPI: Added _OSI(Processor Device) [ 0.670898] ACPI: Added _OSI(3.0 _SCP Extensions) [ 0.672995] ACPI: Added _OSI(Processor Aggregator Device) [ 0.675695] ACPI: Added _OSI(Linux-Dell-Video) [ 0.683082] ACPI: Interpreter enabled [ 0.684506] ACPI: (supports S0 S3 S4 S5) [ 0.686170] ACPI: Using IOAPIC for interrupt routing [ 0.687977] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug [ 0.691471] ACPI: Enabled 2 GPEs in block 00 to 0F [ 0.698892] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff]) [ 0.700940] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI] [ 0.702973] acpi PNP0A03:00: _OSC failed (AE_NOT_FOUND); disabling ASPM [ 0.705143] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge. [ 0.709538] acpiphp: Slot [2] registered [ 0.710940] acpiphp: Slot [5] registered [ 0.712084] acpiphp: Slot [6] registered [ 0.713379] acpiphp: Slot [7] registered [ 0.714784] acpiphp: Slot [8] registered [ 0.716232] acpiphp: Slot [9] registered [ 0.717562] acpiphp: Slot [10] registered [ 0.718852] acpiphp: Slot [3] registered [ 0.720155] acpiphp: Slot [4] registered [ 0.721524] acpiphp: Slot [11] registered [ 0.722892] acpiphp: Slot [12] registered [ 0.724286] acpiphp: Slot [13] registered [ 0.725353] acpiphp: Slot [14] registered [ 0.726738] acpiphp: Slot [15] registered [ 0.727747] acpiphp: Slot [16] registered [ 0.729136] acpiphp: Slot [17] registered [ 0.730532] acpiphp: Slot [18] registered [ 0.731716] acpiphp: Slot [19] registered [ 0.733156] acpiphp: Slot [20] registered [ 0.735189] acpiphp: Slot [21] registered [ 0.736612] acpiphp: Slot [22] registered [ 0.738069] acpiphp: Slot [23] registered [ 0.739526] acpiphp: Slot [24] registered [ 0.740674] acpiphp: Slot [25] registered [ 0.741367] acpiphp: Slot [26] registered [ 0.742309] acpiphp: Slot [27] registered [ 0.743671] acpiphp: Slot [28] registered [ 0.744865] acpiphp: Slot [29] registered [ 0.745987] acpiphp: Slot [30] registered [ 0.747411] acpiphp: Slot [31] registered [ 0.748716] PCI host bridge to bus 0000:00 [ 0.750117] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window] [ 0.752504] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window] [ 0.754832] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window] [ 0.756784] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfebfffff window] [ 0.758703] pci_bus 0000:00: root bus resource [mem 0x380000000000-0x38007fffffff window] [ 0.760959] pci_bus 0000:00: root bus resource [bus 00-ff] [ 0.775313] pci 0000:00:01.1: legacy IDE quirk: reg 0x10: [io 0x01f0-0x01f7] [ 0.779973] pci 0000:00:01.1: legacy IDE quirk: reg 0x14: [io 0x03f6] [ 0.781597] pci 0000:00:01.1: legacy IDE quirk: reg 0x18: [io 0x0170-0x0177] [ 0.783523] pci 0000:00:01.1: legacy IDE quirk: reg 0x1c: [io 0x0376] [ 0.786808] pci 0000:00:01.3: quirk: [io 0x0600-0x063f] claimed by PIIX4 ACPI [ 0.788886] pci 0000:00:01.3: quirk: [io 0x0700-0x070f] claimed by PIIX4 SMB [ 1.049743] ACPI: PCI Interrupt Link [LNKA] (IRQs 5 *10 11) [ 1.054071] ACPI: PCI Interrupt Link [LNKB] (IRQs 5 *10 11) [ 1.056193] ACPI: PCI Interrupt Link [LNKC] (IRQs 5 10 *11) [ 1.057680] ACPI: PCI Interrupt Link [LNKD] (IRQs 5 10 *11) [ 1.060224] ACPI: PCI Interrupt Link [LNKS] (IRQs *9) [ 1.064185] vgaarb: loaded [ 1.065522] SCSI subsystem initialized [ 1.067243] ACPI: bus type USB registered [ 1.068769] usbcore: registered new interface driver usbfs [ 1.070715] usbcore: registered new interface driver hub [ 1.072569] usbcore: registered new device driver usb [ 1.074799] PCI: Using ACPI for IRQ routing [ 1.077119] NetLabel: Initializing [ 1.078266] NetLabel: domain hash size = 128 [ 1.079685] NetLabel: protocols = UNLABELED CIPSOv4 [ 1.081423] NetLabel: unlabeled traffic allowed by default [ 1.084382] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0 [ 1.086482] hpet0: 3 comparators, 64-bit 100.000000 MHz counter [ 1.091288] amd_nb: Cannot enumerate AMD northbridges [ 1.095205] Switched to clocksource kvm-clock [ 1.116045] pnp: PnP ACPI init [ 1.117262] ACPI: bus type PNP registered [ 1.119715] pnp: PnP ACPI: found 6 devices [ 1.121162] ACPI: bus type PNP unregistered [ 1.140432] NET: Registered protocol family 2 [ 1.142511] TCP established hash table entries: 32768 (order: 6, 262144 bytes) [ 1.145675] TCP bind hash table entries: 32768 (order: 8, 1048576 bytes) [ 1.148683] TCP: Hash tables configured (established 32768 bind 32768) [ 1.150069] TCP: reno registered [ 1.150715] UDP hash table entries: 2048 (order: 5, 196608 bytes) [ 1.152089] UDP-Lite hash table entries: 2048 (order: 5, 196608 bytes) [ 1.154237] NET: Registered protocol family 1 [ 1.158630] RPC: Registered named UNIX socket transport module. [ 1.161505] RPC: Registered udp transport module. [ 1.164080] RPC: Registered tcp transport module. [ 1.166739] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 1.171007] pci 0000:00:00.0: Limiting direct PCI/PCI transfers [ 1.175077] pci 0000:00:01.0: PIIX3: Enabling Passive Release [ 1.177838] pci 0000:00:01.0: Activating ISA DMA hang workarounds [ 1.180972] Unpacking initramfs... [ 2.552022] debug: unmapping init [mem 0xffff8800bc2e2000-0xffff8800bffbffff] [ 2.555850] PCI-DMA: Using software bounce buffering for IO (SWIOTLB) [ 2.558267] software IO TLB [mem 0xb82e2000-0xbc2e2000] (64MB) mapped at [ffff8800b82e2000-ffff8800bc2e1fff] [ 2.563022] RAPL PMU: API unit is 2^-32 Joules, 3 fixed counters, 10737418240 ms ovfl timer [ 2.565780] RAPL PMU: hw unit of domain pp0-core 2^-0 Joules [ 2.567420] RAPL PMU: hw unit of domain package 2^-0 Joules [ 2.569179] RAPL PMU: hw unit of domain dram 2^-0 Joules [ 2.575070] cryptomgr_test (52) used greatest stack depth: 14128 bytes left [ 2.579263] futex hash table entries: 1024 (order: 4, 65536 bytes) [ 2.579325] Initialise system trusted keyring [ 2.610999] HugeTLB registered 1 GB page size, pre-allocated 0 pages [ 2.613005] HugeTLB registered 2 MB page size, pre-allocated 0 pages [ 2.620530] zpool: loaded [ 2.621467] zbud: loaded [ 2.622838] VFS: Disk quotas dquot_6.6.0 [ 2.624212] Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 2.627569] NFS: Registering the id_resolver key type [ 2.629408] Key type id_resolver registered [ 2.630876] Key type id_legacy registered [ 2.632388] nfs4filelayout_init: NFSv4 File Layout Driver Registering... [ 2.635726] Key type big_key registered [ 2.641111] cryptomgr_test (58) used greatest stack depth: 13968 bytes left [ 2.650142] NET: Registered protocol family 38 [ 2.652546] Key type asymmetric registered [ 2.653898] Asymmetric key parser 'x509' registered [ 2.655643] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 250) [ 2.658077] io scheduler noop registered [ 2.659131] io scheduler deadline registered (default) [ 2.660653] io scheduler cfq registered [ 2.662019] io scheduler mq-deadline registered [ 2.663410] io scheduler kyber registered [ 2.666281] pci_hotplug: PCI Hot Plug PCI Core version: 0.5 [ 2.668151] pciehp: PCI Express Hot Plug Controller Driver version: 0.4 [ 2.670652] input: Power Button as /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0 [ 2.673033] ACPI: Power Button [PWRF] [ 2.675902] GHES: HEST is not enabled! [ 2.734575] ACPI: PCI Interrupt Link [LNKB] enabled at IRQ 10 [ 2.799998] ACPI: PCI Interrupt Link [LNKA] enabled at IRQ 11 [ 2.924750] ACPI: PCI Interrupt Link [LNKC] enabled at IRQ 11 [ 2.990685] ACPI: PCI Interrupt Link [LNKD] enabled at IRQ 10 [ 3.115374] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 3.145317] 00:03: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A [ 3.175357] 00:04: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A [ 3.178822] Non-volatile memory driver v1.3 [ 3.180377] Linux agpgart interface v0.103 [ 3.182201] crash memory driver: version 1.1 [ 3.184217] nbd: registered device at major 43 [ 3.199861] virtio_blk virtio1: [vda] 60784 512-byte logical blocks (31.1 MB/29.6 MiB) [ 3.225777] virtio_blk virtio2: [vdb] 2097152 512-byte logical blocks (1.07 GB/1.00 GiB) [ 3.240232] virtio_blk virtio3: [vdc] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 3.256504] virtio_blk virtio4: [vdd] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 3.270128] virtio_blk virtio5: [vde] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 3.283759] virtio_blk virtio6: [vdf] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 3.289315] rdac: device handler registered [ 3.290530] hp_sw: device handler registered [ 3.291987] emc: device handler registered [ 3.293177] libphy: Fixed MDIO Bus: probed [ 3.302918] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver [ 3.305006] ehci-pci: EHCI PCI platform driver [ 3.306627] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver [ 3.308298] ohci-pci: OHCI PCI platform driver [ 3.309363] uhci_hcd: USB Universal Host Controller Interface driver [ 3.311716] i8042: PNP: PS/2 Controller [PNP0303:KBD,PNP0f13:MOU] at 0x60,0x64 irq 1,12 [ 3.315206] serio: i8042 KBD port at 0x60,0x64 irq 1 [ 3.316206] serio: i8042 AUX port at 0x60,0x64 irq 12 [ 3.317791] mousedev: PS/2 mouse device common for all mice [ 3.321332] rtc_cmos 00:05: RTC can wake from S4 [ 3.323988] input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input1 [ 3.325024] rtc_cmos 00:05: rtc core: registered rtc_cmos as rtc0 [ 3.325260] rtc_cmos 00:05: alarms up to one day, y3k, 242 bytes nvram, hpet irqs [ 3.331363] hidraw: raw HID events driver (C) Jiri Kosina [ 3.331638] usbcore: registered new interface driver usbhid [ 3.331639] usbhid: USB HID core driver [ 3.331708] drop_monitor: Initializing network drop monitor service [ 3.331782] Netfilter messages via NETLINK v0.30. [ 3.331853] TCP: cubic registered [ 3.331875] Initializing XFRM netlink socket [ 3.332750] NET: Registered protocol family 10 [ 3.333373] NET: Registered protocol family 17 [ 3.333416] Key type dns_resolver registered [ 3.334480] mce: Using 10 MCE banks [ 3.335584] Loading compiled-in X.509 certificates [ 3.336538] Loaded X.509 cert 'Magrathea: Glacier signing key: e34d0e1b7fcf5b414cce75d36d8482945c781ed6' [ 3.336594] registered taskstats version 1 [ 3.340341] modprobe (72) used greatest stack depth: 13456 bytes left [ 3.346170] Key type trusted registered [ 3.351960] Key type encrypted registered [ 3.352011] IMA: No TPM chip found, activating TPM-bypass! (rc=-19) [ 3.354482] BERT: Boot Error Record Table support is disabled. Enable it by using bert_enable as kernel parameter. [ 3.355916] rtc_cmos 00:05: setting system clock to 2024-04-17 08:42:06 UTC (1713343326) [ 3.372095] debug: unmapping init [mem 0xffffffff81da0000-0xffffffff82018fff] [ 3.374938] Write protecting the kernel read-only data: 12288k [ 3.379371] debug: unmapping init [mem 0xffff8800017fe000-0xffff8800017fffff] [ 3.381958] debug: unmapping init [mem 0xffff880001b9b000-0xffff880001bfffff] [ 3.391461] random: systemd: uninitialized urandom read (16 bytes read) [ 3.395029] random: systemd: uninitialized urandom read (16 bytes read) [ 3.397317] random: systemd: uninitialized urandom read (16 bytes read) [ 3.401238] systemd[1]: systemd 219 running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 -SECCOMP +BLKID +ELFUTILS +KMOD +IDN) [ 3.406766] systemd[1]: Detected virtualization kvm. [ 3.408538] systemd[1]: Detected architecture x86-64. [ 3.410072] systemd[1]: Running in initial RAM disk. Welcome to CentOS Linux 7 (Core) dracut-033-572.el7 (Initramfs)! [ 3.415168] systemd[1]: No hostname configured. [ 3.417014] systemd[1]: Set hostname to . [ 3.419020] random: systemd: uninitialized urandom read (16 bytes read) [ 3.421175] systemd[1]: Initializing machine ID from random generator. [ 3.477516] dracut-rootfs-g (86) used greatest stack depth: 13264 bytes left [ 3.481943] random: systemd: uninitialized urandom read (16 bytes read) [ 3.484332] random: systemd: uninitialized urandom read (16 bytes read) [ 3.486815] random: systemd: uninitialized urandom read (16 bytes read) [ 3.489278] random: systemd: uninitialized urandom read (16 bytes read) [ 3.493203] random: systemd: uninitialized urandom read (16 bytes read) [ 3.496031] random: systemd: uninitialized urandom read (16 bytes read) [ 3.506733] systemd[1]: Reached target Local File Systems. [ OK ] Reached target Local File Systems. [ 3.511802] systemd[1]: Reached target Swap. [ OK ] Reached target Swap. [ 3.516133] systemd[1]: Created slice Root Slice. [ OK ] Created slice Root Slice. [ 3.520996] systemd[1]: Created slice System Slice. [ OK ] Created slice System Slice. [ 3.525525] systemd[1]: Reached target Timers. [ OK ] Reached target Timers. [ 3.529777] systemd[1]: Reached target Slices. [ OK ] Reached target Slices. [ 3.533937] systemd[1]: Listening on udev Control Socket. [ OK ] Listening on udev Control Socket. [ 3.539020] systemd[1]: Listening on udev Kernel Socket. [ OK ] Listening on udev Kernel Socket. [ 3.544018] systemd[1]: Listening on Journal Socket. [ OK ] Listening on Journal Socket. [ 3.549941] systemd[1]: Starting Load Kernel Modules... Starting Load Kernel Modules... [ 3.555534] systemd[1]: Starting Create list of required static device nodes for the current kernel... Starting Create list of required st... nodes for the current kernel... [ 3.563644] systemd[1]: Starting dracut cmdline hook... Starting dracut cmdline hook... [ 3.569313] systemd[1]: Starting Setup Virtual Console... Starting Setup Virtual Console... [ 3.573960] tsc: Refined TSC clocksource calibration: 2399.957 MHz [ 3.574391] systemd[1]: Starting Journal Service... Starting Journal Service... [ 3.581291] systemd[1]: Reached target Sockets. [ OK ] Reached target Sockets. [ 3.587026] systemd[1]: Started Load Kernel Modules. [ OK ] Started Load Kernel Modules. [ 3.596976] systemd[1]: Started Create list of required static device nodes for the current kernel. [ OK ] Started Create list of required sta...ce nodes for the current kernel. [ 3.605630] systemd[1]: Started Setup Virtual Console. [ OK ] Started Setup Virtual Console. [ 3.615742] systemd[1]: Starting Create Static Device Nodes in /dev... Starting Create Static Device Nodes in /dev... [ 3.622325] systemd[1]: Starting Apply Kernel Variables... Starting Apply Kernel Variables... [ 3.627510] systemd[1]: Started Create Static Device Nodes in /dev. [ OK ] Started Create Static Device Nodes in /dev. [ 3.632753] systemd[1]: Started Journal Service. [ OK ] Started Journal Service. [[ 3.639214] random: fast init done  OK ] Started Apply Kernel Variables. [ OK ] Started dracut cmdline hook. Starting dracut pre-udev hook... [ OK ] Started dracut pre-udev hook. Starting udev Kernel Device Manager... [ OK ] Started udev Kernel Device Manager. Starting dracut pre-trigger hook... [ OK ] Started dracut pre-trigger hook. Starting udev Coldplug all Devices... [ 4.226351] input: ImExPS/2 Generic Explorer Mouse as /devices/platform/i8042/serio1/input/input2 Mounting Configuration File System... [ OK ] Mounted Configuration File System. [ OK ] Started udev Coldplug all Devices. Starting Show Plymouth Boot Screen... [ OK ] Reached target System Initialization. Starting dracut initqueue hook... [ 4.309259] scsi host0: ata_piix [ 4.318831] scsi host1: ata_piix [ 4.320531] ata1: PATA max MWDMA2 cmd 0x1f0 ctl 0x3f6 bmdma 0xc320 irq 14 [ 4.323468] ata2: PATA max MWDMA2 cmd 0x170 ctl 0x376 bmdma 0xc328 irq 15 [ OK ] Started Show Plymouth Boot Screen. [ OK ] Started Forward Password Requests to Plymouth Directory Watch. [ OK ] Reached target Paths. [ OK ] Reached target Basic System. %G[ 4.555194] ip (321) used greatest stack depth: 13080 bytes left [ 4.617021] ip (344) used greatest stack depth: 12336 bytes left [ 6.311966] dracut-initqueue[276]: RTNETLINK answers: File exists [ 7.150265] dracut-initqueue[276]: bs=4096, sz=32212254720 bytes [ OK ] Started dracut initqueue hook. [ OK ] Reached target Initrd Root File System. Starting Reload Configuration from the Real Root... Mounting /sysroot... [ OK ] Reached target Remote File Systems (Pre). [ OK ] Reached target Remote File Systems. [ 10.939064] EXT4-fs (nbd0): mounted filesystem with ordered data mode. Opts: (null) [ OK ] Mounted /sysroot. [ OK ] Started Reload Configuration from the Real Root. [ OK ] Reached target Initrd File Systems. [ OK ] Reached target Initrd Default Target. Starting dracut pre-pivot and cleanup hook... [ OK ] Started dracut pre-pivot and cleanup hook. Starting Cleaning Up and Shutting Down Daemons... Starting Plymouth switch root service... [ OK ] Stopped target Timers. [ OK ] Stopped dracut pre-pivot and cleanup hook. [ OK ] Stopped target Initrd Default Target. [ OK ] Stopped target Basic System. [ OK ] Stopped target System Initialization. [ OK ] Stopped target Local File Systems. [ OK ] Stopped Apply Kernel Variables. [ OK ] Stopped Load Kernel Modules. [ OK ] Stopped target Swap. [ OK ] Stopped target Paths. [ OK ] Stopped target Sockets. [ OK ] Stopped target Slices. [ OK ] Stopped target Remote File Systems. [ OK ] Stopped target Remote File Systems (Pre). [ OK ] Stopped dracut initqueue hook. [ OK ] Stopped udev Coldplug all Devices. [ OK ] Stopped dracut pre-trigger hook. Stopping udev Kernel Device Manager... [ OK ] Started Cleaning Up and Shutting Down Daemons. [ OK ] Stopped udev Kernel Device Manager. [ OK ] Stopped dracut pre-udev hook. [ OK ] Stopped dracut cmdline hook. [ OK ] Stopped Create Static Device Nodes in /dev. [ OK ] Stopped Create list of required sta...ce nodes for the current kernel. [ OK ] Closed udev Control Socket. [ OK ] Closed udev Kernel Socket. Starting Cleanup udevd DB... [ OK ] Started Cleanup udevd DB. [ OK ] Reached target Switch Root. [ OK ] Started Plymouth switch root service. Starting Switch Root... [ 11.456224] systemd-journald[106]: Received SIGTERM from PID 1 (systemd). [ 11.770902] SELinux: Disabled at runtime. [ 11.854197] ip_tables: (C) 2000-2006 Netfilter Core Team [ 11.858554] systemd[1]: Inserted module 'ip_tables' Welcome to CentOS Linux 7 (Core)! [ OK ] Stopped Switch Root. [ OK ] Stopped Journal Service. Starting Journal Service... [ OK ] Listening on Delayed Shutdown Socket. [ OK ] Listening on udev Control Socket. Mounting Debug File System... Starting Read and set NIS domainname from /etc/sysconfig/network... [ OK ] Reached target rpc_pipefs.target. [ OK ] Stopped target Switch Root. [ OK ] Stopped target Initrd Root File System. [ OK ] Stopped target Initrd File Systems. [ OK ] Listening on /dev/initctl Compatibility Named Pipe. [ OK ] Created slice system-getty.slice. Starting Create list of required st... nodes for the current kernel... Mounting Huge Pages File System... Starting Load Kernel Modules... Starting Remount Root and Kernel File Systems... Mounting POSIX Message Queue File System... [ OK ] Created slice system-serial\x2dgetty.slice. [ OK ] Listening on udev Kernel Socket. [ OK ] Created slice system-selinux\x2dpol...grate\x2dlocal\x2dchanges.slice. [ OK ] Started Forward Password Requests to Wall Directory Watch. [ OK ] Created slice User and Session Slice. [ OK ] Reached target Slices. Starting udev Coldplug all Devices... [ OK ] Reached target Local Encrypted Volumes. [ OK ] Set up automount Arbitrary Executab...ats File System Automount Point. Starting Set Up Additional Binary Formats... [ OK ] Mounted Huge Pages File System. [ OK ] Mounted POSIX Message Queue File System. [ OK ] Mounted Debug File System. [ OK ] Started Create list of required sta...ce nodes for the current kernel. [ OK ] Started Load Kernel Modules. Starting Apply Kernel Variables... Starting Create Static Device Nodes in /dev... [ OK ] Started Journal Service. [ OK ] Started Read and set NIS domainname from /etc/sysconfig/network. Mounting Arbitrary Executable File Formats File System... [ OK ] Started Apply Kernel Variables. [ OK ] Mounted Arbitrary Executable File Formats File System. [ OK ] Started udev Coldplug all Devices. [FAILED] Failed to start Remount Root and Kernel File Systems. See 'systemctl status systemd-remount-fs.service' for details. [ OK ] Started Create Static Device Nodes in /dev. Starting udev Kernel Device Manager... [ OK ] Reached target Local File Systems (Pre). Mounting /mnt... Starting Configure read-only root support... Starting Flush Journal to Persistent Storage... [ OK ] Mounted /mnt. [ OK ] Started Set Up Additional Binary Formats. [ 12.642651] systemd-journald[568]: Received request to flush runtime journal from PID 1 [ OK ] Started Flush Journal to Persistent Storage. [ OK ] Started udev Kernel Device Manager. [ 12.907550] piix4_smbus 0000:00:01.3: SMBus Host Controller at 0x700, revision 0 [ OK ] Found device /dev/ttyS1. [ OK ] Found device /dev/ttyS0. [ 12.985265] input: PC Speaker as /devices/platform/pcspkr/input/input3 [ 13.058501] cryptd: max_cpu_qlen set to 1000 [ OK ] Found device /dev/vda. Mounting /home/green/git/lustre-release... [ OK ] Found device /dev/disk/by-label/SWAP. Activating swap /dev/disk/by-label/SWAP... [ 13.114441] AVX version of gcm_enc/dec engaged. [ 13.116188] AES CTR mode by8 optimization enabled [ 13.125764] squashfs: version 4.0 (2009/01/31) Phillip Lougher [ OK ] Mounted /home/green/git/lustre-release. [ 13.134829] Adding 1048572k swap on /dev/vdb. Priority:-2 extents:1 across:1048572k FS [ OK ] Activated swap /dev/disk/by-label/SWAP. [ OK ] Reached target Swap. [ 13.178846] alg: No test for __gcm-aes-aesni (__driver-gcm-aes-aesni) [ 13.183035] alg: No test for __generic-gcm-aes-aesni (__driver-generic-gcm-aes-aesni) %G[ 13.350943] EDAC MC: Ver: 3.0.0 [ 13.362339] EDAC sbridge: Ver: 1.1.2 [ 16.436907] mount.nfs (773) used greatest stack depth: 10704 bytes left [ OK ] Started Configure read-only root support. [ OK ] Reached target Local File Systems. Starting Preprocess NFS configuration... Starting Tell Plymouth To Write Out Runtime Data... Starting Rebuild Journal Catalog... Starting Mark the need to relabel after reboot... Starting Load/Save Random Seed... Starting Create Volatile Files and Directories... [ OK ] Started Mark the need to relabel after reboot. [ OK ] Started Load/Save Random Seed. [FAILED] Failed to start Create Volatile Files and Directories. See 'systemctl status systemd-tmpfiles-setup.service' for details. [ OK ] Started Preprocess NFS configuration. Starting Update UTMP about System Boot/Shutdown... [ OK ] Started Tell Plymouth To Write Out Runtime Data. [FAILED] Failed to start Rebuild Journal Catalog. See 'systemctl status systemd-journal-catalog-update.service' for details. Starting Update is Completed... [ OK ] Started Update UTMP about System Boot/Shutdown. [ OK ] Started Update is Completed. [ OK ] Reached target System Initialization. [ OK ] Listening on D-Bus System Message Bus Socket. [ OK ] Listening on RPCbind Server Activation Socket. [ OK ] Reached target Sockets. [ OK ] Started Flexible branding. [ OK ] Reached target Paths. [ OK ] Started Daily Cleanup of Temporary Directories. [ OK ] Reached target Timers. [ OK ] Reached target Basic System. Starting Dump dmesg to /var/log/dmesg... Starting GSSAPI Proxy Daemon... [ OK ] Started D-Bus System Message Bus. Starting Network Manager... Starting Login Service... [ OK ] Started Dump dmesg to /var/log/dmesg. [ OK ] Started GSSAPI Proxy Daemon. [ OK ] Reached target NFS client services. [ OK ] Reached target Remote File Systems (Pre). [ OK ] Reached target Remote File Systems. Starting Permit User Sessions... [ OK ] Started Permit User Sessions. [ OK ] Started Login Service. [ OK ] Started Network Manager. Starting Network Manager Wait Online... [ OK ] Reached target Network. Starting /etc/rc.d/rc.local Compatibility... Starting OpenSSH server daemon... Starting Hostname Service... [ OK ] Started OpenSSH server daemon. [ OK ] Started /etc/rc.d/rc.local Compatibility. [ OK ] Started Hostname Service. Starting Network Manager Script Dispatcher Service... Starting Terminate Plymouth Boot Screen... Starting Wait for Plymouth Boot Screen to Quit... CentOS Linux 7 (Core) Kernel 3.10.0-7.9-debug on an x86_64 oleg150-server login: [ 31.951976] libcfs: loading out-of-tree module taints kernel. [ 31.953722] libcfs: module verification failed: signature and/or required key missing - tainting kernel [ 31.975249] LNet: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 1 [ 31.980810] alg: No test for adler32 (adler32-zlib) [ 32.767894] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_hostid [ 38.399024] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing load_modules_local [ 38.756148] Lustre: Lustre: Build Version: 2.15.4_17_g4937c3c [ 38.937368] LNet: Added LNI 192.168.201.150@tcp [8/256/0/180] [ 38.939937] LNet: Accept secure, port 988 [ 40.490960] Key type lgssc registered [ 40.842030] Lustre: Echo OBD driver; http://www.lustre.org/ [ 41.399907] icp: module license 'CDDL' taints kernel. [ 41.402203] Disabling lock debugging due to kernel taint [ 44.187185] ZFS: Loaded module v0.8.6-1, ZFS pool version 5000, ZFS filesystem version 5 [ 46.919841] vdc: vdc1 vdc9 [ 49.720936] vde: vde1 vde9 [ 52.693173] vdf: vdf1 vdf9 [ 57.024305] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing load_modules_local [ 60.243256] Lustre: Setting parameter lustre-MDT0000.mdt.identity_upcall in log lustre-MDT0000 [ 60.317339] Lustre: ctl-lustre-MDT0000: No data found on store. Initialize space: rc = -61 [ 60.355941] Lustre: lustre-MDT0000: new disk, initializing [ 60.474671] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 60.493587] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000200000400-0x0000000240000400]:0:mdt [ 60.555495] mount.lustre (6526) used greatest stack depth: 10112 bytes left [ 61.547959] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 65.104840] random: crng init done [ 65.261375] Lustre: lustre-OST0000: new disk, initializing [ 65.264976] Lustre: srv-lustre-OST0000: No data found on store. Initialize space: rc = -61 [ 65.268517] Lustre: Skipped 1 previous similar message [ 65.299467] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 66.540335] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 70.539245] Lustre: lustre-OST0001: new disk, initializing [ 70.542103] Lustre: srv-lustre-OST0001: No data found on store. Initialize space: rc = -61 [ 70.579795] Lustre: lustre-OST0001: Imperative Recovery not enabled, recovery window 60-180 [ 71.845291] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 76.881390] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 80.146086] Lustre: Modifying parameter general.lod.*.mdt_hash in log params [ 86.045282] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing check_logdir /tmp/testlogs/ [ 87.095757] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing yml_node [ 88.060469] Lustre: DEBUG MARKER: Client: 2.15.4.17 [ 88.678659] Lustre: DEBUG MARKER: MDS: 2.15.4.17 [ 89.307341] Lustre: DEBUG MARKER: OSS: 2.15.4.17 [ 89.702166] Lustre: DEBUG MARKER: -----============= acceptance-small: replay-dual ============----- Wed Apr 17 04:43:32 EDT 2024 [ 91.223588] Lustre: DEBUG MARKER: excepting tests: 14b 21b 21b [ 91.587590] Lustre: DEBUG MARKER: skipping tests SLOW=no: 21b [ 93.127479] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing check_config_client /mnt/lustre [ 97.172478] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 98.002927] Lustre: Modifying parameter general.lod.*.mdt_hash in log params [ 98.781062] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 99.268857] Lustre: DEBUG MARKER: == replay-dual test 0a: expired recovery with lost client ========================================================== 04:43:41 (1713343421) [ 100.151684] LustreError: 14308:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 100.384039] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 101.233123] Lustre: Failing over lustre-MDT0000 [ 101.278204] LustreError: 14596:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 101.368488] Lustre: server umount lustre-MDT0000 complete [ 111.469977] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343427/real 1713343427] req@ffff88008d77aac0 x1796570729170560/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343434 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 111.478016] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 111.478020] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 111.487709] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 2 previous similar messages [ 116.477999] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343432/real 1713343432] req@ffff88008d77b900 x1796570729170752/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343439 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 116.488071] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 117.492015] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e3556056 to 0x1b3448e9e3556c9d [ 117.495559] Lustre: MGC192.168.201.150@tcp: Connection restored to (at 0@lo) [ 117.600720] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 118.461379] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 118.599917] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343434/real 1713343434] req@ffff88008d77c740 x1796570729170944/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343441 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 125.751959] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 128.607694] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 191.566962] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 191.568965] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 191.570488] LustreError: 15462:0:(tgt_grant.c:257:tgt_grant_sanity_check()) mdt_obd_disconnect: tot_granted 2097152 != fo_tot_granted 4194304 [ 191.580026] Lustre: 15462:0:(ldlm_lib.c:2830:target_recovery_thread()) too long recovery - read logs [ 191.583082] LustreError: dumping log to /tmp/lustre-log.1713343514.15462 [ 191.600007] Lustre: lustre-MDT0000: Recovery over after 1:06, of 2 clients 1 recovered and 1 was evicted. [ 191.617220] Lustre: lustre-OST0000: deleting orphan objects from 0x0:28 to 0x0:65 [ 191.617228] Lustre: lustre-OST0001: deleting orphan objects from 0x0:28 to 0x0:65 [ 204.646425] Lustre: DEBUG MARKER: == replay-dual test 0b: lost client during waiting for next transno ========================================================== 04:45:27 (1713343527) [ 205.441483] LustreError: 16702:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 205.679298] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 206.228138] Lustre: Failing over lustre-MDT0000 [ 206.261606] LustreError: 16938:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 206.264141] LustreError: 16938:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 6 previous similar messages [ 206.335100] Lustre: server umount lustre-MDT0000 complete [ 215.750021] Lustre: 2860:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343531/real 1713343531] req@ffff88008d4a9300 x1796570729182720/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343538 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 215.750167] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 215.750169] Lustre: Skipped 1 previous similar message [ 215.750236] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 215.781616] Lustre: 2860:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 2 previous similar messages [ 220.749968] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343536/real 1713343536] req@ffff88008d4aaac0 x1796570729182912/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343543 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 220.757586] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 221.788064] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e3556c9d to 0x1b3448e9e35582ed [ 221.792448] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 221.794987] Lustre: Skipped 1 previous similar message [ 221.931671] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 222.769281] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 222.953066] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 231.905056] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 233.365842] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:55 [ 238.380039] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:50 [ 243.387887] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:45 [ 248.396184] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:40 [ 253.403862] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:35 [ 263.419858] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:25 [ 263.425198] Lustre: Skipped 1 previous similar message [ 276.982569] Lustre: lustre-OST0000: haven't heard from client 40e47947-d664-42c2-85ea-3daf5c9f6727 (at 192.168.201.50@tcp) in 54 seconds. I think it's dead, and I am evicting it. exp ffff88008d476000, cur 1713343600 expire 1713343570 last 1713343546 [ 283.452001] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 0 evicted) to recover in 0:05 [ 283.458143] Lustre: Skipped 3 previous similar messages [ 288.566944] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 288.569231] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 318.507729] Lustre: lustre-MDT0000: Denying connection for new client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp), waiting for 2 known clients (0 recovered, 1 in progress, and 1 evicted) to recover in 0:00 [ 318.511612] Lustre: Skipped 6 previous similar messages [ 318.566976] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 318.569417] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 318.571177] LustreError: 17810:0:(tgt_grant.c:257:tgt_grant_sanity_check()) mdt_obd_disconnect: tot_granted 0 != fo_tot_granted 2097152 [ 318.583691] Lustre: lustre-MDT0000: Recovery over after 1:36, of 2 clients 0 recovered and 2 were evicted. [ 318.597644] Lustre: lustre-OST0000: deleting orphan objects from 0x0:28 to 0x0:97 [ 318.597746] Lustre: lustre-OST0001: deleting orphan objects from 0x0:67 to 0x0:97 [ 325.001248] Lustre: DEBUG MARKER: == replay-dual test 1: |X| simple create ================= 04:47:27 (1713343647) [ 325.827388] LustreError: 19145:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 326.049270] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 326.552167] Lustre: Failing over lustre-MDT0000 [ 326.585130] LustreError: 19379:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 326.587765] LustreError: 19379:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 8 previous similar messages [ 326.660215] Lustre: server umount lustre-MDT0000 complete [ 334.061985] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343650/real 1713343650] req@ffff88008dbf97c0 x1796570729198720/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343657 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 334.070022] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 334.070024] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 334.070025] Lustre: Skipped 1 previous similar message [ 334.082226] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 3 previous similar messages [ 340.086149] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e35582ed to 0x1b3448e9e35589c3 [ 340.091406] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 340.094015] Lustre: Skipped 1 previous similar message [ 340.207595] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 341.042406] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 341.661227] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 341.696389] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 341.712054] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:129 [ 341.712554] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:129 [ 343.991245] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 344.339749] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 346.900111] Lustre: DEBUG MARKER: == replay-dual test 2: |X| mkdir adir ==================== 04:47:49 (1713343669) [ 347.625527] LustreError: 21578:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 347.846008] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 348.312192] Lustre: Failing over lustre-MDT0000 [ 348.351504] LustreError: 21792:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 348.353967] LustreError: 21792:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 6 previous similar messages [ 348.422592] Lustre: server umount lustre-MDT0000 complete [ 357.214081] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343673/real 1713343673] req@ffff88008d77f6c0 x1796570729204160/t0(0) o400->MGC192.168.201.150@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713343680 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 357.223821] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 3 previous similar messages [ 357.227058] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 363.231761] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e35589c3 to 0x1b3448e9e3558f03 [ 363.235648] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 363.348825] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 364.191289] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 364.709284] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 364.728170] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 364.742712] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:161 [ 364.742716] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:161 [ 367.080853] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 367.436660] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 370.004832] Lustre: DEBUG MARKER: == replay-dual test 3: |X| mkdir adir, mkdir adir/bdir === 04:48:12 (1713343692) [ 370.782982] LustreError: 24019:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 371.017333] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 371.540920] Lustre: Failing over lustre-MDT0000 [ 371.577328] LustreError: 24242:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 371.580191] LustreError: 24242:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 6 previous similar messages [ 371.655609] Lustre: server umount lustre-MDT0000 complete [ 379.342045] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 379.347681] Lustre: Skipped 1 previous similar message [ 379.350037] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 385.349777] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e3558f03 to 0x1b3448e9e355940b [ 385.353645] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 385.356779] Lustre: Skipped 2 previous similar messages [ 386.261838] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 387.749125] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 387.773491] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 387.788865] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:193 [ 387.788881] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:193 [ 389.134083] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 389.482151] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 392.124455] Lustre: DEBUG MARKER: == replay-dual test 4: |X| mkdir adir (-EEXIST), mkdir adir/bdir ========================================================== 04:48:34 (1713343714) [ 392.873293] LustreError: 26516:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 393.100194] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 393.593384] Lustre: Failing over lustre-MDT0000 [ 393.636555] LustreError: 26711:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 393.639235] LustreError: 26711:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 6 previous similar messages [ 393.720456] Lustre: server umount lustre-MDT0000 complete [ 402.461915] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343718/real 1713343718] req@ffff88009f590000 x1796570729214976/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343725 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 402.461952] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 402.473034] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 7 previous similar messages [ 402.475581] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 402.479343] Lustre: Skipped 1 previous similar message [ 408.481812] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e355940b to 0x1b3448e9e35599b4 [ 408.611566] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 408.613985] Lustre: Skipped 1 previous similar message [ 409.534087] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 410.792224] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 410.814820] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 410.829428] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:225 [ 410.829429] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:225 [ 412.536238] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 412.913889] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 415.569046] Lustre: DEBUG MARKER: == replay-dual test 5: open, unlink |X| close ============ 04:48:58 (1713343738) [ 416.336229] LustreError: 28990:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 416.585268] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 417.108824] Lustre: Failing over lustre-MDT0000 [ 417.145106] LustreError: 29202:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 417.147545] LustreError: 29202:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 6 previous similar messages [ 417.230508] Lustre: server umount lustre-MDT0000 complete [ 424.605988] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 424.605990] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 430.615960] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e35599b4 to 0x1b3448e9e3559f9c [ 430.620296] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 430.622389] Lustre: Skipped 3 previous similar messages [ 431.691977] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 433.832414] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 433.869575] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 433.888153] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:257 [ 433.888232] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:257 [ 435.201784] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 435.601322] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 438.270398] Lustre: DEBUG MARKER: == replay-dual test 6: open1, open2, unlink |X| close1 [fail mds1] close2 ========================================================== 04:49:20 (1713343760) [ 439.285614] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 447.774026] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 454.790576] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 455.892562] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:289 [ 455.892632] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:289 [ 457.882029] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 458.275342] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 460.971571] Lustre: DEBUG MARKER: == replay-dual test 8: replay of resent request ========== 04:49:43 (1713343783) [ 461.737243] LustreError: 1525:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 461.739912] LustreError: 1525:0:(osd_handler.c:694:osd_ro()) Skipped 1 previous similar message [ 461.980912] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 462.232919] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 462.234607] LustreError: 32393:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88008d7a1300 x1796570723997440/t38654705670(0) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:396/0 lens 512/448 e 0 to 0 dl 1713343791 ref 1 fl Interpret:/0/0 rc 0/0 job:'mcreate.0' [ 469.252154] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnecting [ 469.256461] Lustre: 553:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff88008dbf8000 x1796570723997440/t38654705670(0) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:403/0 lens 512/448 e 0 to 0 dl 1713343798 ref 1 fl Interpret:/2/0 rc 0/0 job:'mcreate.0' [ 469.968965] Lustre: Failing over lustre-MDT0000 [ 469.970299] Lustre: Skipped 1 previous similar message [ 470.000756] LustreError: 1912:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 470.003200] LustreError: 1912:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 12 previous similar messages [ 470.084388] Lustre: server umount lustre-MDT0000 complete [ 470.086466] Lustre: Skipped 1 previous similar message [ 480.910024] Lustre: 2859:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343797/real 1713343797] req@ffff88009f25c740 x1796570729231936/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343804 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 480.918023] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 480.920308] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 480.920310] LustreError: Skipped 1 previous similar message [ 480.932340] Lustre: 2859:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 12 previous similar messages [ 486.938184] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e355a4b9 to 0x1b3448e9e355a958 [ 486.941985] Lustre: Skipped 1 previous similar message [ 487.087813] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 487.091516] Lustre: Skipped 2 previous similar messages [ 487.284195] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 487.287627] Lustre: Skipped 1 previous similar message [ 487.310444] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 487.313663] Lustre: Skipped 1 previous similar message [ 487.332520] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:321 [ 487.332745] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:321 [ 487.997156] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 491.004987] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 491.393461] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 494.108673] Lustre: DEBUG MARKER: == replay-dual test 9: resending a replayed create ======= 04:50:16 (1713343816) [ 495.159288] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 510.095823] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 510.098164] Lustre: Skipped 5 previous similar messages [ 510.323733] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 510.325401] LustreError: 5327:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88008d778980 x1796570724003200/t42949672966(42949672966) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:444/0 lens 520/448 e 0 to 0 dl 1713343839 ref 1 fl Complete:/4/0 rc 0/0 job:'mcreate.0' [ 511.202541] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 517.332844] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnected, waiting for 2 clients in recovery for 0:58 [ 517.362616] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:353 [ 517.362617] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:353 [ 518.771493] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 519.193074] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 522.044456] Lustre: DEBUG MARKER: == replay-dual test 10: resending a replayed unlink ====== 04:50:44 (1713343844) [ 523.063647] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 537.016144] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 538.355817] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 538.357500] LustreError: 8175:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88008d01f6c0 x1796570724009024/t47244640264(47244640264) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:472/0 lens 504/456 e 0 to 0 dl 1713343867 ref 1 fl Complete:/4/0 rc 0/0 job:'munlink.0' [ 545.364754] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnected, waiting for 2 clients in recovery for 0:58 [ 545.391231] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:385 [ 545.391232] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:385 [ 546.705200] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 547.063794] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 549.946330] Lustre: DEBUG MARKER: == replay-dual test 11: both clients timeout during replay ========================================================== 04:51:12 (1713343872) [ 550.737614] LustreError: 9797:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 550.741216] LustreError: 9797:0:(osd_handler.c:694:osd_ro()) Skipped 2 previous similar messages [ 550.979058] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 551.742740] Lustre: Failing over lustre-MDT0000 [ 551.744190] Lustre: Skipped 2 previous similar messages [ 551.785496] LustreError: 10056:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 551.787534] LustreError: 10056:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 14 previous similar messages [ 551.885474] Lustre: server umount lustre-MDT0000 complete [ 551.886589] Lustre: Skipped 2 previous similar messages [ 559.086090] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 559.091007] Lustre: Skipped 3 previous similar messages [ 563.748402] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 563.751450] LustreError: Skipped 2 previous similar messages [ 563.753648] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e355b3b5 to 0x1b3448e9e355b8e7 [ 563.756843] Lustre: Skipped 2 previous similar messages [ 564.796687] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 566.392376] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 566.395898] Lustre: Skipped 2 previous similar messages [ 566.405457] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 566.407402] LustreError: 10940:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880137142140 x1796570724014272/t51539607558(51539607558) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:500/0 lens 520/448 e 0 to 0 dl 1713343895 ref 1 fl Complete:/4/0 rc 0/0 job:'mcreate.0' [ 566.627416] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 573.411687] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnected, waiting for 2 clients in recovery for 0:58 [ 573.429705] Lustre: lustre-MDT0000: Recovery over after 0:07, of 2 clients 2 recovered and 0 were evicted. [ 573.432329] Lustre: Skipped 2 previous similar messages [ 573.445245] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:417 [ 573.445276] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:417 [ 574.104014] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 7 sec [ 575.882209] Lustre: DEBUG MARKER: == replay-dual test 12: open resend timeout ============== 04:51:38 (1713343898) [ 576.905047] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 594.437027] Lustre: *** cfs_fail_loc=302, val=2147483648*** [ 594.883302] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 601.455249] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnected, waiting for 2 clients in recovery for 0:58 [ 601.495540] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:449 [ 601.495640] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:449 [ 602.906122] Lustre: DEBUG MARKER: == replay-dual test 13: close resend timeout ============= 04:52:05 (1713343925) [ 603.911292] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 612.040067] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343928/real 1713343928] req@ffff88008d4bf200 x1796570729260352/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713343935 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 612.049512] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 19 previous similar messages [ 616.761125] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 616.763083] Lustre: Skipped 4 previous similar messages [ 616.781157] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 617.649926] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 622.497306] Lustre: lustre-OST0000: deleting orphan objects from 0x0:99 to 0x0:481 [ 622.497361] Lustre: lustre-OST0001: deleting orphan objects from 0x0:99 to 0x0:481 [ 623.931664] Lustre: DEBUG MARKER: SKIP: replay-dual test_14b skipping ALWAYS excluded test 14b [ 624.297667] Lustre: DEBUG MARKER: == replay-dual test 15a: timeout waiting for lost client during replay, 1 client completes ========================================================== 04:52:26 (1713343946) [ 625.327594] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 639.767544] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 639.770346] Lustre: Skipped 13 previous similar messages [ 639.912081] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 640.791494] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 715.566982] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 715.569775] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 715.685434] Lustre: lustre-MDT0000: Recovery over after 1:06, of 2 clients 1 recovered and 1 was evicted. [ 715.687811] Lustre: Skipped 2 previous similar messages [ 715.703684] Lustre: lustre-OST0000: deleting orphan objects from 0x0:494 to 0x0:513 [ 715.704480] Lustre: lustre-OST0001: deleting orphan objects from 0x0:495 to 0x0:513 [ 717.024703] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 717.388692] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 720.366063] Lustre: DEBUG MARKER: == replay-dual test 15c: remove multiple OST orphans ===== 04:54:02 (1713344042) [ 721.166339] LustreError: 20045:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 721.169644] LustreError: 20045:0:(osd_handler.c:694:osd_ro()) Skipped 3 previous similar messages [ 721.407598] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 738.830515] Lustre: Failing over lustre-MDT0000 [ 738.832630] Lustre: Skipped 3 previous similar messages [ 738.869682] LustreError: 21236:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 738.872240] LustreError: 21236:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 23 previous similar messages [ 738.886143] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.50@tcp (stopping) [ 739.008668] Lustre: server umount lustre-MDT0000 complete [ 739.009884] Lustre: Skipped 3 previous similar messages [ 747.038041] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 747.042324] Lustre: Skipped 6 previous similar messages [ 747.046014] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 747.049292] LustreError: Skipped 3 previous similar messages [ 753.052079] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e355cc3c to 0x1b3448e9e3572160 [ 753.056451] Lustre: Skipped 3 previous similar messages [ 753.211243] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 754.127441] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 754.896056] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 754.898739] Lustre: Skipped 3 previous similar messages [ 820.566987] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 820.569485] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 820.610780] Lustre: lustre-OST0000: deleting orphan objects from 0x0:494 to 0x0:1537 [ 820.611205] Lustre: lustre-OST0001: deleting orphan objects from 0x0:495 to 0x0:1537 [ 822.376904] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 822.878826] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 826.958024] Lustre: DEBUG MARKER: == replay-dual test 16: fail MDS during recovery (3571) == 04:55:49 (1713344149) [ 828.396468] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 829.390675] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.50@tcp (stopping) [ 847.552509] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 848.754243] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 870.198370] LustreError: 25816:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 870.201516] Lustre: 25154:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 870.205999] Lustre: 25154:0:(ldlm_lib.c:1804:abort_req_replay_queue()) @@@ aborted: req@ffff88008b095a40 x1796570725089280/t0(73014444037) o101->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:53/0 lens 592/0 e 1 to 0 dl 1713344203 ref 1 fl Complete:/4/ffffffff rc 0/-1 job:'createmany.0' [ 870.226831] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.50@tcp (stopping) [ 870.233939] Lustre: 25154:0:(mdt_handler.c:7532:mdt_postrecov()) lustre-MDT0000: auto trigger paused LFSCK failed: rc = -6 [ 880.525957] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344196/real 1713344196] req@ffff88008c1617c0 x1796570729302336/t0(0) o400->MGC192.168.201.150@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344203 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 880.542825] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 20 previous similar messages [ 886.780466] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 886.784170] Lustre: Skipped 3 previous similar messages [ 886.822541] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 888.099625] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 897.749137] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 897.753918] Lustre: Skipped 9 previous similar messages [ 967.566979] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 967.571382] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 967.815600] Lustre: lustre-OST0001: deleting orphan objects from 0x0:1551 to 0x0:1569 [ 967.815775] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1550 to 0x0:1569 [ 969.508829] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 970.000006] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 973.641452] Lustre: DEBUG MARKER: == replay-dual test 17: fail OST during recovery (3571) == 04:58:15 (1713344295) [ 975.254436] Lustre: DEBUG MARKER: ost1 REPLAY BARRIER on lustre-OST0000 [ 976.656791] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.50@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 977.814353] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_destroy to node 0@lo failed: rc = -107 [ 977.816902] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 977.823424] LustreError: Skipped 4 previous similar messages [ 981.661257] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.50@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 986.669288] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.50@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 986.675238] LustreError: Skipped 1 previous similar message [ 988.627803] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 989.790531] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1011.057397] Lustre: Failing over lustre-OST0000 [ 1011.059140] Lustre: Skipped 3 previous similar messages [ 1011.062605] LustreError: 29967:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-OST0000: Aborting recovery [ 1011.065842] Lustre: 29266:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1011.069082] Lustre: 29266:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 1011.072218] Lustre: lustre-OST0000: Recovery over after 0:21, of 3 clients 0 recovered and 3 were evicted. [ 1011.078016] Lustre: Skipped 3 previous similar messages [ 1011.081745] Lustre: 29266:0:(ofd_obd.c:554:ofd_postrecov()) lustre-OST0000: auto trigger paused LFSCK failed: rc = -6 [ 1011.099388] Lustre: server umount lustre-OST0000 complete [ 1011.101802] Lustre: Skipped 3 previous similar messages [ 1018.685108] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.50@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1018.690688] LustreError: Skipped 1 previous similar message [ 1023.720400] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 1024.821478] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 3 clients reconnect [ 1024.826875] Lustre: Skipped 3 previous similar messages [ 1025.044024] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1090.567510] Lustre: lustre-OST0000: recovery is timed out, evict stale exports [ 1090.573797] Lustre: lustre-OST0000: disconnecting 1 stale clients [ 1090.638722] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1583 to 0x0:1601 [ 1093.006074] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 1093.592475] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 1098.058680] Lustre: DEBUG MARKER: == replay-dual test 18: ldlm_handle_enqueue succeeds on evicted export (3822) ========================================================== 05:00:20 (1713344420) [ 1100.443034] LustreError: 27171:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 30b sleeping for 40000ms [ 1140.484980] LustreError: 27171:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 30b awake [ 1143.773444] Lustre: DEBUG MARKER: == replay-dual test 19: resend of open request =========== 05:01:06 (1713344466) [ 1144.922983] LustreError: 434:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 1144.926638] LustreError: 434:0:(osd_handler.c:694:osd_ro()) Skipped 2 previous similar messages [ 1145.252673] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1145.673921] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 1145.676343] LustreError: 26502:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff8800a55f1300 x1796570725131520/t77309411443(0) o101->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:394/0 lens 664/656 e 0 to 0 dl 1713344544 ref 1 fl Interpret:/0/0 rc 0/0 job:'createmany.0' [ 1222.686948] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnecting [ 1222.697318] Lustre: 26501:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880098990980 x1796570725131520/t77309411443(0) o101->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:471/0 lens 664/3424 e 0 to 0 dl 1713344621 ref 1 fl Interpret:H/2/0 rc 0/0 job:'createmany.0' [ 1223.805591] LustreError: 929:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1223.808656] LustreError: 929:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 16 previous similar messages [ 1232.038034] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1232.041421] LustreError: Skipped 2 previous similar messages [ 1238.045139] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e357616a to 0x1b3448e9e35778af [ 1238.052278] Lustre: Skipped 2 previous similar messages [ 1238.301100] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1239.467179] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1239.808850] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1603 to 0x0:1633 [ 1239.808856] Lustre: lustre-OST0001: deleting orphan objects from 0x0:1584 to 0x0:1601 [ 1242.873336] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1243.222894] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1243.228198] Lustre: Skipped 7 previous similar messages [ 1243.276816] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1246.634027] Lustre: DEBUG MARKER: == replay-dual test 20: recovery time is not increasing == 05:02:48 (1713344568) [ 1248.247798] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1267.593238] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1402.566989] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 1402.569450] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 1402.577073] Lustre: 4238:0:(ldlm_lib.c:1971:extend_recovery_timer()) lustre-MDT0000: extended recovery timer reached hard limit: 180, extend: 1 [ 1402.613622] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1603 to 0x0:1665 [ 1402.613627] Lustre: lustre-OST0001: deleting orphan objects from 0x0:1603 to 0x0:1633 [ 1403.948936] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1404.326487] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1406.771629] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1416.631510] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344732/real 1713344732] req@ffff88008a94c280 x1796570729366080/t0(0) o400->MGC192.168.201.150@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344739 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:1.0' [ 1416.641823] Lustre: 2862:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 25 previous similar messages [ 1422.650500] Lustre: MGC192.168.201.150@tcp: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 1422.653489] Lustre: Skipped 8 previous similar messages [ 1422.717321] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.50@tcp (not set up) [ 1422.800940] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1422.803438] Lustre: Skipped 4 previous similar messages [ 1422.826688] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1422.829764] Lustre: Skipped 1 previous similar message [ 1423.950144] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1558.568665] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 1558.574486] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 1558.586350] Lustre: 6621:0:(ldlm_lib.c:1971:extend_recovery_timer()) lustre-MDT0000: extended recovery timer reached hard limit: 180, extend: 1 [ 1558.591315] Lustre: 6621:0:(ldlm_lib.c:1971:extend_recovery_timer()) Skipped 2 previous similar messages [ 1558.634931] Lustre: lustre-MDT0000: Recovery over after 2:15, of 2 clients 1 recovered and 1 was evicted. [ 1558.638967] Lustre: Skipped 3 previous similar messages [ 1558.660467] Lustre: lustre-OST0001: deleting orphan objects from 0x0:1635 to 0x0:1665 [ 1558.660472] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1603 to 0x0:1697 [ 1560.614733] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1561.163252] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1564.537667] Lustre: DEBUG MARKER: == replay-dual test 21a: commit on sharing =============== 05:08:06 (1713344886) [ 1566.161880] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1566.872345] Lustre: Failing over lustre-MDT0000 [ 1566.873972] Lustre: Skipped 3 previous similar messages [ 1567.011083] Lustre: server umount lustre-MDT0000 complete [ 1567.012613] Lustre: Skipped 3 previous similar messages [ 1581.224268] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1582.130038] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1582.134660] Lustre: Skipped 3 previous similar messages [ 1582.313008] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1716.567049] Lustre: lustre-MDT0000: recovery is timed out, evict stale exports [ 1716.569965] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 1716.601288] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1699 to 0x0:1729 [ 1716.601355] Lustre: lustre-OST0001: deleting orphan objects from 0x0:1635 to 0x0:1697 [ 1718.865980] Lustre: DEBUG MARKER: SKIP: replay-dual test_21b skipping SLOW test 21b [ 1719.316283] Lustre: DEBUG MARKER: == replay-dual test 22a: c1 lfs mkdir -i 1 dir1, M1 drop reply [ 1719.717259] Lustre: DEBUG MARKER: SKIP: replay-dual test_22a needs >= 2 MDTs [ 1720.186209] Lustre: DEBUG MARKER: == replay-dual test 22b: c1 lfs mkdir -i 1 d1, M1 drop reply [ 1720.576356] Lustre: DEBUG MARKER: SKIP: replay-dual test_22b needs >= 2 MDTs [ 1721.003780] Lustre: DEBUG MARKER: == replay-dual test 22c: c1 lfs mkdir -i 1 d1, M1 drop update [ 1721.372119] Lustre: DEBUG MARKER: SKIP: replay-dual test_22c needs >= 2 MDTs [ 1721.797082] Lustre: DEBUG MARKER: == replay-dual test 22d: c1 lfs mkdir -i 1 d1, M1 drop update [ 1722.163100] Lustre: DEBUG MARKER: SKIP: replay-dual test_22d needs >= 2 MDTs [ 1722.603414] Lustre: DEBUG MARKER: == replay-dual test 23a: c1 rmdir d1, M1 drop reply and fail, client2 mkdir d1 ========================================================== 05:10:45 (1713345045) [ 1723.000442] Lustre: DEBUG MARKER: SKIP: replay-dual test_23a needs >= 2 MDTs [ 1723.447651] Lustre: DEBUG MARKER: == replay-dual test 23b: c1 rmdir d1, M1 drop reply and fail M0/M1, c2 mkdir d1 ========================================================== 05:10:45 (1713345045) [ 1723.851358] Lustre: DEBUG MARKER: SKIP: replay-dual test_23b needs >= 2 MDTs [ 1724.290025] Lustre: DEBUG MARKER: == replay-dual test 23c: c1 rmdir d1, M0 drop update reply and fail M0, c2 mkdir d1 ========================================================== 05:10:46 (1713345046) [ 1724.676792] Lustre: DEBUG MARKER: SKIP: replay-dual test_23c needs >= 2 MDTs [ 1725.126649] Lustre: DEBUG MARKER: == replay-dual test 23d: c1 rmdir d1, M0 drop update reply and fail M0/M1, c2 mkdir d1 ========================================================== 05:10:47 (1713345047) [ 1725.505885] Lustre: DEBUG MARKER: SKIP: replay-dual test_23d needs >= 2 MDTs [ 1725.964725] Lustre: DEBUG MARKER: == replay-dual test 24: reconstruct on non-existing object ========================================================== 05:10:48 (1713345048) [ 1726.278809] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 1726.280580] LustreError: 9355:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff8800989917c0 x1796570725173376/t94489280517(0) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:219/0 lens 488/456 e 0 to 0 dl 1713345124 ref 1 fl Interpret:/0/0 rc 0/0 job:'truncate.0' [ 1803.286913] Lustre: lustre-MDT0000: Client 6d1faadc-0e9d-4031-b972-efdf11822e20 (at 192.168.201.50@tcp) reconnecting [ 1803.292936] Lustre: 9355:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880137141c80 x1796570725173376/t94489280517(0) o36->6d1faadc-0e9d-4031-b972-efdf11822e20@192.168.201.50@tcp:296/0 lens 488/3152 e 0 to 0 dl 1713345201 ref 1 fl Interpret:/2/0 rc 0/0 job:'truncate.0' [ 1804.876424] Lustre: DEBUG MARKER: == replay-dual test 25: replay|resend ==================== 05:12:07 (1713345127) [ 1805.320519] Lustre: *** cfs_fail_loc=304, val=0*** [ 1806.729455] LustreError: 12739:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1806.732679] LustreError: 12739:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 32 previous similar messages [ 1806.758376] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_statfs to node 0@lo failed: rc = -107 [ 1806.761509] LustreError: Skipped 3 previous similar messages [ 1806.763297] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1806.769316] Lustre: Skipped 6 previous similar messages [ 1806.772047] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1806.777633] LustreError: Skipped 1 previous similar message [ 1808.302794] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.50@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1808.309131] LustreError: Skipped 1 previous similar message [ 1811.528084] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1816.535080] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1816.541118] LustreError: Skipped 2 previous similar messages [ 1819.941658] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1820.848257] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1732 to 0x0:1761 [ 1823.060254] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 1823.452470] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 1826.527813] Lustre: DEBUG MARKER: == replay-dual test 26: dbench and tar with mds failover ========================================================== 05:12:28 (1713345148) [ 1829.542823] LustreError: 15149:0:(osd_handler.c:694:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 1829.547497] LustreError: 15149:0:(osd_handler.c:694:osd_ro()) Skipped 3 previous similar messages [ 1829.850252] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1832.242301] Lustre: DEBUG MARKER: test_26 fail mds1 1 times [ 1840.918039] LustreError: 166-1: MGC192.168.201.150@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1840.922774] LustreError: Skipped 3 previous similar messages [ 1846.926619] Lustre: Evicted from MGS (at 192.168.201.150@tcp) after server handle changed from 0x1b3448e9e3578884 to 0x1b3448e9e3582f26 [ 1846.930323] Lustre: Skipped 3 previous similar messages [ 1847.119621] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1847.122371] Lustre: Skipped 1 previous similar message [ 1848.070124] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1861.969087] Lustre: lustre-OST0000: deleting orphan objects from 0x0:1895 to 0x0:1921 [ 1861.969827] Lustre: lustre-OST0001: deleting orphan objects from 0x0:1832 to 0x0:1857 [ 1863.460199] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1863.890899] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1868.395851] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1870.807085] Lustre: DEBUG MARKER: test_26 fail mds1 2 times [ 1886.185618] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1887.983655] Lustre: lustre-OST0000: deleting orphan objects from 0x0:2095 to 0x0:2113 [ 1887.983678] Lustre: lustre-OST0001: deleting orphan objects from 0x0:2031 to 0x0:2049 [ 1889.498908] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1889.931364] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1894.569039] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1896.961691] Lustre: DEBUG MARKER: test_26 fail mds1 3 times [ 1913.431033] mount.lustre (26919) used greatest stack depth: 10008 bytes left [ 1914.357594] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1926.855424] Lustre: lustre-OST0001: deleting orphan objects from 0x0:2199 to 0x0:2241 [ 1926.855448] Lustre: lustre-OST0000: deleting orphan objects from 0x0:2264 to 0x0:2305 [ 1928.327754] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1928.763446] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1933.228460] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1935.610171] Lustre: DEBUG MARKER: test_26 fail mds1 4 times [ 1952.484435] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1952.912559] Lustre: lustre-OST0000: deleting orphan objects from 0x0:2481 to 0x0:2497 [ 1952.912626] Lustre: lustre-OST0001: deleting orphan objects from 0x0:2418 to 0x0:2433 [ 1955.678625] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1956.152555] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1960.547471] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1962.972631] Lustre: DEBUG MARKER: test_26 fail mds1 5 times [ 1963.547491] LustreError: 32129:0:(ldlm_lockd.c:1427:ldlm_handle_enqueue0()) ### lock on destroyed export ffff8801326ae800 ns: mdt-lustre-MDT0000_UUID lock: ffff88012b835440/0x1b3448e9e35d2f5b lrc: 3/0,0 mode: CW/CW res: [0x20000afe1:0x1e0:0x0].0x0 bits 0x5/0x0 rrc: 2 type: IBT gid 0 flags: 0x50200000000000 nid: 192.168.201.50@tcp remote: 0xa67f8ad09748bda9 expref: 4 pid: 32129 timeout: 0 lvb_type: 0 [ 1966.582821] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 1966.585350] Lustre: Skipped 1 previous similar message [ 1985.777745] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 1990.837566] Lustre: lustre-OST0001: deleting orphan objects from 0x0:2637 to 0x0:2657 [ 1990.837756] Lustre: lustre-OST0000: deleting orphan objects from 0x0:2702 to 0x0:2721 [ 1992.313728] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1992.716542] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1997.085573] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 1999.466218] Lustre: DEBUG MARKER: test_26 fail mds1 6 times [ 2017.987249] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 2018.934909] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713345266/real 1713345266] req@ffff88008b764280 x1796570729665856/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1713345342 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 2018.944058] Lustre: 2861:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 60 previous similar messages [ 2034.780008] Lustre: lustre-OST0000: deleting orphan objects from 0x0:2910 to 0x0:2945 [ 2034.780452] Lustre: lustre-OST0001: deleting orphan objects from 0x0:2845 to 0x0:2881 [ 2036.415992] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2036.873683] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2064.100909] Lustre: DEBUG MARKER: == replay-dual test 28: lock replay should be ordered: waiting after granted ========================================================== 05:16:26 (1713345386) [ 2067.071878] Lustre: *** cfs_fail_loc=32a, val=0*** [ 2067.072543] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_destroy to node 0@lo failed: rc = -19 [ 2067.072987] Lustre: lustre-OST0000: Not available for connect from 0@lo (stopping) [ 2067.072988] Lustre: Skipped 2 previous similar messages [ 2067.080855] LustreError: 16838:0:(ldlm_resource.c:1126:ldlm_resource_complain()) filter-lustre-OST0000_UUID: namespace resource [0x7a4:0x0:0x0].0x0 (ffff88012fcd7d00) refcount nonzero (2) after lock cleanup; forcing cleanup. [ 2076.998664] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2077.003046] LustreError: Skipped 2 previous similar messages [ 2084.977883] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 2084.981906] Lustre: Skipped 8 previous similar messages [ 2085.908857] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 2086.962308] Lustre: *** cfs_fail_loc=32a, val=0*** [ 2086.963954] Lustre: Skipped 1 previous similar message [ 2086.987722] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 192.168.201.150@tcp (at 0@lo) [ 2086.990201] Lustre: lustre-OST0000: deleting orphan objects from 0x0:3108 to 0x0:3137 [ 2086.997499] Lustre: Skipped 24 previous similar messages [ 2088.960389] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2089.332380] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 2093.946820] Lustre: DEBUG MARKER: == replay-dual test 29: replay vs update with the same xid ========================================================== 05:16:56 (1713345416) [ 2094.300268] Lustre: DEBUG MARKER: SKIP: replay-dual test_29 needs >= 2 MDTs [ 2094.703145] Lustre: DEBUG MARKER: == replay-dual test 30: layout lock replay is not blocked on IO ========================================================== 05:16:57 (1713345417) [ 2104.853984] Lustre: ll_ost00_003: service thread pid 8892 was inactive for 40.025 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 2104.853987] Pid: 17677, comm: ll_ost00_008 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 2104.853997] Call Trace: [ 2104.854104] [<0>] ldlm_completion_ast+0x7f7/0xa50 [ptlrpc] [ 2104.854157] [<0>] ldlm_cli_enqueue_local+0x259/0x880 [ptlrpc] [ 2104.854178] [<0>] ofd_destroy_by_fid+0x1d1/0x520 [ofd] [ 2104.854183] [<0>] ofd_destroy_hdl+0x20f/0xa80 [ofd] [ 2104.854240] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 2104.854278] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 2104.854314] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 2104.854322] [<0>] kthread+0xe4/0xf0 [ 2104.854327] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 2104.854342] [<0>] 0xfffffffffffffffe [ 2104.854347] Pid: 8133, comm: ll_ost00_002 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 2104.854347] Call Trace: [ 2104.854429] [<0>] ldlm_completion_ast+0x7f7/0xa50 [ptlrpc] [ 2104.854464] [<0>] ldlm_cli_enqueue_local+0x259/0x880 [ptlrpc] [ 2104.854481] [<0>] ofd_destroy_by_fid+0x1d1/0x520 [ofd] [ 2104.854486] [<0>] ofd_destroy_hdl+0x20f/0xa80 [ofd] [ 2104.854539] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 2104.854579] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 2104.854617] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 2104.854624] [<0>] kthread+0xe4/0xf0 [ 2104.854627] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 2104.854640] [<0>] 0xfffffffffffffffe [ 2104.854647] Lustre: ll_ost00_000: service thread pid 8131 was inactive for 40.026 seconds. Watchdog stack traces are limited to 3 per 300 seconds, skipping this one. [ 2104.898879] Lustre: Skipped 2 previous similar messages [ 2104.900646] Pid: 8892, comm: ll_ost00_003 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 2104.904179] Call Trace: [ 2104.905325] [<0>] ldlm_completion_ast+0x7f7/0xa50 [ptlrpc] [ 2104.907616] [<0>] ldlm_cli_enqueue_local+0x259/0x880 [ptlrpc] [ 2104.909815] [<0>] ofd_destroy_by_fid+0x1d1/0x520 [ofd] [ 2104.911845] [<0>] ofd_destroy_hdl+0x20f/0xa80 [ofd] [ 2104.913835] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 2104.915468] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 2104.917288] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 2104.918191] [<0>] kthread+0xe4/0xf0 [ 2104.918995] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 2104.920261] [<0>] 0xfffffffffffffffe [ 2113.939288] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 2127.144435] Lustre: lustre-OST0000: deleting orphan objects from 0x0:3108 to 0x0:3169 [ 2127.144455] Lustre: lustre-OST0001: deleting orphan objects from 0x0:3043 to 0x0:3073 [ 2128.392012] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2128.726497] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2131.265446] Lustre: DEBUG MARKER: == replay-dual test 31: deadlock on file_remove_privs and occupied mod rpc slots ========================================================== 05:17:33 (1713345453) [ 2132.166487] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_statfs to node 0@lo failed: rc = -107 [ 2132.169692] LustreError: Skipped 7 previous similar messages [ 2132.171901] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2132.177149] LustreError: Skipped 5 previous similar messages [ 2149.790825] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug -1 all 8 [ 2152.886998] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2153.285223] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in IDLE [ 2156.170480] Lustre: DEBUG MARKER: == replay-dual test 32: gap in update llog shouldn't break recovery ========================================================== 05:17:58 (1713345478) [ 2156.505580] Lustre: DEBUG MARKER: SKIP: replay-dual test_32 needs >= 2 MDTs [ 2156.899937] Lustre: DEBUG MARKER: == replay-dual test 33: Check for OBD_INCOMPAT_MULTI_RPCS in last_rcvd after abort_recovery ========================================================== 05:17:59 (1713345479) [ 2157.263742] Lustre: DEBUG MARKER: SKIP: replay-dual test_33 ldiskfs only test [ 2157.627285] Lustre: DEBUG MARKER: == replay-dual test complete, duration 2068 sec ========== 05:18:00 (1713345480) [ 2171.433282] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 2173.107583] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 2173.110013] Lustre: Skipped 11 previous similar messages [ 2173.122049] Lustre: lustre-OST0001: deleting orphan objects from 0x0:3043 to 0x0:3105 [ 2173.122050] Lustre: lustre-OST0000: deleting orphan objects from 0x0:3209 to 0x0:3233 [ 2174.263892] Lustre: DEBUG MARKER: oleg150-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2174.587058] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2175.928186] Lustre: 8892:0:(service.c:2333:ptlrpc_server_handle_request()) @@@ Request took longer than estimated (56/56s); client may timeout req@ffff8800a0ead580 x1796570730166656/t4294973215(0) o6->lustre-MDT0000-mdtlov_UUID@0@lo:538/0 lens 544/432 e 2 to 0 dl 1713345443 ref 1 fl Complete:/0/0 rc 0/0 job:'osp-syn-1-0.0' [ 2175.928401] LustreError: 17715:0:(osd_handler.c:224:osd_trans_start()) lustre-OST0001: can't assign tx: rc = -2 [ 2175.929569] LustreError: 17653:0:(client.c:1256:ptlrpc_import_delay_req()) @@@ IMP_CLOSED req@ffff88008aff3440 x1796570730202176/t0(0) o104->lustre-OST0001@192.168.201.50@tcp:15/16 lens 328/224 e 0 to 0 dl 0 ref 1 fl Rpc:QU/0/ffffffff rc 0/-1 job:'' [ 2175.950118] Lustre: 8892:0:(service.c:2333:ptlrpc_server_handle_request()) Skipped 7 previous similar messages [ 2176.561288] Lustre: server umount lustre-MDT0000 complete [ 2176.562782] Lustre: Skipped 11 previous similar messages [ 2177.595334] LustreError: 9584:0:(ldlm_lockd.c:2521:ldlm_cancel_handler()) ldlm_cancel from 0@lo arrived at 1713345500 with bad export cookie 1960271907192533728 [ 2193.809207] Lustre: DEBUG MARKER: oleg150-server.virtnet: executing unload_modules_local [ 2194.307260] Key type lgssc unregistered [ 2194.362224] LNet: 27702:0:(lib-ptl.c:958:lnet_clear_lazy_portal()) Active lazy portal 0 on exit [ 2194.365243] LNet: Removed LNI 192.168.201.150@tcp