[ 0.000000] Initializing cgroup subsys cpuset [ 0.000000] Initializing cgroup subsys cpu [ 0.000000] Initializing cgroup subsys cpuacct [ 0.000000] Linux version 3.10.0-7.9-debug (green@centos7-base) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-44) (GCC) ) #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 0.000000] Command line: rd.shell root=nbd:192.168.200.253:centos7:ext4:ro:-p,-b4096 ro crashkernel=128M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] e820: BIOS-provided physical RAM map: [ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable [ 0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000000f0000-0x00000000000fffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x00000000bffcdfff] usable [ 0.000000] BIOS-e820: [mem 0x00000000bffce000-0x00000000bfffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000feffc000-0x00000000feffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000fffc0000-0x00000000ffffffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000100000000-0x000000013edfffff] usable [ 0.000000] NX (Execute Disable) protection: active [ 0.000000] SMBIOS 3.0.0 present. [ 0.000000] DMI: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.3-1.fc39 04/01/2014 [ 0.000000] Hypervisor detected: KVM [ 0.000000] e820: last_pfn = 0x13ee00 max_arch_pfn = 0x400000000 [ 0.000000] PAT configuration [0-7]: WB WC UC- UC WB WP UC- UC [ 0.000000] e820: last_pfn = 0xbffce max_arch_pfn = 0x400000000 [ 0.000000] found SMP MP-table at [mem 0x000f53f0-0x000f53ff] mapped at [ffffffffff2003f0] [ 0.000000] Using GB pages for direct mapping [ 0.000000] RAMDISK: [mem 0xbc2e2000-0xbffbffff] [ 0.000000] Early table checksum verification disabled [ 0.000000] ACPI: RSDP 00000000000f5200 00014 (v00 BOCHS ) [ 0.000000] ACPI: RSDT 00000000bffe1d87 00034 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACP 00000000bffe1c23 00074 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: DSDT 00000000bffe0040 01BE3 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACS 00000000bffe0000 00040 [ 0.000000] ACPI: APIC 00000000bffe1c97 00090 (v03 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: HPET 00000000bffe1d27 00038 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: WAET 00000000bffe1d5f 00028 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at [mem 0x0000000000000000-0x000000013edfffff] [ 0.000000] NODE_DATA(0) allocated [mem 0x13e5e3000-0x13e609fff] [ 0.000000] Reserving 128MB of memory at 768MB for crashkernel (System RAM: 4077MB) [ 0.000000] kvm-clock: cpu 0, msr 1:3e592001, primary cpu clock [ 0.000000] kvm-clock: Using msrs 4b564d01 and 4b564d00 [ 0.000000] kvm-clock: using sched offset of 373351048 cycles [ 0.000000] Zone ranges: [ 0.000000] DMA [mem 0x00001000-0x00ffffff] [ 0.000000] DMA32 [mem 0x01000000-0xffffffff] [ 0.000000] Normal [mem 0x100000000-0x13edfffff] [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges [ 0.000000] node 0: [mem 0x00001000-0x0009efff] [ 0.000000] node 0: [mem 0x00100000-0xbffcdfff] [ 0.000000] node 0: [mem 0x100000000-0x13edfffff] [ 0.000000] Initmem setup node 0 [mem 0x00001000-0x13edfffff] [ 0.000000] ACPI: PM-Timer IO Port: 0x608 [ 0.000000] ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) [ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) [ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled) [ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x03] enabled) [ 0.000000] ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl lint[0x1]) [ 0.000000] ACPI: IOAPIC (id[0x00] address[0xfec00000] gsi_base[0]) [ 0.000000] IOAPIC[0]: apic_id 0, version 17, address 0xfec00000, GSI 0-23 [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 high level) [ 0.000000] Using ACPI (MADT) for SMP configuration information [ 0.000000] ACPI: HPET id: 0x8086a201 base: 0xfed00000 [ 0.000000] smpboot: Allowing 4 CPUs, 0 hotplug CPUs [ 0.000000] PM: Registered nosave memory: [mem 0x0009f000-0x0009ffff] [ 0.000000] PM: Registered nosave memory: [mem 0x000a0000-0x000effff] [ 0.000000] PM: Registered nosave memory: [mem 0x000f0000-0x000fffff] [ 0.000000] PM: Registered nosave memory: [mem 0xbffce000-0xbfffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xc0000000-0xfeffbfff] [ 0.000000] PM: Registered nosave memory: [mem 0xfeffc000-0xfeffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xff000000-0xfffbffff] [ 0.000000] PM: Registered nosave memory: [mem 0xfffc0000-0xffffffff] [ 0.000000] e820: [mem 0xc0000000-0xfeffbfff] available for PCI devices [ 0.000000] Booting paravirtualized kernel on KVM [ 0.000000] setup_percpu: NR_CPUS:5120 nr_cpumask_bits:4 nr_cpu_ids:4 nr_node_ids:1 [ 0.000000] percpu: Embedded 38 pages/cpu s115176 r8192 d32280 u524288 [ 0.000000] KVM setup async PF for cpu 0 [ 0.000000] kvm-stealtime: cpu 0, msr 13e2135c0 [ 0.000000] PV qspinlock hash table entries: 256 (order: 0, 4096 bytes) [ 0.000000] Built 1 zonelists in Node order, mobility grouping on. Total pages: 1027487 [ 0.000000] Policy zone: Normal [ 0.000000] Kernel command line: rd.shell root=nbd:192.168.200.253:centos7:ext4:ro:-p,-b4096 ro crashkernel=128M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] audit: disabled (until reboot) [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) [ 0.000000] x86/fpu: xstate_offset[2]: 0240, xstate_sizes[2]: 0100 [ 0.000000] xsave: enabled xstate_bv 0x7, cntxt size 0x340 using standard form [ 0.000000] Memory: 3820268k/5224448k available (8172k kernel code, 1049168k absent, 355012k reserved, 5773k data, 2532k init) [ 0.000000] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1 [ 0.000000] Hierarchical RCU implementation. [ 0.000000] RCU restricting CPUs from NR_CPUS=5120 to nr_cpu_ids=4. [ 0.000000] Offload RCU callbacks from all CPUs [ 0.000000] Offload RCU callbacks from CPUs: 0-3. [ 0.000000] NR_IRQS:327936 nr_irqs:456 0 [ 0.000000] Console: colour *CGA 80x25 [ 0.000000] console [ttyS1] enabled [ 0.000000] allocated 25165824 bytes of page_cgroup [ 0.000000] please try 'cgroup_disable=memory' option if you don't want memory cgroups [ 0.000000] kmemleak: Kernel memory leak detector disabled [ 0.000000] tsc: Detected 2399.998 MHz processor [ 0.437399] Calibrating delay loop (skipped) preset value.. 4799.99 BogoMIPS (lpj=2399998) [ 0.440001] pid_max: default: 32768 minimum: 301 [ 0.441355] Security Framework initialized [ 0.442498] SELinux: Initializing. [ 0.445211] Dentry cache hash table entries: 524288 (order: 10, 4194304 bytes) [ 0.449483] Inode-cache hash table entries: 262144 (order: 9, 2097152 bytes) [ 0.452307] Mount-cache hash table entries: 8192 (order: 4, 65536 bytes) [ 0.453553] Mountpoint-cache hash table entries: 8192 (order: 4, 65536 bytes) [ 0.456246] Initializing cgroup subsys memory [ 0.457695] Initializing cgroup subsys devices [ 0.459057] Initializing cgroup subsys freezer [ 0.459986] Initializing cgroup subsys net_cls [ 0.461409] Initializing cgroup subsys blkio [ 0.462883] Initializing cgroup subsys perf_event [ 0.464502] Initializing cgroup subsys hugetlb [ 0.466155] Initializing cgroup subsys pids [ 0.467562] Initializing cgroup subsys net_prio [ 0.469550] x86/cpu: User Mode Instruction Prevention (UMIP) activated [ 0.472958] Last level iTLB entries: 4KB 0, 2MB 0, 4MB 0 [ 0.475201] Last level dTLB entries: 4KB 0, 2MB 0, 4MB 0 [ 0.476465] tlb_flushall_shift: 6 [ 0.477162] FEATURE SPEC_CTRL Present [ 0.478184] FEATURE IBPB_SUPPORT Present [ 0.479038] Spectre V2 : Enabling Indirect Branch Prediction Barrier [ 0.480836] Spectre V2 : Vulnerable [ 0.481560] Speculative Store Bypass: Vulnerable [ 0.484429] debug: unmapping init [mem 0xffffffff82019000-0xffffffff8201ffff] [ 0.492054] ACPI: Core revision 20130517 [ 0.495336] ACPI: All ACPI Tables successfully acquired [ 0.496738] ftrace: allocating 30294 entries in 119 pages [ 0.549048] Enabling x2apic [ 0.550128] Enabled x2apic [ 0.551540] Switched APIC routing to physical x2apic. [ 0.555270] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1 [ 0.557562] smpboot: CPU0: Intel(R) Xeon(R) CPU E5-2695 v2 @ 2.40GHz (fam: 06, model: 3e, stepping: 04) [ 0.561536] Performance Events: IvyBridge events, full-width counters, Intel PMU driver. [ 0.564192] ... version: 2 [ 0.565407] ... bit width: 48 [ 0.566471] ... generic registers: 4 [ 0.567587] ... value mask: 0000ffffffffffff [ 0.569358] ... max period: 00007fffffffffff [ 0.571011] ... fixed-purpose events: 3 [ 0.572202] ... event mask: 000000070000000f [ 0.573804] KVM setup paravirtual spinlock [ 0.577176] smpboot: Booting Node 0, Processors #1[ 0.578919] kvm-clock: cpu 1, msr 1:3e592041, secondary cpu clock [ 0.586366] KVM setup async PF for cpu 1 [ 0.588540] kvm-stealtime: cpu 1, msr 13e2935c0 #2[ 0.593857] kvm-clock: cpu 2, msr 1:3e592081, secondary cpu clock [ 0.600468] KVM setup async PF for cpu 2 [ 0.602695] kvm-clock: cpu 3, msr 1:3e5920c1, secondary cpu clock #3 OK [ 0.609419] kvm-stealtime: cpu 2, msr 13e3135c0 [ 0.614449] KVM setup async PF for cpu 3 [ 0.614470] Brought up 4 CPUs [ 0.614471] smpboot: Max logical packages: 1 [ 0.614474] smpboot: Total of 4 processors activated (19199.98 BogoMIPS) [ 0.645220] kvm-stealtime: cpu 3, msr 13e3935c0 [ 0.651175] devtmpfs: initialized [ 0.653446] x86/mm: Memory block size: 128MB [ 0.660707] EVM: security.selinux [ 0.661947] EVM: security.ima [ 0.662734] EVM: security.capability [ 0.669909] atomic64 test passed for x86-64 platform with CX8 and with SSE [ 0.672740] NET: Registered protocol family 16 [ 0.674773] cpuidle: using governor haltpoll [ 0.677016] ACPI: bus type PCI registered [ 0.678354] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5 [ 0.681186] PCI: Using configuration type 1 for base access [ 0.683315] core: PMU erratum BJ122, BV98, HSD29 worked around, HT is on [ 0.693224] ACPI: Added _OSI(Module Device) [ 0.696185] ACPI: Added _OSI(Processor Device) [ 0.698072] ACPI: Added _OSI(3.0 _SCP Extensions) [ 0.699899] ACPI: Added _OSI(Processor Aggregator Device) [ 0.701789] ACPI: Added _OSI(Linux-Dell-Video) [ 0.707058] ACPI: Interpreter enabled [ 0.708348] ACPI: (supports S0 S3 S4 S5) [ 0.710140] ACPI: Using IOAPIC for interrupt routing [ 0.711767] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug [ 0.716862] ACPI: Enabled 2 GPEs in block 00 to 0F [ 0.724806] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff]) [ 0.727131] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI] [ 0.729587] acpi PNP0A03:00: _OSC failed (AE_NOT_FOUND); disabling ASPM [ 0.732057] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge. [ 0.736228] acpiphp: Slot [2] registered [ 0.737681] acpiphp: Slot [5] registered [ 0.738886] acpiphp: Slot [6] registered [ 0.739997] acpiphp: Slot [7] registered [ 0.741526] acpiphp: Slot [8] registered [ 0.742942] acpiphp: Slot [9] registered [ 0.746901] acpiphp: Slot [10] registered [ 0.748532] acpiphp: Slot [3] registered [ 0.750120] acpiphp: Slot [4] registered [ 0.751831] acpiphp: Slot [11] registered [ 0.753314] acpiphp: Slot [12] registered [ 0.754854] acpiphp: Slot [13] registered [ 0.756610] acpiphp: Slot [14] registered [ 0.758094] acpiphp: Slot [15] registered [ 0.759320] acpiphp: Slot [16] registered [ 0.760876] acpiphp: Slot [17] registered [ 0.762233] acpiphp: Slot [18] registered [ 0.763414] acpiphp: Slot [19] registered [ 0.764862] acpiphp: Slot [20] registered [ 0.765965] acpiphp: Slot [21] registered [ 0.767357] acpiphp: Slot [22] registered [ 0.768877] acpiphp: Slot [23] registered [ 0.770451] acpiphp: Slot [24] registered [ 0.772044] acpiphp: Slot [25] registered [ 0.773673] acpiphp: Slot [26] registered [ 0.775293] acpiphp: Slot [27] registered [ 0.776641] acpiphp: Slot [28] registered [ 0.778116] acpiphp: Slot [29] registered [ 0.779514] acpiphp: Slot [30] registered [ 0.781126] acpiphp: Slot [31] registered [ 0.782651] PCI host bridge to bus 0000:00 [ 0.784105] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window] [ 0.786437] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window] [ 0.788816] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window] [ 0.791297] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfebfffff window] [ 0.793976] pci_bus 0000:00: root bus resource [mem 0x380000000000-0x38007fffffff window] [ 0.807238] pci_bus 0000:00: root bus resource [bus 00-ff] [ 0.825264] pci 0000:00:01.1: legacy IDE quirk: reg 0x10: [io 0x01f0-0x01f7] [ 0.827910] pci 0000:00:01.1: legacy IDE quirk: reg 0x14: [io 0x03f6] [ 0.830383] pci 0000:00:01.1: legacy IDE quirk: reg 0x18: [io 0x0170-0x0177] [ 0.832770] pci 0000:00:01.1: legacy IDE quirk: reg 0x1c: [io 0x0376] [ 0.836279] pci 0000:00:01.3: quirk: [io 0x0600-0x063f] claimed by PIIX4 ACPI [ 0.838631] pci 0000:00:01.3: quirk: [io 0x0700-0x070f] claimed by PIIX4 SMB [ 1.139138] ACPI: PCI Interrupt Link [LNKA] (IRQs 5 *10 11) [ 1.144828] ACPI: PCI Interrupt Link [LNKB] (IRQs 5 *10 11) [ 1.151631] ACPI: PCI Interrupt Link [LNKC] (IRQs 5 10 *11) [ 1.154312] ACPI: PCI Interrupt Link [LNKD] (IRQs 5 10 *11) [ 1.156511] ACPI: PCI Interrupt Link [LNKS] (IRQs *9) [ 1.160791] vgaarb: loaded [ 1.161945] SCSI subsystem initialized [ 1.164749] ACPI: bus type USB registered [ 1.166432] usbcore: registered new interface driver usbfs [ 1.168048] usbcore: registered new interface driver hub [ 1.170711] usbcore: registered new device driver usb [ 1.173262] PCI: Using ACPI for IRQ routing [ 1.175420] NetLabel: Initializing [ 1.179206] NetLabel: domain hash size = 128 [ 1.180924] NetLabel: protocols = UNLABELED CIPSOv4 [ 1.182830] NetLabel: unlabeled traffic allowed by default [ 1.185107] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0 [ 1.187169] hpet0: 3 comparators, 64-bit 100.000000 MHz counter [ 1.193356] amd_nb: Cannot enumerate AMD northbridges [ 1.195688] Switched to clocksource kvm-clock [ 1.215943] pnp: PnP ACPI init [ 1.217075] ACPI: bus type PNP registered [ 1.219288] pnp: PnP ACPI: found 6 devices [ 1.220541] ACPI: bus type PNP unregistered [ 1.236987] NET: Registered protocol family 2 [ 1.239396] TCP established hash table entries: 32768 (order: 6, 262144 bytes) [ 1.242363] TCP bind hash table entries: 32768 (order: 8, 1048576 bytes) [ 1.245450] TCP: Hash tables configured (established 32768 bind 32768) [ 1.248117] TCP: reno registered [ 1.250374] UDP hash table entries: 2048 (order: 5, 196608 bytes) [ 1.258931] UDP-Lite hash table entries: 2048 (order: 5, 196608 bytes) [ 1.261186] NET: Registered protocol family 1 [ 1.263416] RPC: Registered named UNIX socket transport module. [ 1.265193] RPC: Registered udp transport module. [ 1.266667] RPC: Registered tcp transport module. [ 1.267763] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 1.270037] pci 0000:00:00.0: Limiting direct PCI/PCI transfers [ 1.272003] pci 0000:00:01.0: PIIX3: Enabling Passive Release [ 1.274568] pci 0000:00:01.0: Activating ISA DMA hang workarounds [ 1.278019] Unpacking initramfs... [ 2.697124] debug: unmapping init [mem 0xffff8800bc2e2000-0xffff8800bffbffff] [ 2.701043] PCI-DMA: Using software bounce buffering for IO (SWIOTLB) [ 2.703177] software IO TLB [mem 0xb82e2000-0xbc2e2000] (64MB) mapped at [ffff8800b82e2000-ffff8800bc2e1fff] [ 2.706847] RAPL PMU: API unit is 2^-32 Joules, 3 fixed counters, 10737418240 ms ovfl timer [ 2.709728] RAPL PMU: hw unit of domain pp0-core 2^-0 Joules [ 2.711925] RAPL PMU: hw unit of domain package 2^-0 Joules [ 2.713907] RAPL PMU: hw unit of domain dram 2^-0 Joules [ 2.721356] cryptomgr_test (52) used greatest stack depth: 14480 bytes left [ 2.722647] futex hash table entries: 1024 (order: 4, 65536 bytes) [ 2.722690] Initialise system trusted keyring [ 2.754040] HugeTLB registered 1 GB page size, pre-allocated 0 pages [ 2.756420] HugeTLB registered 2 MB page size, pre-allocated 0 pages [ 2.763731] zpool: loaded [ 2.764831] zbud: loaded [ 2.766174] VFS: Disk quotas dquot_6.6.0 [ 2.767769] Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 2.771138] NFS: Registering the id_resolver key type [ 2.772740] Key type id_resolver registered [ 2.773827] Key type id_legacy registered [ 2.774924] nfs4filelayout_init: NFSv4 File Layout Driver Registering... [ 2.777527] Key type big_key registered [ 2.785834] cryptomgr_test (58) used greatest stack depth: 14048 bytes left [ 2.795886] cryptomgr_test (63) used greatest stack depth: 13984 bytes left [ 2.802088] NET: Registered protocol family 38 [ 2.805920] Key type asymmetric registered [ 2.807485] Asymmetric key parser 'x509' registered [ 2.809024] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 250) [ 2.813039] io scheduler noop registered [ 2.814563] io scheduler deadline registered (default) [ 2.817955] io scheduler cfq registered [ 2.819020] io scheduler mq-deadline registered [ 2.820134] io scheduler kyber registered [ 2.825770] pci_hotplug: PCI Hot Plug PCI Core version: 0.5 [ 2.829389] pciehp: PCI Express Hot Plug Controller Driver version: 0.4 [ 2.833515] input: Power Button as /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0 [ 2.836895] ACPI: Power Button [PWRF] [ 2.838982] GHES: HEST is not enabled! [ 2.905587] ACPI: PCI Interrupt Link [LNKB] enabled at IRQ 10 [ 2.968462] ACPI: PCI Interrupt Link [LNKA] enabled at IRQ 11 [ 3.095692] ACPI: PCI Interrupt Link [LNKC] enabled at IRQ 11 [ 3.160127] ACPI: PCI Interrupt Link [LNKD] enabled at IRQ 10 [ 3.286591] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 3.316461] 00:03: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A [ 3.347383] 00:04: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A [ 3.353495] Non-volatile memory driver v1.3 [ 3.355066] Linux agpgart interface v0.103 [ 3.356750] crash memory driver: version 1.1 [ 3.358647] nbd: registered device at major 43 [ 3.377860] virtio_blk virtio1: [vda] 60784 512-byte logical blocks (31.1 MB/29.6 MiB) [ 3.400824] virtio_blk virtio2: [vdb] 2097152 512-byte logical blocks (1.07 GB/1.00 GiB) [ 3.420123] virtio_blk virtio3: [vdc] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 3.443203] virtio_blk virtio4: [vdd] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 3.458062] virtio_blk virtio5: [vde] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 3.474009] virtio_blk virtio6: [vdf] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 3.484072] rdac: device handler registered [ 3.486356] hp_sw: device handler registered [ 3.487999] emc: device handler registered [ 3.489787] libphy: Fixed MDIO Bus: probed [ 3.498202] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver [ 3.499718] ehci-pci: EHCI PCI platform driver [ 3.501226] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver [ 3.503566] ohci-pci: OHCI PCI platform driver [ 3.504893] uhci_hcd: USB Universal Host Controller Interface driver [ 3.507058] i8042: PNP: PS/2 Controller [PNP0303:KBD,PNP0f13:MOU] at 0x60,0x64 irq 1,12 [ 3.511124] serio: i8042 KBD port at 0x60,0x64 irq 1 [ 3.512514] serio: i8042 AUX port at 0x60,0x64 irq 12 [ 3.514766] mousedev: PS/2 mouse device common for all mice [ 3.518534] input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input1 [ 3.520879] rtc_cmos 00:05: RTC can wake from S4 [ 3.525328] rtc_cmos 00:05: rtc core: registered rtc_cmos as rtc0 [ 3.527931] rtc_cmos 00:05: alarms up to one day, y3k, 242 bytes nvram, hpet irqs [ 3.534158] hidraw: raw HID events driver (C) Jiri Kosina [ 3.536703] usbcore: registered new interface driver usbhid [ 3.539605] usbhid: USB HID core driver [ 3.541233] drop_monitor: Initializing network drop monitor service [ 3.543488] Netfilter messages via NETLINK v0.30. [ 3.545734] TCP: cubic registered [ 3.547270] Initializing XFRM netlink socket [ 3.549305] NET: Registered protocol family 10 [ 3.551769] NET: Registered protocol family 17 [ 3.552585] Key type dns_resolver registered [ 3.553840] mce: Using 10 MCE banks [ 3.555633] Loading compiled-in X.509 certificates [ 3.558054] Loaded X.509 cert 'Magrathea: Glacier signing key: e34d0e1b7fcf5b414cce75d36d8482945c781ed6' [ 3.560946] registered taskstats version 1 [ 3.565901] modprobe (72) used greatest stack depth: 13456 bytes left [ 3.573788] Key type trusted registered [ 3.585355] Key type encrypted registered [ 3.586954] IMA: No TPM chip found, activating TPM-bypass! (rc=-19) [ 3.593911] BERT: Boot Error Record Table support is disabled. Enable it by using bert_enable as kernel parameter. [ 3.601868] rtc_cmos 00:05: setting system clock to 2024-04-17 08:42:26 UTC (1713343346) [ 3.605058] debug: unmapping init [mem 0xffffffff81da0000-0xffffffff82018fff] [ 3.609133] Write protecting the kernel read-only data: 12288k [ 3.612279] debug: unmapping init [mem 0xffff8800017fe000-0xffff8800017fffff] [ 3.615077] debug: unmapping init [mem 0xffff880001b9b000-0xffff880001bfffff] [ 3.633236] random: systemd: uninitialized urandom read (16 bytes read) [ 3.637664] random: systemd: uninitialized urandom read (16 bytes read) [ 3.640283] random: systemd: uninitialized urandom read (16 bytes read) [ 3.644765] systemd[1]: systemd 219 running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 -SECCOMP +BLKID +ELFUTILS +KMOD +IDN) [ 3.652691] systemd[1]: Detected virtualization kvm. [ 3.654657] systemd[1]: Detected architecture x86-64. [ 3.663091] systemd[1]: Running in initial RAM disk. Welcome to CentOS Linux 7 (Core) dracut-033-572.el7 (Initramfs)! [ 3.672919] systemd[1]: No hostname configured. [ 3.674563] systemd[1]: Set hostname to . [ 3.676535] random: systemd: uninitialized urandom read (16 bytes read) [ 3.678894] systemd[1]: Initializing machine ID from random generator. [ 3.711159] ln (88) used greatest stack depth: 13024 bytes left [ 3.718726] tsc: Refined TSC clocksource calibration: 2399.949 MHz [ 3.761883] random: systemd: uninitialized urandom read (16 bytes read) [ 3.763875] random: systemd: uninitialized urandom read (16 bytes read) [ 3.770850] random: systemd: uninitialized urandom read (16 bytes read) [ 3.773490] random: systemd: uninitialized urandom read (16 bytes read) [ 3.776767] random: systemd: uninitialized urandom read (16 bytes read) [ 3.779136] random: systemd: uninitialized urandom read (16 bytes read) [ 3.789791] systemd[1]: Reached target Timers. [ OK ] Reached target Timers. [ 3.794850] systemd[1]: Created slice Root Slice. [ OK ] Created slice Root Slice. [ 3.798582] systemd[1]: Listening on udev Kernel Socket. [ OK ] Listening on udev Kernel Socket. [ 3.802818] systemd[1]: Listening on Journal Socket. [ OK ] Listening on Journal Socket. [ 3.807365] systemd[1]: Listening on udev Control Socket. [ OK ] Listening on udev Control Socket. [ 3.814568] systemd[1]: Reached target Sockets. [ OK ] Reached target Sockets. [ 3.820372] systemd[1]: Reached target Local File Systems. [ OK ] Reached target Local File Systems. [ 3.827518] systemd[1]: Created slice System Slice. [ OK ] Created slice System Slice. [ 3.833974] systemd[1]: Starting Create list of required static device nodes for the current kernel... Starting Create list of required st... nodes for the current kernel... [ 3.847138] systemd[1]: Starting Journal Service... Starting Journal Service... [ 3.851253] systemd[1]: Reached target Slices. [ OK ] Reached target Slices. [ 3.859975] systemd[1]: Starting Load Kernel Modules... Starting Load Kernel Modules... [ 3.868978] systemd[1]: Starting Setup Virtual Console... Starting Setup Virtual Console... [ 3.881082] systemd[1]: Starting dracut cmdline hook... Starting dracut cmdline hook... [ 3.886929] systemd[1]: Reached target Swap. [ OK ] Reached target Swap. [ 3.894184] systemd[1]: Started Journal Service. [ OK ] Started Journal Service. [ OK ] Started Create list of required sta...ce nodes for the current kernel. [ OK ] Started Load Kernel Modules. [ OK ] Started Setup Virtual Console. Starting Apply Kernel Variables... Starting Create Static Device Nodes in /dev... [ OK ] Started Apply Kernel Variables. [ OK ] Started Create Static Device Nodes in /dev. [ OK ] Started dracut cmdline hook. Starting dracut pre-udev hook... [ 4.438770] random: fast init done [ 4.441719] input: ImExPS/2 Generic Explorer Mouse as /devices/platform/i8042/serio1/input/input2 [ OK ] Started dracut pre-udev hook. Starting udev Kernel Device Manager... [ OK ] Started udev Kernel Device Manager. Starting dracut pre-trigger hook... [ OK ] Started dracut pre-trigger hook. Starting udev Coldplug all Devices... Mounting Configuration File System... [ OK ] Mounted Configuration File System. [ OK ] Started udev Coldplug all Devices. Starting dracut initqueue hook... [ OK ] Reached target System Initialization. Starting Show Plymouth Boot Screen... [ 4.841379] scsi host0: ata_piix [ 4.842474] scsi host1: ata_piix [ 4.849587] ata1: PATA max MWDMA2 cmd 0x1f0 ctl 0x3f6 bmdma 0xc320 irq 14 [ 4.856113] ata2: PATA max MWDMA2 cmd 0x170 ctl 0x376 bmdma 0xc328 irq 15 [ OK ] Started Show Plymouth Boot Screen. [ OK ] Started Forward Password Requests to Plymouth Directory Watch. [ OK ] Reached target Paths. [ OK ] Reached target Basic System. %G[ 5.177448] ip (345) used greatest stack depth: 12336 bytes left [ 6.760672] dracut-initqueue[277]: RTNETLINK answers: File exists [ 7.560394] dracut-initqueue[277]: bs=4096, sz=32212254720 bytes [ OK ] Started dracut initqueue hook. [ OK ] Reached target Remote File Systems (Pre). [ OK ] Reached target Remote File Systems. Mounting /sysroot... [ OK ] Reached target Initrd Root File System. Starting Reload Configuration from the Real Root... [ 11.459585] EXT4-fs (nbd0): mounted filesystem with ordered data mode. Opts: (null) [ OK ] Mounted /sysroot. [ OK ] Started Reload Configuration from the Real Root. [ OK ] Reached target Initrd File Systems. [ OK ] Reached target Initrd Default Target. Starting dracut pre-pivot and cleanup hook... [ OK ] Started dracut pre-pivot and cleanup hook. Starting Cleaning Up and Shutting Down Daemons... [ OK ] Stopped dracut pre-pivot and cleanup hook. [ OK ] Stopped target Remote File Systems. [ OK ] Stopped target Timers. [ OK ] Stopped target Remote File Systems (Pre). [ OK ] Stopped dracut initqueue hook. [ OK ] Stopped target Initrd Default Target. [ OK ] Stopped target Basic System. [ OK ] Stopped target System Initialization. [ OK ] Stopped udev Coldplug all Devices. [ OK ] Stopped dracut pre-trigger hook. [ OK ] Stopped Apply Kernel Variables. [ OK ] Stopped target Local File Systems. [ OK ] Stopped target Swap. [ OK ] Stopped target Slices. [ OK ] Stopped target Sockets. Starting Plymouth switch root service... [ OK ] Stopped Load Kernel Modules. Stopping udev Kernel Device Manager... [ OK ] Stopped target Paths. [ OK ] Stopped udev Kernel Device Manager. [ OK ] Stopped dracut pre-udev hook. [ OK ] Stopped dracut cmdline hook. [ OK ] Stopped Create Static Device Nodes in /dev. [ OK ] Stopped Create list of required sta...ce nodes for the current kernel. [ OK ] Closed udev Kernel Socket. [ OK ] Closed udev Control Socket. Starting Cleanup udevd DB... [ OK ] Started Cleaning Up and Shutting Down Daemons. [ OK ] Started Plymouth switch root service. [ OK ] Started Cleanup udevd DB. [ OK ] Reached target Switch Root. Starting Switch Root... [ 11.974930] systemd-journald[102]: Received SIGTERM from PID 1 (systemd). [ 12.249036] SELinux: Disabled at runtime. [ 12.323230] ip_tables: (C) 2000-2006 Netfilter Core Team [ 12.327412] systemd[1]: Inserted module 'ip_tables' Welcome to CentOS Linux 7 (Core)! [ OK ] Stopped Switch Root. [ OK ] Stopped Journal Service. Starting Journal Service... [ OK ] Reached target Local Encrypted Volumes. [ OK ] Set up automount Arbitrary Executab...ats File System Automount Point. [ OK ] Created slice system-getty.slice. Mounting POSIX Message Queue File System... [ OK ] Stopped target Switch Root. Mounting Huge Pages File System... [ OK ] Stopped target Initrd File Systems. Starting Read and set NIS domainname from /etc/sysconfig/network... [ OK ] Listening on Delayed Shutdown Socket. [ OK ] Listening on udev Control Socket. Mounting Debug File System... Starting Load Kernel Modules... Starting Set Up Additional Binary Formats... [ OK ] Listening on /dev/initctl Compatibility Named Pipe. [ OK ] Stopped target Initrd Root File System. [ OK ] Started Forward Password Requests to Wall Directory Watch. Starting Remount Root and Kernel File Systems... [ OK ] Created slice User and Session Slice. [ OK ] Reached target Slices. [ OK ] Reached target rpc_pipefs.target. [ OK ] Created slice system-serial\x2dgetty.slice. [ OK ] Listening on udev Kernel Socket. Starting udev Coldplug all Devices... [ OK ] Created slice system-selinux\x2dpol...grate\x2dlocal\x2dchanges.slice. Starting Create list of required st... nodes for the current kernel... [ OK ] Started Load Kernel Modules. Starting Apply Kernel Variables... [ OK ] Mounted Huge Pages File System. [ OK ] Mounted POSIX Message Queue File System. [ OK ] Mounted Debug File System. [ OK ] Started Create list of required sta...ce nodes for the current kernel. Starting Create Static Device Nodes in /dev... [ OK ] Started Journal Service. Mounting Arbitrary Executable File Formats File System... [ OK ] Started Read and set NIS domainname from /etc/sysconfig/network. [ OK ] Mounted Arbitrary Executable File Formats File System. [ OK ] Started Apply Kernel Variables. [ OK ] Started udev Coldplug all Devices. [ OK ] Started Set Up Additional Binary Formats. [FAILED] Failed to start Remount Root and Kernel File Systems. See 'systemctl status systemd-remount-fs.service' for details. [ OK ] Started Create Static Device Nodes in /dev. Starting udev Kernel Device Manager... Starting Flush Journal to Persistent Storage... Starting Configure read-only root support... [ OK ] Reached target Local File Systems (Pre). Mounting /mnt... [ OK ] Mounted /mnt. [ 12.910171] systemd-journald[569]: Received request to flush runtime journal from PID 1 [ OK ] Started Flush Journal to Persistent Storage. [ OK ] Started udev Kernel Device Manager. [ 13.058705] input: PC Speaker as /devices/platform/pcspkr/input/input3 [ 13.102116] piix4_smbus 0000:00:01.3: SMBus Host Controller at 0x700, revision 0 [ OK ] Found device /dev/ttyS0. [ OK ] Found device /dev/ttyS1. [ 13.152410] cryptd: max_cpu_qlen set to 1000 [ OK ] Found device /dev/vda. Mounting /home/green/git/lustre-release... [ OK ] Found device /dev/disk/by-label/SWAP. Activating swap /dev/disk/by-label/SWAP... [ 13.249246] AVX version of gcm_enc/dec engaged. [ 13.251496] AES CTR mode by8 optimization enabled [ 13.276498] Adding 1048572k swap on /dev/vdb. Priority:-2 extents:1 across:1048572k FS [ 13.279320] alg: No test for __gcm-aes-aesni (__driver-gcm-aes-aesni) [ OK ] Activated swap /dev/disk/by-label/SWAP. [ OK ] Reached target Swap. [ 13.287501] alg: No test for __generic-gcm-aes-aesni (__driver-generic-gcm-aes-aesni) [ 13.292349] squashfs: version 4.0 (2009/01/31) Phillip Lougher [ OK ] Mounted /home/green/git/lustre-release. %G[ 13.437015] EDAC MC: Ver: 3.0.0 [ 13.467217] EDAC sbridge: Ver: 1.1.2 [ 15.725190] mount.nfs (773) used greatest stack depth: 10712 bytes left [ OK ] Started Configure read-only root support. Starting Load/Save Random Seed... [ OK ] Reached target Local File Systems. Starting Rebuild Journal Catalog... Starting Preprocess NFS configuration... Starting Mark the need to relabel after reboot... Starting Create Volatile Files and Directories... Starting Tell Plymouth To Write Out Runtime Data... [ OK ] Started Load/Save Random Seed. [ OK ] Started Mark the need to relabel after reboot. [ OK ] Started Preprocess NFS configuration. [FAILED] Failed to start Rebuild Journal Catalog. See 'systemctl status systemd-journal-catalog-update.service' for details. Starting Update is Completed... [FAILED] Failed to start Create Volatile Files and Directories. See 'systemctl status systemd-tmpfiles-setup.service' for details. [ OK ] Started Tell Plymouth To Write Out Runtime Data. Starting Update UTMP about System Boot/Shutdown... [ OK ] Started Update is Completed. [ OK ] Started Update UTMP about System Boot/Shutdown. [ OK ] Reached target System Initialization. [ OK ] Listening on D-Bus System Message Bus Socket. [ OK ] Started Flexible branding. [ OK ] Reached target Paths. [ OK ] Listening on RPCbind Server Activation Socket. [ OK ] Reached target Sockets. [ OK ] Reached target Basic System. Starting Login Service... Starting Dump dmesg to /var/log/dmesg... [ OK ] Started D-Bus System Message Bus. Starting Network Manager... Starting GSSAPI Proxy Daemon... [ OK ] Started Daily Cleanup of Temporary Directories. [ OK ] Reached target Timers. [ OK ] Started Dump dmesg to /var/log/dmesg. [ OK ] Started Login Service. [ OK ] Started GSSAPI Proxy Daemon. [ OK ] Reached target NFS client services. [ OK ] Reached target Remote File Systems (Pre). [ OK ] Reached target Remote File Systems. Starting Permit User Sessions... [ OK ] Started Permit User Sessions. [ OK ] Started Network Manager. Starting Network Manager Wait Online... [ OK ] Reached target Network. Starting OpenSSH server daemon... Starting /etc/rc.d/rc.local Compatibility... Starting Hostname Service... [ OK ] Started OpenSSH server daemon. [ OK ] Started /etc/rc.d/rc.local Compatibility. [ OK ] Started Hostname Service. Starting Wait for Plymouth Boot Screen to Quit... Starting Terminate Plymouth Boot Screen... Starting Network Manager Script Dispatcher Service... CentOS Linux 7 (Core) Kernel 3.10.0-7.9-debug on an x86_64 oleg109-server login: [ 24.624436] device-mapper: uevent: version 1.0.3 [ 24.627202] device-mapper: ioctl: 4.37.1-ioctl (2018-04-03) initialised: dm-devel@redhat.com [ 30.047511] libcfs: loading out-of-tree module taints kernel. [ 30.049593] libcfs: module verification failed: signature and/or required key missing - tainting kernel [ 30.065481] LNet: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 1 [ 30.071348] alg: No test for adler32 (adler32-zlib) [ 30.874924] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_hostid [ 36.179347] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing load_modules_local [ 36.567089] Lustre: Lustre: Build Version: 2.15.4_17_g4937c3c [ 36.762085] LNet: Added LNI 192.168.201.109@tcp [8/256/0/180] [ 36.764225] LNet: Accept secure, port 988 [ 38.320970] Key type lgssc registered [ 38.656120] Lustre: Echo OBD driver; http://www.lustre.org/ [ 41.932422] icp: module license 'CDDL' taints kernel. [ 41.934351] Disabling lock debugging due to kernel taint [ 44.778118] ZFS: Loaded module v0.8.6-1, ZFS pool version 5000, ZFS filesystem version 5 [ 47.903728] LDISKFS-fs (vdc): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 53.152916] LDISKFS-fs (vdd): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 55.888660] LDISKFS-fs (vde): file extents enabled, maximum tree depth=5 [ 55.893815] LDISKFS-fs (vde): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 58.586188] LDISKFS-fs (vdf): file extents enabled, maximum tree depth=5 [ 58.590903] LDISKFS-fs (vdf): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 61.963445] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing load_modules_local [ 65.544821] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 65.573401] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 66.656234] Lustre: Setting parameter lustre-MDT0000.mdt.identity_upcall in log lustre-MDT0000 [ 66.666228] Lustre: ctl-lustre-MDT0000: No data found on store. Initialize space: rc = -61 [ 66.702776] Lustre: lustre-MDT0000: new disk, initializing [ 66.728119] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 66.736637] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000200000400-0x0000000240000400]:0:mdt [ 66.767407] mount.lustre (6825) used greatest stack depth: 10336 bytes left [ 67.727029] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 72.220988] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 72.245224] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 72.268287] Lustre: Setting parameter lustre-MDT0001.mdt.identity_upcall in log lustre-MDT0001 [ 72.275542] Lustre: srv-lustre-MDT0001: No data found on store. Initialize space: rc = -61 [ 72.278093] Lustre: Skipped 1 previous similar message [ 72.316744] Lustre: lustre-MDT0001: new disk, initializing [ 72.335825] Lustre: lustre-MDT0001: Imperative Recovery not enabled, recovery window 60-180 [ 72.345785] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000240000400-0x0000000280000400]:1:mdt [ 72.349696] Lustre: cli-ctl-lustre-MDT0001: Allocated super-sequence [0x0000000240000400-0x0000000280000400]:1:mdt] [ 73.339471] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 77.774985] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 77.779267] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 77.801125] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 77.804503] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 77.806739] Lustre: lustre-OST0000-osd: enabled 'large_dir' feature on device /dev/mapper/ost1_flakey [ 77.872688] Lustre: lustre-OST0000: new disk, initializing [ 77.875026] Lustre: srv-lustre-OST0000: No data found on store. Initialize space: rc = -61 [ 77.891096] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 78.880454] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 83.410744] LDISKFS-fs (dm-3): file extents enabled, maximum tree depth=5 [ 83.416640] LDISKFS-fs (dm-3): mounted filesystem with ordered data mode. Opts: errors=remount-ro [ 83.436138] LDISKFS-fs (dm-3): file extents enabled, maximum tree depth=5 [ 83.440739] LDISKFS-fs (dm-3): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 83.445208] Lustre: lustre-OST0001-osd: enabled 'large_dir' feature on device /dev/mapper/ost2_flakey [ 83.473776] Lustre: lustre-OST0001: new disk, initializing [ 83.476165] Lustre: srv-lustre-OST0001: No data found on store. Initialize space: rc = -61 [ 83.490797] Lustre: lustre-OST0001: Imperative Recovery not enabled, recovery window 60-180 [ 84.058972] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000280000400-0x00000002c0000400]:0:ost [ 84.063609] Lustre: cli-lustre-OST0000-super: Allocated super-sequence [0x0000000280000400-0x00000002c0000400]:0:ost] [ 84.520210] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 89.380903] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 95.573732] Lustre: Modifying parameter general.lod.*.mdt_hash in log params [ 101.295138] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing check_logdir /tmp/testlogs/ [ 102.237775] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing yml_node [ 103.162463] Lustre: DEBUG MARKER: Client: 2.15.4.17 [ 103.744892] Lustre: DEBUG MARKER: MDS: 2.15.4.17 [ 104.336140] Lustre: DEBUG MARKER: OSS: 2.15.4.17 [ 104.684804] Lustre: DEBUG MARKER: -----============= acceptance-small: recovery-small ============----- Wed Apr 17 04:44:07 EDT 2024 [ 105.828133] random: crng init done [ 105.947322] Lustre: DEBUG MARKER: excepting tests: 136 [ 107.338166] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing check_config_client /mnt/lustre [ 111.950600] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 112.755263] Lustre: Modifying parameter general.lod.*.mdt_hash in log params [ 113.493509] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 114.277038] Lustre: DEBUG MARKER: == recovery-small test 1: create, chmod, stat: drop req, drop rep ========================================================== 04:44:16 (1713343456) [ 114.543708] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 126.555930] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 127.031630] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 127.033328] LustreError: 8624:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880130d62ac0 x1796570742789888/t4294967300(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:80/0 lens 520/448 e 0 to 0 dl 1713343475 ref 1 fl Interpret:/0/0 rc 0/0 job:'mcreate.0' [ 134.047039] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 134.057193] Lustre: 8624:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880130d663c0 x1796570742789888/t4294967300(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:87/0 lens 520/448 e 0 to 0 dl 1713343482 ref 1 fl Interpret:/2/0 rc 0/0 job:'mcreate.0' [ 134.542198] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 141.560501] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 142.071492] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 142.073643] LustreError: 6847:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff8801299397c0 x1796570742791424/t4294967302(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:95/0 lens 488/456 e 0 to 0 dl 1713343490 ref 1 fl Interpret:/0/0 rc 0/0 job:'tchmod.0' [ 149.086712] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 149.097149] Lustre: 6847:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880082e3b900 x1796570742791424/t4294967302(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:102/0 lens 488/3152 e 0 to 0 dl 1713343497 ref 1 fl Interpret:/2/0 rc 0/0 job:'tchmod.0' [ 149.577071] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 156.592540] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 157.065710] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 157.067460] LustreError: 6846:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880082e3bdc0 x1796570742792448/t0(0) o34->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:110/0 lens 472/464 e 0 to 0 dl 1713343505 ref 1 fl Interpret:/0/0 rc 0/0 job:'statone.0' [ 165.757692] Lustre: DEBUG MARKER: == recovery-small test 4: open: drop req, drop rep ======= 04:45:08 (1713343508) [ 166.065244] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 173.087116] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 173.090818] Lustre: Skipped 1 previous similar message [ 173.576793] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 173.578551] LustreError: 6850:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880082e84c00 x1796570742794624/t4294967308(0) o35->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:127/0 lens 392/456 e 0 to 0 dl 1713343522 ref 1 fl Interpret:/0/0 rc 0/0 job:'cat.0' [ 180.579217] Lustre: 6850:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880130d584c0 x1796570742794624/t4294967308(0) o35->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:134/0 lens 392/456 e 0 to 0 dl 1713343529 ref 1 fl Interpret:/2/0 rc 0/0 job:'cat.0' [ 182.114578] Lustre: DEBUG MARKER: == recovery-small test 5: rename: drop req, drop rep ===== 04:45:24 (1713343524) [ 182.383494] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 189.399830] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 189.403421] Lustre: Skipped 1 previous similar message [ 189.895196] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 189.897288] LustreError: 6848:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880082e41300 x1796570742797056/t4294967312(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:143/0 lens 552/456 e 0 to 0 dl 1713343538 ref 1 fl Interpret:/0/0 rc 0/0 job:'mv.0' [ 196.896270] Lustre: 7944:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880130d63900 x1796570742797056/t4294967312(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:150/0 lens 552/2888 e 0 to 0 dl 1713343545 ref 1 fl Interpret:/2/0 rc 0/0 job:'mv.0' [ 198.534863] Lustre: DEBUG MARKER: == recovery-small test 6: link, unlink: drop req, drop rep ========================================================== 04:45:41 (1713343541) [ 198.806871] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 206.276552] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 206.278481] LustreError: 6847:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880082c7ed40 x1796570742799360/t4294967317(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:160/0 lens 512/440 e 0 to 0 dl 1713343555 ref 1 fl Interpret:/0/0 rc 0/0 job:'mlink.0' [ 213.278014] Lustre: 6847:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880082502ac0 x1796570742799360/t4294967317(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:167/0 lens 512/440 e 0 to 0 dl 1713343562 ref 1 fl Interpret:/2/0 rc 0/0 job:'mlink.0' [ 228.234957] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 228.238019] Lustre: Skipped 4 previous similar messages [ 228.244902] Lustre: 8624:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880137123440 x1796570742801152/t4294967319(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:182/0 lens 504/2888 e 0 to 0 dl 1713343577 ref 1 fl Interpret:/2/0 rc 0/0 job:'munlink.0' [ 229.870743] Lustre: DEBUG MARKER: == recovery-small test 8: touch: drop rep (bug 1423) ===== 04:46:12 (1713343572) [ 230.114811] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 230.116165] Lustre: Skipped 1 previous similar message [ 230.117583] LustreError: 8624:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88008274f6c0 x1796570742802240/t4294967322(0) o36->83c7955a-247c-4189-8c36-57924ee75526@192.168.201.9@tcp:184/0 lens 488/456 e 0 to 0 dl 1713343579 ref 1 fl Interpret:/0/0 rc 0/0 job:'touch.0' [ 230.126849] LustreError: 8624:0:(ldlm_lib.c:3225:target_send_reply_msg()) Skipped 1 previous similar message [ 238.683017] Lustre: DEBUG MARKER: == recovery-small test 9: pause bulk on OST (bug 1420) === 04:46:21 (1713343581) [ 239.184306] LustreError: 9277:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 214 sleeping for 5000ms [ 244.188758] LustreError: 9277:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 214 awake [ 246.036624] Lustre: DEBUG MARKER: == recovery-small test 10a: finish request on server after client eviction (bug 1521) ========================================================== 04:46:28 (1713343588) [ 256.340749] Lustre: 10636:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343592/real 1713343592] req@ffff880082f73900 x1796570747038592/t0(0) o104->lustre-OST0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343599 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' [ 256.349335] Lustre: 10636:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 2 previous similar messages [ 257.102733] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343589/real 1713343589] req@ffff88012ca276c0 x1796570747037184/t0(0) o104->lustre-MDT0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343600 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' [ 263.340766] Lustre: 9271:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343599/real 1713343599] req@ffff880082e80000 x1796570747038656/t0(0) o104->lustre-OST0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343606 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' [ 263.350086] Lustre: 9271:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 2 previous similar messages [ 268.109746] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343600/real 1713343600] req@ffff88012ca276c0 x1796570747037184/t0(0) o104->lustre-MDT0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343611 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' [ 277.340749] Lustre: 9272:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343613/real 1713343613] req@ffff880082f1e3c0 x1796570747038720/t0(0) o104->lustre-OST0001@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343620 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' [ 277.349061] Lustre: 9272:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 4 previous similar messages [ 286.138767] Lustre: mdt00_003: service thread pid 7944 was inactive for 40.036 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 286.145379] Pid: 7944, comm: mdt00_003 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 286.148386] Call Trace: [ 286.149526] [<0>] ptlrpc_set_wait+0x7cf/0x850 [ptlrpc] [ 286.151451] [<0>] ldlm_run_ast_work+0xd5/0x3d0 [ptlrpc] [ 286.153106] [<0>] ldlm_handle_conflict_lock+0x6e/0x300 [ptlrpc] [ 286.154786] [<0>] ldlm_lock_enqueue+0x5c3/0xb40 [ptlrpc] [ 286.156040] [<0>] ldlm_cli_enqueue_local+0x1ec/0x880 [ptlrpc] [ 286.158153] [<0>] mdt_object_local_lock+0x527/0xb90 [mdt] [ 286.160072] [<0>] mdt_object_lock_internal+0x70/0x390 [mdt] [ 286.162118] [<0>] mdt_reint_object_lock+0x2c/0x60 [mdt] [ 286.164086] [<0>] mdt_reint_striped_lock+0x89/0x6a0 [mdt] [ 286.166036] [<0>] mdt_attr_set+0xad/0x910 [mdt] [ 286.167493] [<0>] mdt_reint_setattr+0x73a/0x1210 [mdt] [ 286.168838] [<0>] mdt_reint_rec+0x87/0x240 [mdt] [ 286.170130] [<0>] mdt_reint_internal+0x76c/0xb50 [mdt] [ 286.171442] [<0>] mdt_reint+0x67/0x150 [mdt] [ 286.173270] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 286.175592] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 286.178861] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 286.181435] [<0>] kthread+0xe4/0xf0 [ 286.182981] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 286.184810] [<0>] 0xfffffffffffffffe [ 289.466720] Lustre: ll_ost00_002: service thread pid 9272 was inactive for 40.125 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 289.466727] Lustre: ll_ost00_001: service thread pid 9271 was inactive for 40.125 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 289.466731] Lustre: ll_ost00_004: service thread pid 10636 was inactive for 40.125 seconds. Watchdog stack traces are limited to 3 per 300 seconds, skipping this one. [ 289.466734] Pid: 9271, comm: ll_ost00_001 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 289.466738] Call Trace: [ 289.466825] [<0>] ptlrpc_set_wait+0x7cf/0x850 [ptlrpc] [ 289.466858] [<0>] ldlm_run_ast_work+0xd5/0x3d0 [ptlrpc] [ 289.466886] [<0>] ldlm_handle_conflict_lock+0x6e/0x300 [ptlrpc] [ 289.466917] [<0>] ldlm_lock_enqueue+0x5c3/0xb40 [ptlrpc] [ 289.466952] [<0>] ldlm_cli_enqueue_local+0x377/0x880 [ptlrpc] [ 289.466968] [<0>] ofd_destroy_by_fid+0x1d1/0x520 [ofd] [ 289.466974] [<0>] ofd_destroy_hdl+0x20f/0xa80 [ofd] [ 289.467025] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 289.467066] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 289.467104] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 289.467111] [<0>] kthread+0xe4/0xf0 [ 289.467114] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 289.467126] [<0>] 0xfffffffffffffffe [ 289.503821] Pid: 9272, comm: ll_ost00_002 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 289.506703] Call Trace: [ 289.508610] [<0>] ptlrpc_set_wait+0x7cf/0x850 [ptlrpc] [ 289.510310] [<0>] ldlm_run_ast_work+0xd5/0x3d0 [ptlrpc] [ 289.511925] [<0>] ldlm_handle_conflict_lock+0x6e/0x300 [ptlrpc] [ 289.513967] [<0>] ldlm_lock_enqueue+0x5c3/0xb40 [ptlrpc] [ 289.516230] [<0>] ldlm_cli_enqueue_local+0x377/0x880 [ptlrpc] [ 289.518186] [<0>] ofd_destroy_by_fid+0x1d1/0x520 [ofd] [ 289.519644] [<0>] ofd_destroy_hdl+0x20f/0xa80 [ofd] [ 289.521233] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 289.523366] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 289.525296] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 289.527063] [<0>] kthread+0xe4/0xf0 [ 289.528487] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 289.530251] [<0>] 0xfffffffffffffffe [ 290.119756] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343622/real 1713343622] req@ffff88012ca276c0 x1796570747037184/t0(0) o104->lustre-MDT0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343633 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' [ 290.127228] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 7 previous similar messages [ 312.129742] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343644/real 1713343644] req@ffff88012ca276c0 x1796570747037184/t0(0) o104->lustre-MDT0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343655 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' [ 312.140175] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 12 previous similar messages [ 345.143810] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713343677/real 1713343677] req@ffff88012ca276c0 x1796570747037184/t0(0) o104->lustre-MDT0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713343688 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' [ 345.152891] Lustre: 7944:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 20 previous similar messages [ 354.350851] LustreError: 9272:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) ### client (nid 192.168.201.9@tcp) failed to reply to blocking AST (req@ffff880082f1e3c0 x1796570747038720 status 0 rc -110), evict it ns: filter-lustre-OST0001_UUID lock: ffff8800a56d8b40/0x53803acb2299c029 lrc: 4/0,0 mode: PW/PW res: [0x4:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400030020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a745 expref: 5 pid: 9272 timeout: 446 lvb_type: 0 [ 354.363204] LustreError: 138-a: lustre-OST0001: A client on nid 192.168.201.9@tcp was evicted due to a lock blocking callback time out: rc -110 [ 354.363441] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 7s: evicting client at 192.168.201.9@tcp ns: filter-lustre-OST0000_UUID lock: ffff8800a56d9680/0x53803acb2299c092 lrc: 3/0,0 mode: PW/PW res: [0x5:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4194303) gid 0 flags: 0x60000400030020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a76f expref: 8 pid: 9271 timeout: 0 lvb_type: 0 [ 354.378377] LustreError: Skipped 2 previous similar messages [ 356.155894] LustreError: 7944:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) ### client (nid 192.168.201.9@tcp) failed to reply to blocking AST (req@ffff88012ca276c0 x1796570747037184 status 0 rc -110), evict it ns: mdt-lustre-MDT0000_UUID lock: ffff88012efd7180/0x53803acb2299c102 lrc: 4/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a77d expref: 11 pid: 7944 timeout: 444 lvb_type: 0 [ 356.167149] LustreError: 7944:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) Skipped 2 previous similar messages [ 356.170019] LustreError: 138-a: lustre-MDT0000: A client on nid 192.168.201.9@tcp was evicted due to a lock blocking callback time out: rc -110 [ 356.173728] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 11s: evicting client at 192.168.201.9@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff88012efd7180/0x53803acb2299c102 lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a77d expref: 12 pid: 7944 timeout: 0 lvb_type: 0 [ 356.182955] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) Skipped 1 previous similar message [ 357.622956] Lustre: DEBUG MARKER: == recovery-small test 10b: re-send BL AST =============== 04:48:20 (1713343700) [ 370.133757] Lustre: DEBUG MARKER: == recovery-small test 10c: re-send BL AST vs reconnect race (LU-5569) ========================================================== 04:48:32 (1713343712) [ 371.187882] Lustre: lustre-MDT0000: Client 83c7955a-247c-4189-8c36-57924ee75526 (at 192.168.201.9@tcp) reconnecting [ 371.191084] Lustre: Skipped 1 previous similar message [ 372.679029] Lustre: DEBUG MARKER: == recovery-small test 10d: test failed blocking ast ===== 04:48:35 (1713343715) [ 374.168562] LustreError: 9271:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) ### client (nid 192.168.201.9@tcp) returned error from blocking AST (req@ffff880082f76880 x1796570747070592 status -71 rc -71), evict it ns: filter-lustre-OST0000_UUID lock: ffff8800a56d8d80/0x53803acb2299c4f9 lrc: 4/0,0 mode: PW/PW res: [0x7:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000480000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a991 expref: 5 pid: 9271 timeout: 473 lvb_type: 0 [ 374.183376] LustreError: 9271:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) Skipped 1 previous similar message [ 374.185697] LustreError: 138-a: lustre-OST0000: A client on nid 192.168.201.9@tcp was evicted due to a lock blocking callback time out: rc -71 [ 374.190284] LustreError: Skipped 1 previous similar message [ 374.192379] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.9@tcp ns: filter-lustre-OST0000_UUID lock: ffff8800a56d8d80/0x53803acb2299c4f9 lrc: 3/0,0 mode: PW/PW res: [0x7:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000480000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a991 expref: 6 pid: 9271 timeout: 0 lvb_type: 0 [ 376.006773] Lustre: DEBUG MARKER: == recovery-small test 10e: re-send BL AST vs reconnect race 2 ========================================================== 04:48:38 (1713343718) [ 376.326381] Lustre: DEBUG MARKER: SKIP: recovery-small test_10e need two clients [ 376.677562] Lustre: DEBUG MARKER: == recovery-small test 11: wake up a thread waiting for completion after eviction (b=2460) ========================================================== 04:48:39 (1713343719) [ 390.519063] Lustre: DEBUG MARKER: == recovery-small test 12: recover from timed out resend in ptlrpcd (b=2494) ========================================================== 04:48:53 (1713343733) [ 390.788553] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 390.790502] Lustre: Skipped 1 previous similar message [ 431.354717] Lustre: DEBUG MARKER: == recovery-small test 13: mdc_readpage restart test (bug 1138) ========================================================== 04:49:33 (1713343773) [ 431.620253] Lustre: *** cfs_fail_loc=104, val=2147483648*** [ 440.369593] Lustre: DEBUG MARKER: == recovery-small test 14: mdc_readpage resend test (bug 1138) ========================================================== 04:49:42 (1713343782) [ 440.644300] Lustre: *** cfs_fail_loc=106, val=0*** [ 442.334297] Lustre: DEBUG MARKER: == recovery-small test 15: failed open (-ENOMEM) ========= 04:49:44 (1713343784) [ 442.577760] Lustre: *** cfs_fail_loc=128, val=0*** [ 443.971272] Lustre: DEBUG MARKER: == recovery-small test 16: timeout bulk put, don't evict client (2732) ========================================================== 04:49:46 (1713343786) [ 444.323089] Lustre: *** cfs_fail_loc=504, val=0*** [ 444.324154] LustreError: 9277:0:(ldlm_lib.c:3559:target_bulk_io()) @@@ truncated bulk READ 0(102400) req@ffff880130d65a40 x1796570742844032/t0(0) o3->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:403/0 lens 488/440 e 0 to 0 dl 1713343798 ref 1 fl Interpret:/0/0 rc 0/0 job:'cmp.0' [ 444.333549] Lustre: lustre-OST0001: Bulk IO read error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc -110 [ 478.051398] Lustre: DEBUG MARKER: == recovery-small test 17a: timeout bulk get, don't evict client (2732) ========================================================== 04:50:20 (1713343820) [ 521.186195] Lustre: DEBUG MARKER: == recovery-small test 17b: timeout bulk get, dont evict client (3582) ========================================================== 04:51:03 (1713343863) [ 521.528994] Lustre: DEBUG MARKER: SKIP: recovery-small test_17b Needs multiple clients [ 521.911102] Lustre: DEBUG MARKER: == recovery-small test 18a: manual ost invalidate clears page cache immediately ========================================================== 04:51:04 (1713343864) [ 522.097985] Lustre: lustre-OST0001: Client 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp) reconnecting [ 522.101751] Lustre: Skipped 3 previous similar messages [ 523.523365] Lustre: DEBUG MARKER: == recovery-small test 18b: eviction and reconnect clears page cache (2766) ========================================================== 04:51:06 (1713343866) [ 523.928151] Lustre: 23065:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting 591586b5-12a0-4c75-b897-ddb9d5cfeb54 at adminstrative request [ 547.377199] Lustre: DEBUG MARKER: == recovery-small test 18c: Dropped connect reply after eviction handing (14755) ========================================================== 04:51:29 (1713343889) [ 547.796899] Lustre: 23406:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting 591586b5-12a0-4c75-b897-ddb9d5cfeb54 at adminstrative request [ 549.060436] Lustre: *** cfs_fail_loc=225, val=0*** [ 558.568648] Lustre: DEBUG MARKER: == recovery-small test 19a: test expired_lock_main on mds (2867) ========================================================== 04:51:41 (1713343901) [ 559.047457] Lustre: *** cfs_fail_loc=304, val=0*** [ 559.048818] Lustre: Skipped 1 previous similar message [ 566.059916] Lustre: *** cfs_fail_loc=304, val=0*** [ 566.061828] Lustre: Skipped 1 previous similar message [ 573.061961] Lustre: *** cfs_fail_loc=304, val=0*** [ 573.063160] Lustre: Skipped 1 previous similar message [ 580.063948] Lustre: *** cfs_fail_loc=304, val=0*** [ 580.065716] Lustre: Skipped 1 previous similar message [ 587.065873] Lustre: *** cfs_fail_loc=304, val=0*** [ 587.068233] Lustre: Skipped 1 previous similar message [ 601.069783] Lustre: *** cfs_fail_loc=304, val=0*** [ 601.071550] Lustre: Skipped 3 previous similar messages [ 622.076114] Lustre: *** cfs_fail_loc=304, val=0*** [ 622.077641] Lustre: Skipped 5 previous similar messages [ 657.087457] Lustre: *** cfs_fail_loc=304, val=0*** [ 657.089632] Lustre: Skipped 9 previous similar messages [ 659.130749] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 100s: evicting client at 192.168.201.9@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff88012efd5b00/0x53803acb2299c47b lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x11/0x0 rrc: 5 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1a967 expref: 21 pid: 7944 timeout: 658 lvb_type: 0 [ 661.110516] Lustre: DEBUG MARKER: == recovery-small test 19b: test expired_lock_main on ost (2867) ========================================================== 04:53:23 (1713344003) [ 724.225371] Lustre: *** cfs_fail_loc=304, val=0*** [ 724.227017] Lustre: Skipped 19 previous similar messages [ 761.658801] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 100s: evicting client at 192.168.201.9@tcp ns: filter-lustre-OST0000_UUID lock: ffff8800a8fa8d80/0x53803acb2299d028 lrc: 3/0,0 mode: PW/PW res: [0xd:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89c1adea expref: 6 pid: 18477 timeout: 760 lvb_type: 0 [ 770.036684] Lustre: DEBUG MARKER: == recovery-small test 19c: check reconnect and lock resend do not trigger expired_lock_main ========================================================== 04:55:12 (1713344112) [ 779.390928] Lustre: DEBUG MARKER: == recovery-small test 20a: ldlm_handle_enqueue error (should return error) ========================================================== 04:55:21 (1713344121) [ 781.769745] Lustre: DEBUG MARKER: == recovery-small test 20b: ldlm_handle_enqueue error (should return error) ========================================================== 04:55:24 (1713344124) [ 783.844586] Lustre: DEBUG MARKER: == recovery-small test 21a: drop close request while close and open are both in flight ========================================================== 04:55:26 (1713344126) [ 784.145862] LustreError: 6847:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 129 sleeping for 5000ms [ 785.449702] LustreError: 6847:0:(fail.c:144:__cfs_fail_timeout_set()) cfs_fail_timeout interrupted [ 785.676737] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 792.688650] Lustre: lustre-MDT0001: Client 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp) reconnecting [ 792.693346] Lustre: Skipped 43 previous similar messages [ 794.931351] Lustre: DEBUG MARKER: == recovery-small test 21b: drop open request while close and open are both in flight ========================================================== 04:55:37 (1713344137) [ 966.595698] Lustre: DEBUG MARKER: == recovery-small test 21c: drop both request while close and open are both in flight ========================================================== 04:58:28 (1713344308) [ 1101.286369] Lustre: DEBUG MARKER: == recovery-small test 21d: drop close reply while close and open are both in flight ========================================================== 05:00:43 (1713344443) [ 1101.604569] LustreError: 6848:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 129 sleeping for 5000ms [ 1102.908721] LustreError: 6848:0:(fail.c:144:__cfs_fail_timeout_set()) cfs_fail_timeout interrupted [ 1103.128540] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 1103.130365] LustreError: 6850:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88012cefed40 x1796570742935872/t4294967357(0) o35->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:302/0 lens 392/456 e 0 to 0 dl 1713344452 ref 1 fl Interpret:/0/0 rc 0/0 job:'multiop.0' [ 1110.131865] Lustre: 6850:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff88009f8e4050 x1796570742935872/t4294967357(0) o35->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:309/0 lens 392/456 e 0 to 0 dl 1713344459 ref 1 fl Interpret:/2/0 rc 0/0 job:'multiop.0' [ 1110.139785] Lustre: 6850:0:(mdt_recovery.c:200:mdt_req_from_lrd()) Skipped 1 previous similar message [ 1112.763250] Lustre: DEBUG MARKER: == recovery-small test 21e: drop open reply while close and open are both in flight ========================================================== 05:00:55 (1713344455) [ 1113.202593] LustreError: 8624:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880082f18000 x1796570742940416/t4294967521(0) o36->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:437/0 lens 488/456 e 0 to 0 dl 1713344587 ref 1 fl Interpret:/0/0 rc 0/0 job:'touch.0' [ 1245.209209] Lustre: 23847:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800872b9c80 x1796570742940416/t4294967521(0) o36->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:569/0 lens 488/3152 e 0 to 0 dl 1713344719 ref 1 fl Interpret:/2/0 rc 0/0 job:'touch.0' [ 1245.759810] Lustre: DEBUG MARKER: == recovery-small test 21f: drop both reply while close and open are both in flight ========================================================== 05:03:08 (1713344588) [ 1246.161406] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 1246.163824] Lustre: Skipped 1 previous similar message [ 1246.166723] LustreError: 8624:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff8801299d8980 x1796570742952256/t4294967536(0) o36->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:445/0 lens 488/456 e 0 to 0 dl 1713344595 ref 1 fl Interpret:/0/0 rc 0/0 job:'touch.0' [ 1253.163430] Lustre: 6846:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff88012ce4f200 x1796570742952256/t4294967536(0) o36->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:452/0 lens 488/3152 e 0 to 0 dl 1713344602 ref 1 fl Interpret:/2/0 rc 0/0 job:'touch.0' [ 1257.398238] Lustre: DEBUG MARKER: == recovery-small test 21g: drop open reply and close request while close and open are both in flight ========================================================== 05:03:19 (1713344599) [ 1259.350934] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 1259.352514] Lustre: Skipped 3 previous similar messages [ 1264.755291] Lustre: 18505:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800871a2140 x1796570742957312/t4294967549(0) o36->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:463/0 lens 488/3152 e 0 to 0 dl 1713344613 ref 1 fl Interpret:/2/0 rc 0/0 job:'touch.0' [ 1264.767200] Lustre: 18505:0:(mdt_recovery.c:200:mdt_req_from_lrd()) Skipped 1 previous similar message [ 1268.696246] Lustre: DEBUG MARKER: == recovery-small test 21h: drop open request and close reply while close and open are both in flight ========================================================== 05:03:31 (1713344611) [ 1270.844222] LustreError: 16020:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff8801299df200 x1796570742962688/t4294967393(0) o35->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:469/0 lens 392/456 e 0 to 0 dl 1713344619 ref 1 fl Interpret:/0/0 rc 0/0 job:'multiop.0' [ 1270.858924] LustreError: 16020:0:(ldlm_lib.c:3225:target_send_reply_msg()) Skipped 2 previous similar messages [ 1280.080489] Lustre: DEBUG MARKER: == recovery-small test 22: drop close request and do mknod ========================================================== 05:03:42 (1713344622) [ 1289.151001] Lustre: DEBUG MARKER: == recovery-small test 23: client hang when close a file after mds crash ========================================================== 05:03:51 (1713344631) [ 1295.226029] Lustre: Failing over lustre-MDT0000 [ 1295.262462] LustreError: 30585:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1295.296587] Lustre: server umount lustre-MDT0000 complete [ 1296.027399] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1296.028095] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1296.037161] Lustre: Skipped 2 previous similar messages [ 1297.436060] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 192.168.201.9@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1297.441738] LustreError: Skipped 3 previous similar messages [ 1301.035402] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1301.041765] LustreError: Skipped 3 previous similar messages [ 1303.051858] Lustre: 3327:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344638/real 1713344638] req@ffff880082e7f200 x1796570747306048/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344645 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 1303.062750] Lustre: 3327:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 10 previous similar messages [ 1303.065252] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1303.070487] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1303.078594] LustreError: Skipped 4 previous similar messages [ 1307.449328] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 192.168.201.9@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1307.454162] LustreError: Skipped 1 previous similar message [ 1307.852477] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1309.078839] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb2299b612 to 0x53803acb2299e6be [ 1309.084263] Lustre: MGC192.168.201.109@tcp: Connection restored to (at 0@lo) [ 1309.167402] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1309.181322] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1309.190968] mount.lustre (31190) used greatest stack depth: 10264 bytes left [ 1309.453845] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1310.319390] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1314.175950] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1314.187220] Lustre: lustre-MDT0000: Recovery over after 0:05, of 2 clients 2 recovered and 0 were evicted. [ 1314.208114] Lustre: lustre-OST0000: deleting orphan objects from 0x0:20 to 0x0:65 [ 1314.209073] Lustre: lustre-OST0001: deleting orphan objects from 0x0:16 to 0x0:33 [ 1316.108065] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1316.589324] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1319.702407] Lustre: DEBUG MARKER: == recovery-small test 24a: fsync error (should return error) ========================================================== 05:04:22 (1713344662) [ 1320.192153] Lustre: 32332:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting 591586b5-12a0-4c75-b897-ddb9d5cfeb54 at adminstrative request [ 1322.133815] Lustre: DEBUG MARKER: == recovery-small test 24b: test dirty page discard due to client eviction ========================================================== 05:04:24 (1713344664) [ 1322.551421] Lustre: 32656:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting 591586b5-12a0-4c75-b897-ddb9d5cfeb54 at adminstrative request [ 1324.479944] Lustre: DEBUG MARKER: == recovery-small test 26a: evict dead exports =========== 05:04:26 (1713344666) [ 1325.168517] Lustre: DEBUG MARKER: SKIP: recovery-small test_26a msg and ost1 are at the same node [ 1325.728899] Lustre: DEBUG MARKER: == recovery-small test 26b: evict dead exports =========== 05:04:28 (1713344668) [ 1326.255297] Lustre: DEBUG MARKER: SKIP: recovery-small test_26b msg and ost1 are at the same node [ 1326.782065] Lustre: DEBUG MARKER: == recovery-small test 27: fail LOV while using OSC's ==== 05:04:29 (1713344669) [ 1328.442678] Lustre: Failing over lustre-MDT0000 [ 1328.497740] LustreError: 867:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1328.500610] LustreError: 867:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 13 previous similar messages [ 1328.532674] Lustre: server umount lustre-MDT0000 complete [ 1329.195974] Lustre: lustre-MDT0000-lwp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1329.196546] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1329.196548] LustreError: Skipped 8 previous similar messages [ 1329.207274] Lustre: Skipped 4 previous similar messages [ 1337.106724] Lustre: 3329:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344672/real 1713344672] req@ffff88012cefb900 x1796570747323456/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344679 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:1.0' [ 1337.119012] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1341.280582] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1343.140830] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb2299e6be to 0x53803acb229a1d8a [ 1343.146402] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1343.151177] Lustre: Skipped 3 previous similar messages [ 1343.286183] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1343.311375] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1343.758114] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1344.460025] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1348.287308] Lustre: lustre-MDT0000-lwp-MDT0001: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1348.306494] Lustre: lustre-MDT0000: Recovery over after 0:04, of 2 clients 2 recovered and 0 were evicted. [ 1348.337260] Lustre: lustre-OST0000: deleting orphan objects from 0x0:158 to 0x0:193 [ 1437.135682] Lustre: Failing over lustre-MDT0000 [ 1437.185814] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 1437.193681] LustreError: 6759:0:(ldlm_resource.c:1126:ldlm_resource_complain()) mdt-lustre-MDT0000_UUID: namespace resource [0x200000405:0x1dfb:0x0].0x0 (ffff88009374fe00) refcount nonzero (1) after lock cleanup; forcing cleanup. [ 1437.218841] LustreError: 6759:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1437.222104] LustreError: 6759:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 6 previous similar messages [ 1437.260263] Lustre: server umount lustre-MDT0000 complete [ 1438.444273] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1438.446481] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1438.446483] LustreError: Skipped 24 previous similar messages [ 1438.461821] Lustre: Skipped 2 previous similar messages [ 1445.452743] Lustre: 3329:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344781/real 1713344781] req@ffff880082e14c00 x1796570748142912/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344788 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:1.0' [ 1445.464842] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1450.692999] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1451.498039] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb229a1d8a to 0x53803acb22a2a2b2 [ 1451.506971] LustreError: 7399:0:(ldlm_resource.c:1126:ldlm_resource_complain()) MGC192.168.201.109@tcp: namespace resource [0x65727473756c:0x5:0x0].0x0 (ffff88012d496600) refcount nonzero (1) after lock cleanup; forcing cleanup. [ 1451.514080] LustreError: 7399:0:(ldlm_resource.c:1126:ldlm_resource_complain()) Skipped 67 previous similar messages [ 1451.518781] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1451.523642] Lustre: Skipped 3 previous similar messages [ 1451.643872] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1451.669864] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1451.732456] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1453.074526] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1456.668666] Lustre: lustre-MDT0000: Recovery over after 0:04, of 2 clients 2 recovered and 0 were evicted. [ 1456.678310] Lustre: 6847:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800ad4b2ac0 x1796570746572672/t12884924339(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:692/0 lens 672/3424 e 0 to 0 dl 1713344842 ref 1 fl Interpret:/2/0 rc 0/0 job:'writemany.0' [ 1456.686170] Lustre: 6847:0:(mdt_recovery.c:200:mdt_req_from_lrd()) Skipped 1 previous similar message [ 1456.687412] Lustre: lustre-OST0001: deleting orphan objects from 0x0:124 to 0x0:161 [ 1456.687437] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7713 [ 1458.657768] Lustre: DEBUG MARKER: == recovery-small test 28: handle error adding new clients (bug 6086) ========================================================== 05:06:41 (1713344801) [ 1472.106900] Lustre: Failing over lustre-MDT0000 [ 1472.148256] LustreError: 8360:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1472.205111] Lustre: server umount lustre-MDT0000 complete [ 1476.683372] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1476.684341] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1476.684343] LustreError: Skipped 23 previous similar messages [ 1476.696166] Lustre: Skipped 4 previous similar messages [ 1483.338713] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344819/real 1713344819] req@ffff8800ad4b3440 x1796570748222592/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344826 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:1.0' [ 1483.348371] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 1483.350735] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1485.142304] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1489.368764] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22a2a2b2 to 0x53803acb22a2b248 [ 1489.374635] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1489.378333] Lustre: Skipped 4 previous similar messages [ 1489.444067] Lustre: *** cfs_fail_loc=12f, val=0*** [ 1489.446986] LustreError: 26982:0:(tgt_lastrcvd.c:1028:tgt_client_new()) lustre-MDT0001: no room for 0 clients - fix LR_MAX_CLIENTS [ 1489.452888] LustreError: 11-0: lustre-MDT0001-osp-MDT0000: operation mds_connect to node 0@lo failed: rc = -75 [ 1489.496965] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1489.515770] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1489.813852] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1490.668171] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1494.513282] Lustre: lustre-MDT0000: Recovery over after 0:04, of 2 clients 2 recovered and 0 were evicted. [ 1494.533639] Lustre: lustre-OST0001: deleting orphan objects from 0x0:163 to 0x0:193 [ 1494.533884] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7745 [ 1496.317369] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1496.807722] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1499.975423] Lustre: DEBUG MARKER: == recovery-small test 29a: error adding new clients doesn't cause LBUG (bug 22273) ========================================================== 05:07:22 (1713344842) [ 1500.891649] Lustre: Failing over lustre-MDT0000 [ 1500.949359] LustreError: 10258:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1500.952539] LustreError: 10258:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 4 previous similar messages [ 1501.008283] Lustre: server umount lustre-MDT0000 complete [ 1503.767516] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1503.825205] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1503.825383] LustreError: 11-0: MGC192.168.201.109@tcp: operation mgs_target_reg to node 0@lo failed: rc = -107 [ 1503.825393] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1503.826324] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22a2b248 to 0x53803acb22a2b64d [ 1503.844132] Lustre: Skipped 2 previous similar messages [ 1503.846187] Lustre: lustre-MDT0000: Not available for connect from 0@lo (not set up) [ 1503.848835] Lustre: Skipped 1 previous similar message [ 1503.878245] Lustre: *** cfs_fail_loc=711, val=0*** [ 1503.904982] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1503.919200] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1503.919338] LustreError: 10866:0:(mdt_handler.c:7428:mdt_iocontrol()) lustre-MDT0000: Aborting recovery for device [ 1503.919341] LustreError: 10866:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 1503.926987] Lustre: 10897:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1508.909369] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1508.912214] Lustre: Skipped 5 previous similar messages [ 1508.919331] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 1508.922035] LustreError: 10897:0:(ldlm_lib.c:1824:abort_lock_replay_queue()) @@@ aborted: req@ffff8800997a2600 x1796570748239616/t0(0) o101->lustre-MDT0001-mdtlov_UUID@0@lo:707/0 lens 328/0 e 0 to 0 dl 1713344857 ref 1 fl Complete:/40/ffffffff rc 0/-1 job:'ldlm_lock_repla.0' [ 1508.931528] Lustre: 10897:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1508.931653] LustreError: 11-0: lustre-MDT0000-osp-MDT0001: operation ldlm_enqueue to node 0@lo failed: rc = -107 [ 1508.932164] Lustre: lustre-MDT0000: Denying connection for new client lustre-MDT0001-mdtlov_UUID (at 0@lo), waiting for 2 known clients (1 recovered, 0 in progress, and 1 evicted) already passed deadline 25:08 [ 1508.963090] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7777 [ 1508.963861] Lustre: lustre-OST0001: deleting orphan objects from 0x0:163 to 0x0:225 [ 1510.047129] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1513.916494] LustreError: 167-0: lustre-MDT0000-osp-MDT0001: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 1521.818513] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing wait_import_state FULL os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid 40 [ 1521.883608] Lustre: DEBUG MARKER: os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid in FULL state after 0 sec [ 1523.823186] Lustre: DEBUG MARKER: == recovery-small test 29b: error adding new clients doesn't cause LBUG (bug 22273) ========================================================== 05:07:46 (1713344866) [ 1524.851837] Lustre: Failing over lustre-OST0000 [ 1524.859378] LustreError: 12348:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1524.891109] Lustre: server umount lustre-OST0000 complete [ 1526.412649] LustreError: 11-0: lustre-OST0000-osc-MDT0001: operation ost_statfs to node 0@lo failed: rc = -107 [ 1526.416844] Lustre: lustre-OST0000-osc-MDT0001: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1526.421649] Lustre: Skipped 1 previous similar message [ 1527.818423] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 1527.825045] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 1527.915899] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 1527.922869] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 1527.923082] LustreError: 12942:0:(ofd_obd.c:1263:ofd_iocontrol()) lustre-OST0000: aborting recovery [ 1527.923086] LustreError: 12942:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-OST0000: Aborting recovery [ 1527.930966] Lustre: Skipped 2 previous similar messages [ 1527.932397] Lustre: 12955:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1527.936095] Lustre: 12955:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 1 previous similar message [ 1527.938842] Lustre: lustre-OST0000: disconnecting 3 stale clients [ 1527.954670] mount.lustre (12942) used greatest stack depth: 10192 bytes left [ 1529.013126] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1529.246901] Lustre: *** cfs_fail_loc=711, val=0*** [ 1529.489185] LustreError: 167-0: lustre-OST0000-osc-MDT0000: This client was evicted by lustre-OST0000; in progress operations using this service will fail. [ 1529.499430] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7809 [ 1529.504257] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:9 to 0x280000400:33 [ 1541.872577] Lustre: DEBUG MARKER: == recovery-small test 50: failover MDS under load ======= 05:08:04 (1713344884) [ 1552.692229] Lustre: Failing over lustre-MDT0000 [ 1552.736440] LustreError: 14010:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1552.740368] LustreError: 14010:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 4 previous similar messages [ 1552.784714] Lustre: server umount lustre-MDT0000 complete [ 1552.956057] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1552.956727] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1552.956729] LustreError: Skipped 27 previous similar messages [ 1552.970410] Lustre: Skipped 3 previous similar messages [ 1559.962845] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713344895/real 1713344895] req@ffff8801299e0980 x1796570748354944/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713344902 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 1559.975731] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1565.817748] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1565.991534] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22a2b64d to 0x53803acb22a56821 [ 1566.004995] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1566.010755] Lustre: Skipped 5 previous similar messages [ 1566.156968] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1566.185274] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1566.188901] Lustre: Skipped 2 previous similar messages [ 1567.862242] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1568.201944] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1571.167783] Lustre: lustre-MDT0000: Recovery over after 0:03, of 2 clients 2 recovered and 0 were evicted. [ 1571.193253] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7841 [ 1571.193532] Lustre: lustre-OST0001: deleting orphan objects from 0x0:163 to 0x0:257 [ 1573.646461] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1574.448184] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1636.518695] Lustre: Failing over lustre-MDT0000 [ 1636.559837] LustreError: 15609:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1636.564848] LustreError: 15609:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 1 previous similar message [ 1636.607145] Lustre: server umount lustre-MDT0000 complete [ 1641.259284] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1641.265582] Lustre: Skipped 3 previous similar messages [ 1648.484891] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1649.561464] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1654.499524] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22a56821 to 0x53803acb22ae16a1 [ 1654.505235] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1654.508416] Lustre: Skipped 4 previous similar messages [ 1654.595886] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1654.612601] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1655.715942] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1656.320567] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1659.602630] Lustre: lustre-MDT0000: Recovery over after 0:03, of 2 clients 2 recovered and 0 were evicted. [ 1659.619020] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7873 [ 1659.619104] Lustre: lustre-OST0001: deleting orphan objects from 0x0:163 to 0x0:289 [ 1661.442748] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1661.934320] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1723.821847] Lustre: Failing over lustre-MDT0000 [ 1723.859696] LustreError: 17218:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1723.863439] LustreError: 17218:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 1 previous similar message [ 1723.904012] Lustre: server umount lustre-MDT0000 complete [ 1724.699142] Lustre: lustre-MDT0000-lwp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1724.700026] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1724.700027] LustreError: Skipped 47 previous similar messages [ 1724.718400] Lustre: Skipped 4 previous similar messages [ 1736.514833] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1736.570743] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1736.570918] LustreError: 11-0: MGC192.168.201.109@tcp: operation mgs_target_reg to node 0@lo failed: rc = -107 [ 1736.581890] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22ae16a1 to 0x53803acb22b90ca5 [ 1736.672760] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1736.686806] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1737.970493] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1738.889317] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1741.683632] Lustre: lustre-MDT0000: Recovery over after 0:02, of 2 clients 2 recovered and 0 were evicted. [ 1741.701434] Lustre: lustre-OST0001: deleting orphan objects from 0x0:163 to 0x0:321 [ 1741.701522] Lustre: lustre-OST0000: deleting orphan objects from 0x0:7679 to 0x0:7905 [ 1743.430819] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1743.897102] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1766.849326] Lustre: DEBUG MARKER: == recovery-small test 51: failover MDS during recovery == 05:11:49 (1713345109) [ 1768.768761] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 1778.733718] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713345114/real 1713345114] req@ffff880097dfb900 x1796570750804992/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713345121 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 1778.745818] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 1781.182896] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1784.760549] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 1784.763990] Lustre: Skipped 9 previous similar messages [ 1785.885917] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1786.050371] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1786.795977] Lustre: DEBUG MARKER: test_51: failover in 1 sec [ 1788.351368] LustreError: 20401:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 1788.356716] Lustre: 19752:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1788.361784] Lustre: 19752:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 1788.365417] Lustre: lustre-MDT0000: Recovery over after 0:02, of 2 clients 0 recovered and 2 were evicted. [ 1788.380925] Lustre: 19752:0:(mdt_handler.c:7532:mdt_postrecov()) lustre-MDT0000: auto trigger paused LFSCK failed: rc = -6 [ 1801.045688] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1802.932990] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22bd0bd2 to 0x53803acb22bd1254 [ 1802.935897] Lustre: Skipped 1 previous similar message [ 1803.985545] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1804.939059] Lustre: DEBUG MARKER: test_51: failover in 5 sec [ 1810.514454] LustreError: 21689:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 1810.519611] Lustre: 21037:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1810.524615] Lustre: 21037:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 1810.529331] LustreError: 21037:0:(ldlm_lib.c:1824:abort_lock_replay_queue()) @@@ aborted: req@ffff8800872bed40 x1796570750853440/t0(0) o101->lustre-MDT0001-mdtlov_UUID@0@lo:262/0 lens 328/0 e 1 to 0 dl 1713345167 ref 1 fl Complete:/40/ffffffff rc 0/-1 job:'ldlm_lock_repla.0' [ 1810.537743] LustreError: 11-0: lustre-MDT0000-osp-MDT0001: operation ldlm_enqueue to node 0@lo failed: rc = -19 [ 1810.541630] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 1810.552439] Lustre: 21037:0:(mdt_handler.c:7532:mdt_postrecov()) lustre-MDT0000: auto trigger paused LFSCK failed: rc = -6 [ 1819.954801] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1819.959508] LustreError: Skipped 2 previous similar messages [ 1823.009903] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1827.001530] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1827.935291] Lustre: DEBUG MARKER: test_51: failover in 10 sec [ 1831.071484] Lustre: lustre-OST0000: deleting orphan objects from 0x0:8007 to 0x0:8033 [ 1831.074572] Lustre: lustre-OST0001: deleting orphan objects from 0x0:423 to 0x0:449 [ 1838.536888] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 1851.108596] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1854.569423] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1854.572840] Lustre: Skipped 2 previous similar messages [ 1855.148444] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1856.068674] Lustre: DEBUG MARKER: test_51: failover in 20 sec [ 1859.046802] Lustre: lustre-MDT0000: Recovery over after 0:05, of 2 clients 2 recovered and 0 were evicted. [ 1859.051449] Lustre: Skipped 2 previous similar messages [ 1859.068698] Lustre: lustre-OST0001: deleting orphan objects from 0x0:907 to 0x0:929 [ 1859.068901] Lustre: lustre-OST0000: deleting orphan objects from 0x0:8490 to 0x0:8513 [ 1876.663203] Lustre: Failing over lustre-MDT0000 [ 1876.665418] Lustre: Skipped 4 previous similar messages [ 1876.745652] LustreError: 24287:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 1876.748833] LustreError: 24287:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 4 previous similar messages [ 1876.781806] Lustre: server umount lustre-MDT0000 complete [ 1876.783735] Lustre: Skipped 4 previous similar messages [ 1879.067797] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1879.073895] Lustre: Skipped 13 previous similar messages [ 1889.242390] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1892.176178] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1892.179926] Lustre: Skipped 4 previous similar messages [ 1892.197299] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1892.200696] Lustre: Skipped 4 previous similar messages [ 1893.088680] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1893.971247] Lustre: DEBUG MARKER: test_51: failover in 25 sec [ 1897.199046] Lustre: lustre-OST0001: deleting orphan objects from 0x0:2048 to 0x0:2081 [ 1897.199048] Lustre: lustre-OST0000: deleting orphan objects from 0x0:9632 to 0x0:9665 [ 1919.554653] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 1919.571987] LustreError: 25592:0:(ldlm_resource.c:1126:ldlm_resource_complain()) mdt-lustre-MDT0000_UUID: namespace resource [0x200000405:0x1e14:0x0].0x848107e8 (ffff88012cda5c00) refcount nonzero (2) after lock cleanup; forcing cleanup. [ 1931.866027] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1935.236628] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22c0ab67 to 0x53803acb22c42873 [ 1935.240672] Lustre: Skipped 3 previous similar messages [ 1936.433707] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1937.357362] Lustre: DEBUG MARKER: test_51: failover in 30 sec [ 1940.353704] Lustre: lustre-OST0001: deleting orphan objects from 0x0:3610 to 0x0:3649 [ 1940.353723] Lustre: lustre-OST0000: deleting orphan objects from 0x0:11196 to 0x0:11233 [ 1967.969316] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 1970.395195] LustreError: 11-0: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 1980.810237] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 192.168.201.9@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1980.816940] LustreError: Skipped 163 previous similar messages [ 1982.402784] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1982.409107] LustreError: Skipped 3 previous similar messages [ 1986.393630] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1988.420638] LustreError: 27519:0:(ldlm_resource.c:1126:ldlm_resource_complain()) MGC192.168.201.109@tcp: namespace resource [0x65727473756c:0x5:0x0].0x0 (ffff880130a32200) refcount nonzero (1) after lock cleanup; forcing cleanup. [ 1988.426595] LustreError: 27519:0:(ldlm_resource.c:1126:ldlm_resource_complain()) Skipped 8 previous similar messages [ 1988.817839] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1988.821885] Lustre: Skipped 2 previous similar messages [ 1989.573444] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 1993.523572] Lustre: lustre-MDT0000: Recovery over after 0:04, of 2 clients 2 recovered and 0 were evicted. [ 1993.527782] Lustre: Skipped 2 previous similar messages [ 1993.528741] Lustre: 16732:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880097c52ac0 x1796570762240832/t55834586789(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:473/0 lens 672/3424 e 0 to 0 dl 1713345378 ref 1 fl Interpret:/2/0 rc 0/0 job:'writemany.0' [ 1993.555395] Lustre: lustre-OST0001: deleting orphan objects from 0x0:5641 to 0x0:5665 [ 1993.555396] Lustre: lustre-OST0000: deleting orphan objects from 0x0:13224 to 0x0:13249 [ 2011.696376] Lustre: DEBUG MARKER: == recovery-small test 52: failover OST under load ======= 05:15:54 (1713345354) [ 2022.398022] Lustre: lustre-OST0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 2022.402256] Lustre: Skipped 5 previous similar messages [ 2022.815348] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -107 [ 2022.819796] LustreError: 27528:0:(osp_precreate.c:677:osp_precreate_send()) lustre-OST0000-osc-MDT0000: can't precreate: rc = -107 [ 2036.700278] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 2036.704200] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 2037.791315] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 2038.652546] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:11459 to 0x280000400:11489 [ 2041.047046] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2041.491626] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 2353.021708] Lustre: Failing over lustre-OST0000 [ 2353.023930] Lustre: Skipped 3 previous similar messages [ 2353.033739] LustreError: 30185:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 2353.037130] LustreError: 30185:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 4 previous similar messages [ 2353.054710] Lustre: server umount lustre-OST0000 complete [ 2353.056397] Lustre: Skipped 3 previous similar messages [ 2353.509979] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -107 [ 2353.514229] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2353.518508] Lustre: Skipped 11 previous similar messages [ 2353.520793] LustreError: 27528:0:(osp_precreate.c:677:osp_precreate_send()) lustre-OST0000-osc-MDT0000: can't precreate: rc = -107 [ 2365.400916] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 2365.405342] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 2365.558556] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 2365.562200] Lustre: Skipped 3 previous similar messages [ 2365.570227] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 2365.573936] Lustre: Skipped 3 previous similar messages [ 2366.510201] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 2366.849985] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 3 clients reconnect [ 2366.852999] Lustre: Skipped 1 previous similar message [ 2367.414034] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 2367.414201] Lustre: lustre-OST0000: Recovery over after 0:01, of 3 clients 3 recovered and 0 were evicted. [ 2367.414203] Lustre: Skipped 1 previous similar message [ 2367.423640] Lustre: Skipped 31 previous similar messages [ 2367.428996] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:11459 to 0x280000400:11521 [ 2369.735220] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2370.199787] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 2683.975158] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.9@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2683.979326] LustreError: Skipped 30 previous similar messages [ 2684.004672] LustreError: 11-0: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -107 [ 2684.007775] LustreError: Skipped 1 previous similar message [ 2684.009759] LustreError: 27528:0:(osp_precreate.c:677:osp_precreate_send()) lustre-OST0000-osc-MDT0000: can't precreate: rc = -107 [ 2696.253486] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 2696.257316] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 2696.415545] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 2697.303202] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 2698.385550] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:11459 to 0x280000400:11553 [ 2700.488726] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2700.891547] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 2975.687192] Lustre: DEBUG MARKER: == recovery-small test 53a: touch: drop rep ============== 05:31:58 (1713346318) [ 2976.030828] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 2976.032359] Lustre: Skipped 3 previous similar messages [ 2976.033628] LustreError: 8624:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880137125580 x1796570832160128/t60129992360(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:664/0 lens 648/656 e 0 to 0 dl 1713346324 ref 1 fl Interpret:/0/0 rc 0/0 job:'openfile.0' [ 2983.041098] Lustre: lustre-MDT0000: Client 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp) reconnecting [ 2983.044155] Lustre: Skipped 12 previous similar messages [ 2983.047277] Lustre: 23847:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880086437200 x1796570832160128/t60129992360(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:671/0 lens 648/3424 e 0 to 0 dl 1713346331 ref 1 fl Interpret:H/2/0 rc 0/0 job:'openfile.0' [ 2984.648152] Lustre: DEBUG MARKER: == recovery-small test 53b: touch: drop rep ============== 05:32:07 (1713346327) [ 2984.988343] LustreError: 23847:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88012efdb440 x1796570832184448/t60129992365(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:673/0 lens 648/656 e 0 to 0 dl 1713346333 ref 1 fl Interpret:/0/0 rc 0/0 job:'openfile.0' [ 2991.990335] Lustre: 7944:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff88012efdd580 x1796570832184448/t60129992365(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:680/0 lens 648/3424 e 0 to 0 dl 1713346340 ref 1 fl Interpret:H/2/0 rc 0/0 job:'openfile.0' [ 2993.692573] Lustre: DEBUG MARKER: == recovery-small test 53c: touch: drop rep ============== 05:32:16 (1713346336) [ 2994.021161] LustreError: 6848:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88008720cc00 x1796570832185600/t60129992367(0) o101->591586b5-12a0-4c75-b897-ddb9d5cfeb54@192.168.201.9@tcp:682/0 lens 664/656 e 0 to 0 dl 1713346342 ref 1 fl Interpret:/0/0 rc 0/0 job:'openfile.0' [ 3002.777823] Lustre: DEBUG MARKER: == recovery-small test 54: back in time ================== 05:32:25 (1713346345) [ 3013.486445] Lustre: Failing over lustre-MDT0000 [ 3013.488287] Lustre: Skipped 1 previous similar message [ 3013.534601] LustreError: 2733:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 3013.537449] LustreError: 2733:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 3 previous similar messages [ 3013.573019] Lustre: server umount lustre-MDT0000 complete [ 3013.574248] Lustre: Skipped 1 previous similar message [ 3015.179290] LustreError: 11-0: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 3015.181798] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 3015.185379] Lustre: Skipped 3 previous similar messages [ 3024.898669] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713346359/real 1713346359] req@ffff8800a073b440 x1796570768360640/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713346366 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:0.0' [ 3024.905371] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 6 previous similar messages [ 3024.908782] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3025.807429] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3030.931537] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb22c8b6fd to 0x53803acb23734aa1 [ 3030.935908] Lustre: Skipped 1 previous similar message [ 3030.939329] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 3030.941308] Lustre: Skipped 3 previous similar messages [ 3031.010360] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 3031.021100] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 3031.024807] Lustre: Skipped 1 previous similar message [ 3031.857119] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3032.733933] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 3 clients reconnect [ 3032.736425] Lustre: Skipped 1 previous similar message [ 3036.017307] Lustre: lustre-MDT0000: Recovery over after 0:03, of 3 clients 3 recovered and 0 were evicted. [ 3036.020080] Lustre: Skipped 1 previous similar message [ 3036.053334] Lustre: lustre-OST0000: deleting orphan objects from 0x0:85671 to 0x0:85697 [ 3036.053335] Lustre: lustre-OST0001: deleting orphan objects from 0x0:83318 to 0x0:83361 [ 3037.354432] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 3037.710453] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 3040.410032] Lustre: DEBUG MARKER: == recovery-small test 55: ost_brw_read/write drops timed-out read/write request ========================================================== 05:33:02 (1713346382) [ 3045.212524] Lustre: *** cfs_fail_loc=21d, val=0*** [ 3045.213630] Lustre: Skipped 3 previous similar messages [ 3045.214635] LustreError: 9276:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85699 took 0 seconds (limit was 6). [ 3045.218773] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3052.977685] Lustre: lustre-OST0000: Client 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp) reconnecting [ 3052.980157] Lustre: Skipped 2 previous similar messages [ 3052.982716] LustreError: 9276:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85699 took 0 seconds (limit was 6). [ 3052.982768] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3052.982769] Lustre: Skipped 8 previous similar messages [ 3052.994102] LustreError: 9276:0:(tgt_handler.c:2738:tgt_brw_write()) Skipped 16 previous similar messages [ 3060.011677] LustreError: 9276:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85699 took 0 seconds (limit was 6). [ 3060.011715] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3060.011716] Lustre: Skipped 8 previous similar messages [ 3060.020305] LustreError: 9276:0:(tgt_handler.c:2738:tgt_brw_write()) Skipped 8 previous similar messages [ 3067.015247] LustreError: 9275:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85698 took 0 seconds (limit was 6). [ 3067.015255] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3067.015257] Lustre: Skipped 8 previous similar messages [ 3067.025330] LustreError: 9275:0:(tgt_handler.c:2738:tgt_brw_write()) Skipped 8 previous similar messages [ 3074.017637] LustreError: 9277:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85699 took 0 seconds (limit was 6). [ 3074.017710] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3074.017711] Lustre: Skipped 8 previous similar messages [ 3074.024566] LustreError: 9277:0:(tgt_handler.c:2738:tgt_brw_write()) Skipped 8 previous similar messages [ 3088.024032] LustreError: 17082:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85699 took 0 seconds (limit was 6). [ 3088.024066] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3088.024067] Lustre: Skipped 17 previous similar messages [ 3088.039228] LustreError: 17082:0:(tgt_handler.c:2738:tgt_brw_write()) Skipped 17 previous similar messages [ 3108.441968] LustreError: 9275:0:(tgt_handler.c:2738:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.9@tcp because locking object 0x0:85699 took 0 seconds (limit was 6). [ 3108.441984] Lustre: lustre-OST0000: Bulk IO write error with 591586b5-12a0-4c75-b897-ddb9d5cfeb54 (at 192.168.201.9@tcp), client will retry: rc = -110 [ 3108.441986] Lustre: Skipped 27 previous similar messages [ 3108.449140] LustreError: 9275:0:(tgt_handler.c:2738:tgt_brw_write()) Skipped 29 previous similar messages [ 3120.771094] Lustre: DEBUG MARKER: == recovery-small test 56: do not fail on getattr resend ========================================================== 05:34:23 (1713346463) [ 3121.024521] LustreError: 23847:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 136 sleeping for 40000ms [ 3161.026716] LustreError: 23847:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 136 awake [ 3162.677310] Lustre: DEBUG MARKER: == recovery-small test 57: read procfs entries causes kernel crash ========================================================== 05:35:05 (1713346505) [ 3166.527334] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3166.555483] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3166.559462] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb23734aa1 to 0x53803acb2373514d [ 3166.636371] LustreError: 6043:0:(mdt_handler.c:7428:mdt_iocontrol()) lustre-MDT0000: Aborting recovery for device [ 3166.638426] LustreError: 6043:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 3166.640566] Lustre: 6075:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 3166.643463] Lustre: 6075:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 3166.645916] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 3166.663877] Lustre: lustre-OST0000: deleting orphan objects from 0x0:85701 to 0x0:85729 [ 3166.663886] Lustre: lustre-OST0001: deleting orphan objects from 0x0:83318 to 0x0:83393 [ 3167.515341] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3171.646780] LustreError: 167-0: lustre-MDT0000-osp-MDT0001: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 3171.650124] LustreError: Skipped 1 previous similar message [ 3177.708769] Lustre: DEBUG MARKER: == recovery-small test 58: Eviction in the middle of open RPC reply processing ========================================================== 05:35:20 (1713346520) [ 3189.791756] Lustre: 8624:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713346521/real 1713346521] req@ffff8800a0771300 x1796570768407808/t0(0) o104->lustre-MDT0000@192.168.201.9@tcp:15/16 lens 328/224 e 0 to 1 dl 1713346532 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' [ 3191.305456] Lustre: DEBUG MARKER: == recovery-small test 59: Read cancel race on client eviction ========================================================== 05:35:33 (1713346533) [ 3201.626104] LustreError: 13897:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) ### client (nid 192.168.201.9@tcp) returned error from blocking AST (req@ffff88009b8ced40 x1796570768412544 status -107 rc -107), evict it ns: filter-lustre-OST0001_UUID lock: ffff8800a8faa880/0x53803acb23735624 lrc: 4/0,0 mode: PW/PW res: [0x145c2:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89e3413b expref: 5 pid: 13897 timeout: 3300 lvb_type: 0 [ 3201.636386] LustreError: 138-a: lustre-OST0001: A client on nid 192.168.201.9@tcp was evicted due to a lock blocking callback time out: rc -107 [ 3201.639667] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.9@tcp ns: filter-lustre-OST0001_UUID lock: ffff8800a8faa880/0x53803acb23735624 lrc: 3/0,0 mode: PW/PW res: [0x145c2:0x0:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89e3413b expref: 6 pid: 13897 timeout: 0 lvb_type: 0 [ 3201.647166] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) Skipped 1 previous similar message [ 3203.064218] Lustre: DEBUG MARKER: == recovery-small test 60: Add Changelog entries during MDS failover ========================================================== 05:35:45 (1713346545) [ 3203.099260] LustreError: 16732:0:(ldlm_lockd.c:721:ldlm_handle_ast_error()) ### client (nid 192.168.201.9@tcp) returned error from blocking AST (req@ffff88009b8cbdc0 x1796570768413568 status -107 rc -107), evict it ns: mdt-lustre-MDT0000_UUID lock: ffff880082230fc0/0x53803acb23735640 lrc: 4/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89e34149 expref: 6 pid: 23847 timeout: 3302 lvb_type: 0 [ 3203.108458] LustreError: 138-a: lustre-MDT0000: A client on nid 192.168.201.9@tcp was evicted due to a lock blocking callback time out: rc -107 [ 3203.111793] LustreError: 6838:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.9@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff880082230fc0/0x53803acb23735640 lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89e34149 expref: 7 pid: 23847 timeout: 0 lvb_type: 0 [ 3203.826973] Lustre: lustre-MDD0000: changelog on [ 3204.532061] Lustre: lustre-MDD0001: changelog on [ 3220.483705] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 3220.487184] Lustre: Skipped 3 previous similar messages [ 3232.846814] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3234.739256] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb2373514d to 0x53803acb23767fa7 [ 3234.852695] Lustre: lustre-MDD0000: changelog on [ 3235.751020] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3239.854325] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87015 to 0x0:87041 [ 3239.854330] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84679 to 0x0:84705 [ 3243.102494] Lustre: lustre-MDT0001: haven't heard from client 058761c1-046b-4295-aa5f-51ff0e4fd862 (at 192.168.201.9@tcp) in 52 seconds. I think it's dead, and I am evicting it. exp ffff880082634000, cur 1713346586 expire 1713346556 last 1713346534 [ 3244.250936] Lustre: lustre-OST0000: haven't heard from client 058761c1-046b-4295-aa5f-51ff0e4fd862 (at 192.168.201.9@tcp) in 53 seconds. I think it's dead, and I am evicting it. exp ffff88009fe1a800, cur 1713346587 expire 1713346557 last 1713346534 [ 3253.667122] Lustre: lustre-MDD0000: changelog off [ 3254.544410] Lustre: lustre-MDD0001: changelog off [ 3258.075707] Lustre: DEBUG MARKER: == recovery-small test 61: Verify to not reuse orphan objects - bug 17025 ========================================================== 05:36:40 (1713346600) [ 3260.682026] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 3265.002829] LDISKFS-fs (dm-0): recovery complete [ 3265.004208] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3265.033686] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3265.036795] LustreError: Skipped 1 previous similar message [ 3265.124790] LustreError: 12101:0:(mdt_handler.c:7428:mdt_iocontrol()) lustre-MDT0000: Aborting recovery for device [ 3265.127279] LustreError: 12101:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 3265.129656] Lustre: 12131:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 3265.132162] Lustre: 12131:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 3265.134443] Lustre: lustre-MDT0000: disconnecting 2 stale clients [ 3265.150411] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87015 to 0x0:87073 [ 3265.150412] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84679 to 0x0:84737 [ 3266.070047] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3270.108286] LustreError: 167-0: lustre-MDT0000-osp-MDT0001: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 3276.198225] Lustre: DEBUG MARKER: == recovery-small test 65: lock enqueue for destroyed export ========================================================== 05:36:58 (1713346618) [ 3276.568260] LustreError: 868:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 31e sleeping for 6000ms [ 3276.573386] Lustre: *** cfs_fail_loc=31e, val=0*** [ 3276.574534] Lustre: Skipped 12 previous similar messages [ 3278.567423] LustreError: 18490:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 31e sleeping for 6000ms [ 3280.801712] Lustre: 13178:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting 070f00cc-2a1e-454b-8413-9f9941dd24a7 at adminstrative request [ 3280.804955] LustreError: 9286:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 31e sleeping for 4000ms [ 3282.569707] LustreError: 868:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 31e awake [ 3282.571837] LustreError: 868:0:(ldlm_lockd.c:1427:ldlm_handle_enqueue0()) ### lock on destroyed export ffff8800b4047800 ns: filter-lustre-OST0000_UUID lock: ffff8800a8fab600/0x53803acb237998de lrc: 3/0,0 mode: --/PW res: [0x15423:0x0:0x0].0x0 rrc: 4 type: EXT [0->4095] (req 0->4095) gid 0 flags: 0x70000000020020 nid: 192.168.201.9@tcp remote: 0x7afd575d89e40374 expref: 3 pid: 868 timeout: 0 lvb_type: 0 [ 3283.070721] LustreError: 18490:0:(fail.c:144:__cfs_fail_timeout_set()) cfs_fail_timeout interrupted [ 3285.278500] Lustre: lustre-MDT0000: Client 0b996b30-597f-4986-a768-64de21f59e6c (at 192.168.201.9@tcp) reconnecting [ 3285.281333] Lustre: Skipped 14 previous similar messages [ 3290.118003] Lustre: DEBUG MARKER: == recovery-small test 66: lock enqueue re-send vs client eviction ========================================================== 05:37:12 (1713346632) [ 3290.501120] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 3290.502601] Lustre: Skipped 2 previous similar messages [ 3290.503658] LustreError: 6847:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff88009b84aac0 x1796570834329408/t0(0) o101->070f00cc-2a1e-454b-8413-9f9941dd24a7@192.168.201.9@tcp:273/0 lens 592/640 e 0 to 0 dl 1713346688 ref 1 fl Interpret:/0/0 rc 0/0 job:'stat.0' [ 3292.466765] LustreError: 6847:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 136 sleeping for 40000ms [ 3294.703425] Lustre: 13704:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 070f00cc-2a1e-454b-8413-9f9941dd24a7 at adminstrative request [ 3298.870662] LustreError: 6847:0:(fail.c:144:__cfs_fail_timeout_set()) cfs_fail_timeout interrupted [ 3298.872456] LustreError: 6847:0:(fail.c:144:__cfs_fail_timeout_set()) Skipped 1 previous similar message [ 3300.505358] Lustre: DEBUG MARKER: == recovery-small test 67: connect vs import invalidate race ========================================================== 05:37:22 (1713346642) [ 3302.813433] Lustre: 14104:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 070f00cc-2a1e-454b-8413-9f9941dd24a7 at adminstrative request [ 3315.705406] Lustre: DEBUG MARKER: == recovery-small test 100: IR: Make sure normal recovery still works w/o IR ========================================================== 05:37:38 (1713346658) [ 3318.619066] LustreError: 137-5: lustre-OST0000_UUID: not available for connect from 192.168.201.9@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 3318.623341] LustreError: Skipped 75 previous similar messages [ 3329.101872] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 3329.105080] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3330.014556] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3334.176895] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12770 to 0x280000400:12801 [ 3334.180638] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87105 [ 3335.471852] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3335.854943] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3338.964744] Lustre: DEBUG MARKER: == recovery-small test 101a: IR: Make sure IR works w/o normal recovery ========================================================== 05:38:01 (1713346681) [ 3352.095119] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 3352.098463] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3352.151899] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 3352.987833] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3353.382176] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12770 to 0x280000400:12833 [ 3353.383453] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87137 [ 3355.988088] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3356.359583] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3359.565280] Lustre: DEBUG MARKER: == recovery-small test 101b: IR: Make sure IR works w/o normal recovery and proceed EAGAIN ========================================================== 05:38:22 (1713346702) [ 3372.939970] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 3372.944241] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3372.995056] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 3373.000003] LustreError: 19452:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 247 sleeping for 25000ms [ 3398.001740] LustreError: 19452:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 247 awake [ 3398.751173] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12770 to 0x280000400:12865 [ 3398.755087] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87169 [ 3398.898057] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3402.015727] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3402.382985] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3405.027121] Lustre: DEBUG MARKER: == recovery-small test 102: IR: New client gets updated nidtbl after MGS restart ========================================================== 05:39:07 (1713346747) [ 3418.187992] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 3418.191087] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3418.240033] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 3419.085014] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3420.073271] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12770 to 0x280000400:12897 [ 3420.077549] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87201 [ 3422.102462] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3422.467953] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3426.728061] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3426.761704] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3426.768005] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb2379932e to 0x53803acb2379a499 [ 3426.770961] Lustre: Skipped 1 previous similar message [ 3427.709681] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3431.856527] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84739 to 0x0:84769 [ 3440.913179] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 3440.916750] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3441.867186] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3442.679983] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87233 [ 3442.682673] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12770 to 0x280000400:12929 [ 3444.947360] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3447.829511] Lustre: DEBUG MARKER: == recovery-small test 103: IR: MDS can start w/o MGS and get updated nidtbl later ========================================================== 05:39:50 (1713346790) [ 3448.417459] Lustre: DEBUG MARKER: SKIP: recovery-small test_103 needs separate mgs and mds [ 3448.797385] Lustre: DEBUG MARKER: == recovery-small test 104: IR: ost can disable IR voluntarily ========================================================== 05:39:51 (1713346791) [ 3451.912181] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 3451.915430] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3452.855074] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3453.677879] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87265 [ 3453.678493] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12770 to 0x280000400:12961 [ 3456.544509] Lustre: DEBUG MARKER: == recovery-small test 105: IR: NON IR clients support === 05:39:59 (1713346799) [ 3456.912168] Lustre: DEBUG MARKER: SKIP: recovery-small test_105 Needs multiple clients [ 3457.326028] Lustre: DEBUG MARKER: == recovery-small test 106: lightweight connection support ========================================================== 05:39:59 (1713346799) [ 3459.889592] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 3469.002741] Lustre: 3329:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713346804/real 1713346804] req@ffff88011e415a40 x1796570768862080/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713346811 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 3469.013632] Lustre: 3329:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 3473.730569] LDISKFS-fs (dm-0): recovery complete [ 3473.732264] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3475.978156] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3480.108042] LustreError: 28877:0:(ldlm_lockd.c:906:ldlm_server_blocking_ast()) ### BUG 6063: lock collide during recovery ns: mdt-lustre-MDT0000_UUID lock: ffff88009698a400/0x53803acb2379af9e lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x40200000000020 nid: 192.168.201.9@tcp remote: 0x7afd575d89e406d8 expref: 7 pid: 16732 timeout: 0 lvb_type: 0 [ 3480.134986] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84771 to 0x0:84801 [ 3480.135006] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87297 [ 3482.064898] Lustre: DEBUG MARKER: == recovery-small test 107: drop reint reply, then restart MDT ========================================================== 05:40:24 (1713346824) [ 3482.316678] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 3482.318260] LustreError: 6846:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880095116d40 x1796570834362048/t85899345924(0) o36->ec3e33ef-1eca-4217-9415-31841d735390@192.168.201.9@tcp:465/0 lens 504/448 e 0 to 0 dl 1713346880 ref 1 fl Interpret:/0/0 rc 0/0 job:'mkdir.0' [ 3495.291026] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3499.113751] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3503.233401] Lustre: 6846:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880129d62140 x1796570834362048/t85899345924(0) o36->ec3e33ef-1eca-4217-9415-31841d735390@192.168.201.9@tcp:486/0 lens 504/448 e 0 to 0 dl 1713346901 ref 1 fl Interpret:/2/0 rc 0/0 job:'mkdir.0' [ 3503.241037] Lustre: 6846:0:(mdt_recovery.c:200:mdt_req_from_lrd()) Skipped 1 previous similar message [ 3503.245326] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84771 to 0x0:84833 [ 3503.245808] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87077 to 0x0:87329 [ 3504.534793] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 3504.896764] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 3507.612517] Lustre: DEBUG MARKER: == recovery-small test 108: client eviction don't crash == 05:40:50 (1713346850) [ 3507.989911] Lustre: 31664:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting ec3e33ef-1eca-4217-9415-31841d735390 at adminstrative request [ 3516.478551] Lustre: DEBUG MARKER: == recovery-small test 110a: create remote directory: drop client req ========================================================== 05:40:59 (1713346859) [ 3517.185250] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 3517.187446] Lustre: Skipped 92 previous similar messages [ 3573.196325] Lustre: lustre-MDT0000: Client ec3e33ef-1eca-4217-9415-31841d735390 (at 192.168.201.9@tcp) reconnecting [ 3573.199241] Lustre: Skipped 2 previous similar messages [ 3574.983408] Lustre: DEBUG MARKER: == recovery-small test 110b: create remote directory: drop Master rep ========================================================== 05:41:57 (1713346917) [ 3575.255252] LustreError: 16732:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff8800a073c280 x1796570834376384/t4295043263(0) o36->ec3e33ef-1eca-4217-9415-31841d735390@192.168.201.9@tcp:558/0 lens 560/448 e 0 to 0 dl 1713346973 ref 1 fl Interpret:/0/0 rc 0/0 job:'lfs.0' [ 3631.248767] Lustre: 6846:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800820f84c0 x1796570834376384/t4295043263(0) o36->ec3e33ef-1eca-4217-9415-31841d735390@192.168.201.9@tcp:614/0 lens 560/448 e 0 to 0 dl 1713347029 ref 1 fl Interpret:/2/0 rc 0/0 job:'lfs.0' [ 3633.029849] Lustre: DEBUG MARKER: == recovery-small test 110c: create remote directory: drop update rep on slave MDT ========================================================== 05:42:55 (1713346975) [ 3633.303958] Lustre: *** cfs_fail_loc=1701, val=2147483648*** [ 3633.305661] Lustre: Skipped 1 previous similar message [ 3640.302797] Lustre: 8071:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713346976/real 1713346976] req@ffff880129d617c0 x1796570768915712/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/4 lens 4144/4320 e 0 to 1 dl 1713346983 ref 2 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'osp_up0-1.0' [ 3640.314507] Lustre: 8071:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 3640.318291] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 3640.324892] Lustre: Skipped 39 previous similar messages [ 3640.328534] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 3640.332980] Lustre: lustre-MDT0000-osp-MDT0001: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 3640.335797] Lustre: Skipped 45 previous similar messages [ 3642.466161] Lustre: DEBUG MARKER: == recovery-small test 110d: remove remote directory: drop client req ========================================================== 05:43:04 (1713346984) [ 3700.537428] Lustre: DEBUG MARKER: == recovery-small test 110e: remove remote directory: drop master rep ========================================================== 05:44:03 (1713347043) [ 3756.848746] Lustre: 6846:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff880082e8da40 x1796570834391488/t4295043282(0) o36->ec3e33ef-1eca-4217-9415-31841d735390@192.168.201.9@tcp:739/0 lens 496/2888 e 0 to 0 dl 1713347154 ref 1 fl Interpret:/2/0 rc 0/0 job:'rm.0' [ 3759.067801] Lustre: DEBUG MARKER: == recovery-small test 110f: remove remote directory: drop slave rep ========================================================== 05:45:01 (1713347101) [ 3759.525464] LustreError: 26982:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880092493dc0 x1796570768953088/t90194313285(0) o1000->lustre-MDT0001-mdtlov_UUID@0@lo:693/0 lens 488/4320 e 0 to 0 dl 1713347108 ref 1 fl Interpret:/0/0 rc 0/0 job:'osp_up0-1.0' [ 3759.537266] LustreError: 26982:0:(ldlm_lib.c:3225:target_send_reply_msg()) Skipped 2 previous similar messages [ 3766.525818] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 3768.527260] Lustre: DEBUG MARKER: == recovery-small test 110g: drop reply during migration ========================================================== 05:45:11 (1713347111) [ 3826.348532] Lustre: DEBUG MARKER: == recovery-small test 110h: drop update reply during cross-MDT file rename ========================================================== 05:46:08 (1713347168) [ 3828.559552] Lustre: DEBUG MARKER: == recovery-small test 110i: drop update reply during cross-MDT dir rename ========================================================== 05:46:10 (1713347170) [ 3833.698807] Lustre: lustre-MDT0001: Received new MDS connection from 0@lo, keep former export from same NID [ 3840.990340] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 3843.102960] Lustre: DEBUG MARKER: == recovery-small test 110j: drop update reply during cross-MDT ln ========================================================== 05:46:25 (1713347185) [ 3850.604365] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 3852.367276] Lustre: DEBUG MARKER: == recovery-small test 110k: FID_QUERY failed during recovery ========================================================== 05:46:34 (1713347194) [ 3853.114245] Lustre: Failing over lustre-MDT0001 [ 3853.115771] Lustre: Skipped 12 previous similar messages [ 3853.153387] LustreError: 3923:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 3853.155270] LustreError: 3923:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 70 previous similar messages [ 3853.189326] Lustre: server umount lustre-MDT0001 complete [ 3853.191030] Lustre: Skipped 12 previous similar messages [ 3853.739116] LustreError: 11-0: lustre-MDT0001-osp-MDT0000: operation mds_statfs to node 0@lo failed: rc = -107 [ 3853.741113] LustreError: Skipped 9 previous similar messages [ 3855.769801] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3855.903209] Lustre: lustre-MDT0001: Imperative Recovery not enabled, recovery window 60-180 [ 3855.907811] Lustre: Skipped 9 previous similar messages [ 3855.915098] Lustre: *** cfs_fail_loc=1103, val=0*** [ 3855.920811] Lustre: lustre-MDT0001: in recovery but waiting for the first client to connect [ 3855.920989] LustreError: 4592:0:(mdt_handler.c:7428:mdt_iocontrol()) lustre-MDT0001: Aborting recovery for device [ 3855.920993] LustreError: 4592:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0001: Aborting recovery [ 3855.933146] Lustre: Skipped 16 previous similar messages [ 3855.934966] Lustre: 4614:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 3855.938339] Lustre: 4614:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 3857.926596] LustreError: 4613:0:(lod_dev.c:424:lod_sub_recovery_thread()) lustre-MDT0000-osp-MDT0001 get update log failed: rc = -108 [ 3857.929069] Lustre: lustre-MDT0001: disconnecting 1 stale clients [ 3857.945055] Lustre: lustre-OST0001: deleting orphan objects from 0x2c0000400:12649 to 0x2c0000400:12673 [ 3857.945082] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12963 to 0x280000400:12993 [ 3858.847140] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3862.217376] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3862.339184] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12963 to 0x280000400:13025 [ 3862.339189] Lustre: lustre-OST0001: deleting orphan objects from 0x2c0000400:12649 to 0x2c0000400:12705 [ 3863.342484] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3867.341884] LustreError: 167-0: lustre-MDT0001-osp-MDT0000: This client was evicted by lustre-MDT0001; in progress operations using this service will fail. [ 3872.726211] Lustre: DEBUG MARKER: == recovery-small test 110m: update resent vs original RPC race ========================================================== 05:46:55 (1713347215) [ 3873.294176] LustreError: 8073:0:(libcfs_fail.h:169:cfs_race()) cfs_race id 525 sleeping [ 3876.953598] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 3876.958086] LustreError: 26982:0:(libcfs_fail.h:180:cfs_race()) cfs_fail_race id 525 waking [ 3876.961067] LustreError: 8073:0:(libcfs_fail.h:178:cfs_race()) cfs_fail_race id 525 awake: rc=1335 [ 3876.967011] LustreError: 8073:0:(libcfs_fail.h:180:cfs_race()) cfs_fail_race id 525 waking [ 3878.528829] Lustre: DEBUG MARKER: == recovery-small test 111: mdd setup fail should not cause umount oops ========================================================== 05:47:01 (1713347221) [ 3883.720669] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3883.757779] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3883.763146] LustreError: Skipped 2 previous similar messages [ 3883.766416] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb2379b309 to 0x53803acb2379d323 [ 3883.768943] Lustre: Skipped 2 previous similar messages [ 3883.846861] Lustre: *** cfs_fail_loc=151, val=0*** [ 3883.847868] LustreError: 8187:0:(mdd_device.c:663:mdd_changelog_init()) lustre-MDD0000: changelog setup during init failed: rc = -5 [ 3883.850189] LustreError: 8187:0:(mdd_device.c:1383:mdd_prepare()) lustre-MDD0000: failed to initialize changelog: rc = -5 [ 3883.852535] LustreError: 8187:0:(obd_mount_server.c:2027:server_fill_super()) Unable to start targets: -5 [ 3883.856199] LustreError: 8217:0:(llog_osd.c:264:llog_osd_read_header()) lustre-MDT0001-osp-MDT0000: bad log [0x240000401:0x1:0x0] header magic: 0x0 (expected 0x10645539) [ 3883.859830] LustreError: 8217:0:(lod_dev.c:424:lod_sub_recovery_thread()) lustre-MDT0001-osp-MDT0000 get update log failed: rc = -5 [ 3883.915708] LustreError: 8187:0:(super25.c:183:lustre_fill_super()) llite: Unable to mount : rc = -5 [ 3885.845537] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3885.878308] LustreError: 6845:0:(mgc_request.c:612:do_requeue()) failed processing log: -5 [ 3886.823749] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 3888.777469] Lustre: DEBUG MARKER: == recovery-small test 112a: bulk resend while orignal request is in progress ========================================================== 05:47:11 (1713347231) [ 3888.969462] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 3888.971372] Lustre: Skipped 10 previous similar messages [ 3890.961833] Lustre: lustre-MDT0000: Recovery over after 0:02, of 2 clients 2 recovered and 0 were evicted. [ 3890.964028] Lustre: Skipped 10 previous similar messages [ 3890.977629] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84835 to 0x0:84865 [ 3890.977653] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87333 to 0x0:87361 [ 3891.022999] LustreError: 5082:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 214 sleeping for 20000ms [ 3911.024658] LustreError: 5082:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 214 awake [ 3912.758131] Lustre: DEBUG MARKER: == recovery-small test 115a: read: late REQ MDunlink and no bulk ========================================================== 05:47:35 (1713347255) [ 3918.241212] Lustre: DEBUG MARKER: == recovery-small test 115b: write: late REQ MDunlink and no bulk ========================================================== 05:47:40 (1713347260) [ 3922.297350] Lustre: *** cfs_fail_loc=215, val=0*** [ 3922.298583] Lustre: Skipped 1 previous similar message [ 3923.703224] Lustre: DEBUG MARKER: == recovery-small test 115c: read: late Reply MDunlink and no bulk ========================================================== 05:47:46 (1713347266) [ 3926.426119] Lustre: DEBUG MARKER: == recovery-small test 115d: write: late Reply MDunlink and no bulk ========================================================== 05:47:48 (1713347268) [ 3929.187043] Lustre: DEBUG MARKER: == recovery-small test 115e: read: late Bulk MDunlink and no reply ========================================================== 05:47:51 (1713347271) [ 3931.926024] Lustre: DEBUG MARKER: == recovery-small test 115f: read: late REQ MDunlink and no reply ========================================================== 05:47:54 (1713347274) [ 3934.041403] Lustre: *** cfs_fail_loc=211, val=2147483648*** [ 3934.043255] Lustre: Skipped 6 previous similar messages [ 3936.374726] LustreError: 26982:0:(sec.c:2543:sptlrpc_svc_unwrap_bulk()) @@@ truncated bulk GET 0(32768) req@ffff88008274cc00 x1796570769012224/t0(0) o1000->lustre-MDT0000-mdtlov_UUID@0@lo:99/0 lens 336/33016 e 0 to 0 dl 1713347269 ref 1 fl Interpret:/0/0 rc 0/0 job:'lod0000_rec0001.0' [ 3936.379943] Lustre: 26982:0:(service.c:2333:ptlrpc_server_handle_request()) @@@ Request took longer than estimated (43/10s); client may timeout req@ffff88008274cc00 x1796570769012224/t0(0) o1000->lustre-MDT0000-mdtlov_UUID@0@lo:99/0 lens 336/33016 e 0 to 0 dl 1713347269 ref 1 fl Complete:/0/0 rc -110/-110 job:'lod0000_rec0001.0' [ 3937.434120] Lustre: DEBUG MARKER: == recovery-small test 115g: read: late REQ MDunlink and Reply MDunlink ========================================================== 05:47:59 (1713347279) [ 3993.546280] Lustre: DEBUG MARKER: == recovery-small test 120: flock race: completion vs. evict ========================================================== 05:48:56 (1713347336) [ 3995.834177] Lustre: 12309:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-MDT0000: evicting f9cb0360-1950-4579-8bb1-0dea4a1974b7 at adminstrative request [ 4009.845488] Lustre: 12448:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-MDT0000: evicting f9cb0360-1950-4579-8bb1-0dea4a1974b7 at adminstrative request [ 4009.850366] Lustre: 12448:0:(genops.c:1710:obd_export_evict_by_uuid()) Skipped 1 previous similar message [ 4030.311712] Lustre: 12658:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-MDT0000: evicting f9cb0360-1950-4579-8bb1-0dea4a1974b7 at adminstrative request [ 4030.314884] Lustre: 12658:0:(genops.c:1710:obd_export_evict_by_uuid()) Skipped 2 previous similar messages [ 4054.377426] Lustre: DEBUG MARKER: == recovery-small test 113: ldlm enqueue dropped reply should not cause deadlocks ========================================================== 05:49:56 (1713347396) [ 4054.687815] LustreError: 6847:0:(ldlm_lib.c:3225:target_send_reply_msg()) @@@ dropping reply req@ffff880085c71c80 x1796570834472384/t0(0) o101->f9cb0360-1950-4579-8bb1-0dea4a1974b7@192.168.201.9@tcp:282/0 lens 592/640 e 0 to 0 dl 1713347452 ref 1 fl Interpret:/0/0 rc 0/0 job:'stat.0' [ 4054.696002] LustreError: 6847:0:(ldlm_lib.c:3225:target_send_reply_msg()) Skipped 5 previous similar messages [ 4110.687418] Lustre: lustre-MDT0000: Client f9cb0360-1950-4579-8bb1-0dea4a1974b7 (at 192.168.201.9@tcp) reconnecting [ 4110.689905] Lustre: Skipped 6 previous similar messages [ 4116.506642] Lustre: DEBUG MARKER: == recovery-small test 130a: enqueue resend on not existing file ========================================================== 05:50:59 (1713347459) [ 4116.955648] LustreError: 6848:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 160 sleeping for 10000ms [ 4126.957832] LustreError: 6848:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 160 awake [ 4174.537125] Lustre: DEBUG MARKER: == recovery-small test 130b: enqueue resend on a stale inode ========================================================== 05:51:57 (1713347517) [ 4185.038723] LustreError: 18505:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 160 awake [ 4230.962793] Lustre: *** cfs_fail_loc=217, val=0*** [ 4233.015933] Lustre: DEBUG MARKER: == recovery-small test 130c: layout intent resend on a stale inode ========================================================== 05:52:55 (1713347575) [ 4235.536780] LustreError: 23847:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 160 sleeping for 10000ms [ 4235.541272] LustreError: 23847:0:(fail.c:138:__cfs_fail_timeout_set()) Skipped 1 previous similar message [ 4245.544799] LustreError: 23847:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 160 awake [ 4258.258934] Lustre: DEBUG MARKER: == recovery-small test 132: long punch =================== 05:53:20 (1713347600) [ 4330.682841] Lustre: ll_ost_io00_004: service thread pid 5082 was inactive for 72.058 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 4330.691503] Pid: 5082, comm: ll_ost_io00_004 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 4330.695540] Call Trace: [ 4330.697155] [<0>] __cfs_fail_timeout_set+0xe1/0x200 [libcfs] [ 4330.700104] [<0>] ofd_punch_hdl+0x11a/0xaf0 [ofd] [ 4330.702575] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 4330.705357] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 4330.708562] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 4330.711049] [<0>] kthread+0xe4/0xf0 [ 4330.712807] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 4330.715335] [<0>] 0xfffffffffffffffe [ 4378.723691] LustreError: 5082:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 236 awake [ 4380.471834] Lustre: DEBUG MARKER: == recovery-small test 131: IO vs evict results to IO under staled lock ========================================================== 05:55:22 (1713347722) [ 4382.121578] Lustre: 15532:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting f9cb0360-1950-4579-8bb1-0dea4a1974b7 at adminstrative request [ 4382.124655] Lustre: 15532:0:(genops.c:1710:obd_export_evict_by_uuid()) Skipped 3 previous similar messages [ 4382.126845] LustreError: 8067:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 31e sleeping for 4000ms [ 4382.130388] LustreError: 8067:0:(fail.c:138:__cfs_fail_timeout_set()) Skipped 1 previous similar message [ 4385.032774] LustreError: 8067:0:(fail.c:144:__cfs_fail_timeout_set()) cfs_fail_timeout interrupted [ 4386.085927] Lustre: DEBUG MARKER: == recovery-small test 133: don't fail on flock resend === 05:55:28 (1713347728) [ 4445.195917] Lustre: DEBUG MARKER: == recovery-small test 134: race between failover and search for reply data free slot ========================================================== 05:56:27 (1713347787) [ 4445.620665] Lustre: DEBUG MARKER: SKIP: recovery-small test_134 Need 2+ clients, have 1 [ 4446.029542] Lustre: DEBUG MARKER: == recovery-small test 135: DOM: open/create resend to return size ========================================================== 05:56:28 (1713347788) [ 4446.373705] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 4446.375160] Lustre: Skipped 4 previous similar messages [ 4464.374820] Lustre: 8624:0:(mdt_recovery.c:200:mdt_req_from_lrd()) @@@ restoring transno req@ffff88009cbed0c0 x1796570834516544/t94489280668(0) o101->f9cb0360-1950-4579-8bb1-0dea4a1974b7@192.168.201.9@tcp:654/0 lens 648/3424 e 0 to 0 dl 1713347824 ref 1 fl Interpret:H/2/0 rc 0/0 job:'openfile.0' [ 4464.385497] Lustre: 8624:0:(mdt_recovery.c:200:mdt_req_from_lrd()) Skipped 1 previous similar message [ 4465.819973] Lustre: DEBUG MARKER: SKIP: recovery-small test_136 skipping excluded test 136 [ 4466.203924] Lustre: DEBUG MARKER: == recovery-small test 137: late resend must be skipped if already applied ========================================================== 05:56:48 (1713347808) [ 4467.524826] LustreError: 6846:0:(libcfs_fail.h:169:cfs_race()) cfs_race id 525 sleeping [ 4472.525772] LustreError: 6846:0:(libcfs_fail.h:178:cfs_race()) cfs_fail_race id 525 awake: rc=0 [ 4472.541753] LustreError: 6846:0:(libcfs_fail.h:180:cfs_race()) cfs_fail_race id 525 waking [ 4484.896909] Lustre: DEBUG MARKER: == recovery-small test 138: Umount MDT during recovery === 05:57:07 (1713347827) [ 4485.849109] Lustre: Failing over lustre-MDT0000 [ 4485.850913] Lustre: Skipped 3 previous similar messages [ 4486.907348] LustreError: 11-0: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 4486.907789] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 4486.907791] Lustre: Skipped 12 previous similar messages [ 4486.908478] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 4486.908483] Lustre: Skipped 1 previous similar message [ 4486.918382] LustreError: Skipped 3 previous similar messages [ 4491.915087] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 4491.917800] Lustre: Skipped 5 previous similar messages [ 4495.957705] LustreError: 17233:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 724 awake [ 4495.985407] LustreError: 17233:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 4495.988052] LustreError: 17233:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 20 previous similar messages [ 4496.020968] Lustre: server umount lustre-MDT0000 complete [ 4496.022589] Lustre: Skipped 3 previous similar messages [ 4496.923394] LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4496.927098] LustreError: Skipped 115 previous similar messages [ 4503.922718] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713347839/real 1713347839] req@ffff880082e8d580 x1796570769163712/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713347846 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:2.0' [ 4503.929849] Lustre: 3330:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 4 previous similar messages [ 4503.932630] LustreError: 166-1: MGC192.168.201.109@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 4503.935884] LustreError: Skipped 1 previous similar message [ 4508.237571] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4509.957195] Lustre: Evicted from MGS (at 192.168.201.109@tcp) after server handle changed from 0x53803acb2379d457 to 0x53803acb2379edb0 [ 4509.962006] Lustre: Skipped 1 previous similar message [ 4509.965041] Lustre: MGC192.168.201.109@tcp: Connection restored to 192.168.201.109@tcp (at 0@lo) [ 4509.967101] Lustre: Skipped 14 previous similar messages [ 4510.032055] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 4510.034162] Lustre: Skipped 3 previous similar messages [ 4510.044778] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 4510.046539] Lustre: Skipped 3 previous similar messages [ 4510.902221] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4570.067973] Lustre: 17874:0:(mdt_handler.c:7532:mdt_postrecov()) lustre-MDT0000: auto trigger paused LFSCK failed: rc = -6 [ 4570.124280] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 4570.127398] Lustre: Skipped 3 previous similar messages [ 4570.838713] LustreError: 17873:0:(lod_dev.c:424:lod_sub_recovery_thread()) lustre-MDT0001-osp-MDT0000 get update log failed: rc = -5 [ 4575.131763] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 4575.134800] Lustre: Skipped 3 previous similar messages [ 4579.471026] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4580.417634] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4580.971881] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 4580.974336] Lustre: lustre-MDT0000: Denying connection for new client 3a48193c-7d94-4148-ae05-6412d6067f0b (at 192.168.201.9@tcp), waiting for 1 known clients (0 recovered, 0 in progress, and 0 evicted) to recover in 0:59 [ 4584.581749] Lustre: lustre-MDT0000: Recovery over after 0:03, of 1 clients 1 recovered and 0 were evicted. [ 4584.609733] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84878 to 0x0:84897 [ 4584.613086] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87376 to 0x0:87393 [ 4588.055247] Lustre: DEBUG MARKER: == recovery-small test 139: corrupted catid won't cause crash ========================================================== 05:58:50 (1713347930) [ 4592.208707] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4592.324626] Lustre: *** cfs_fail_loc=2106, val=0*** [ 4592.325863] LustreError: 20723:0:(osp_sync.c:1438:osp_sync_llog_init()) lustre-OST0000-osc-MDT0000: the catid [0x0:0x68:0x0] for init llog 0 is bad [ 4593.478725] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4595.739835] Lustre: DEBUG MARKER: == recovery-small test 140a: local mount is flagged properly ========================================================== 05:58:58 (1713347938) [ 4596.942842] Lustre: lustre-MDT0000: Denying connection for new client 6a6131ff-a490-4b5f-b305-5482d70dd6c6 (at 0@lo), waiting for 2 known clients (1 recovered, 0 in progress, and 0 evicted) to recover in 1:14 [ 4596.943394] Lustre: lustre-MDT0001: local client 6a6131ff-a490-4b5f-b305-5482d70dd6c6 w/o recovery [ 4596.978098] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84878 to 0x0:84929 [ 4596.978166] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87376 to 0x0:87425 [ 4601.964120] Lustre: lustre-MDT0000: local client 6a6131ff-a490-4b5f-b305-5482d70dd6c6 w/o recovery [ 4601.969622] Lustre: Mounted lustre-client [ 4602.578930] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4603.811701] Lustre: Unmounted lustre-client [ 4604.913094] Lustre: Mounted lustre-client [ 4605.562119] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4606.650561] Lustre: Unmounted lustre-client [ 4608.599258] Lustre: DEBUG MARKER: == recovery-small test 140b: local mount is excluded from recovery ========================================================== 05:59:11 (1713347951) [ 4609.535233] Lustre: lustre-MDT0000: local client 9fe96651-62d5-4427-ae29-8f900340e4b1 w/o recovery [ 4609.537145] Lustre: Skipped 1 previous similar message [ 4609.540990] Lustre: Mounted lustre-client [ 4610.153802] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4612.236097] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 4613.192734] Lustre: Unmounted lustre-client [ 4627.879948] LDISKFS-fs (dm-0): recovery complete [ 4627.881383] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4633.623343] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4637.734486] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87376 to 0x0:87457 [ 4637.734577] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84878 to 0x0:84961 [ 4639.487922] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 4640.021709] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 4644.201210] Lustre: DEBUG MARKER: == recovery-small test 141: do not lose locks on MGS restart ========================================================== 05:59:46 (1713347986) [ 4645.061946] Lustre: DEBUG MARKER: SKIP: recovery-small test_141 cannot run in local mode or from build tree [ 4645.699140] Lustre: DEBUG MARKER: == recovery-small test 142: orphan name stub can be cleaned up in startup ========================================================== 05:59:48 (1713347988) [ 4646.077175] Lustre: *** cfs_fail_loc=165, val=0*** [ 4649.827811] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4651.156572] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4653.687338] Lustre: DEBUG MARKER: == recovery-small test 143: orphan cleanup thread shouldn't be blocked even delete failed ========================================================== 05:59:56 (1713347996) [ 4654.263996] LustreError: 28758:0:(ldlm_lib.c:2883:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 4654.269379] Lustre: 27849:0:(ldlm_lib.c:2289:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 4654.273373] Lustre: 27849:0:(ldlm_lib.c:2289:target_recovery_overseer()) Skipped 2 previous similar messages [ 4654.289025] Lustre: 27849:0:(mdt_handler.c:7532:mdt_postrecov()) lustre-MDT0000: auto trigger paused LFSCK failed: rc = -6 [ 4656.021292] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: (null) [ 4658.228486] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4659.231345] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4660.627293] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 [ 4672.147305] Lustre: lustre-OST0001: deleting orphan objects from 0x0:84963 to 0x0:84993 [ 4672.147445] Lustre: lustre-OST0000: deleting orphan objects from 0x0:87376 to 0x0:87489 [ 4672.150505] LustreError: 30562:0:(osd_handler.c:281:osd_idc_find_or_init()) can't lookup: rc = -2 [ 4677.904807] Lustre: DEBUG MARKER: == recovery-small test 144a: MDT failover should stop precreation threads ========================================================== 06:00:20 (1713348020) [ 4679.767813] LustreError: 29828:0:(osp_precreate.c:677:osp_precreate_send()) lustre-OST0000-osc-MDT0000: can't precreate: rc = -19 [ 4679.771945] LustreError: 29828:0:(osp_precreate.c:1340:osp_precreate_thread()) lustre-OST0000-osc-MDT0000: cannot precreate objects: rc = -19 [ 4679.792890] Lustre: lustre-OST0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 4692.059438] LDISKFS-fs (dm-2): file extents enabled, maximum tree depth=5 [ 4692.062755] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4692.965068] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4693.912631] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12963 to 0x280000400:13057 [ 4696.172221] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4696.599762] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4758.976893] LustreError: 6848:0:(osp_precreate.c:1377:osp_precreate_ready_condition()) lustre-OST0000-osc-MDT0000: precreate failed opd_pre_status -108 [ 4758.986984] LustreError: 6848:0:(lod_qos.c:115:lod_statfs_and_check()) lustre-MDT0000-mdtlov: statfs: rc = -108 [ 4758.991454] LustreError: 29830:0:(osp_precreate.c:677:osp_precreate_send()) lustre-OST0001-osc-MDT0000: can't precreate: rc = -5 [ 4759.001185] LustreError: 29830:0:(osp_precreate.c:1340:osp_precreate_thread()) lustre-OST0001-osc-MDT0000: cannot precreate objects: rc = -5 [ 4759.005491] LustreError: 6848:0:(lod_qos.c:1362:lod_ost_alloc_specific()) can't lstripe objid [0x20000d6f1:0xa:0x0]: have 184 want 1000 [ 4759.013663] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.9@tcp (stopping) [ 4772.233237] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4776.925980] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4777.041393] Lustre: 8624:0:(ldlm_lib.c:1971:extend_recovery_timer()) lustre-MDT0000: extended recovery timer reached hard limit: 180, extend: 0 [ 4781.138934] Lustre: lustre-OST0001: deleting orphan objects from 0x0:87478 to 0x0:87649 [ 4781.139431] Lustre: lustre-OST0000: deleting orphan objects from 0x0:90006 to 0x0:90049 [ 4782.779133] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 4783.339583] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 4798.914746] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4800.083661] Lustre: 7944:0:(ldlm_lib.c:1971:extend_recovery_timer()) lustre-MDT0000: extended recovery timer reached hard limit: 180, extend: 0 [ 4800.495879] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 4804.259124] Lustre: lustre-OST0000: deleting orphan objects from 0x0:107011 to 0x0:107041 [ 4804.259266] Lustre: lustre-OST0001: deleting orphan objects from 0x0:104689 to 0x0:104705 [ 4806.097276] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 4806.554375] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 4809.922072] Lustre: 16732:0:(osd_handler.c:1948:osd_trans_start()) lustre-MDT0000: credits 64189 > trans_max 3200 [ 4809.925548] Lustre: 16732:0:(osd_handler.c:1877:osd_trans_dump_creds()) create: 1000/4000/0, destroy: 1/4/0 [ 4809.928662] Lustre: 16732:0:(osd_handler.c:1884:osd_trans_dump_creds()) attr_set: 3/3/0, xattr_set: 1004/148/0 [ 4809.933585] Lustre: 16732:0:(osd_handler.c:1894:osd_trans_dump_creds()) write: 5001/43010/0, punch: 0/0/0, quota 0/0/0 [ 4809.937504] Lustre: 16732:0:(osd_handler.c:1901:osd_trans_dump_creds()) insert: 1001/17016/0, delete: 2/5/0 [ 4809.941471] Lustre: 16732:0:(osd_handler.c:1908:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 2/2/0 [ 4809.945029] Pid: 16732, comm: mdt00_007 3.10.0-7.9-debug #1 SMP Sat Mar 26 23:28:42 EDT 2022 [ 4809.948404] Call Trace: [ 4809.949500] [<0>] libcfs_call_trace+0x90/0xf0 [libcfs] [ 4809.951414] [<0>] libcfs_debug_dumpstack+0x26/0x30 [libcfs] [ 4809.953220] [<0>] osd_trans_start+0x5ba/0x5e0 [osd_ldiskfs] [ 4809.956456] [<0>] top_trans_start+0x763/0xa10 [ptlrpc] [ 4809.958046] [<0>] lod_trans_start+0x34/0x40 [lod] [ 4809.960523] [<0>] mdd_trans_start+0x14/0x20 [mdd] [ 4809.962150] [<0>] mdd_unlink+0x5e3/0xdb0 [mdd] [ 4809.966167] [<0>] mdt_reint_unlink+0xe32/0x1df0 [mdt] [ 4809.968173] [<0>] mdt_reint_rec+0x87/0x240 [mdt] [ 4809.970019] [<0>] mdt_reint_internal+0x76c/0xb50 [mdt] [ 4809.971973] [<0>] mdt_reint+0x67/0x150 [mdt] [ 4809.973919] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [ 4809.975539] [<0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [ 4809.978015] [<0>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [ 4809.979526] [<0>] kthread+0xe4/0xf0 [ 4809.980540] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 4809.982681] [<0>] 0xfffffffffffffffe [ 4809.984044] Lustre: 16732:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 7: before 3200 < left 43010, rollback = 7 [ 4810.507544] Lustre: 16732:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 7: before 3202 < left 43010, rollback = 7 [ 4810.510727] Lustre: 16732:0:(osd_internal.h:1333:osd_trans_exec_op()) Skipped 8001 previous similar messages [ 4810.513977] Lustre: 16732:0:(osd_handler.c:1877:osd_trans_dump_creds()) create: 1000/4000/0, destroy: 1/4/0 [ 4810.517998] Lustre: 16732:0:(osd_handler.c:1877:osd_trans_dump_creds()) Skipped 8002 previous similar messages [ 4810.522585] Lustre: 16732:0:(osd_handler.c:1884:osd_trans_dump_creds()) attr_set: 3/3/0, xattr_set: 1004/148/0 [ 4810.527414] Lustre: 16732:0:(osd_handler.c:1884:osd_trans_dump_creds()) Skipped 8002 previous similar messages [ 4810.530225] Lustre: 16732:0:(osd_handler.c:1894:osd_trans_dump_creds()) write: 5001/43010/0, punch: 0/0/0, quota 1/3/0 [ 4810.533164] Lustre: 16732:0:(osd_handler.c:1894:osd_trans_dump_creds()) Skipped 8002 previous similar messages [ 4810.536623] Lustre: 16732:0:(osd_handler.c:1901:osd_trans_dump_creds()) insert: 1001/17016/0, delete: 2/5/0 [ 4810.539723] Lustre: 16732:0:(osd_handler.c:1901:osd_trans_dump_creds()) Skipped 8002 previous similar messages [ 4810.542797] Lustre: 16732:0:(osd_handler.c:1908:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 2/2/1 [ 4810.545966] Lustre: 16732:0:(osd_handler.c:1908:osd_trans_dump_creds()) Skipped 8002 previous similar messages [ 4811.527195] Lustre: 6847:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 7: before 3203 < left 43010, rollback = 7 [ 4811.532642] Lustre: 6847:0:(osd_internal.h:1333:osd_trans_exec_op()) Skipped 16003 previous similar messages [ 4811.536853] Lustre: 6847:0:(osd_handler.c:1877:osd_trans_dump_creds()) create: 1000/4000/0, destroy: 1/4/0 [ 4811.541252] Lustre: 6847:0:(osd_handler.c:1877:osd_trans_dump_creds()) Skipped 16003 previous similar messages [ 4811.546012] Lustre: 6847:0:(osd_handler.c:1884:osd_trans_dump_creds()) attr_set: 3/3/0, xattr_set: 1004/148/0 [ 4811.550901] Lustre: 6847:0:(osd_handler.c:1884:osd_trans_dump_creds()) Skipped 16003 previous similar messages [ 4811.554349] Lustre: 6847:0:(osd_handler.c:1894:osd_trans_dump_creds()) write: 5001/43010/0, punch: 0/0/0, quota 1/3/0 [ 4811.558175] Lustre: 6847:0:(osd_handler.c:1894:osd_trans_dump_creds()) Skipped 16003 previous similar messages [ 4811.561438] Lustre: 6847:0:(osd_handler.c:1901:osd_trans_dump_creds()) insert: 1001/17016/0, delete: 2/5/0 [ 4811.565402] Lustre: 6847:0:(osd_handler.c:1901:osd_trans_dump_creds()) Skipped 16003 previous similar messages [ 4811.569550] Lustre: 6847:0:(osd_handler.c:1908:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 2/2/0 [ 4811.572909] Lustre: 6847:0:(osd_handler.c:1908:osd_trans_dump_creds()) Skipped 16003 previous similar messages [ 4813.691470] Lustre: 6847:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 7: before 3203 < left 43010, rollback = 7 [ 4813.693598] Lustre: 6847:0:(osd_internal.h:1333:osd_trans_exec_op()) Skipped 36008 previous similar messages [ 4813.695628] Lustre: 6847:0:(osd_handler.c:1877:osd_trans_dump_creds()) create: 1000/4000/0, destroy: 1/4/0 [ 4813.698635] Lustre: 6847:0:(osd_handler.c:1877:osd_trans_dump_creds()) Skipped 36008 previous similar messages [ 4813.702290] Lustre: 6847:0:(osd_handler.c:1884:osd_trans_dump_creds()) attr_set: 3/3/0, xattr_set: 1004/148/0 [ 4813.705751] Lustre: 6847:0:(osd_handler.c:1884:osd_trans_dump_creds()) Skipped 36008 previous similar messages [ 4813.709014] Lustre: 6847:0:(osd_handler.c:1894:osd_trans_dump_creds()) write: 5001/43010/0, punch: 0/0/0, quota 1/3/0 [ 4813.711870] Lustre: 6847:0:(osd_handler.c:1894:osd_trans_dump_creds()) Skipped 36008 previous similar messages [ 4813.714510] Lustre: 6847:0:(osd_handler.c:1901:osd_trans_dump_creds()) insert: 1001/17016/0, delete: 2/5/0 [ 4813.717167] Lustre: 6847:0:(osd_handler.c:1901:osd_trans_dump_creds()) Skipped 36008 previous similar messages [ 4813.719865] Lustre: 6847:0:(osd_handler.c:1908:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 2/2/0 [ 4813.722429] Lustre: 6847:0:(osd_handler.c:1908:osd_trans_dump_creds()) Skipped 36008 previous similar messages [ 4817.703912] Lustre: 6847:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 7: before 3202 < left 43010, rollback = 7 [ 4817.708209] Lustre: 6847:0:(osd_internal.h:1333:osd_trans_exec_op()) Skipped 56013 previous similar messages [ 4817.711088] Lustre: 6847:0:(osd_handler.c:1877:osd_trans_dump_creds()) create: 1000/4000/0, destroy: 1/4/0 [ 4817.716734] Lustre: 6847:0:(osd_handler.c:1877:osd_trans_dump_creds()) Skipped 56013 previous similar messages [ 4817.720669] Lustre: 6847:0:(osd_handler.c:1884:osd_trans_dump_creds()) attr_set: 3/3/0, xattr_set: 1004/148/0 [ 4817.725366] Lustre: 6847:0:(osd_handler.c:1884:osd_trans_dump_creds()) Skipped 56013 previous similar messages [ 4817.728653] Lustre: 6847:0:(osd_handler.c:1894:osd_trans_dump_creds()) write: 5001/43010/0, punch: 0/0/0, quota 1/3/0 [ 4817.732544] Lustre: 6847:0:(osd_handler.c:1894:osd_trans_dump_creds()) Skipped 56013 previous similar messages [ 4817.737076] Lustre: 6847:0:(osd_handler.c:1901:osd_trans_dump_creds()) insert: 1001/17016/0, delete: 2/5/0 [ 4817.739770] Lustre: 6847:0:(osd_handler.c:1901:osd_trans_dump_creds()) Skipped 56013 previous similar messages [ 4817.742823] Lustre: 6847:0:(osd_handler.c:1908:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 2/2/1 [ 4817.746143] Lustre: 6847:0:(osd_handler.c:1908:osd_trans_dump_creds()) Skipped 56013 previous similar messages [ 4822.965313] Lustre: DEBUG MARKER: == recovery-small test 145: connect mdtlovs and process update logs after recovery expire ========================================================== 06:02:45 (1713348165) [ 4823.316475] Lustre: DEBUG MARKER: SKIP: recovery-small test_145 needs >= 3 MDTs [ 4823.733986] Lustre: DEBUG MARKER: == recovery-small test 147: Check client reconnect ======= 06:02:46 (1713348166) [ 4824.261387] Lustre: 3956:0:(genops.c:1710:obd_export_evict_by_uuid()) lustre-OST0000: evicting 3a48193c-7d94-4148-ae05-6412d6067f0b at adminstrative request [ 4824.340411] Lustre: *** cfs_fail_loc=225, val=0*** [ 4835.003156] hrtimer: interrupt took 5630448 ns [ 4835.341001] Lustre: lustre-OST0000: Client 3a48193c-7d94-4148-ae05-6412d6067f0b (at 192.168.201.9@tcp) reconnecting [ 4835.344701] Lustre: Skipped 5 previous similar messages [ 4872.349187] Lustre: *** cfs_fail_loc=225, val=0*** [ 4872.351709] Lustre: Skipped 2 previous similar messages [ 4965.355367] Lustre: *** cfs_fail_loc=225, val=0*** [ 4965.357141] Lustre: Skipped 2 previous similar messages [ 4999.404560] Lustre: lustre-OST0000: haven't heard from client 3a48193c-7d94-4148-ae05-6412d6067f0b (at 192.168.201.9@tcp) in 175 seconds. I think it's dead, and I am evicting it. exp ffff880082642000, cur 1713348342 expire 1713348312 last 1713348167 [ 5006.671625] Lustre: DEBUG MARKER: == recovery-small test 148: data corruption through resend ========================================================== 06:05:49 (1713348349) [ 5007.953706] ofd: 'obdfilter.*.writethrough_cache_enable' is deprecated, use 'osd-*.*.writethrough_cache_enable' instead [ 5008.575201] LustreError: 739:0:(fail.c:138:__cfs_fail_timeout_set()) cfs_fail_timeout id 227 sleeping for 22000ms [ 5008.579169] LustreError: 739:0:(fail.c:138:__cfs_fail_timeout_set()) Skipped 14 previous similar messages [ 5030.585740] LustreError: 739:0:(fail.c:149:__cfs_fail_timeout_set()) cfs_fail_timeout id 227 awake [ 5030.589299] LustreError: 739:0:(fail.c:149:__cfs_fail_timeout_set()) Skipped 13 previous similar messages [ 5035.127433] Lustre: DEBUG MARKER: == recovery-small test 149: skip orphan removal at umount ========================================================== 06:06:17 (1713348377) [ 5038.219804] Lustre: lustre-MDT0001: Not available for connect from 192.168.201.9@tcp (stopping) [ 5038.222608] Lustre: Skipped 1 previous similar message [ 5049.329997] mdt00_005 (18505) used greatest stack depth: 9944 bytes left [ 5051.884772] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5052.066383] Lustre: lustre-OST0001: deleting orphan objects from 0x0:110222 to 0x0:110241 [ 5052.066385] Lustre: lustre-OST0000: deleting orphan objects from 0x0:112529 to 0x0:112545 [ 5052.992439] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 5055.532434] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5055.683187] Lustre: lustre-OST0001: deleting orphan objects from 0x2c0000400:12649 to 0x2c0000400:12737 [ 5055.683218] Lustre: lustre-OST0000: deleting orphan objects from 0x280000400:12963 to 0x280000400:13089 [ 5056.641301] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug -1 all 8 [ 5066.437283] Lustre: DEBUG MARKER: == recovery-small test 152: QoS object allocation could be awakened in case of OST failover ========================================================== 06:06:48 (1713348408) [ 5067.557552] Lustre: DEBUG MARKER: SKIP: recovery-small test_152 MDS Linux kernel does not support killable semaphore [ 5068.136786] Lustre: DEBUG MARKER: == recovery-small test complete, duration 4963 sec ======= 06:06:50 (1713348410) [ 5074.446022] LustreError: 17076:0:(ldlm_lockd.c:2521:ldlm_cancel_handler()) ldlm_cancel from 0@lo arrived at 1713348417 with bad export cookie 6016873746315313339 [ 5074.450674] LustreError: 17076:0:(ldlm_lockd.c:2521:ldlm_cancel_handler()) Skipped 5 previous similar messages [ 5082.698856] Lustre: 3328:0:(client.c:2295:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1713348418/real 1713348418] req@ffff880085c73dc0 x1796570772632064/t0(0) o400->MGC192.168.201.109@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1713348425 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u8:1.0' [ 5082.711823] Lustre: 3328:0:(client.c:2295:ptlrpc_expire_one_request()) Skipped 3 previous similar messages [ 5086.085900] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5089.891722] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 8 [ 5093.848081] Lustre: lustre-OST0001: deleting orphan objects from 0x0:110222 to 0x0:110273 [ 5093.848470] Lustre: lustre-OST0000: deleting orphan objects from 0x0:112529 to 0x0:112577 [ 5095.636489] Lustre: DEBUG MARKER: oleg109-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 5096.170573] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 5098.827272] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 5098.833306] Lustre: Skipped 39 previous similar messages [ 5100.412331] LustreError: 10444:0:(lprocfs_jobstats.c:137:job_stat_exit()) should not have any items [ 5100.416015] LustreError: 10444:0:(lprocfs_jobstats.c:137:job_stat_exit()) Skipped 61 previous similar messages [ 5100.452152] Lustre: server umount lustre-MDT0000 complete [ 5100.454357] Lustre: Skipped 11 previous similar messages [ 5103.052268] LustreError: 6835:0:(ldlm_lockd.c:2521:ldlm_cancel_handler()) ldlm_cancel from 0@lo arrived at 1713348445 with bad export cookie 6016873746315320955 [ 5103.057336] LustreError: 6835:0:(ldlm_lockd.c:2521:ldlm_cancel_handler()) Skipped 4 previous similar messages [ 5116.252792] device-mapper: core: cleaned up [ 5119.060698] Lustre: DEBUG MARKER: oleg109-server.virtnet: executing unload_modules_local [ 5119.634317] Key type lgssc unregistered [ 5119.699000] LNet: 13168:0:(lib-ptl.c:958:lnet_clear_lazy_portal()) Active lazy portal 0 on exit [ 5119.702103] LNet: Removed LNI 192.168.201.109@tcp