[Ocfs2-users] Another node is heartbeating in ourslot! errorswith LUN removal/addition

Sunil Mushran sunil.mushran at oracle.com
Mon Dec 22 18:24:13 PST 2008


Please file a bug. http://oss.oracle.com/bugzilla

This needs to be looked into.

Brian Kroth wrote:
> Thanks for the info.  I've just now finally gotten around to trying
> this.  Here's my setup:
>
> A three node cluster and a separate backup node.  The SAN is configured
> to give the cluster access to just the live volume and the backup node
> access to just the snapshots.  The backup node is also given a different
> OCFS2 cluster.conf file so that it doesn't try to communicate with the
> other nodes.  The kernel is based off of Debian's 2.6.26 kernel.  FS
> version is listed as 1.5.0 and the tools are 1.4.1.
>
>
> 1) Snapshot the volume on the SAN (EqualLogic).
> 2) Replay the journal
> # fsck.ocfs2 -y $snapshot_dev
> 3) Clear the dirty bits
> mount $snapshot_dev /snapshots/$snapshot_datetime 
> sleep 300 # let the fs sit for a bit
> umount $snapshot_dev
> 4) Change the fs uuid, label, and type
> yes | tunefs.ocfs2 -U -M local -L $snapshot_datetime $snapshot_dev
> 5) On the SAN change the snapshot to readonly.
>
> Now, if I try to mount that snapshot with "-o ro", the volume mounts,
> but results in strange behavior and an angry kernel.  Some example
> output is below.  
>
> If I leave off step 5 and mount it ro, everything seems to be ok, but
> I'd rather have the SAN enforce readonly access.  Is there anything to
> be done about this?
>
> Thanks,
> Brian
>
> # ls -la /snapshots/$snapshot_datetime/
> ls: cannot access lost+found: Read-only file system
> ls: cannot access mail: Read-only file system
> ls: cannot access .shareddiskfs-ha_monitor-only.mercury: Read-only file
> system
> ls: cannot access .shareddiskfs-ha_monitor-only.iris: Read-only file
> system
> ls: cannot access .shareddiskfs-ha_monitor-only.hermes: Read-only file
> system
> ls: cannot access tmp: Read-only file system
> total 8
> drwxr-xr-x 5 root root 4096 2008-12-11 10:29 ./
> drwxr-xr-x 3 root root 4096 2008-12-21 14:07 ../
> d????????? ? ?    ?       ?                ? lost+found/
> d????????? ? ?    ?       ?                ? mail/
> -????????? ? ?    ?       ?                ?
> .shareddiskfs-ha_monitor-only.hermes
> -????????? ? ?    ?       ?                ?
> .shareddiskfs-ha_monitor-only.iris
> -????????? ? ?    ?       ?                ?
> .shareddiskfs-ha_monitor-only.mercury
> d????????? ? ?    ?       ?                ? tmp/
>
> Some output from dmesg:
>
> [  747.120562] sd 1:0:0:0: [sdb] Attached SCSI disk
> [  796.331467] OCFS2 1.5.0
> [  796.381325] Readonly device detected. No cluster services will be
> utilized for this mount. Recovery will be skipped.
> [  796.381325] ocfs2: Mounting device (8,17) on (node local, slot -1)
> with ordered data mode.
> [  814.890455] (2720,0):ocfs2_dentry_attach_lock:290 ERROR: status = -30
> [  814.890455] (2720,0):ocfs2_lookup:168 ERROR: status = -30
> [  814.940692] (2720,0):ocfs2_dentry_attach_lock:290 ERROR: status = -30
> [  814.940745] (2720,0):ocfs2_lookup:168 ERROR: status = -30
> [  814.951784] (2720,0):ocfs2_dentry_attach_lock:290 ERROR: status = -30
> [  814.951784] (2720,0):ocfs2_lookup:168 ERROR: status = -30
> [  814.982553] (2720,0):ocfs2_dentry_attach_lock:290 ERROR: status = -30
> [  814.982610] (2720,0):ocfs2_lookup:168 ERROR: status = -30
> [  814.997255] (2720,0):ocfs2_dentry_attach_lock:290 ERROR: status = -30
> [  814.997308] (2720,0):ocfs2_lookup:168 ERROR: status = -30
> [  815.014511] (2720,0):ocfs2_dentry_attach_lock:290 ERROR: status = -30
> [  815.014575] (2720,0):ocfs2_lookup:168 ERROR: status = -30
> [  818.490805] BUG: unable to handle kernel NULL pointer dereference at
> 00000018
> [  818.490805] IP: [<e0c98a04>] :ocfs2:ocfs2_statfs+0x1a5/0x2d3
> [  818.490805] *pde = 00000000 
> [  818.490805] Oops: 0000 [#1] SMP 
> [  818.490805] Modules linked in: ocfs2 crc32c libcrc32c ocfs2_dlmfs
> ocfs2_stack_o2cb ocfs2_dlm ocfs2_nodemanager ocfs2_stackglue configfs
> ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp
> libiscsi scsi_transport_iscsi ipv6 dm_snapshot dm_mirror dm_log dm_mod
> usbhid hid ff_memless uhci_hcd ohci_hcd ehci_hcd usbcore snd_pcm
> snd_timer snd soundcore snd_page_alloc serio_raw parport_pc i2c_piix4
> intel_agp shpchp pci_hotplug button ac container parport pcspkr psmouse
> i2c_core agpgart evdev ext3 jbd mbcache sd_mod ide_cd_mod cdrom
> ata_generic libata dock floppy pcnet32 mii mptspi mptscsih mptbase
> scsi_transport_spi scsi_mod piix ide_pci_generic ide_core thermal
> processor fan thermal_sys
> [  818.490805] 
> [  818.490805] Pid: 2408, comm: collectd Not tainted
> (2.6.26.vmwareguest-smp.081125 #1)
> [  818.490805] EIP: 0060:[<e0c98a04>] EFLAGS: 00010246 CPU: 0
> [  818.490805] EIP is at ocfs2_statfs+0x1a5/0x2d3 [ocfs2]
> [  818.490805] EAX: 00000000 EBX: df2d14b0 ECX: 00000000 EDX: 00004848
> [  818.490805] ESI: 00000000 EDI: 00000000 EBP: dee1e000 ESP: dee1fe68
> [  818.490805]  DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
> [  818.490805] Process collectd (pid: 2408, ti=dee1e000 task=df570040
> task.ti=dee1e000)
> [  818.490805] Stack: 00000001 b7f6f000 00000000 dee1fea8 dea41000
> df2d5790 00000000 dee1fea8 
> [  818.490805]        df2d14b0 dee1fefc dee1e000 c016f36c 00000000
> dee1fea8 dee1ff08 c016f394 
> [  818.490805]        00000000 00000000 00000000 00000000 00000000
> 00000000 00000000 00000000 
> [  818.490805] Call Trace:
> [  818.490805]  [<c016f36c>] vfs_statfs+0x43/0x5b
> [  818.490805]  [<c016f394>] vfs_statfs64+0x10/0x21
> [  818.490805]  [<c016f449>] sys_statfs64+0x44/0x7b
> [  818.490805]  [<c0161ac8>] vma_adjust+0x310/0x397
> [  818.490805]  [<c01604dc>] free_pgtables+0x84/0x93
> [  818.490805]  [<c010edf8>] flush_tlb_mm+0x39/0x5d
> [  818.490805]  [<c0161da3>] do_munmap+0x181/0x19b
> [  818.490805]  [<c0161df6>] sys_munmap+0x39/0x3e
> [  818.490805]  [<c010313f>] sysenter_past_esp+0x78/0xb1
> [  818.490805]  =======================
> [  818.490805] Code: 68 40 04 00 00 68 b1 e2 c9 e0 50 ff b3 0c 01 00 00
> 68 2b 67 ca e0 e8 9a 88 48 df 83 c4 1c e9 21 01 00 00 8b 44 24 18 89 44
> 24 08 <8b> 40 18 8b 88 bc 00 00 00 89 cf 2b b8 b8 00 00 00 8b 44 24 0c 
> [  818.490805] EIP: [<e0c98a04>] ocfs2_statfs+0x1a5/0x2d3 [ocfs2] SS:ESP
> 0068:dee1fe68
> [  818.490805] ---[ end trace ae323790ea69e92a ]---
>   



More information about the Ocfs2-users mailing list