[Ocfs2-users] one node rejects connection from new node

Sunil Mushran sunil.mushran at oracle.com
Sat Jan 31 07:59:26 PST 2009


Nodes can be added to an online cluster. The instructions are listed  
in the user's guide.

On Jan 31, 2009, at 7:53 AM, Carl Benson <cbenson at fhcrc.org> wrote:

> Sunil,
>
> Thank you for responding. I will try o2cb_ctl on Monday, when I have
> physical access to hit Reset in case one or more nodes lock up.
>
> If there really is a requirement to restart the cluster on wilson1  
> every time
> I add a new node (and I have five or six more nodes to add), that is  
> too
> bad. Wilson1 is a 24x7 production system.
>
> --Carl Benson
>
> Sunil Mushran wrote:
>> Could be that the cluster was already online on wilson1 when you
>> propagated the cluster.conf to all nodes. If so, restart the cluster
>> on that node.
>>
>> To add a node to an online cluster, you need to use the o2cb_ctl
>> command. Details are in the 1.4 user's guide.
>>
>>
>> Carl J. Benson wrote:
>>
>>> Hello.
>>>
>>> I have three systems that share an ocfs2 filesystem, and I'm
>>> trying to add a fourth system.
>>>
>>> These are all openSUSE 11.1, x86_64, kernel 2.6.27.7-9-default.
>>> All have RPMs ocfs2-tools-1.4.1-6.9 and ocfs2console-1.4.1-6.9
>>>
>>> cluster.conf looks like this:
>>> node:
>>>        ip_port = 7777
>>>        ip_address = 140.107.170.116
>>>        number = 0
>>>        name = merlot1
>>>        cluster = ocfs2
>>>
>>> node:
>>>        ip_port = 7777
>>>        ip_address = 140.107.158.54
>>>        number = 1
>>>        name = merlot2
>>>        cluster = ocfs2
>>>
>>> node:
>>>        ip_port = 7777
>>>        ip_address = 140.107.158.82
>>>        number = 2
>>>        name = wilson1
>>>        cluster = ocfs2
>>>
>>> node:
>>>        ip_port = 7778
>>>        ip_address = 140.107.170.108
>>>        number = 3
>>>        name = gladstone
>>>        cluster = ocfs2
>>>
>>> cluster:
>>>        node_count = 4
>>>        name = ocfs2
>>>
>>> gladstone is the new node.
>>>
>>> I edited the cluster.conf on wilson1 using ocfs2console, and
>>> propagated it to the other systems from there.
>>>
>>> When I try to bring my ocfs2 online with /etc/init.d/o2cb online  
>>> ocfs2,
>>> merlot1 accepts the connection from gladstone, as does merlot2.
>>> However, wilson1 rejects it as an unknown node! For example:
>>>
>>> Jan 30 14:11:46 wilson1 kernel: (4447,3):o2net_accept_one:1795  
>>> attempt
>>> to connect from unknown node at 140.107.170.108:37795
>>>
>>> Why would this happen?
>>>
>>>
>>
>>
>> _______________________________________________
>> Ocfs2-users mailing list
>> Ocfs2-users at oss.oracle.com
>> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>>
>



More information about the Ocfs2-users mailing list