[Ocfs2-devel] [PATCH] ocfs2: Optimize the reading of heartbeat data

Changwei Ge ge.changwei at h3c.com
Tue Oct 23 21:56:00 PDT 2018


Hi Jia,

On 2018/10/24 10:49, Jia Guo wrote:
> Hi Changwei:
> 
> On 2018/10/24 9:21, Changwei Ge wrote:
>> Hi Jia,
>>
>> Your patch was not made properly and can't be applied to my work tree. :-(
>> Please check it.
>>
> I'm very sorry that I chose a wrong code path when executing 'git init'. :-(
> PATCH V2 will fix it.
>> Moreover, I *disagree* with your way trying to boost HB performance.
>> Actually, ocfs2 o2cb/heartbeat mechanism reads and writes region according slot number rather
>> than cluster node number.(Perhaps I missed something or your patch relies on your other patches)
>> And for slot assignment algorithm applied in ocfs2/mounting, the lowest slot number is always the first choice.
>> So your patch should not work...
>>
> I'm afraid you misunderstood the concept of slot number and heartbeat region. The heartbeat region corresponds to
> the cluster node number while the slot number assignment is what you describe.

Oops, you are right. I made mistake here. :-)

Thanks,
Changwei

> Thanks,
> Jia
>> Thanks,
>> Changwei
>>
>> On 2018/10/23 20:33, Jia Guo wrote:
>>> Reading heartbeat data from lowest node rather than from zero, in
>>> cases where the node is not defined from zero, can reduce the number
>>> of sectors read.
>>>
>>> Here is a simple test data obtained with 'iostat -dmx dm-5 2', with
>>> two nodes in the cluster, node number 10, 20, respectively.
>>> Before optimization:
>>> Device:         rrqm/s   wrqm/s     r/s     w/s    rMB/s    wMB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
>>> dm-5              0.00     0.00    0.50    0.50     0.01     0.00    11.00     0.00    1.00    1.00    1.00   1.50   0.15
>>> After the optimization:
>>> Device:         rrqm/s   wrqm/s     r/s     w/s    rMB/s    wMB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
>>> dm-5              0.00     0.00    0.50    0.50     0.00     0.00     6.00     0.00    0.50    1.00    0.00   0.50   0.05
>>>
>>> Signed-off-by: Jia Guo <guojia12 at huawei.com>
>>> Reviewed-by: Jun Piao <piaojun at huawei.com>
>>> Reviewed-by: Yiwen Jiang <jiangyiwen at huawei.com>
>>> ---
>>>    cluster/heartbeat.c | 23 +++++++++++++++--------
>>>    1 file changed, 15 insertions(+), 8 deletions(-)
>>>
>>> diff --git a/cluster/heartbeat.c b/cluster/heartbeat.c
>>> index 9b2ed62..a9cf984 100644
>>> --- a/cluster/heartbeat.c
>>> +++ b/cluster/heartbeat.c
>>> @@ -582,17 +582,18 @@ bail:
>>>    }
>>>
>>>    static int o2hb_read_slots(struct o2hb_region *reg,
>>> -			   unsigned int max_slots)
>>> +			   unsigned int min_slot,
>>> +			   unsigned int max_slot)
>>>    {
>>> -	unsigned int current_slot=0;
>>> +	unsigned int current_slot = min_slot;
>>>    	int status;
>>>    	struct o2hb_bio_wait_ctxt wc;
>>>    	struct bio *bio;
>>>
>>>    	o2hb_bio_wait_init(&wc);
>>>
>>> -	while(current_slot < max_slots) {
>>> -		bio = o2hb_setup_one_bio(reg, &wc, &current_slot, max_slots,
>>> +	while(current_slot < max_slot) {
>>> +		bio = o2hb_setup_one_bio(reg, &wc, &current_slot, max_slot,
>>>    					 REQ_OP_READ, 0);
>>>    		if (IS_ERR(bio)) {
>>>    			status = PTR_ERR(bio);
>>> @@ -1093,9 +1094,14 @@ static int o2hb_highest_node(unsigned long *nodes, int numbits)
>>>    	return find_last_bit(nodes, numbits);
>>>    }
>>>
>>> +static int o2hb_lowest_node(unsigned long *nodes, int numbits)
>>> +{
>>> +	return find_first_bit(nodes, numbits);
>>> +}
>>> +
>>>    static int o2hb_do_disk_heartbeat(struct o2hb_region *reg)
>>>    {
>>> -	int i, ret, highest_node;
>>> +	int i, ret, highest_node, lowest_node;
>>>    	int membership_change = 0, own_slot_ok = 0;
>>>    	unsigned long configured_nodes[BITS_TO_LONGS(O2NM_MAX_NODES)];
>>>    	unsigned long live_node_bitmap[BITS_TO_LONGS(O2NM_MAX_NODES)];
>>> @@ -1120,7 +1126,8 @@ static int o2hb_do_disk_heartbeat(struct o2hb_region *reg)
>>>    	}
>>>
>>>    	highest_node = o2hb_highest_node(configured_nodes, O2NM_MAX_NODES);
>>> -	if (highest_node >= O2NM_MAX_NODES) {
>>> +	lowest_node = o2hb_lowest_node(configured_nodes, O2NM_MAX_NODES);
>>> +	if (highest_node >= O2NM_MAX_NODES || lowest_node >= O2NM_MAX_NODES) {
>>>    		mlog(ML_NOTICE, "o2hb: No configured nodes found!\n");
>>>    		ret = -EINVAL;
>>>    		goto bail;
>>> @@ -1130,7 +1137,7 @@ static int o2hb_do_disk_heartbeat(struct o2hb_region *reg)
>>>    	 * yet. Of course, if the node definitions have holes in them
>>>    	 * then we're reading an empty slot anyway... Consider this
>>>    	 * best-effort. */
>>> -	ret = o2hb_read_slots(reg, highest_node + 1);
>>> +	ret = o2hb_read_slots(reg, lowest_node, highest_node + 1);
>>>    	if (ret < 0) {
>>>    		mlog_errno(ret);
>>>    		goto bail;
>>> @@ -1801,7 +1808,7 @@ static int o2hb_populate_slot_data(struct o2hb_region *reg)
>>>    	struct o2hb_disk_slot *slot;
>>>    	struct o2hb_disk_heartbeat_block *hb_block;
>>>
>>> -	ret = o2hb_read_slots(reg, reg->hr_blocks);
>>> +	ret = o2hb_read_slots(reg, 0, reg->hr_blocks);
>>>    	if (ret)
>>>    		goto out;
>>>
>> .
>>
> 



More information about the Ocfs2-devel mailing list