[Ocfs2-devel] [patch 4/5] ocfs2/dlm: solve a BUG when deref failed in dlm_drop_lockres_ref
Mark Fasheh
mfasheh at suse.de
Fri Jul 29 10:47:13 PDT 2016
On Fri, Jul 29, 2016 at 11:08:50AM +0800, piaojun wrote:
> Hello Mark,
>
> On 2016-7-29 6:12, Mark Fasheh wrote:
> > On Thu, Jul 28, 2016 at 02:06:05PM -0700, Andrew Morton wrote:
> >> From: piaojun <piaojun at huawei.com>
> >> Subject: ocfs2/dlm: solve a BUG when deref failed in dlm_drop_lockres_ref
> >>
> >> We found a BUG situation that lockres is migrated during deref described
> >> below. To solve the BUG, we could purge lockres directly when other node
> >> says I did not have a ref. Additionally, we'd better purge lockres if
> >> master goes down, as no one will response deref done.
> >>
> >> Node 1 Node 2(old master) Node3(new master)
> >> dlm_purge_lockres
> >> send deref to N2
> >>
> >> leave domain
> >> migrate lockres to N3
> >> finish migration
> >> send do assert
> >> master to N1
> >>
> >> receive do assert msg
> >> form N3, but can not
> >> find lockres because
> >> DROPPING_REF is set,
> >> so the owner is still
> >> N2.
> >>
> >> receive deref from N1
> >> and response -EINVAL
> >> because lockres is migrated
> >>
> >> BUG when receive -EINVAL
> >> in dlm_drop_lockres_ref
> >>
> >> Fixes: 842b90b62461d ("ocfs2/dlm: return in progress if master can not clear the refmap bit right now")
> >>
> >> Link: http://lkml.kernel.org/r/57845103.3070406@huawei.com
> >> Signed-off-by: Jun Piao <piaojun at huawei.com>
> >> Reviewed-by: Joseph Qi <joseph.qi at huawei.com>
> >> Reviewed-by: Jiufei Xue <xuejiufei at huawei.com>
> >
> > Reviewed-by: Mark Fasheh <mfasheh at suse.de>
> >
> > The only thing is I wonder if those ML_NOTICE messages in this patch and
> > the previous one will cause unnecessary end-user concern.
> >
> > The fixes though look good, thanks for those.
> > --Mark
> >
> >
> Those ML_NOTICE log just server as reminders for developer, I think
> end-user usually care about ML_NOTICE log.
Ok, I had different experiences but it's not a big deal one way or the
other. If it helps you guys track what's going on then it's probably worth
it :)
--Mark
--
Mark Fasheh
More information about the Ocfs2-devel
mailing list