[Ocfs2-devel] [RFC] Doubt about dlm_worker

Sunil Mushran sunil.mushran at gmail.com
Thu Sep 10 12:18:35 PDT 2015


Sure. It will need to be tested appropriately.

On Thu, Sep 10, 2015 at 4:49 AM, Joseph Qi <joseph.qi at huawei.com> wrote:

> Hi Junxiao & Sunil,
> Your comments would be appreciated.
>
> Thanks,
> Joseph
>
> On 2015/9/6 21:11, Joseph Qi wrote:
> > Comments for dlm_dispatch_work is described below:
> > /* Worker function used during recovery. */
> >
> > But actually dlm_worker is used by 4 types of dlm message workers:
> >       dlm_assert_master_worker
> >       dlm_deref_lockres_worker
> >       dlm_request_all_locks_worker
> >       dlm_mig_lockres_worker
> >
> > And the first 2 are not dlm recovery related. Moreover, it will send
> > DLM_ASSERT_MASTER_MSG to all other nodes in dlm_assert_master_worker.
> > And it may do a lot of assert master during recovery. In our scenario,
> > it is tens of thousands.
> > This will delay the recovery because dlm_worker is a single thread
> > workqueue and cluster is hanging during dlm recovery.
> > So I doubt if we can move the assert master to a new workqueue or just
> > use a system workqueue.
> > Any suggestions?
> >
> >
> > _______________________________________________
> > Ocfs2-devel mailing list
> > Ocfs2-devel at oss.oracle.com
> > https://oss.oracle.com/mailman/listinfo/ocfs2-devel
> >
> >
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-devel/attachments/20150910/b4bff62e/attachment.html 


More information about the Ocfs2-devel mailing list