[Ocfs2-devel] [PATCH] ocfs2/dlm: remove unreasonable BUG_ON()

Wengang Wang wen.gang.wang at oracle.com
Tue May 25 03:57:34 PDT 2010


A node "down" can happen at any time. When that happens, all locres' owned
by the "down" node are move to recovery list. They also are marked as in
recovery and the owner are set to "unknown" under dlm->spinlock and
res->spinlock.

Any place shouldn't believe the owner is not changed to "unknown" after
dropping the lockres and retaking them.

Signed-off-by: Wengang Wang <wen.gang.wang at oracle.com>
---
 fs/ocfs2/dlm/dlmthread.c |   17 +++++++++--------
 1 files changed, 9 insertions(+), 8 deletions(-)

diff --git a/fs/ocfs2/dlm/dlmthread.c b/fs/ocfs2/dlm/dlmthread.c
index 11a6d1f..cf86b43 100644
--- a/fs/ocfs2/dlm/dlmthread.c
+++ b/fs/ocfs2/dlm/dlmthread.c
@@ -675,15 +675,16 @@ static int dlm_thread(void *data)
 			 * dirty_list in this gap, but that is ok */
 
 			spin_lock(&res->spinlock);
-			if (res->owner != dlm->node_num) {
-				__dlm_print_one_lock_resource(res);
-				mlog(ML_ERROR, "inprog:%s, mig:%s, reco:%s, dirty:%s\n",
-				     res->state & DLM_LOCK_RES_IN_PROGRESS ? "yes" : "no",
-				     res->state & DLM_LOCK_RES_MIGRATING ? "yes" : "no",
-				     res->state & DLM_LOCK_RES_RECOVERING ? "yes" : "no",
-				     res->state & DLM_LOCK_RES_DIRTY ? "yes" : "no");
+			if (unlikely(res->owner != dlm->node_num)) {
+				/*
+				 * there is chance that the lockres is marked
+				 * as in recovery if a node down happened.
+				 */
+				if (!(res->state & DLM_LOCK_RES_RECOVERING)) {
+					__dlm_print_one_lock_resource(res);
+					BUG();
+				}
 			}
-			BUG_ON(res->owner != dlm->node_num);
 
 			/* it is now ok to move lockreses in these states
 			 * to the dirty list, assuming that they will only be
-- 
1.6.6.1




More information about the Ocfs2-devel mailing list