[Ocfs2-users] OCFS2 performance and debug help
Jay V
jvasaoo at gmail.com
Mon Apr 9 10:22:30 PDT 2012
On Mon, Apr 9, 2012 at 9:38 AM, Joel Becker <jlbec at evilplan.org> wrote:
> On Sun, Apr 08, 2012 at 10:54:52AM -0700, Jay V wrote:
>> I have a similar blocking/hanging/stall issue. On Oracle 6.2/x64. We
>> are running OCFS2 on 3 partitions about 14-15TB each. One of the
>> partitions has been running extremely slowly too. They are running on
>> the same hardware-- LSI HW Raid Cards and Enterprise Drives. I am
>> running over drbd and nfsd.
>
> How do drbd and nfsd fit in your scheme? nfsd is on top of
> ocfs2, right? You're exporting ocfs2 volumes via NFS? Can you think of
> any differences between the slow partition and the fast ones?
>
Oh I miss wrote. It's a basic setup. I'm exporting NFS to the clients.
There are no differences I can think of of this partition that the
other 2, except it is more full at 83% vs the others at 76% and 74%.
>> I am getting really slow performance in writes. The process
>> [jbd2-drbd-18] seems to be stuck for a long length of time (about 2
>> minutes) before it finally commits. This jbd2-drbd2 prevents any other
>> writes from happening which stalls the system.
>
> I'm worried that drbd is doing things slowly and getting in your
> way. But given the slowdown you are replying to, that might not be the
> only issue.
>
It's not DRBD. Even when I disconnect DRBD, so it's running on just 1
node, it has the performance issues.
>> This would be from "ps auxr"
>> USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
>> root 6374 0.0 0.0 0 0 ? D Apr03 0:28 [jbd2/drbd2-18]
>> root 6876 0.0 0.0 0 0 ? D Apr06 0:11 [nfsd]
>> root 6884 0.0 0.0 0 0 ? D Apr06 0:09 [nfsd]
>> root 6999 0.0 0.0 0 0 ? D Apr06 0:45 [nfsd]
>> root 7046 0.0 0.0 0 0 ? D Apr06 0:09 [nfsd]
>> root 7053 0.0 0.0 0 0 ? D Apr06 0:09 [nfsd]
>> root 7054 0.0 0.0 0 0 ? D Apr06 0:08 [nfsd]
>>
>> Running "scan_locks2" shows nothing. Nothing is held up locking wise.
>> It seems to happen more with files copying about 1MB or larger. It
>> only happens for me on my second partition, but not the other 2. It
>> seems to super slow in writes. Reads are fast.
>>
>> I hope to find a solution quickly too. I wonder it is because we have
>> very large partitions.
>
> It shouldn't be because of large partitions. How long have your
> partitions been around? How full are they?
>
The partitions have been around about 1 1/2 years. It has been running
well for so long. It's not the hard drives too. Everything is optimal,
just slow on writes. Reads are fast.
I opened up bug 1349, it gives more information:
http://oss.oracle.com/bugzilla/show_bug.cgi?id=1349
Thanks for your help,
Jay
> Joel
>
> --
>
> One look at the From:
> understanding has blossomed
> .procmailrc grows
> - Alexander Viro
>
> http://www.jlbec.org/
> jlbec at evilplan.org
More information about the Ocfs2-users
mailing list