[Ocfs2-devel] [PATCH] Track negative dentries v2
Goldwyn Rodrigues
rgoldwyn at gmail.com
Fri Jun 25 09:13:56 PDT 2010
Thanks Mark for reviewing.
On Thu, Jun 24, 2010 at 6:03 PM, Mark Fasheh <mfasheh at suse.com> wrote:
> Hi Goldwyn,
>
> The patch looks pretty good so far - thanks for taking on this task!
> My comments are inlined.
>
> On Wed, Jun 23, 2010 at 08:45:10PM -0500, Goldwyn Rodrigues wrote:
>> Track negative dentries by recording the generation number of the parent
>> directory in d_fsdata. The generation number for the parent directory is
>> recorded in the inode_info, which increments every time the lock on the
>> directory is dropped.
>>
>> If the generation number of the parent directory and the negative dentry
>> matches, there is no need to perform the revalidate, else a revalidate
>> is forced. This improves performance in situations where nodes look for
>> the same non-existent file multiple times.
>>
>> Thanks Mark for explaining the DLM sequence.
>>
>> Signed-off-by: Goldwyn Rodrigues <rgoldwyn at suse.de>
>> ---
>> diff --git a/fs/ocfs2/dcache.c b/fs/ocfs2/dcache.c
>> index b4957c7..6db61a3 100644
>> --- a/fs/ocfs2/dcache.c
>> +++ b/fs/ocfs2/dcache.c
>> @@ -40,6 +40,16 @@
>> #include "inode.h"
>> #include "super.h"
>>
>> +/* Attach a generation number to a negative dentry */
>> +void ocfs2_dentry_attach_gen(struct dentry *dentry)
>> +{
>> + unsigned long gen = OCFS2_I(dentry->d_parent->d_inode)->ip_generation;
>> + /* Specifically for negative dentries */
>
>> + if (dentry->d_inode)
>> + BUG();
>
> You should turn those two lines into:
> BUG_ON(dentry->d_inode);
>
Yes. Incorporated.
>
>> + dentry->d_fsdata = (void *)gen;
>> +}
>> +
>>
>> static int ocfs2_dentry_revalidate(struct dentry *dentry,
>> struct nameidata *nd)
>> @@ -51,11 +61,22 @@ static int ocfs2_dentry_revalidate(struct dentry *dentry,
>> mlog_entry("(0x%p, '%.*s')\n", dentry,
>> dentry->d_name.len, dentry->d_name.name);
>>
>> - /* Never trust a negative dentry - force a new lookup. */
>
> Btw, it occurs to me that we could also optimize away local mounts in a
> seperate patch by always trusting the dentry passed in that case.
>
>
>> + /* For a negative dentry -
>> + check the generation number of the parent and compare with the
>> + one stored in the inode.
>> + */
>
> The preferred style for long (multi-line) comments is:
>
> /*
> * line one
> * line two
> */
>
>
Incorporated.
>> if (inode == NULL) {
>> - mlog(0, "negative dentry: %.*s\n", dentry->d_name.len,
>> - dentry->d_name.name);
>> - goto bail;
>> + unsigned long gen = (unsigned long) dentry->d_fsdata;
>> + unsigned long pgen =
>> + OCFS2_I(dentry->d_parent->d_inode)->ip_generation;
>> + mlog(0, "negative dentry: %.*s parent gen: %lu "
>> + "dentry gen: %lu\n",
>> + dentry->d_name.len, dentry->d_name.name, pgen, gen);
>> + if (gen != pgen) {
>> + dentry->d_fsdata = (void *) pgen;
>> + goto bail;
>> + }
>
> What are you setting the dentry generation here? We should only be
> doing that when under cluster lock when we are guaranteed that the dentry is
> negative.
>
Hm, I was thinking in terms to avoid the validation on the next
lookup. However, I did not consider the failure path of the
revalidation/lookup/locking operations. Will remove this.
>
>> + goto valid;
>> }
>>
>> BUG_ON(!osb);
>> @@ -96,6 +117,7 @@ static int ocfs2_dentry_revalidate(struct dentry *dentry,
>> goto bail;
>> }
>>
>> +valid:
>> ret = 1;
>>
>> bail:
>> @@ -227,6 +249,12 @@ int ocfs2_dentry_attach_lock(struct dentry *dentry,
>> if (!inode)
>> return 0;
>>
>> + if (!dentry->d_inode && dentry->d_fsdata) {
>> + /* Converting a negative dentry to positive
>> + Clear dentry->d_fsdata */
>> + dentry->d_fsdata = dl = NULL;
>> + }
>> +
>> if (dl) {
>> mlog_bug_on_msg(dl->dl_parent_blkno != parent_blkno,
>> " \"%.*s\": old parent: %llu, new: %llu\n",
>> @@ -451,6 +479,7 @@ static void ocfs2_dentry_iput(struct dentry
>> *dentry, struct inode *inode)
>> ocfs2_dentry_lock_put(OCFS2_SB(dentry->d_sb), dl);
>>
>> out:
>> + ocfs2_dentry_attach_gen(dentry);
>
> Why the call to ocfs2_dentry_attach_gen() here?
>
This is called when a positive dentry is converted into negative.
However, this would trigger the BUG_ON in ocfs2_dentry_attach_gen().
Will find a better way to do this, perhaps NULL'ing the d_inode after
iput.
>
>> iput(inode);
>> }
>>
>> diff --git a/fs/ocfs2/dcache.h b/fs/ocfs2/dcache.h
>> index f5dd178..b79eff7 100644
>> --- a/fs/ocfs2/dcache.h
>> +++ b/fs/ocfs2/dcache.h
>> @@ -64,5 +64,6 @@ void ocfs2_dentry_move(struct dentry *dentry, struct
>> dentry *target,
>> struct inode *old_dir, struct inode *new_dir);
>>
>> extern spinlock_t dentry_attach_lock;
>> +void ocfs2_dentry_attach_gen(struct dentry *dentry);
>>
>> #endif /* OCFS2_DCACHE_H */
>> diff --git a/fs/ocfs2/dlmglue.c b/fs/ocfs2/dlmglue.c
>> index 39eb16a..bf79dcf 100644
>> --- a/fs/ocfs2/dlmglue.c
>> +++ b/fs/ocfs2/dlmglue.c
>> @@ -3635,10 +3635,18 @@ static int ocfs2_data_convert_worker(struct
>> ocfs2_lock_res *lockres,
>> {
>> struct inode *inode;
>> struct address_space *mapping;
>> + struct ocfs2_inode_info *oi;
>>
>> inode = ocfs2_lock_res_inode(lockres);
>> mapping = inode->i_mapping;
>>
>> + if (S_ISDIR(inode->i_mode)) {
>> + oi = OCFS2_I(inode);
>> + oi->ip_generation++;
>> + mlog(0, "generation: %u\n", oi->ip_generation);
>> + goto out;
>> + }
>> +
>> if (!S_ISREG(inode->i_mode))
>> goto out;
>>
>> diff --git a/fs/ocfs2/inode.h b/fs/ocfs2/inode.h
>> index 9f5f5fc..34bad2c 100644
>> --- a/fs/ocfs2/inode.h
>> +++ b/fs/ocfs2/inode.h
>> @@ -70,6 +70,7 @@ struct ocfs2_inode_info
>> /* Only valid if the inode is the dir. */
>> u32 ip_last_used_slot;
>> u64 ip_last_used_group;
>> + u32 ip_generation; /*for -ve dentry */
>
> Comment doesn't make sense :) Also, can you pick something we're less
> likely to confuse with i_generation? Maybe ip_dir_lock_gen?
>
Ok.
> Also, it should be initialized to a nonzero value in when we're initializing
> the directory inode under cluster lock in ocfs2_read_locked_inode().
Yes, you're right. Both may start from zero, making it avoid the
revalidation when it shouldn't.
Thanks,
--
Goldwyn
More information about the Ocfs2-devel
mailing list