[Gluster-devel] Operation reordering and data corruption

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Gluster-devel] Operation reordering and data corruption

From:	Emmanuel Dreyfus
Subject:	[Gluster-devel] Operation reordering and data corruption
Date:	Thu, 8 Sep 2011 15:04:57 +0200
User-agent:	MacSOUP/2.7 (unregistered for 1692 days)

Hi

I experience a nasty data corruption bug that I can track down to a bad
interraction between how the NetBSD kernel resize files and an operation
reordering ability glusterfs seems to have.

Consider the exchanges below that I logged at the FUSE level:

>> SETATTR size = 12178 -> 32394
>> WRITE size = 12178
<< WRITE size = 12178 -> 32394
>> GETATTR size = 32394
<< GETATTR size = 32394 -> 12178
<< SETATTR size = 32394 -> 32394

SETATTR si sent before WRITE, but WRITE completes first. Then GETATTR
kicks in and it seems SETATTR is not yet completed, since the old size
is obtained.

This is a real problem, because the NetBSD kernel will detect a file
shrink in the GETATTR vnode method, and it will truncate the file to
match the new size. The result is that the WRITE that was just done is
discarded, and data is replaced by a chunk of zeroes.

In what conditions glusterfs performs operation reordering? Is it a bug
or a feature? Linux does not truncate in the same situation?

-- 
Emmanuel Dreyfus
http://hcpnet.free.fr/pubz
address@hidden

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] Operation reordering and data corruption, Emmanuel Dreyfus <=
- Re: [Gluster-devel] Operation reordering and data corruption, Anand Avati, 2011/09/08

Prev by Date: [Gluster-devel] Announcing: International GlusterFS Scale-out Community Contest
Next by Date: Re: [Gluster-devel] Operation reordering and data corruption
Previous by thread: [Gluster-devel] Announcing: International GlusterFS Scale-out Community Contest
Next by thread: Re: [Gluster-devel] Operation reordering and data corruption
Index(es):
- Date
- Thread