Re: [Gluster-devel] Client side AFR race conditions?

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Client side AFR race conditions?

From:	Derek Price
Subject:	Re: [Gluster-devel] Client side AFR race conditions?
Date:	Wed, 07 May 2008 12:40:06 -0400
User-agent:	Thunderbird 2.0.0.14 (Windows/20080421)

address@hidden wrote:

On Wed, 7 May 2008, Anand Avati wrote:
The only way I see to ensure data integrity is to have some arbiter vet
all writes.  You can try to make that arbiter redundant, but good luck
making it actually distributed.
I've seen the distributed arbiter done in proprietary software, so it
must be possible.  The design is pretty clear to me, but I have no idea
where to start integrating the idea into glusterfs, though gluster's the
closest thing to what I need that I've seen in open source.
Can you give some details/links? We would be interested to learn aboutit.
I suspect what was referred to was a system where the locks are notifiedto every host, not an actually load sharing system. DLM (RHCS/GFS) doesit by multicasting, presumably with acknowledgements being returned fromeach connected node. I've not looked at the DLM protocol in greatdetail, so I don't know what the details are.

Actually, I was thinking of WANdisco's Multi-site CVS/SVN/MySQLmirroring software. It's not generalized to the point of being a diskload sharing system, exactly, but I think the concept and the problemsare the same. They use a quorum locking model and basically journal thetransaction with whichever server they are wrapping for later replay onthe other servers.

There used to be a white-paper on WANdisco's protocol online (I haven'tlooked recently). I didn't know much about DLM (and, after reading whatdocumentation I could find online just now, I don't feel like I knowmuch more), but it sounds like DLM uses a similar quorum model for locking.

As for the versioning (and perhaps this is relevant to the discussiontaking place in another thread), I don't see how this can be donewithout meta-data journaling, so why not make things even simpler andshare a unique version number between all entities changed in atransaction? So, for any server to acquire an implicit write lock, thequorum must agree to increment a global transaction ID (which could alsobe attached in the FS as a directory and/or file's version number).Then, as long as any given system knew that its journal/replay wasup-to-date with the latest transaction ID according to the quorum, thenit could trust a file's content without consulting a file-specificrevision number.

If a server was not completely up-to-date, then it would at least haveto synchronize the meta-data journal and consult it to find if arequested file had any pending writes and decide whether it needed tosynchronize the file before serving it.


Regards,

Derek
--
Derek R. Price
Solutions Architect
Ximbiot, LLC <http://ximbiot.com>
Get CVS and Subversion Support from Ximbiot!

v: +1 248.835.1260
f: +1 248.246.1176

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] Client side AFR race conditions?, (continued)

Prev by Date: Re: [Gluster-devel] trusted.glusterfs.version xattr
Next by Date: Re: [Gluster-devel] Has anyone... pure nfs replacement
Previous by thread: Re: [Gluster-devel] Client side AFR race conditions?
Next by thread: [Gluster-devel] GlusterFS configuration scripts
Index(es):
- Date
- Thread