Re: [Gluster-devel] Client side AFR race conditions?

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Client side AFR race conditions?

From:	Kevan Benson
Subject:	Re: [Gluster-devel] Client side AFR race conditions?
Date:	Tue, 06 May 2008 14:47:34 -0700
User-agent:	Thunderbird 1.5.0.12 (X11/20071220)

Derek Price wrote:

Kevan Benson wrote:
I'm not saying I don't want to see a more robust solution for clientside AFR, just that each configuration has it's place, and client sideAFR isn't currently (and may never be) capable of serving a share thatrequires high data integrity.
If you think fixing this current issue will solve your problems, maybeyou haven't considered the implications of connectivity problemsbetween some clients and some (not all) servers... Add in someclients with slightly off timestamps and you might have some majorproblems WITHOUT any reboots.
Am I getting this straight? Even with server-side AFR, you get mirrors,but if all the clients aren't talking to the same server then there isno forced synchronization going on? How hard would it be to implementsome sort of synchronization/locking layer over AFR such that reads andwrites could still go to the nearest (read: fastest) possible server yetstill be guaranteed to be in sync?

Server side AFR should be susceptible to the same problems as clientside AFR in clients can use arbitrary servers. E.g. Client A writes toServer A for file X at the same time Client B writes to Server B forfile X. Server A and B are essentially "clients" to the AFR, so thesame race condition should exist. Possibly even exacerbated due to thespeed difference in local verses remote AFR sub-volumes.

In other words, the majority of servers would know of new versionnumbers being written anywhere and yet reads would always serve localcopies (potentially after waiting for synchronization). The applicationI'm thinking of is virtualized read/write storage. For example, say youwant to share some sort of data repository with offices in Europe,India, and the U.S. and you only have slow links connecting the variousoffices. You would want all client access to happen against a localmirror, and you would want to restrict traffic between the mirrors tothat absolutely required for locking and data synchronization.
The only thing I'm not quite sure of in this model is what to do if theserver processing a write operation crashes before the write finishes. Iwouldn't want reads against the other mirrors to have to waitindefinitely for the crashed server to return, so the best I can come upwith is that "write locks" for any files that hadn't been mirrored to atleast one available server before a crash would need to be revoked onthe first subsequent attempted access of the unsynchronized file. Thenwhen the crashed server came back up and tried to synchronize, it wouldfind that its file wasn't the current version and sync in the otherdirection.

I would think a specialized translator would work great for this.Something optimized for the server, where it intercepts writes andcreates binary diffs for syncing instead of copying the whole file. Inessence, trade computing power for bandwidth. That doesn't help rightnow though, and it doesn't address locking.

The only way I see to ensure data integrity is to have some arbiter vetall writes. You can try to make that arbiter redundant, but good luckmaking it actually distributed.



--

-Kevan Benson
-A-1 Networks

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] Client side AFR race conditions?, (continued)

Prev by Date: Re: [Gluster-devel] Client side AFR race conditions?
Next by Date: Re: [Gluster-devel] Client side AFR race conditions?
Previous by thread: Re: [Gluster-devel] Client side AFR race conditions?
Next by thread: Re: [Gluster-devel] Client side AFR race conditions?
Index(es):
- Date
- Thread