Re: Fwd: [Gluster-devel] proposals to afr

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Fwd: [Gluster-devel] proposals to afr

From:	Kevan Benson
Subject:	Re: Fwd: [Gluster-devel] proposals to afr
Date:	Tue, 23 Oct 2007 09:48:54 -0700
User-agent:	Thunderbird 2.0.0.6 (X11/20070728)

Alexey Filin wrote:

correctly, if afr xlator node doesn't crash during writing. If it crashes
the close() is not issued at all and version attribute is not updated
(according to your description). If children update version without
assistance as a workaround after afr xlator node crash, the new versions of
replicas are equal but data can be different (because operations are queued
in interconnect and issued sequentially as datagrams come out). Such a
situation can't occur if every operation is atomic relative to the version
attribute i.e. the attribute is updated instantly after every operation.

I'll be happy if don't know something what helps to handle the situation
correctly in current implementation.

Actually, I just thought of a major problem with this. I think theextended attributes need to be set as atomic operations. Imagine thecase where two processes are writing the file at the same time, the opcounters could get very messed up.

Another solution comes to mind. Just set another extended attributedenoting that the file is being written to currently (and unset itafterwards). If the AFR subvolume notices that the file islisted asbeing written to but no clients have it open (I hope this is easilydeterminable) a flag is returned for the file. If all subvolumes returnthis flag for the file in the AFR (and all the trusted_afr_versions arethe same), choose one version of the file (for example from the firstAFR subvolume) as the legit copy and copy it to the other AFR nodes. Itdoesn't matter which version is the most up to date, they will all befairly close, and since this is from a failed write operation there wasno guarantee the file was in a valid state after the write. it'sdoesn't matter which copy you get, as long as it's consistent across AFRmembers.


P.S.

For those still unsure what we are referring to, it's the case where awrite to an AFR fails, so no AFR subvolume finishes and calls close().In this case the trusted_afr_version hasn't been incremented, but theactual data in the files may not be consistent across AFR subvolumes.As I've seen in prior testing, subsequent operations on the file willhappen independently on each subvolume, and the files may continue tostay out of sync. The data in the file may not be entirely trusted doto the failed write, but it should at least be consistent across AFRsubvolumes. An AFR subvolume failure should not change what data isreturned.


--

-Kevan Benson
-A-1 Networks

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] proposals to afr, Alexey Filin, 2007/10/21
- Re: [Gluster-devel] proposals to afr, Kevan Benson, 2007/10/21
  - Re: [Gluster-devel] proposals to afr, Alexey Filin, 2007/10/22
    - Re: [Gluster-devel] proposals to afr, Kevan Benson, 2007/10/22
  - Re: [Gluster-devel] proposals to afr, Krishna Srinivas, 2007/10/23
    - Message not available
    - Fwd: [Gluster-devel] proposals to afr, Alexey Filin, 2007/10/23
    - Re: Fwd: [Gluster-devel] proposals to afr, Kevan Benson <=
    - Re: Fwd: [Gluster-devel] proposals to afr, Alexey Filin, 2007/10/24
    - Re: Fwd: [Gluster-devel] proposals to afr, Kevan Benson, 2007/10/24
    - Re: Fwd: [Gluster-devel] proposals to afr, Alexey Filin, 2007/10/25
    - Re: Fwd: [Gluster-devel] proposals to afr, Alexey Filin, 2007/10/25
    - Re: Fwd: [Gluster-devel] proposals to afr, Krishna Srinivas, 2007/10/25
    - Re: Fwd: [Gluster-devel] proposals to afr, Chris Johnson, 2007/10/25
    - Message not available
    - Re: Fwd: [Gluster-devel] proposals to afr, Chris Johnson, 2007/10/25
    - Re: [Gluster-devel] option client-volume-filename (was) Re: Fwd: [Gluster-devel] proposals to afr, Matt Paine, 2007/10/25
    - Re: [Gluster-devel] option client-volume-filename (was) Re: Fwd: [Gluster-devel] proposals to afr, Chris Johnson, 2007/10/26
    - Re: Fwd: [Gluster-devel] proposals to afr, Krishna Srinivas, 2007/10/25

Prev by Date: [Gluster-devel] performance improvements
Next by Date: Re: [Gluster-devel] performance improvements
Previous by thread: Fwd: [Gluster-devel] proposals to afr
Next by thread: Re: Fwd: [Gluster-devel] proposals to afr
Index(es):
- Date
- Thread