Re: [Gluster-devel] afr logic

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] afr logic

From:	Kevan Benson
Subject:	Re: [Gluster-devel] afr logic
Date:	Wed, 17 Oct 2007 09:38:25 -0700
User-agent:	Thunderbird 2.0.0.6 (X11/20070728)

Alexey Filin wrote:

Hi Kevan,
consistency of afr'ed files is important question as of failures inbackend fs too, afr is a medicine against node failures not backend fsones (at least not directly), in the last case files can be changed"legally" in bypass glusterfs by fsck after a hw/sw failure and thechanges have to be handled for corrupted replica, else reading of thesame file can give different data (especialy for forthcoming loadbalanced read of replicas). Fortunately rsync'ing of original mustcreate consistent replica in the case too (if cluster/stripe under afrworks equally with replicas), unfortunately extended attributes aren'trsync'ed (I tested it) what can be required during repairing.
It seems glusterfs could try to handle hw/sw failures in backend fswith checksums in extended attributes and checksums are to becalculated for file chunks (because one checksum requires fullrecalculation after appending/changing of one byte to/in a gigabytefile) in the case glusterfs has to recalculate checksums of all fileson corrupted fs (may be toooo long, it is the same case withrsync'ing) or get list of corrupted files from backend fs in some way(e.g. with a flag set by fsck in extended attributes). May be somekind of distributed raid is a better solution, first step in thedirection was done already by cluster/stripe (unfortunately one ofimplementations, DDRaid http://sources.redhat.com/cluster/ddraid/ byDaniel Phillips seems to be suspended), perhaps it is toocomputational/network intensive and raid under backend fs is the bestsolution even taking into account disk space overhead.
I'm very interested to hear thoughts about it from glusterfsdevelopers to clear my misunderstanding.

The rsync case can probably be handled through a separate find of theappropriate attributes on the source and set on the target. A simplebash/perl script could handle this in a few lines.

The fsck case is more interesting, but if you could get fsck to reportfile/directory names that have problems and not fix them, it's easy topipe that to a script to remove the trusted.afr.version attribute on thefiles and then the AFR will heal itself.

Checksums would of course give you much better tracking of corruptedfiles, but I imagine the cpu strain and speed decrease would make itnon-feasible.


--

-Kevan Benson
-A-1 Networks

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] afr logic, Kevan Benson, 2007/10/16
- Message not available
  - Re: [Gluster-devel] afr logic, Kevan Benson, 2007/10/17
- Re: [Gluster-devel] afr logic, Alexey Filin, 2007/10/17
  - Re: [Gluster-devel] afr logic, Leonardo Rodrigues de Mello, 2007/10/17
    - Re: [Gluster-devel] afr logic, Alexey Filin, 2007/10/17
    - Re: [Gluster-devel] afr logic, Christian Meder, 2007/10/18
  - Re: [Gluster-devel] afr logic, Kevan Benson <=
    - Re: [Gluster-devel] afr logic, Alexey Filin, 2007/10/17
    - Re: [Gluster-devel] afr logic, Kevan Benson, 2007/10/17
    - Re: [Gluster-devel] afr logic, Alexey Filin, 2007/10/17
    - Re: [Gluster-devel] afr logic, Kevan Benson, 2007/10/17
    - Re: [Gluster-devel] afr logic, Krishna Srinivas, 2007/10/18
    - Re: [Gluster-devel] afr logic, Alexey Filin, 2007/10/18

Prev by Date: Re: [Gluster-devel] afr logic
Next by Date: Re: [Gluster-devel] HA failover question.
Previous by thread: Re: [Gluster-devel] afr logic
Next by thread: Re: [Gluster-devel] afr logic
Index(es):
- Date
- Thread