Re: [Gluster-devel] Re; Load balancing ...

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Re; Load balancing ...

From:	gordan
Subject:	Re: [Gluster-devel] Re; Load balancing ...
Date:	Fri, 25 Apr 2008 16:29:15 +0100 (BST)
User-agent:	Alpine 1.10 (LRH 962 2008-03-14)

On Fri, 25 Apr 2008, Gareth Bult wrote:

Well here's the thing. I've tried to apply Gluster in 8 different "realworld" scenario's, and each time I've failed either because of bugs orbecause "this simply isn't what GlusterFS is designed for".


[...]

Suggesting that I'm either not tuning it properly or should be using analternative filesystem I'm afraid is a bit of a cop-out. There are realproblems here and saying "yes but Gluster is only designed to work inspecific instances" is frankly a bit daft, and if this were the case,instead of a heavy sales pitch on the website along the lines of"Gluster is wonderful and does everything", it should be saying "Glusterwill do x, y and z, only."

The impression I got from the site is that it isn't yet very mature, butis usable. IMO, it stops way short of the "Gluster is wonderful and doeseverything" claim.

Now, Zope is a long-standing web based application server that I've beenusing for nearly 10 years, telling me it's "excessive" really doesn'tfly. Trying to back up a gluster AFR with rsync runs into similarproblems when you have lots of small files - it takes way longer than itshould do.

How many nodes have you got? Have you tried running it with RHCS+GFS in anotherwise similar setup? If so, how did the performance compare?

Moving to the other end of the scale, AFR can't cope with large fileseither .. handling of sparse files doesn't work properly and self-healhas no concept of repairing part of a file .. so sticking a 20Gb file ona GlusterFS is just asking for trouble as every time you restart agluster server (or every time one crashes) it'll crucify your network.

I thought about this, and there isn't really a way to do anything aboutthis, unless you relax the constraints. You could to a rsync-type rollingchecksum block-sync, but this would both take up more CPU time and resultin theoretical scope for the file to not be the same on both ends. Whetherthis minute possibility of corruption that the hashing algorithm doedn'tpick up is a reasonable trade-off, I don't know. Perhaps if such a thingwere implemented it should be made optional.

Now, a couple of points;

a. With regards to metadata, given two volumes mirrored via AFR, please can you
  explain to me why it's ok to do a data read operation against one
  node only, but not a metadata read operation .. and what would break
  if you read metadata from only one volume?

The fact that the file may have been deleted or modified when you try toopen it. File's content is a feature of the file. Whether the file isthere and/or up to date is a feature of the metadata of the file and it'sparent directory. If you start loosening this, you might as welldisconnect the nodes and run them in a deliberate split-brain case andresync periodically with all the conflict and data loss that entails.

b. Looking back through the list, Gluster's non-caching mechanism for
   acquiring file-system information seems to be at the root of many of
   it's performance issues. Is there no mileage in trying to address
   this issue ?

How would you propose to obtain the full posix locking/consistency withoutthis? Look at the similar alternatives like DRBD + [GFS | OCFS2]. Theyeither require shared storage (SAN) or block level replicated FS (DRBD).Split-braining in those cases is a non-option, and you need 100%functional fencing to forcefully disable the failed node or risk extensivecorruption. GlusterFS being file-based works around the risk of trashingthe entire FS on the block device. Having shared/replicated storage blockdevice works around a part of the problem because all the underlying datais replicated, but you'll find that GFS and OCFS2 also suffer similarperformance penalties with lots of small files due to locking, especiallyon directory level. If anything, the design of GlusterFS is better forthat scenario.

Since in GFS there is no scope for split-brain operation, you canguarantee that everything that was written is what is accessible. This themain source of contention is the write-locks. In GlusterFS the split-brainrequirement is relaxed, but to compensate for this in order to maintain FSconsistency, the metadata has to be checked each time. If you need thisrelaxed further, then you have to move away from the posix lockingrequirements, which puts you out of the realm of GlusterFS use-cases andinto a more WAN-directed FS like Coda.

c. If I stop one of my two servers, AFR suddenly speeds up "a lot" !
  Would it be so bad if there were an additional option "subvolume-read-meta" ?
  This would probably involve only a handful of additional lines of code, if 
that .. ?

How are your clients and servers organized? Are you using server-serverbased AFR? Or do you have clients doing the AFR-ing? Do you have moreclients than servers? Have you tried adjusting the timeout options toglusterfs (-a, -e)?


Gordan

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/23
- Re: [Gluster-devel] Re; Load balancing ..., Krishna Srinivas, 2008/04/23
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/23
  - Re: [Gluster-devel] Re; Load balancing ..., Krishna Srinivas, 2008/04/23
    - Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/25
    - Re: [Gluster-devel] Re; Load balancing ..., gordan, 2008/04/25
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/25
  - Re: [Gluster-devel] Re; Load balancing ..., gordan <=
    - Re: [Gluster-devel] Re; Load balancing ..., Martin Fick, 2008/04/25
    - Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/25
    - Re: [Gluster-devel] Re; Load balancing ..., Gordan Bobic, 2008/04/25
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/25
  - Re: [Gluster-devel] Re; Load balancing ..., Gordan Bobic, 2008/04/25
    - Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/25
    - Re: [Gluster-devel] Re; Load balancing ..., Gordan Bobic, 2008/04/25
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/25
  - Re: [Gluster-devel] Re; Load balancing ..., Krishna Srinivas, 2008/04/28
    - Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/28

Prev by Date: Re: [Gluster-devel] Re; Load balancing ...
Next by Date: Re: [Gluster-devel] Re; Load balancing ...
Previous by thread: Re: [Gluster-devel] Re; Load balancing ...
Next by thread: Re: [Gluster-devel] Re; Load balancing ...
Index(es):
- Date
- Thread