Re: [Gluster-devel] Performance Translators' Stability and Usefulness

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Performance Translators' Stability and Usefulness

From:	Gordan Bobic
Subject:	Re: [Gluster-devel] Performance Translators' Stability and Usefulness
Date:	Sun, 05 Jul 2009 02:19:10 +0100
User-agent:	Thunderbird 2.0.0.22 (X11/20090625)

Geoff Kassel wrote:

Sounds like a lot of effort and micro-downtime compared to a migration
to something else. Have you explored other options like PeerFS, GFS and
SeznamFS? Or NFS exports with failover rather than Gluster clients, with
Gluster only server-to-server?
These options are not production ready (as I believe has been pointed outalready to the list) for what I need;


What is production unready (more than Gluster) about PeerFS or SeznamFS?

or in the case of NFS, defeating thepoint of redundancy in the first place.

You can fail over NFS servers. If the servers themselves are mirrored(DRBD) and/or have a shared file system NFS should be able to handle theIP being migrated between servers. I've found it this tends to workbetter with NFS over UDP provided you have a network that doesn'tnormally suffer packet loss.

(Also, GFS is also not compatiblewith the kernel patchset I need to use.)


How do you mean? GFS1 has been in the vanilla kernel for a while.

I have tried AFR on the server side and the client side. Both display similarissues.
An older version of GlusterFS - as buggy as it is for me - is unfortunatelystill the best option.

Out of interest, what was the last version of Gluster did you deemcompletely stable?

(That doesn't mean I can't complain about the lack of progress towardsstability and reliability, though :)

Heh - and would you believe I just rebooted one of my root-on-glusterfsnodes and it came up OK without the bail-out requiring manualintervention caused by the bug that causes first access after mountingto fail before things have settled.

One of the problems is that some tests in this case are impossible to
carry out without having multiple nodes up and running, as a number of
bugs have been arising in cases where nodes join/leave or cause race
conditions. It would require a distributed test harness which would be
difficult to implement so that they run on any client that builds the
binaries. Just because the test harness doesn't ship with the sources
doesn't mean it doesn't exist on a test rig the developers use

Okay, so what about the volume of test cases that can be tested without adistributed test harness? I don't see any sign of testing mechanisms forthat.


That point is hard to argue against. :)

And wouldn't it be prudent anyway - giving how often the GlusterFS devs do nothave access to the platform with the reported problem - to provide thisharness so that people can generate the appropriate test results the devsneed for themselves? (Giving a complete stranger from overseas root access isa legal minefield to those who have to work with data held in-confidence.)

Indeed. And shifting test-case VM images tends to be impractical (eventhough I have provided both to the gluster developers in the past forspecific error-case analysis).

It's been my impression, though, that the relevant bugs are not heisenbugs orrace conditions.

I don't agree on that particular point, since the last outstanding bugI'm seeing with any significant frequency in my use case is the one ofhaving to wait for a few seconds for the FS to settle after mountingbefore doing anything or the operation fails. And to top it off, I'vejust had it succeed without the wait. That seems quite heisenbuggy/receyto me. :)

(I'm judging that on the speed of the follow up patch, by the way - raceconditions notoriously can take a long time to track down.)

That doesn't help - the first-access-settle-time bug has been around fora very long time. ;)


Gordan

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] Performance Translators' Stability and Usefulness - Regression test outline, (continued)
- Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Geoff Kassel, 2009/07/04
  - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Gordan Bobic <=
    - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Geoff Kassel, 2009/07/05
    - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Anand Babu Periasamy, 2009/07/06
    - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Geoff Kassel, 2009/07/06
    - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Alpha Electronics, 2009/07/06
    - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Geoff Kassel, 2009/07/07
- Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Vikas Gorur, 2009/07/06
  - Re: [Gluster-devel] Performance Translators' Stability and Usefulness, Gordan Bobic, 2009/07/06

Prev by Date: Re: [Gluster-devel] Performance Translators' Stability and Usefulness
Next by Date: Re: [Gluster-devel] Performance Translators' Stability and Usefulness
Previous by thread: Re: [Gluster-devel] Performance Translators' Stability and Usefulness
Next by thread: Re: [Gluster-devel] Performance Translators' Stability and Usefulness
Index(es):
- Date
- Thread