Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to sp

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to sp

From:	Xavier Hernandez
Subject:	Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster
Date:	Wed, 05 Feb 2014 20:27:34 +0100
User-agent:	Roundcube Webmail/0.7.1

On 04.02.2014 17:18, Jeff Darcy wrote:

The only synchronization point needed is to make sure that allbricksagree on the inode state and which client owns it. This can beachievedwithout locking using a method similar to what I implemented in theDFCtranslator. Besides the lock-less architecture, the main advantageis
that much more aggressive caching strategies can be implemented very
near to the final user, increasing considerably the throughput ofthefile system. Special care has to be taken with things than can failonbackground writes (basically brick space and user access rights).Thoseshould be handled appropiately on the client side to guaranteefuture
success of writes. Of course this is only a high level overview. A
deeper analysis should be done to see what to do on each specialcase.
What do you think ?
I think this is a great idea for where we can go - and need to go -in the
long term. However, it's important to recognize that it *is* the long
term. We had to solve almost exactly the same problems in MPFS longago.Whether the synchronization uses locks or not *locally* ismeaningless,
because all of the difficult problems have to do with recovering the
*distributed* state. What happens when a brick fails while holding an
inode in any state but I? How do we recognize it, what do we do aboutit,how do we handle the case where it comes back and needs to re-acquireitsprevious state? How do we make sure that a brick can successfullyflush
everything it needs to before it yields a lock/lease/whatever? That's
going to require some kind of flow control, which is itself a prettybigproject. It's not impossible, but it took multiple people some yearsfor
MPFS, and ditto for every other project (e.g. Ceph or XtreemFS) which
adopted similar approaches. GlusterFS's historical avoidance of this
complexity certainly has some drawbacks, but it has also been key tous
making far more progress in other areas.

Well, it's true that there will be a lot of tricky cases that will need

to be handled to be sure that data integrity and system responsivenessis

guaranteed, however I think that they are not more difficult than what
can happen currently if a client dies or loses communication while it
holds a lock on a file.

Anyway I think there is a great potential with this mechanism becauseit

can allow the implementation of powefull caches, even based on SSD that
could improve the performance a lot.

Of course there is a lot of work solving all potential failures and
designing the right thing. An important consideration is that all
these methods try to solve a problem that is seldom found (i.e. having
more than one client modifying the same file at the same time). So a
solution that has almost 0 overhead for the normal case and allows the
implementation of aggressive caching mechanisms seems a big win.

To move forward on this, I think we need a *much* more detailed ideaof
how we're going to handle the nasty cases. Would some sort of online
collaboration - e.g. Hangouts - make more sense than continuing via
email?

Of course, we can talk on irc or another place if you prefer

Xavi

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Xavier Hernandez, 2014/02/04
- Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Jeff Darcy, 2014/02/04
  - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Xavier Hernandez <=
    - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Anand Avati, 2014/02/05
    - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Ira Cooper, 2014/02/05
    - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Vijay Bellur, 2014/02/06
    - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Xavier Hernandez, 2014/02/06
    - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Xavier Hernandez, 2014/02/06
    - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Xavier Hernandez, 2014/02/10
- Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Niels de Vos, 2014/02/10
  - Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster, Xavier Hernandez, 2014/02/10

Prev by Date: Re: [Gluster-devel] Agenda for Community meeting today
Next by Date: Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster
Previous by thread: Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster
Next by thread: Re: [Gluster-devel] [RFC] A new caching/synchronization mechanism to speed up gluster
Index(es):
- Date
- Thread