Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s p

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s p

From:	Brent A Nelson
Subject:	Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem
Date:	Thu, 10 May 2007 19:36:24 -0400 (EDT)

It looks like the glusterfs crash in the slow NFS-client case may becaused by read-ahead.


I was able to get this backtrace:
Program terminated with signal 11, Segmentation fault.
#0  0xb756246d in ra_frame_return ()
   from /usr/lib/glusterfs/1.3.0-pre3/xlator/performance/read-ahead.so
(gdb) bt
#0  0xb756246d in ra_frame_return ()
   from /usr/lib/glusterfs/1.3.0-pre3/xlator/performance/read-ahead.so
#1  0xb7562587 in ra_page_error ()
   from /usr/lib/glusterfs/1.3.0-pre3/xlator/performance/read-ahead.so
#2  0xb7562cf0 in ?? ()
   from /usr/lib/glusterfs/1.3.0-pre3/xlator/performance/read-ahead.so
#3  0x12b66f20 in ?? ()
#4  0xffffffff in ?? ()
#5  0x0000004d in ?? ()
#6  0x00000020 in ?? ()
#7  0x00000000 in ?? ()

Removing read-ahead from my config, I was able to do my 10GB file copywithout a crash. A bonus was that my copy was much faster withoutread-ahead (3.7MBps vs. 2.2MBps), although I suspect that's why the copyactually completed successfully.

Even without read-ahead, I still get a very large glusterfs process, so itappears that read-ahead is not the memory leak culprit.

If I also remove write-behind (making the copy horribly slow), my copystill fails eventually, but glusterfs doesn't crash and the filesystem isstill available. Errors logged:

[May 10 18:14:18] [ERROR/common-utils.c:55/full_rw()]libglusterfs:full_rw: 0 bytes r/w instead of 113 (errno=115)[May 10 18:14:18] [CRITICAL/tcp.c:81/tcp_disconnect()]transport/tcp:share4-1: connection to server disconnected[May 10 18:14:18] [CRITICAL/client-protocol.c:218/call_bail()]client/protocol:bailing transport[May 10 18:14:18] [ERROR/common-utils.c:55/full_rw()]libglusterfs:full_rw: 0 bytes r/w instead of 113 (errno=9)[May 10 18:14:18] [CRITICAL/tcp.c:81/tcp_disconnect()]transport/tcp:share4-0: connection to server disconnected[May 10 18:14:18] [ERROR/client-protocol.c:204/client_protocol_xfer()]protocol/client:transport_submit failed[May 10 18:14:18] [ERROR/client-protocol.c:204/client_protocol_xfer()]protocol/client:transport_submit failed[May 10 18:14:19] [CRITICAL/client-protocol.c:218/call_bail()]client/protocol:bailing transport[May 10 18:14:19] [ERROR/common-utils.c:55/full_rw()]libglusterfs:full_rw: 0 bytes r/w instead of 113 (errno=115)[May 10 18:14:19] [CRITICAL/tcp.c:81/tcp_disconnect()]transport/tcp:share4-0: connection to server disconnected[May 10 18:14:19] [ERROR/client-protocol.c:204/client_protocol_xfer()]protocol/client:transport_submit failed

I've seen the "0 bytes r/w instead of 113" message plenty of times in thepast (with older GlusterFS versions), although it was apparently harmlessbefore. It looks like the code now considers this to be a disconnectionand tries to reconnect. For some reason, when it does manage toreconnect, it nevertheless results in an I/O error. I wonder if thisrelates to a previous issue I mentioned with real disconnects (node diesor glusterfsd is restarted), where the first access after a failure (atleast for ls or df) results in an error, but the next attempt succeeds?Seems like an issue with the reconnection logic (and some sort of glitchmasquerading as a disconnect in the first place)... This is probably thereal problem that is triggering the read-ahead crash (i.e., the read-aheadcrash would not be triggered in my test case if it weren't for thisissue).

Finally, glusterfs still grows even in this case, so that would leave afr,unify, protocol/client, or glusterfs itself as possible leakers.


Thanks,

Brent


On Thu, 10 May 2007, Brent A Nelson wrote:

The -s issue was completely eliminated with the recent patch.
GlusterFS is looking quite solid now, but I can still kill it with an NFSreexport to a slow client (100Mbps, while the servers and reexport node are1000Mbps) and a 10GB file copy via NFS from the GlusterFS filesystem to theGlusterFS filesystem.
The glusterfs process slowly consumes more and more memory (many 10s of MB toseveral hundred MB) and eventually dies sometime before the copy completes(well before it would run out of memory, however). The copy does work forquite a while before the glusterfs suddenly dies. See attached -LDEBUGoutput from the glusterfs process.
The glusterfs client is using client, afr, unify, read-ahead, andwrite-behind (with aggregation of 0). glusterfsd runs with server,storage/posix, and posix locks (although nothing in my test should invokelocking). The glusterfsd processes survive the test just fine and don'trequire a restart.
Thanks,

Brent

On Tue, 8 May 2007, Anand Avati wrote:
does the log say "connection on <socket> still in progress - try
later" when run with -LDEBUG?

avati

2007/5/8, Brent A Nelson <address@hidden>:
On Sun, 6 May 2007, Anand Avati wrote:

>> 3) When doing glusterfs -s to a different machine to retrieve the spec
>> file, it now fails.  A glusterfs -s to the local machine succeeds.  It
>> looks like a small buglet was introduced in the -s support.
>
> this is fixed now, it was an unrelated change triggered by the new way-s
> works.
>

Hmm, my -s issue still seems to be there, a client can only seem to
retrieve its spec file from a local glusterfsd. Was the -s fix applied to
the tla repository?

address@hidden:~# glusterfs -s jupiter01 /backup
glusterfs: could not open specfile
address@hidden:~# glusterfs -s jupiter02 /backup
address@hidden:~#

The reverse on jupiter01 behaves the same way (can retrieve from itself,
not from jupiter02).

The big glitch that I thought might be related (client could only mount a
GlusterFS if it was also a server of that GlusterFS) WAS fixed after a
tla update and recompile following your email, however.

Thanks,

Brent
--
Anand V. Avati

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] Re: NFS reexport works, still stat-prefetch issues, -s problem, (continued)
- [Gluster-devel] Re: NFS reexport works, still stat-prefetch issues, -s problem, Brent A Nelson, 2007/05/05
  - Re: [Gluster-devel] Re: NFS reexport works, still stat-prefetch issues, -s problem, Amar S. Tumballi, 2007/05/05
    - Re: [Gluster-devel] Re: NFS reexport works, still stat-prefetch issues, -s problem, Anand Avati, 2007/05/06
- Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Anand Avati, 2007/05/06
  - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Brent A Nelson, 2007/05/07
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Steffen Grunewald, 2007/05/08
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Anand Avati, 2007/05/08
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Brent A Nelson, 2007/05/08
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Anand Avati, 2007/05/09
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Brent A Nelson, 2007/05/14
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Brent A Nelson <=
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Brent A Nelson, 2007/05/10
    - Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem, Anand Avati, 2007/05/11

Prev by Date: [Gluster-devel] Plans for FreeBSD support?
Next by Date: Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem
Previous by thread: Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem
Next by thread: Re: [Gluster-devel] NFS reexport works, still stat-prefetch issues, -s problem
Index(es):
- Date
- Thread