bug-glibc
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

intermittent hangs with threads (clone() bug?/linuxthreads bug?)


From: Ed Connell
Subject: intermittent hangs with threads (clone() bug?/linuxthreads bug?)
Date: Tue, 19 Jun 2001 17:12:13 -0400

Hi,

I am experiencing intermittent hangs running LinuxThreads programs from a shell 
script.  This happens with any combination of stock RH 7.1 SMP kernel (2.4.2) 
or my own 2.4.5 kernel and stock RH 7.1 libc (2.2.2), redhat rawhide libc 
(2.2.3) and my own 2.2.3 libc.

If I run, for example, linuxthreads/Examples/ex1 (one thread prints 'a', one 
prints 'b') it will run fine.  If I run it from a shell script (bash or ksh) 
with 
   exec ex1
it almost always hangs.  When I do a "ps" I see the original "ex1" process plus 
another defunct "ex1" process  with a higher pid.  This defunct process was 
supposed to be the LinuxThreads manager thread but it seems that clone() is 
silently failing to create a valid thread.  Attaching a debugger to the 
original "ex1" and putting print statements in libc/linuxthreads and the kernel 
(do_fork() and friends) all indicate things are proceeding normally.  Yet the 
manager thread/process is never scheduled to run...it is immediately defunct.

My hardware is an 8-way i686.  I tried removing CPU's but the problem remains 
until I get down to a single CPU (or boot a uniprocessor kernel).  When I got 
down to 2 CPU's I noticed that running my script from console almost always 
produced a hang while running from an xterm always worked.  Obviously a timing 
issue.  Also things run fine on other SMP hardware I have access to.

If anyone has any idea what is going on, I would love to hear it.  If you need 
more information please let me know, as well.

Thanks
Ed Connell

Healthy traceback from original "ex1" process where it is waiting for the 
manager thread to take over.
(gdb) where
#0  0x40067ff5 in __sigsuspend (set=0xbffff2e8)
    at ../sysdeps/unix/sysv/linux/sigsuspend.c:45
#1  0x4002d25f in __pthread_wait_for_restart_signal (self=0x40036300)
    at pthread.c:958
#2  0x4002cbc4 in __pthread_create_2_1 (thread=0xbffff454, attr=0x0, 
    start_routine=0x8048580 <process>, arg=0x804871d) at restart.h:34
#3  0x080485e7 in main () at ex1.c:29
#4  0x400565e7 in __libc_start_main (main=0x80485cc <main>, argc=1, 
    ubp_av=0xbffff4c4, init=0x80483d0 <_init>, fini=0x80486e0 <_fini>, 
    rtld_fini=0x4000e154 <_dl_fini>, stack_end=0xbffff4bc)
    at ../sysdeps/generic/libc-start.c:129




reply via email to

[Prev in Thread] Current Thread [Next in Thread]