climateprediction.net (CPDN) home page
Thread 'some zombiprocesses ...whats going wrong?'

Thread 'some zombiprocesses ...whats going wrong?'

Questions and Answers : Unix/Linux : some zombiprocesses ...whats going wrong?
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user18015

Send message
Joined: 14 Sep 04
Posts: 3
Credit: 21,040
RAC: 0
Message 4537 - Posted: 23 Sep 2004, 16:05:18 UTC

Hi,

I\'m new to cpdn and see here somme zombi\'s with \'cp\'.
Any hints?

~/boinc> top

22150 benn 27 2 54684 45m 7084 R 83.6 4.5 1142:47 hadsm3um_4.04_i

22148 benn 16 0 2696 1540 2280 S 0.0 0.1 0:00.24 boinc

22901 benn 27 2 0 0 0 Z 0.0 0.0 0:00.10 cp
23368 benn 27 2 0 0 0 Z 0.0 0.0 0:00.11 cp
23406 benn 27 2 0 0 0 Z 0.0 0.0 0:00.11 cp
23580 benn 27 2 0 0 0 Z 0.0 0.0 0:00.11 cp

~/boinc> pstree

init─┬─boinc───hadsm3_4.04_i68───hadsm3um_4.04_i───7*[cp]


Benn
ID: 4537 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 4538 - Posted: 23 Sep 2004, 16:10:35 UTC

I guess it's crashing before cpdn gets to do a "wait" on the process and remove it?
ID: 4538 · Report as offensive     Reply Quote
old_user18015

Send message
Joined: 14 Sep 04
Posts: 3
Credit: 21,040
RAC: 0
Message 4544 - Posted: 23 Sep 2004, 16:50:39 UTC - in response to Message 4538.  

> I guess it's crashing before cpdn gets to do a "wait" on the process and
> remove it?

Yes, but whats a 'normal run'? What should be copyed?
I'm seeing here a growing up directory.

~/boinc/projects/climateprediction.net/2jb9_100139539/dataout> l


drwxr-xr-x 2 benn users 2696 2004-09-23 18:23 ./
drwxr-xr-x 7 benn users 272 2004-09-15 00:48 ../
-rw-r--r-- 1 benn users 6999424 2004-09-19 03:08 2jb9aa.da12c40
-rw-r--r-- 1 benn users 6999424 2004-09-19 05:54 2jb9aa.da12cs0
-rw-r--r-- 1 benn users 6999424 2004-09-19 06:15 2jb9aa.da13110
-rw-r--r-- 1 benn users 6999424 2004-09-19 10:51 2jb9aa.da132a0
-rw-r--r-- 1 benn users 6999424 2004-09-19 15:50 2jb9aa.da133g0
-rw-r--r-- 1 benn users 6999424 2004-09-19 19:09 2jb9aa.da134a0
-rw-r--r-- 1 benn users 6999424 2004-09-19 19:32 2jb9aa.da134d0
-rw-r--r-- 1 benn users 6999424 2004-09-19 22:18 2jb9aa.da13510
-rw-r--r-- 1 benn users 6999424 2004-09-20 01:29 2jb9aa.da135p0
-rw-r--r-- 1 benn users 6999424 2004-09-20 09:55 2jb9aa.da13870
-rw-r--r-- 1 benn users 6999424 2004-09-20 11:41 2jb9aa.da138m0
-rw-r--r-- 1 benn users 6999424 2004-09-20 14:15 2jb9aa.da139a0
-rw-r--r-- 1 benn users 6999424 2004-09-20 19:32 2jb9aa.da13ad0
-rw-r--r-- 1 benn users 6999424 2004-09-21 00:57 2jb9aa.da13bj0
.....

Can i delete all the files?

The Tricklets are counted well in my cpdn-account.

I've no problem with the zombi's, but i don't want provide useless work for the
cpdn-project.

with regards
Benn

ID: 4544 · Report as offensive     Reply Quote
old_user18015

Send message
Joined: 14 Sep 04
Posts: 3
Credit: 21,040
RAC: 0
Message 4763 - Posted: 28 Sep 2004, 16:27:09 UTC

Hello,

i've solved my problem with these "cp"-zombi's.

I've downgraded to the last official kernelversion of my linux-distribution and ther'e no zombi's. :-)

Distribution: SuSE 9.1 - 2.6.5-7.108-default #1 Wed Aug 25 13:34:40 UTC 2004 i686 athlon i386 GNU/Linux

boinc: boinc_4.09_i686-pc-linux-gnu
client: hadsm3_4.04_i686-pc-linux-gnu hadsm3um_4.04_i686-pc-linux-gnu

hardware: asus a7v with amd-athlon 800 + 1Gbyte Ram

Benn


ID: 4763 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 4848 - Posted: 30 Sep 2004, 11:06:03 UTC - in response to Message 4763.  

that's good, the only thing I can't figure out is you said it was trickling but you still had zombies? because I've only seen zombies on "chronically failed" machines, where it crashes somehow before even getting to the waitpid() to clean up the zombie. but glad to see it's working now.
ID: 4848 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : some zombiprocesses ...whats going wrong?

©2024 cpdn.org