climateprediction.net (CPDN) home page
Thread 'No Progress being saved?'

Thread 'No Progress being saved?'

Message boards : Number crunching : No Progress being saved?
Message board moderation

To post messages, you must log in.

AuthorMessage
EclipseHA

Send message
Joined: 28 Aug 04
Posts: 42
Credit: 1,443,857
RAC: 0
Message 6860 - Posted: 11 Dec 2004, 15:17:48 UTC
Last modified: 11 Dec 2004, 15:47:50 UTC

About 2-3 hours ago, I completed a WU, and downloead a new one.

The new one is now at Ph1 TS 2650, but an interesting thing is occuring.

Both boincview and a home grown app that reads client_state.xml directly, are showing no progress and no cpu time!

The other machines I have running CP are running fine (including boincview info), and this machine (redhat v9, boinc 4.13,hadsm3 4.04) has worked fine for the last few CP WU's.

Any ideas? To me it looks like it's not checkpointing for some reason.

Update: Noticed that the seti WU on the same machine had 2:30:00 of CPU time, but 0.35% completion (the WU should take about 4h to complete on this machine).

I rebooted, and CP started about where it left off, and details in client_state.xml are being updated correctly...
ID: 6860 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 6875 - Posted: 12 Dec 2004, 1:17:08 UTC

This may have nothing to do with your problem, but:

I'm using dialup to access the net, and "ticking" 'disable BOINC network access'
because of the bug in BOINC. When I allow a trickle, or, as recently, an upload of
a completed wu, and forget to disable again, at next trickle time, BOINC logs on,
trickles, and logs off. Then everything stops, but shows "running". And I don't
find out until a few hours later. (mubble, mutter).

The only cure is a re-boot. After saving the 4 std files.

Les

Backups: Here
ID: 6875 · Report as offensive     Reply Quote

Message boards : Number crunching : No Progress being saved?

©2024 cpdn.org