climateprediction.net (CPDN) home page
Thread 'repaired XP CPDN restarted.'

Thread 'repaired XP CPDN restarted.'

Message boards : Number crunching : repaired XP CPDN restarted.
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user5738

Send message
Joined: 31 Aug 04
Posts: 14
Credit: 113,008
RAC: 0
Message 10538 - Posted: 8 Mar 2005, 1:11:07 UTC

I had some major system problems with my xp machine this last weekend. When all was said and done BOINC started up and ran CPDN since I was out of seti WU's. THen I noticed that I had 220+ hours into the WU and it restarted the WU from the beginning. Did CPDN miss something? did I get a data file corrupted? I also noticed that I am not receiving credit for the reworked portion of the WU which is ok since it would be like getting double Pts for the WU. Is there a file to change to make this WU see where it is supposed to be at?

The littlewhitedog your friends on the net and in space.
Talk to the LWD at Littleblackdog
ID: 10538 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 10539 - Posted: 8 Mar 2005, 1:47:05 UTC

Without knowing the specifics it is imposible to know exactly what happened. However, when boinc/CPDN is knocked-down other than the usual Suspend/wait/Exit routine, one or more of the many open files can be out of sync with the others. I suspect that is what happened during your system (HD?) problem.

If that is the case, CPDN start-up finds things not as they should be and the whale turns belly-up.

Did it re-start with the same W/U (1uh4_...) or a new one?

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 10539 · Report as offensive     Reply Quote
old_user5738

Send message
Joined: 31 Aug 04
Posts: 14
Credit: 113,008
RAC: 0
Message 10542 - Posted: 8 Mar 2005, 4:32:37 UTC

yes I restarted with the same WU. It showed that I had completed about 250+ hours of work but had 0.00% completed. currently it has wworked its way back up to around 5%. When it restarted it showed a completion time of aroun 20,000 hours. I attributed this to me having 0% done and 250 hours of time worked. it has come down greatly since that time. I can onnly assume that some file associated with the trickles or the WU were altered showing that I had not completed any work.

The littlewhitedog your friends on the net and in space.
Talk to the LWD at Littleblackdog
ID: 10542 · Report as offensive     Reply Quote
old_user2147

Send message
Joined: 27 Aug 04
Posts: 55
Credit: 1,106,201
RAC: 0
Message 10547 - Posted: 8 Mar 2005, 6:44:42 UTC - in response to Message 10542.  

> yes I restarted with the same WU. It showed that I had completed about 250+
> hours of work but had 0.00% completed. currently it has wworked its way back
> up to around 5%. When it restarted it showed a completion time of aroun
> 20,000 hours. I attributed this to me having 0% done and 250 hours of time
> worked. it has come down greatly since that time. I can onnly assume that
> some file associated with the trickles or the WU were altered showing that I
> had not completed any work.
>

FWIW -

When you hit the point in the model where the crash occured, you will once again start receiving additional credit for your trickles (i.e., ~250+ machine hours into the model).

Strat
ID: 10547 · Report as offensive     Reply Quote
old_user5738

Send message
Joined: 31 Aug 04
Posts: 14
Credit: 113,008
RAC: 0
Message 10566 - Posted: 8 Mar 2005, 12:43:31 UTC

thats what I assumed. Seems a shame that all that data got lost

The littlewhitedog your friends on the net and in space.
Talk to the LWD at Littleblackdog
ID: 10566 · Report as offensive     Reply Quote
crandles
Volunteer moderator

Send message
Joined: 16 Oct 04
Posts: 692
Credit: 277,679
RAC: 0
Message 10567 - Posted: 8 Mar 2005, 12:53:33 UTC

>thats what I assumed. Seems a shame that all that data got lost

Do you have a backup of the boinc folder from before CP went back to beginning?
If so, copying this back could save crunching the same thing again.
Visit BOINC WIKI for help

And join BOINC Synergy for all the news in one place.
ID: 10567 · Report as offensive     Reply Quote
old_user5738

Send message
Joined: 31 Aug 04
Posts: 14
Credit: 113,008
RAC: 0
Message 10591 - Posted: 8 Mar 2005, 23:50:30 UTC

unfortuntely saving the CPDN data files weren't top on my list as the system started to crash

The littlewhitedog your friends on the net and in space.
Talk to the LWD at Littleblackdog
ID: 10591 · Report as offensive     Reply Quote

Message boards : Number crunching : repaired XP CPDN restarted.

©2024 cpdn.org