climateprediction.net (CPDN) home page
Thread 'TASK LOOPING?'

Thread 'TASK LOOPING?'

Message boards : Number crunching : TASK LOOPING?
Message board moderation

To post messages, you must log in.

AuthorMessage
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 53184 - Posted: 1 Jan 2016, 13:21:19 UTC

I�m wondering what happened to task Wah2_eu25_e0hl_194012_12_010200909_0. It seems to have regressed to 2.7% from more than 90% complete. It has been running 24/7 since Dec. 18 and according to the results page has sent 11 trickles. Elapsed time is shown as 366 hours. Did it loop back to the beginning and start over? Should I abort?

ID: 53184 · Report as offensive     Reply Quote
WB8ILI

Send message
Joined: 1 Sep 04
Posts: 161
Credit: 81,522,141
RAC: 1,164
Message 53185 - Posted: 1 Jan 2016, 15:51:03 UTC
Last modified: 1 Jan 2016, 15:51:46 UTC

If you look at the elapsed time and time remaining what is YOUR calculation of the %done (Progress)?

Is the time remaining decreasing about 1 second every second? If so, and there is a large amount of time remaining, it might have regressed to an earlier time.

Did you re-boot your PC or shutdown the BOINC Manager? If so it might have regressed to an earlier time.

Look at the stderr.txt file in the appropriate slot folder (0,1,2, etc.) for a clue.
ID: 53185 · Report as offensive     Reply Quote
MyLittleBoinc

Send message
Joined: 31 Mar 13
Posts: 44
Credit: 6,950,896
RAC: 0
Message 53186 - Posted: 1 Jan 2016, 21:16:13 UTC

ID: 53186 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 53187 - Posted: 1 Jan 2016, 21:37:45 UTC - in response to Message 53184.  

Do those models have graphics?
If so, you can see what year/month it's up to.

If it IS back at the start, and appears to be running through the whole model again, then best to abort.
The reason being, you'll get NO credits for any trickles (based on Timesteps), that have already been reported, and it may well loop again when it gets to between the 11th & 12th zip/trickle.

ID: 53187 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 53188 - Posted: 2 Jan 2016, 17:04:57 UTC

Thanks for the advice. I have aborted the task. Weather@home tasks don�t have any graphic so it is not see what the model date is. The time remaining estimate had gone back to 1335 hours!

I don�t make routine daily backups anymore like I did years ago on single core machines. What is really needed is a practical way to restore one model from a backup without turning back all the other tasks running on other cores. Maybe I should post that on the Wish List.

ID: 53188 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 53189 - Posted: 2 Jan 2016, 21:39:01 UTC - in response to Message 53188.  

Such a procedure exists but is not easy nor is it for the faint of heart.

I used it a few times but quit after deciding it wasn't worth the trouble; that was before tasks were broken into pieces -- and long before advent of these shorter regional tasks.

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 53189 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 53190 - Posted: 3 Jan 2016, 5:38:35 UTC - in response to Message 53189.  

I Know what you mean. I remember running the 160 year models on a single core 1.2 GHz processor with 256 MB�s or RAM. It took about 8 months to finish one model.

ID: 53190 · Report as offensive     Reply Quote

Message boards : Number crunching : TASK LOOPING?

©2024 cpdn.org