Message boards : Number crunching : TASK LOOPING?
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
I�m wondering what happened to task Wah2_eu25_e0hl_194012_12_010200909_0. It seems to have regressed to 2.7% from more than 90% complete. It has been running 24/7 since Dec. 18 and according to the results page has sent 11 trickles. Elapsed time is shown as 366 hours. Did it loop back to the beginning and start over? Should I abort? |
Send message Joined: 1 Sep 04 Posts: 161 Credit: 81,522,141 RAC: 1,164 |
If you look at the elapsed time and time remaining what is YOUR calculation of the %done (Progress)? Is the time remaining decreasing about 1 second every second? If so, and there is a large amount of time remaining, it might have regressed to an earlier time. Did you re-boot your PC or shutdown the BOINC Manager? If so it might have regressed to an earlier time. Look at the stderr.txt file in the appropriate slot folder (0,1,2, etc.) for a clue. |
Send message Joined: 31 Mar 13 Posts: 44 Credit: 6,950,896 RAC: 0 |
|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Do those models have graphics? If so, you can see what year/month it's up to. If it IS back at the start, and appears to be running through the whole model again, then best to abort. The reason being, you'll get NO credits for any trickles (based on Timesteps), that have already been reported, and it may well loop again when it gets to between the 11th & 12th zip/trickle. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
Thanks for the advice. I have aborted the task. Weather@home tasks don�t have any graphic so it is not see what the model date is. The time remaining estimate had gone back to 1335 hours! I don�t make routine daily backups anymore like I did years ago on single core machines. What is really needed is a practical way to restore one model from a backup without turning back all the other tasks running on other cores. Maybe I should post that on the Wish List. |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Such a procedure exists but is not easy nor is it for the faint of heart. I used it a few times but quit after deciding it wasn't worth the trouble; that was before tasks were broken into pieces -- and long before advent of these shorter regional tasks. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
I Know what you mean. I remember running the 160 year models on a single core 1.2 GHz processor with 256 MB�s or RAM. It took about 8 months to finish one model. |
©2024 cpdn.org