Message boards : Number crunching : Progress % restarting from 0 on linux.
Message board moderation
Author | Message |
---|---|
Send message Joined: 28 Dec 09 Posts: 2 Credit: 189,763 RAC: 0 |
Hi to everyone I've been trying to search an answer for my question on this board, but i didn't find it. Sorry if it was already posted. I'm running an hadm3p task on my laptop; task no. 14376140, OS linux mint 10. After some day the progress % resetted to 0, it was around 75%, don't show me any error, and it's keeping the spent hours, is just about the progress. http://imageshack.us/f/812/screenshotqto.png/ I'd like to know what's going on, if the whole project restarted because of some backup failure or what else. I've been reading about experiencing some trouble when the project is backing up, but i should find some error on my log, right? Thanks in advance! gian82 [BOINC.Italy] |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
No specific answer because the server is non-responsive when trying to retrieve details. 14376140 8015569 7 Apr 2012 10:53:42 UTC 20 Mar 2013 16:13:42 UTC In progress --- --- 1,790.21 1,790.21 UK Met Office HADAM3P European Region v6.09 It's curious that credit was awarded with no CPU time shown, while showing "In progress." Perhaps we'll get a look at "stderr" later. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 28 Dec 09 Posts: 2 Credit: 189,763 RAC: 0 |
39 hours running since the % resetting, and there isn't any output. Should I abort it? Latest Trickles Received Time Sent (UTC) Host ID Result ID Result Name Phase Timestep CPU Time (sec) Average (sec/TS) 26 Apr 2012 23:40:21 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 103,776 588,958 5.6753 26 Apr 2012 00:19:42 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 92,256 513,756 5.5688 25 Apr 2012 01:23:54 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 80,736 447,450 5.5421 24 Apr 2012 05:26:46 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 69,216 381,368 5.5098 23 Apr 2012 09:49:19 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 57,696 315,055 5.4606 22 Apr 2012 10:22:35 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 46,176 249,414 5.4014 21 Apr 2012 14:43:00 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 34,656 186,872 5.3922 20 Apr 2012 19:07:21 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 23,136 124,864 5.3970 19 Apr 2012 23:36:07 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 11,616 62,216 5.3561 |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
These models do sometimes revert to the beginning if they get interrupted at a critical point. What will most likely happen if you continue, is that the model will keep going. However, your computer isn't running them very fast, so there's 2 things that you can do: 1) Let it run, and redo all of the calculations again, with the hope that it won't restart again. 2) Abort it, and get a new one. Backups: Here |
©2024 cpdn.org