climateprediction.net (CPDN) home page
Thread 'Progress % restarting from 0 on linux.'

Thread 'Progress % restarting from 0 on linux.'

Message boards : Number crunching : Progress % restarting from 0 on linux.
Message board moderation

To post messages, you must log in.

AuthorMessage
gian [Puglia]

Send message
Joined: 28 Dec 09
Posts: 2
Credit: 189,763
RAC: 0
Message 44130 - Posted: 1 May 2012, 14:52:53 UTC
Last modified: 1 May 2012, 14:54:24 UTC

Hi to everyone
I've been trying to search an answer for my question on this board, but i didn't find it. Sorry if it was already posted.

I'm running an hadm3p task on my laptop; task no. 14376140, OS linux mint 10.

After some day the progress % resetted to 0, it was around 75%, don't show me any error, and it's keeping the spent hours, is just about the progress.

http://imageshack.us/f/812/screenshotqto.png/

I'd like to know what's going on, if the whole project restarted because of some backup failure or what else. I've been reading about experiencing some trouble when the project is backing up, but i should find some error on my log, right?

Thanks in advance!

gian82 [BOINC.Italy]
ID: 44130 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 44132 - Posted: 1 May 2012, 17:46:09 UTC

No specific answer because the server is non-responsive when trying to retrieve details.
14376140 8015569 7 Apr 2012 10:53:42 UTC 20 Mar 2013 16:13:42 UTC In progress --- --- 1,790.21 1,790.21 UK Met Office HADAM3P European Region v6.09

It's curious that credit was awarded with no CPU time shown, while showing "In progress." Perhaps we'll get a look at "stderr" later.
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 44132 · Report as offensive     Reply Quote
gian [Puglia]

Send message
Joined: 28 Dec 09
Posts: 2
Credit: 189,763
RAC: 0
Message 44144 - Posted: 2 May 2012, 21:24:03 UTC - in response to Message 44132.  

39 hours running since the % resetting, and there isn't any output. Should I abort it?


Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Phase Timestep CPU Time (sec) Average (sec/TS)
26 Apr 2012 23:40:21 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 103,776 588,958 5.6753
26 Apr 2012 00:19:42 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 92,256 513,756 5.5688
25 Apr 2012 01:23:54 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 80,736 447,450 5.5421
24 Apr 2012 05:26:46 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 69,216 381,368 5.5098
23 Apr 2012 09:49:19 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 57,696 315,055 5.4606
22 Apr 2012 10:22:35 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 46,176 249,414 5.4014
21 Apr 2012 14:43:00 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 34,656 186,872 5.3922
20 Apr 2012 19:07:21 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 23,136 124,864 5.3970
19 Apr 2012 23:36:07 1204156 14376140 hadam3p_eu_9zk3_1981_1_007860457_1 1 11,616 62,216 5.3561
ID: 44144 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 44145 - Posted: 2 May 2012, 21:35:09 UTC

These models do sometimes revert to the beginning if they get interrupted at a critical point.
What will most likely happen if you continue, is that the model will keep going. However, your computer isn't running them very fast, so there's 2 things that you can do:
1) Let it run, and redo all of the calculations again, with the hope that it won't restart again.
2) Abort it, and get a new one.


Backups: Here
ID: 44145 · Report as offensive     Reply Quote

Message boards : Number crunching : Progress % restarting from 0 on linux.

©2024 cpdn.org