Message boards : Number crunching : Should I continue crunching this work unit?
Message board moderation
Author | Message |
---|---|
Send message Joined: 5 Mar 06 Posts: 5 Credit: 405,068 RAC: 0 |
I started crunching \"hadsm3mh_kp2_006022174_2\" long time ago. Initially it indicated that crunching this work would last about 800 hours. But now, after 2481 hours of crunching, the progress accounts for only 4.451% and the time remained to complete this work becomes 3515 hours! (Let\'s see the below) --------------------------- Application: UK Met Office HADSM3 Mid-Holocene 6.02 Name: hadsm3mh_kp2_006022174_2 CPU time: 2481:27:53 Progress: 4.451% To complete: 3515:09:58 Report deadline: 2/12/2010 --------------------------- By the way, the progress remains around 4.451% while the time to complete is increasing all the time. Should I continue crunching this work? |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
I would not continue crunching that work. I would abort it. It appears to be looping, i.e. continually restarting at the same point and never making any progress. |
Send message Joined: 5 Mar 06 Posts: 5 Credit: 405,068 RAC: 0 |
Thanks, I have aborted it |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
|
Send message Joined: 5 Mar 06 Posts: 5 Credit: 405,068 RAC: 0 |
Hi, Perhaps I hit again another looping work unit. Initially it stated that crunching the unit \"hadsm3fub_k95o_006469046\" would last about 1000 hours (sorry, I don\'t remember the exact number). But now it seems to crunch indefinite time. Let\'s see 2 notes below (I registered 5 days ago and today): 1/ 18 march 2010 CPU time: 981:36:05 Progress: 21.252% To completion: 1078:17:30 2/ Today (23 march 2010) CPU time: 1061:35:01 Progress: 21.551% To completion: 1141:03:51 Is it really a looping work unit or not? |
Send message Joined: 3 Oct 06 Posts: 43 Credit: 8,017,057 RAC: 0 |
Is this the task you\'re talking about? http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=10336578 It trickled for the last time January 6th. It used to trickle about once a day. I would abort it. |
Send message Joined: 28 Nov 06 Posts: 89 Credit: 12,023,653 RAC: 4,025 |
Is it really a looping work unit or not? It may be a rewinding task, we are talking about such tasks here. Look to the current speed, if it is not slow, your task may be finished with \"Success\". |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Hi Vdquang You have an \'iceworld\'. The phenomenon is described here by Geophi. All computers on the same platform with the same operating system can be expected to hit this problem at the same point. That\'s what\'s happened with this workunit and you\'re not the only person with exactly the same iceworld problem at the same point. If you look at the model\'s graphics they will be monochrome showing only the default colour. The workunit is here; Vdquang\'s model is #4 in the list. One very fast computer is managing to send in a few trickles. But look at the sec/TS ie the speed (or, rather, how slow it is). If you restore a backup the same problem will happen again at the same point. Please abort the model. If you can please look quickly at your graphics for HadSM or HadSM MH models at least twice a week to check that you can see all the normal colours. Normal graphics indicate normal progress producing good data. Cpdn news |
Send message Joined: 5 Mar 06 Posts: 5 Credit: 405,068 RAC: 0 |
Reply to transient (message ID 39301): Is this the task you\'re talking about? http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=10336578 It trickled for the last time January 6th. It used to trickle about once a day. I would abort it. Oh, I have never paid my attention to tricle information. It is right that this work unit tricled for the last time on 6th January. ----------------- Reply to mo.v (message ID 39308): Hi Vdquang OK, I am going to abort it now. |
©2024 cpdn.org