Message boards : Number crunching : Task wont restart
Message board moderation
Author | Message |
---|---|
Send message Joined: 28 Jun 07 Posts: 6 Credit: 929,653 RAC: 20,169 |
Hi, I have this task running: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/result.php?resultid=15813485 But it won't restart. I tried restarting my computer several times, but it is stuck at 25.097%. ps aux on my computers says _user_ 3540 0.2 0.1 9836 7144 ? SNl 09:58 0:00 ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu hadcm3n_o4ep_1940_40_008382057 ocean_o4ep_1940_40_008382057_0 atmos_o4ep_1940_40_008382057_0 spec3a_sw_3_asol2c_hadcm3 spec3a_lw_3_asol2c_hadcm3 waterfix.ancil.be.32 NAT_VOLC DMSSO2NH3_1900_RCP sulpc_oxidants_19_A2_1990f SPARC_O3_rebuild_1900 It isn't using CPU resources so it isn't really running, as you can see CPU time is zero. The last lines of stderr.txt (while still in 'running' state) in the slots directory are these [...] What should I do? Is there a way to start this one again? Should I abort? Is one of the devs interested in more information? |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
There is a fragile point in the models at 25%, 50%, 75%, and 100% (when certain key output files are being generated). It is not uncommon for them to get stuck at this point. The only solution is either to revert to a backup, or to abort them. I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The parts of interest are: 25.097% malloc and failed All of the 25% points are a very common place for this model type to fail. The malloc ... failed is a common failure message. The only cure is to abort it. edit Beaten to it. :) |
©2024 cpdn.org