climateprediction.net home page
Model startup failure: who killed hadsm3um?

Model startup failure: who killed hadsm3um?

Questions and Answers : Unix/Linux : Model startup failure: who killed hadsm3um?
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user11305

Send message
Joined: 3 Sep 04
Posts: 2
Credit: 595
RAC: 0
Message 2890 - Posted: 3 Sep 2004, 16:18:52 UTC
Last modified: 3 Sep 2004, 16:25:43 UTC

Project subscription went ok, files download too, but now after boinc start

...
Copying files for startup...
Starting model ID 1u40_000106555 Phase 1
Stack size=4096.00 MB
Waiting for model startup, this may take a minute...
1u40_000106555 - PH 1 TS 000001 - 00/00/0000 00:00 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00

No further output. Model startup takes forever to start. I think it has something to do with this:

ps ax:
12728 pts/4 S+ 0:00 ./boinc_4.05_i686-pc-linux-gnu
12729 pts/4 SN+ 0:00 hadsm3_4.03_i686-pc-linux-gnu 1u40_000106555
12730 pts/4 ZN+ 0:00 [hadsm3um_4.03_i] defunct


BOINC client version 4.05 for i686-pc-linux-gnu
OS: Linux Debian Sarge

Best regards.


After a while:
Model timeout at 180.00 seconds
Model crashed...retrying...restart level 0
Preparing for restart...
Rewinding a model-day...
Starting model ID 1u40_000106555 Phase 1
Stack size=4096.00 MB
Waiting for model startup, this may take a minute...
1u40_000106555 - PH 1 TS 000001 - 00/00/0000 00:00 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00

Unfortunately hadsm3um_4.03_i is always defunct
ID: 2890 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 2903 - Posted: 3 Sep 2004, 17:13:28 UTC

Hi [Miles],

Your problem <i>might</i> be related to a Visual Fortran error that's been afflicting the windows build recently. Seems that some workunits have gone out with a duff file.

Check out <a href="http://www.climateprediction.net/board/viewtopic.php?t=2296&amp;p=20006#20006">this thread</a> on the phpBB forum.

And thanks to <b>sjokela</b> for doing the investigative work and <b>UK_Nick</b> for providing a link to the file that gives a workaround for the problem :)

<a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a>
ID: 2903 · Report as offensive     Reply Quote
old_user4563

Send message
Joined: 31 Aug 04
Posts: 3
Credit: 55,768
RAC: 0
Message 2908 - Posted: 3 Sep 2004, 18:29:08 UTC - in response to Message 2903.  

I'm having the exact same problem on three linux machines - distro makes no difference. However I'm not having a problem under windows xp. Go figure eh?
ID: 2908 · Report as offensive     Reply Quote
old_user11305

Send message
Joined: 3 Sep 04
Posts: 2
Credit: 595
RAC: 0
Message 2928 - Posted: 3 Sep 2004, 22:27:11 UTC

I've replaced the spec3a_sw_3_asol2b_hadcm3 file and the client started working.
Should I remove the ClimatePrediction project from my Boinc client and add it again hoping to get a correct new model or I can finish the work with this one?

Thanks for your help
ID: 2928 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 2968 - Posted: 4 Sep 2004, 9:16:54 UTC - in response to Message 2928.  

spec* files can be different for each experiment, so it is best if you "Reset" and then reattach to get a new, correct workunit. If you still have a problem please let us know, but it appears that regenerating the workunits last night fixed the problem in the spec* files.
ID: 2968 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : Model startup failure: who killed hadsm3um?

©2024 cpdn.org