climateprediction.net home page
*sniff* failed WU infos

*sniff* failed WU infos

Message boards : Number crunching : *sniff* failed WU infos
Message board moderation

To post messages, you must log in.

AuthorMessage
Lucid
Avatar

Send message
Joined: 18 Dec 05
Posts: 7
Credit: 64,098
RAC: 0
Message 20878 - Posted: 1 Mar 2006, 2:21:00 UTC

Hello,

After running this WU for 388 hrs (almost 20% done), it has failed. Here\'s some of the info I got:

Unrecoverable error for result sulphur_gqex_000780729_0 | sulphur_gqex_000780729_0_2.zip | -161

-AND from yabsd.out file-

Model aborted with error code - 102 Routine and message:- INITDUMP: Wrong no of atmos prognostic fields

----------------------

Now the CPDN WU crashed with a Rosetta WU, Einstein screensaver was running but no crashes there. Not sure if one caused the other... after reading many posts concerning failed WUs I decided to post this info on to the devs.

Haven\'t had any problems with my PC or installed new software lately.

*sniff* And I was almost into Phase 2...

ID: 20878 · Report as offensive     Reply Quote
Lucid
Avatar

Send message
Joined: 18 Dec 05
Posts: 7
Credit: 64,098
RAC: 0
Message 20880 - Posted: 1 Mar 2006, 2:25:38 UTC
Last modified: 1 Mar 2006, 2:27:15 UTC

Oh yeah here- WU

ID: 20880 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2185
Credit: 64,822,615
RAC: 5,275
Message 20887 - Posted: 1 Mar 2006, 5:19:53 UTC

Hi Lucid,

(nearly) All WUs created between December 9th and December 23rd errored in this manner. It was nothing wrong with the software or hardware on your PC.

If it is any consolation, the scientists said that the first phase results, which you completed and uploaded, are very important to the next part of the experiment. The results are being used in the generation of the coupled model work units for the BBC project.
ID: 20887 · Report as offensive     Reply Quote
Lucid
Avatar

Send message
Joined: 18 Dec 05
Posts: 7
Credit: 64,098
RAC: 0
Message 20913 - Posted: 1 Mar 2006, 16:37:24 UTC

geophi,

Thanks for the response. I also have a second WU that is about 2% complete here. Should I abort that one and get a fresh one, or continue on? Either way I\'ll crunch away...:)

ID: 20913 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2185
Credit: 64,822,615
RAC: 5,275
Message 20915 - Posted: 1 Mar 2006, 17:11:34 UTC

There\'s a 99% likelihood that will error at the beginning of phase 2. Like I said, the first phase completion is valuable. But if you want to run the full model, go ahead and abort and download a new one. It\'s up to you.
ID: 20915 · Report as offensive     Reply Quote
Lucid
Avatar

Send message
Joined: 18 Dec 05
Posts: 7
Credit: 64,098
RAC: 0
Message 20954 - Posted: 2 Mar 2006, 3:57:30 UTC

Thanks for your help, I\'ll run it for now and send it in when x happens.

ID: 20954 · Report as offensive     Reply Quote

Message boards : Number crunching : *sniff* failed WU infos

©2024 cpdn.org