Message boards : Number crunching : dammit
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Aug 04 Posts: 13 Credit: 134,268 RAC: 0 |
so my workunit (754048) errored out with 3:43 left to go when someone printed to this computer. oh well, only a couple hundred hours lost!!! any idea why?? for some reason it was removed from memory just before it completed. this has never been a problem before. Exit status -5 (0xfffffffb) <core_client_version>4.45</core_client_version> <message> - exit code -5 (0xfffffffb) </message> |
Send message Joined: 31 Aug 04 Posts: 13 Credit: 134,268 RAC: 0 |
is there any way i can restart this result and try to let it finish? sorry for the hassle but i lost 18 days of compute time. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
You can only recover by deleting BOINC folder and copying over a BOINC folder saved before the crash. Also, it\'s worth upgrading to the latest Version 5 from the download page. The model in question has 72 trickles and all the graph files, so it looks like it finished OK. And eventually you will get the last bit of credit for it. |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
I remember another user having a similar problem once before. In that case the model crashed on generating one of the phase 3 rmts files. Can you check the structure of your projects/climateprediction.net/1hru_100090407 directory. As the model generated an error I expect there will still be a dataout directory. What\'s the newest file in it? "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
©2024 cpdn.org