Message boards : Number crunching : Every last HADAM3P European Region ends in computation error
Message board moderation
Author | Message |
---|---|
Send message Joined: 16 Jul 05 Posts: 32 Credit: 10,513,155 RAC: 0 |
Hello, the last days I encountered a lot of computation errors on HADAM3P European Region-Models. I think that the problem isn�t on my side, because also the wingman showed computation errors. Please look here for example: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=7791693 Is there a bad batch of models running? Do you have any suggestions? Thanks Nowi |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,011,472 RAC: 21,368 |
I know there is another thread that says something similar and it has been reported to the relevant team by one of the admins. Dave |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Nowi On the list that you provided: The first computer is failing because it's a Mac that's been upgraded without detaching/reattaching as per the sticky in the Mac section. (I'm about to report this.) The last one seems to be failing because of a computer problem. Which leaves yours. And I think that it's possible that it's your computer rather than the models. The most recent problem with these models was with download errors, not computation errors. And this has been fixed. Backups: Here |
Send message Joined: 16 Jul 05 Posts: 32 Credit: 10,513,155 RAC: 0 |
Thanks Les! Of course it is possible that the problem is my computer, especially with your extra information. The last good result I returned on 15.01., after that every wu failed. I changed nothing on my configuration, only added Test4Theory again to my active project list... I will have a look on my computer and the CPDN-tasks. I hope that the error will not persist. |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
It looks like they are all failing with the same error, only EU tasks, and it started in November. My guess would be something associated with the files that the hadam3p EU needs has become corrupted or gone missing. Perhaps set climateprediction.net to no new work, then do a project reset, then allow work again. That should clear out the files in the climateprediction.net directory and allow a fresh batch of files to download. |
Send message Joined: 16 Jul 05 Posts: 32 Credit: 10,513,155 RAC: 0 |
Thanks Geophi! By now the last task is running fine (about 44 % completed). I will watch it and, if an error would occur, I try your procedure. |
Send message Joined: 16 Jul 05 Posts: 32 Credit: 10,513,155 RAC: 0 |
I�ve got another one. All three exited with an error: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/workunit.php?wuid=7807866 |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
|
Send message Joined: 5 Jun 06 Posts: 28 Credit: 2,790,048 RAC: 0 |
On 2 systems had similar problems (Win Server 2008 and 2003 server): On the 2003 server there was a popup error message,
Runtime Error! Program:E:\BOINC\projects... This application has requested the Runtime to terminate it in an unusual way. Please contact the application's support team for more information. [OK]
|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
If BOINC is displaying 100%, then it's lost contact with the model, which usually means that the model has crashed. The only way to get it going again, is by restoring a backup made before the crash. It's possible that the temperature graph will be all blue, and the Hours Elapsed and the Timestep will not be advancing, or at best, only for a while before starting in a constant loop. The runtime error is something that happens to some computers, sometimes. There's no known cure. Backups: Here |
Send message Joined: 5 Jun 06 Posts: 28 Credit: 2,790,048 RAC: 0 |
Thanks Les. I aborted the task sitting at 100%. The error message disappeared. Closed and opened Boinc again and the error message did not return. Might be worth knowing. |
©2024 cpdn.org