climateprediction.net home page
Model has crashed

Model has crashed

Questions and Answers : Windows : Model has crashed
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile pschoefer

Send message
Joined: 21 Dec 04
Posts: 4
Credit: 480,331
RAC: 0
Message 13281 - Posted: 9 Jun 2005, 11:35:21 UTC

I had a big problem with my CPDN-Model yesterday. The 1st Phase had just been done when the HadSM-application crashed (\"hadsm3_4.12 caused a problem and was closed\"). After that I got the following message in the BOINC_GUI:

2005-06-08 19:57:12 [climateprediction.net] Unrecoverable error for result 1m0w_000095972_1 ( - exit code -1073741819 (0xc0000005))

stdout_um.txt:
Starting HadSM3 model for ID# 1m0w_000095972...
Changing to slots directory C:\\Programme\\BOINC\\slots\\2
Model finished with 1161474.848813 CPU Time...
Detaching shared memory, closing model...

What does \"exit code 1073741819\" mean and why was the model closed when the application was detaching shared memory?

my system:
1,5GHz Intel Celeron
256MB RAM (32MB of them are shared memory)
BOINC 4.43
ID: 13281 · Report as offensive     Reply Quote
crandles
Volunteer moderator

Send message
Joined: 16 Oct 04
Posts: 692
Credit: 277,679
RAC: 0
Message 13288 - Posted: 9 Jun 2005, 14:28:27 UTC

Welcome to the boards.

Not sure about what -1073741819 means. The 0xc0000005 could be a Windows STOP message.

There are a few discussions around if you seach for -1073741819

For example:
<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=2616">this thread</a>
ID: 13288 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2184
Credit: 64,822,615
RAC: 5,275
Message 13292 - Posted: 9 Jun 2005, 14:44:36 UTC

Frustrating isn't it? This seems to occur at end of phase most frequently, during periods of intense disk activity. I've lost one at the end of phase 2 and one at the end of phase 3 with this error. Sorry I don't have any ready remedy for this problem.
ID: 13292 · Report as offensive     Reply Quote
TulipVorlax

Send message
Joined: 17 Mar 05
Posts: 1
Credit: 342,401
RAC: 1,316
Message 13339 - Posted: 11 Jun 2005, 16:46:16 UTC


hadsm3_4.12 just keep chrashing for me. It run well for a few minutes and then crash. I don't think it's restarting from 0 though.

It only happen since i've upgraded BOINC to the last version. Should i return to the version i previously was ? I've kept all setup in a safe place.

In the mean time, i'll run a checkdisk.
See ya.

ID: 13339 · Report as offensive     Reply Quote
Profile old_user248

Send message
Joined: 6 Aug 04
Posts: 65
Credit: 1,605,224
RAC: 0
Message 13361 - Posted: 12 Jun 2005, 11:27:26 UTC

It might also be a good idea to check the manufacturer for your hard disk and from their web site download a utility to check the HD for problems. If this only happens during intense disk activity then somewhere in that end of the chain is where the problem probably rests.

Also, what other programs are running when this happens that may be doing a fair bit of activity? Myself, I have my antivirus software set to exclude the directories under climate prediction except when I a weekly scan for problems.

On the 4.12 front, it seems that a some people are having problems with this particular version, especially if running Win98 or Me. Just to be on the safe side maybe try running Prime95 torture test or other intense stability checker just in case something has changed within your system. I personnnally haven't tried all the lastest vesions of boinc so can't comment on the boinc version. I just run the ones currently listed on the CP web site.
ID: 13361 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2184
Credit: 64,822,615
RAC: 5,275
Message 13362 - Posted: 12 Jun 2005, 13:18:09 UTC - in response to Message 13361.  

&gt; It might also be a good idea to check the manufacturer for your hard disk and
&gt; from their web site download a utility to check the HD for problems. If this
&gt; only happens during intense disk activity then somewhere in that end of the
&gt; chain is where the problem probably rests.
&gt;
For me...
This I did, immediately after the first crash, and again after the second. No problems according to the diagnostic software.
&gt;
&gt; Also, what other programs are running when this happens that may be doing a
&gt; fair bit of activity? Myself, I have my antivirus software set to exclude the
&gt; directories under climate prediction except when I a weekly scan for
&gt; problems.
&gt;
My prolems occurred on a dedicated crunching PC, no AV software is running on it.
ID: 13362 · Report as offensive     Reply Quote
old_user2098

Send message
Joined: 27 Aug 04
Posts: 23
Credit: 747,508
RAC: 0
Message 14308 - Posted: 12 Jul 2005, 19:50:40 UTC
Last modified: 12 Jul 2005, 19:52:49 UTC

I also got this error for workunit 542137 resultID 808917.

Here is the error output:

core_client_version=4.19
exit code= -1073741819 (0xc0000005)
active_task_state=1
signal=0


It is the 1st time I get this error.
ID: 14308 · Report as offensive     Reply Quote
Profile Andrew Hingston
Volunteer moderator

Send message
Joined: 17 Aug 04
Posts: 753
Credit: 9,804,700
RAC: 0
Message 14311 - Posted: 12 Jul 2005, 21:04:54 UTC - in response to Message 14308.  
Last modified: 12 Jul 2005, 21:06:20 UTC

&gt; It is the 1st time I get this error.

The mystery to me is why some people do not experience it whilst others of us do so fairly regularly on different machines which until v 4.1x of the client were wholly reliable running CPDN.
ID: 14311 · Report as offensive     Reply Quote
old_user2098

Send message
Joined: 27 Aug 04
Posts: 23
Credit: 747,508
RAC: 0
Message 14358 - Posted: 14 Jul 2005, 19:08:32 UTC - in response to Message 14311.  
Last modified: 14 Jul 2005, 19:09:25 UTC

By the way, does anyone knows what this error code means?
Is it a BOINC error code or a CPDN one?

I found somewhere in the forum a reference to a BOINC error codes file:
http://boinc.berkeley.edu/error_numbers.h

I don't know whether it is complete. (and even wheter it is the actual BOINC error codes file!)

Also, does anyone has an equivalent for CPDN?



ID: 14358 · Report as offensive     Reply Quote
Profile Andrew Hingston
Volunteer moderator

Send message
Joined: 17 Aug 04
Posts: 753
Credit: 9,804,700
RAC: 0
Message 14364 - Posted: 14 Jul 2005, 21:09:22 UTC - in response to Message 14358.  

&gt; By the way, does anyone knows what this error code means?

I don't claim to understand these things, but error 1073741819 is a Windows system error and seems to refer to an access violation relating to file handling, which would fit with it happening mainly at the end of a phase. It generates a Windows error message. There has been discussion in various threads, and if you enjoy these things a search in Google brings up lots of interesting material, but nothing that enables someone like me to identify why it is happening.

It seems to have occured only after Hadsm3 went to v4.10, and not to depend on the BOINC version.
ID: 14364 · Report as offensive     Reply Quote

Questions and Answers : Windows : Model has crashed

©2024 cpdn.org