climateprediction.net (CPDN) home page
Thread 'exit code 193 (0xc1)'

Thread 'exit code 193 (0xc1)'

Message boards : Number crunching : exit code 193 (0xc1)
Message board moderation

To post messages, you must log in.

AuthorMessage
pe
Avatar

Send message
Joined: 9 Mar 05
Posts: 2
Credit: 1,984,293
RAC: 0
Message 43269 - Posted: 25 Oct 2011, 16:29:35 UTC

Hi there,

two of the models my computer was crunching crashed lately. The second is still not reported as being crashed due to the project being offline.
the first one:
name: hadcm3n_p3yz_1940_40_007420580_1
it states: exit code 193 (0xc1)
<stderr_txt>
etVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
.....

I did spot some strange behaviour prior to the models crashing: their status in boinc manager was 'waiting to run', but the times did still change and they still used half a core on my quad-core system. Then after rebooting the system (for some other reason), the workunits errored out. the first was something about 80% and the second about 60% through.
So I think there might be a problem with proper pausing and resuming the work-units. (they are left in memory while suspended)

anything known that causes these problems?

greetz, pe.

ID: 43269 · Report as offensive     Reply Quote
ProfileGreg van Paassen

Send message
Joined: 17 Nov 07
Posts: 142
Credit: 4,271,370
RAC: 0
Message 43280 - Posted: 26 Oct 2011, 1:01:49 UTC - in response to Message 43269.  

Hi pe,

it seems that since June the HadCM3N models have become increasingly prone to this code 193 error. At first I thought the problem occurred only on Linux computers, but that is not so. It's been discussed a bit here: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7306

The problem must be due to changes in configuration files in new series of HadCM3Ns, since the executable program for HadCM3N has not changed since the first models were released early in June.

The advice is to "avoid other activity on the computer", which does not explain why the early models were OK but later ones crash.
ID: 43280 · Report as offensive     Reply Quote
pe
Avatar

Send message
Joined: 9 Mar 05
Posts: 2
Credit: 1,984,293
RAC: 0
Message 43300 - Posted: 26 Oct 2011, 22:03:20 UTC - in response to Message 43280.  
Last modified: 26 Oct 2011, 22:03:32 UTC

Hi Greg,

thank you for your input.
I did read a bit on the link you gave me. It seems my mem and hds are ok.
these two wu's were the first in a long time to error out..

did you notice the lack of proper suspend too?

greetz, pe.
ID: 43300 · Report as offensive     Reply Quote

Message boards : Number crunching : exit code 193 (0xc1)

©2024 cpdn.org