Message boards : Number crunching : exit code 193 (0xc1)
Message board moderation
Author | Message |
---|---|
Send message Joined: 9 Mar 05 Posts: 2 Credit: 1,984,293 RAC: 0 |
Hi there, two of the models my computer was crunching crashed lately. The second is still not reported as being crashed due to the project being offline. the first one: name: hadcm3n_p3yz_1940_40_007420580_1 it states: exit code 193 (0xc1) <stderr_txt> etVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 ..... I did spot some strange behaviour prior to the models crashing: their status in boinc manager was 'waiting to run', but the times did still change and they still used half a core on my quad-core system. Then after rebooting the system (for some other reason), the workunits errored out. the first was something about 80% and the second about 60% through. So I think there might be a problem with proper pausing and resuming the work-units. (they are left in memory while suspended) anything known that causes these problems? greetz, pe. |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
Hi pe, it seems that since June the HadCM3N models have become increasingly prone to this code 193 error. At first I thought the problem occurred only on Linux computers, but that is not so. It's been discussed a bit here: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7306 The problem must be due to changes in configuration files in new series of HadCM3Ns, since the executable program for HadCM3N has not changed since the first models were released early in June. The advice is to "avoid other activity on the computer", which does not explain why the early models were OK but later ones crash. |
Send message Joined: 9 Mar 05 Posts: 2 Credit: 1,984,293 RAC: 0 |
Hi Greg, thank you for your input. I did read a bit on the link you gave me. It seems my mem and hds are ok. these two wu's were the first in a long time to error out.. did you notice the lack of proper suspend too? greetz, pe. |
©2024 cpdn.org