Message boards : Number crunching : HadCM3 short 8.34 workunits errors
Message board moderation
Author | Message |
---|---|
Send message Joined: 19 Aug 15 Posts: 5 Credit: 2,899,370 RAC: 0 |
Are there people who have also errors when workunits of HadCM3 short 8.34 start ? good luck with climateprediction |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
Hi Christophe, can you say what the errors are please? The tasks of this type are still listed as running on your computer https://www.cpdn.org/cpdnboinc/results.php?hostid=1396550 so anyone looking is going to have to guess what the problem is. Searching a bit, at least one windows and one linux computer have had failures with this batch but both the ones I found have quite a high failure rate anyway. Perhaps if they have failed when they report it will be possible to work out a bit more or if anyone else with failures reports in. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,364,793 RAC: 15,575 |
I've had one fail with an invalid theta. Failed after only 42sec of CPU time. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,364,793 RAC: 15,575 |
Just had a second one fail after a few seconds with a visual fortran runtime error. Is there a problem with this batch? |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
I notice Alan that yours are from batch 595 whereas Cristophe's are from 599. Certainly, scouting around I haven't found any completed from 599 yet. I will let project know. |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
I have had a reply from David in Oxford. Hi Dave, I suspect the same is true of 595 which has the same design. - I hadn't actually looked through 595 tasks to see how many were failing hence the reply being only mentioning the successes from them. |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
And from Sarah, Hi, |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
I have a batch 595 WU running now. It is 85% complete after nearly 6 days. It should complete in 1 day. So not all fail early. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,364,793 RAC: 15,575 |
And a third. |
Send message Joined: 3 Sep 04 Posts: 126 Credit: 26,610,380 RAC: 3,377 |
I received 9 tasks from batch 595. 8 completed successfully, the 9th hasn't started yet. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,202,915 RAC: 2,154 |
Are there people who have also errors when workunits of HadCM3 short 8.34 start ? hadcm3s_a092_203412_120_599_011122340_1 Workunit 11122340 Created 18 Jul 2017, 8:31:10 UTC Sent 18 Jul 2017, 8:31:22 UTC Report deadline 30 Jun 2018, 13:51:22 UTC Received 18 Jul 2017, 12:23:28 UTC Server state Over Outcome Computation error Client state Compute error Exit status 22 (0x16) Unknown error number https://www.cpdn.org/cpdnboinc/result.php?resultid=20552745 <core_client_version>7.2.33</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> forrtl: severe (24): end-of-file during read, unit 5, file /home/boinc/projects/climateprediction.net/hadcm3s_a092_203412_120_599_011122340/jobs/climate.cpdc, line 873, position 0 Image PC Routine Line Source hadcm3s_um_8.34_i 0848A415 Unknown Unknown Unknown hadcm3s_um_8.34_i 084AE5F7 Unknown Unknown Unknown hadcm3s_um_8.34_i 082C98AF Unknown Unknown Unknown hadcm3s_um_8.34_i 081C028B Unknown Unknown Unknown hadcm3s_um_8.34_i 081C1E0A Unknown Unknown Unknown hadcm3s_um_8.34_i 083F95C9 Unknown Unknown Unknown hadcm3s_um_8.34_i 083F867F Unknown Unknown Unknown hadcm3s_um_8.34_i 0840346D Unknown Unknown Unknown libc-2.12.so 00421D26 __libc_start_main Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27249, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (24): end-of-file during read, unit 5, file /home/boinc/projects/climateprediction.net/hadcm3s_a092_203412_120_599_011122340/jobs/climate.cpdc, line 873, position 0 Image PC Routine Line Source hadcm3s_um_8.34_i 0848A415 Unknown Unknown Unknown hadcm3s_um_8.34_i 084AE5F7 Unknown Unknown Unknown hadcm3s_um_8.34_i 082C98AF Unknown Unknown Unknown hadcm3s_um_8.34_i 081C028B Unknown Unknown Unknown hadcm3s_um_8.34_i 081C1E0A Unknown Unknown Unknown hadcm3s_um_8.34_i 083F95C9 Unknown Unknown Unknown hadcm3s_um_8.34_i 083F867F Unknown Unknown Unknown hadcm3s_um_8.34_i 0840346D Unknown Unknown Unknown libc-2.12.so 0033ED26 __libc_start_main Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27249, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (24): end-of-file during read, unit 5, file /home/boinc/projects/climateprediction.net/hadcm3s_a092_203412_120_599_011122340/jobs/climate.cpdc, line 873, position 0 Image PC Routine Line Source hadcm3s_um_8.34_i 0848A415 Unknown Unknown Unknown hadcm3s_um_8.34_i 084AE5F7 Unknown Unknown Unknown hadcm3s_um_8.34_i 082C98AF Unknown Unknown Unknown hadcm3s_um_8.34_i 081C028B Unknown Unknown Unknown hadcm3s_um_8.34_i 081C1E0A Unknown Unknown Unknown hadcm3s_um_8.34_i 083F95C9 Unknown Unknown Unknown hadcm3s_um_8.34_i 083F867F Unknown Unknown Unknown hadcm3s_um_8.34_i 0840346D Unknown Unknown Unknown libc-2.12.so 0030ED26 __libc_start_main Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27249, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (24): end-of-file during read, unit 5, file /home/boinc/projects/climateprediction.net/hadcm3s_a092_203412_120_599_011122340/jobs/climate.cpdc, line 873, position 0 Image PC Routine Line Source hadcm3s_um_8.34_i 0848A415 Unknown Unknown Unknown hadcm3s_um_8.34_i 084AE5F7 Unknown Unknown Unknown hadcm3s_um_8.34_i 082C98AF Unknown Unknown Unknown hadcm3s_um_8.34_i 081C028B Unknown Unknown Unknown hadcm3s_um_8.34_i 081C1E0A Unknown Unknown Unknown hadcm3s_um_8.34_i 083F95C9 Unknown Unknown Unknown hadcm3s_um_8.34_i 083F867F Unknown Unknown Unknown hadcm3s_um_8.34_i 0840346D Unknown Unknown Unknown libc-2.12.so 00322D26 __libc_start_main Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27249, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (24): end-of-file during read, unit 5, file /home/boinc/projects/climateprediction.net/hadcm3s_a092_203412_120_599_011122340/jobs/climate.cpdc, line 873, position 0 Image PC Routine Line Source hadcm3s_um_8.34_i 0848A415 Unknown Unknown Unknown hadcm3s_um_8.34_i 084AE5F7 Unknown Unknown Unknown hadcm3s_um_8.34_i 082C98AF Unknown Unknown Unknown hadcm3s_um_8.34_i 081C028B Unknown Unknown Unknown hadcm3s_um_8.34_i 081C1E0A Unknown Unknown Unknown hadcm3s_um_8.34_i 083F95C9 Unknown Unknown Unknown hadcm3s_um_8.34_i 083F867F Unknown Unknown Unknown hadcm3s_um_8.34_i 0840346D Unknown Unknown Unknown libc-2.12.so 0030ED26 __libc_start_main Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27249, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (24): end-of-file during read, unit 5, file /home/boinc/projects/climateprediction.net/hadcm3s_a092_203412_120_599_011122340/jobs/climate.cpdc, line 873, position 0 Image PC Routine Line Source hadcm3s_um_8.34_i 0848A415 Unknown Unknown Unknown hadcm3s_um_8.34_i 084AE5F7 Unknown Unknown Unknown hadcm3s_um_8.34_i 082C98AF Unknown Unknown Unknown hadcm3s_um_8.34_i 081C028B Unknown Unknown Unknown hadcm3s_um_8.34_i 081C1E0A Unknown Unknown Unknown hadcm3s_um_8.34_i 083F95C9 Unknown Unknown Unknown hadcm3s_um_8.34_i 083F867F Unknown Unknown Unknown hadcm3s_um_8.34_i 0840346D Unknown Unknown Unknown libc-2.12.so 0030ED26 __libc_start_main Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27249, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Calling boinc_finish...04:32:01 (27249): called boinc_finish(22) In boinc_exit called with status 22 Calloing set_signal_exit_code with status 22 </stderr_txt> ]]> |
Send message Joined: 15 Oct 05 Posts: 1 Credit: 25,226,348 RAC: 15,094 |
I'm also getting errors. All start with: 'forrtl:severe (24): end-of-file read. Last line is 'Stack trace terminated abnormally. |
Send message Joined: 19 Aug 15 Posts: 5 Credit: 2,899,370 RAC: 0 |
Hi Christophe, The requested errors were already mentioned in the meanwhile on this topic... errors mentioning line873 |
Send message Joined: 9 Feb 17 Posts: 4 Credit: 2,447,704 RAC: 0 |
I have had exactly the same issue. I just aborted the task. |
©2024 cpdn.org