Questions and Answers :
Unix/Linux :
Model crashing...is it me?
Message board moderation
Author | Message |
---|---|
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
Hi I just installed the new BOINC 4.02 and registered for the project. Everything loaded up fine and the model started to crunch. However, after just a few steps it crashed. The next model did the same. Am I doing something wrong? After the second model crashed BOINC even crashed! (this is the first time I have ever seen this!) (Only difference is that I'm running as root, but I don't expect that to have any influence?) Complete log: 2004-08-05 18:59:54 [---] General prefs: from climateprediction.net (last modified 2004-08-05 18:53:04) 2004-08-05 18:59:54 [---] General prefs: no separate prefs for home; using your defaults 2004-08-05 18:59:54 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults 2004-08-05 18:59:54 [climateprediction.net] Finished download of 006g_000025217.zip 2004-08-05 18:59:54 [climateprediction.net] Approximate throughput 17894.461215 bytes/sec 2004-08-05 18:59:54 [climateprediction.net] Starting computation for result 006g_000025217_0 using hadsm3 version 4.02 Starting model in /root/boinc/projects/climateprediction.net... Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 006g_000025217/dataout/ inflating: 006g_000025217/dataout/thist creating: 006g_000025217/jobs/ inflating: 006g_000025217/jobs/control.stashc inflating: 006g_000025217/jobs/double.stashc inflating: 006g_000025217/jobs/Recona.12 inflating: 006g_000025217/jobs/Recona.13 inflating: 006g_000025217/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 006g_000025217/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 006g_000025217/jobs/spin.stashc inflating: 006g_000025217/jobs/yabsd.ihist inflating: 006g_000025217/jobs/yabsd.PRESM_A extracting: 006g_000025217/jobs/yabsd.PRESM_O extracting: 006g_000025217/jobs/yabsd.PRESM_S extracting: 006g_000025217/jobs/yabsd.PRESM_W creating: 006g_000025217/tmp/ inflating: 006g_000025217/tmp/cache2 inflating: 006g_000025217/tmp/cp.namelists extracting: 006g_000025217/tmp/pipe_dummy creating: 006g_000025217/viz/ inflating: 006g_000025217/viz/globe.rgb inflating: 006g_000025217/registration_license.txt creating: 006g_000025217/datain/ creating: 006g_000025217/datain/ancil/ creating: 006g_000025217/datain/ancil/ctldata/ creating: 006g_000025217/datain/ancil/ctldata/stasets/ inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01001218 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01002207 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003236 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003237 extracting: 006g_000025217/datain/ancil/ctldata/stasets/X01003254 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003255 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003274 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003275 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003276 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003277 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003278 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003279 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003280 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003281 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01003286 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005207 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005208 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005222 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01005223 inflating: 006g_000025217/datain/ancil/ctldata/stasets/X01010206 creating: 006g_000025217/datain/ancil/ctldata/STASHmaster/ inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_A inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 006g_000025217/datain/ancil/ctldata/STASHmaster/STASHmaster_W inflating: 006g_000025217/datain/ancil/qrclim.icedp.32 inflating: 006g_000025217/datain/ancil/qrclim.newsst5.32 inflating: 006g_000025217/datain/ancil/qrclim.ozone_preind_corr inflating: 006g_000025217/datain/ancil/qrclim.uvcurr.32 creating: 006g_000025217/datain/dumps/ inflating: 006g_000025217/datain/dumps/slab32_1810.start inflating: 006g_000025217/datain/lats inflating: 006g_000025217/datain/ppcodes Archive: 006g_000025217.zip inflating: 006g_000025217/jobs/climate.spin inflating: 006g_000025217/jobs/climate.cont inflating: 006g_000025217/jobs/climate.doub inflating: 006g_000025217/jobs/ncatts.cpdc Created shared memory region key = 24630 Env Used=LD_LIBRARY_PATH=/root/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) Starting model ID 006g_000025217 Phase 1 Stack size=48.00 MB Waiting for model startup, this may take a minute... 006g_000025217 - PH 1 TS 000001 - 00/00/0000 00:00 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00 006g_000025217 - PH 1 TS 000003 - 01/12/1810 01:30 - H:M:S=0000:00:10 AVG= 3.50 DLT= 5.25 006g_000025217 - PH 1 TS 000004 - 01/12/1810 02:00 - H:M:S=0000:00:11 AVG= 2.88 DLT= 1.00 006g_000025217 - PH 1 TS 000005 - 01/12/1810 02:30 - H:M:S=0000:00:12 AVG= 2.50 DLT= 1.00 006g_000025217 - PH 1 TS 000007 - 01/12/1810 03:30 - H:M:S=0000:00:14 AVG= 2.07 DLT= 1.00 Model crashed...retrying... adding: ncatts.cpdc (deflated 72%) adding: climate.cont (deflated 79%) adding: climate.cpdc (deflated 79%) adding: climate.doub (deflated 79%) adding: climate.spin (deflated 79%) adding: 006g_000025217.xml (deflated 70%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: stderr_um.txt (deflated 74%) adding: yabsd.out (deflated 93%) adding: restart.day (deflated 43%) 2004-08-05 19:00:14 [climateprediction.net] Unrecoverable error for result 006g_000025217_0 (process exited with code 251 (0xfb)) 2004-08-05 19:00:14 [climateprediction.net] Unrecoverable error for result 006g_000025217_0 (process exited with code 251 (0xfb)) 2004-08-05 19:00:14 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-05 19:00:14 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-05 19:00:14 [climateprediction.net] Computation for result 006g_000025217 finished 2004-08-05 19:00:14 [climateprediction.net] Started upload of 006g_000025217_0_1.zip 2004-08-05 19:00:14 [climateprediction.net] Started upload of 006g_000025217_0_2.zip 2004-08-05 19:00:14 [climateprediction.net] Finished upload of 006g_000025217_0_1.zip 2004-08-05 19:00:14 [climateprediction.net] Approximate throughput 5931.765245 bytes/sec 2004-08-05 19:00:15 [climateprediction.net] Started upload of 006g_000025217_0_3.zip 2004-08-05 19:00:15 [climateprediction.net] Finished upload of 006g_000025217_0_2.zip 2004-08-05 19:00:15 [climateprediction.net] Approximate throughput 19671.998588 bytes/sec 2004-08-05 19:00:15 [climateprediction.net] Started upload of 006g_000025217_0_4.zip 2004-08-05 19:00:16 [climateprediction.net] Finished upload of 006g_000025217_0_3.zip 2004-08-05 19:00:16 [climateprediction.net] Approximate throughput 5908.106585 bytes/sec 2004-08-05 19:00:16 [climateprediction.net] Started upload of 006g_000025217_0_5.zip 2004-08-05 19:00:16 [climateprediction.net] Finished upload of 006g_000025217_0_4.zip 2004-08-05 19:00:16 [climateprediction.net] Approximate throughput 5754.167789 bytes/sec 2004-08-05 19:00:19 [climateprediction.net] Finished upload of 006g_000025217_0_5.zip 2004-08-05 19:00:19 [climateprediction.net] Approximate throughput 25372.677485 bytes/sec 2004-08-05 19:01:15 [---] CPU scheduler starvation imminent; requesting more work 2004-08-05 19:01:15 [climateprediction.net] Requesting 6399 seconds of work 2004-08-05 19:01:15 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi 2004-08-05 19:01:16 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2004-08-05 19:01:16 [climateprediction.net] Started download of 006q_000025227.zip 2004-08-05 19:01:16 [climateprediction.net] Finished download of 006q_000025227.zip 2004-08-05 19:01:16 [climateprediction.net] Approximate throughput 25149.116738 bytes/sec 2004-08-05 19:01:16 [climateprediction.net] Starting computation for result 006q_000025227_0 using hadsm3 version 4.02 Starting model in /root/boinc/projects/climateprediction.net... Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 006q_000025227/dataout/ inflating: 006q_000025227/dataout/thist creating: 006q_000025227/jobs/ inflating: 006q_000025227/jobs/control.stashc inflating: 006q_000025227/jobs/double.stashc inflating: 006q_000025227/jobs/Recona.12 inflating: 006q_000025227/jobs/Recona.13 inflating: 006q_000025227/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 006q_000025227/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 006q_000025227/jobs/spin.stashc inflating: 006q_000025227/jobs/yabsd.ihist inflating: 006q_000025227/jobs/yabsd.PRESM_A extracting: 006q_000025227/jobs/yabsd.PRESM_O extracting: 006q_000025227/jobs/yabsd.PRESM_S extracting: 006q_000025227/jobs/yabsd.PRESM_W creating: 006q_000025227/tmp/ inflating: 006q_000025227/tmp/cache2 inflating: 006q_000025227/tmp/cp.namelists extracting: 006q_000025227/tmp/pipe_dummy creating: 006q_000025227/viz/ inflating: 006q_000025227/viz/globe.rgb inflating: 006q_000025227/registration_license.txt creating: 006q_000025227/datain/ creating: 006q_000025227/datain/ancil/ creating: 006q_000025227/datain/ancil/ctldata/ creating: 006q_000025227/datain/ancil/ctldata/stasets/ inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01001218 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01002207 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003236 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003237 extracting: 006q_000025227/datain/ancil/ctldata/stasets/X01003254 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003255 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003274 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003275 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003276 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003277 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003278 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003279 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003280 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003281 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01003286 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005207 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005208 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005222 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01005223 inflating: 006q_000025227/datain/ancil/ctldata/stasets/X01010206 creating: 006q_000025227/datain/ancil/ctldata/STASHmaster/ inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_A inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 006q_000025227/datain/ancil/ctldata/STASHmaster/STASHmaster_W inflating: 006q_000025227/datain/ancil/qrclim.icedp.32 inflating: 006q_000025227/datain/ancil/qrclim.newsst5.32 inflating: 006q_000025227/datain/ancil/qrclim.ozone_preind_corr inflating: 006q_000025227/datain/ancil/qrclim.uvcurr.32 creating: 006q_000025227/datain/dumps/ inflating: 006q_000025227/datain/dumps/slab32_1810.start inflating: 006q_000025227/datain/lats inflating: 006q_000025227/datain/ppcodes Archive: 006q_000025227.zip inflating: 006q_000025227/jobs/climate.spin inflating: 006q_000025227/jobs/climate.cont inflating: 006q_000025227/jobs/climate.doub inflating: 006q_000025227/jobs/ncatts.cpdc Created shared memory region key = 24840 Env Used=LD_LIBRARY_PATH=/root/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) Starting model ID 006q_000025227 Phase 1 Stack size=48.00 MB Waiting for model startup, this may take a minute... 006q_000025227 - PH 1 TS 000001 - 01/12/1810 00:30 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00 006q_000025227 - PH 1 TS 000002 - 01/12/1810 01:00 - H:M:S=0000:01:14 AVG=37.30 DLT=74.61 Model crashed...retrying... adding: ncatts.cpdc (deflated 72%) adding: climate.cont (deflated 79%) adding: climate.cpdc (deflated 79%) adding: climate.doub (deflated 79%) adding: climate.spin (deflated 79%) adding: 006q_000025227.xml (deflated 70%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: stderr_um.txt (deflated 74%) adding: yabsd.out (deflated 100%) adding: restart.day (deflated 43%) 2004-08-05 19:02:37 [climateprediction.net] Unrecoverable error for result 006q_000025227_0 (process exited with code 251 (0xfb)) 2004-08-05 19:02:37 [climateprediction.net] Unrecoverable error for result 006q_000025227_0 (process exited with code 251 (0xfb)) 2004-08-05 19:02:37 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-05 19:02:37 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-05 19:02:37 [climateprediction.net] Computation for result 006q_000025227 finished 2004-08-05 19:02:37 [climateprediction.net] Started upload of 006q_000025227_0_1.zip 2004-08-05 19:02:37 [climateprediction.net] Started upload of 006q_000025227_0_2.zip 2004-08-05 19:02:38 [climateprediction.net] Error on file upload: invalid signature 2004-08-05 19:02:38 [climateprediction.net] Error on file upload: invalid signature 2004-08-05 19:02:38 [climateprediction.net] Permanently failed upload of 006q_000025227_0_1.zip 2004-08-05 19:02:38 [climateprediction.net] Giving up on upload of 006q_000025227_0_1.zip: server rejected file 2004-08-05 19:02:38 [climateprediction.net] Giving up on upload of 006q_000025227_0_1.zip: server rejected file SIGSEGV: segmentation violation Exiting... |
Send message Joined: 5 Aug 04 Posts: 84 Credit: 76,646 RAC: 0 |
PC overclocked? |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
No - and the error is reproducable - it does this every time. The upload error crashes the core client and every time I get a model downloaded it crashes after a few steps. ------------------------------ Run 2 ------------------------------ 2004-08-05 19:29:53 [climateprediction.net] Started upload of 006q_000025227_0_3.zip 2004-08-05 19:29:54 [climateprediction.net] Started upload of 006q_000025227_0_4.zip 2004-08-05 19:29:55 [climateprediction.net] Error on file upload: invalid signature 2004-08-05 19:29:55 [climateprediction.net] Error on file upload: invalid signature 2004-08-05 19:29:55 [climateprediction.net] Permanently failed upload of 006q_000025227_0_3.zip 2004-08-05 19:29:55 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: server rejected file 2004-08-05 19:29:55 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: server rejected file SIGSEGV: segmentation violation Exiting... ------------------------------ Run 3 ------------------------------ 2004-08-05 19:34:08 [climateprediction.net] Started upload of 006q_000025227_0_2.zip 2004-08-05 19:34:08 [climateprediction.net] Started upload of 006q_000025227_0_3.zip HTTP::init_post2: couldn't get file size 2004-08-05 19:34:09 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: File downloaded was not the correct file or was garbage from bad URL 2004-08-05 19:34:09 [climateprediction.net] Giving up on upload of 006q_000025227_0_3.zip: File downloaded was not the correct file or was garbage from bad URL 2004-08-05 19:34:09 [climateprediction.net] Started upload of 006q_000025227_0_4.zip 2004-08-05 19:34:10 [climateprediction.net] Error on file upload: invalid signature 2004-08-05 19:34:10 [climateprediction.net] Error on file upload: invalid signature 2004-08-05 19:34:10 [climateprediction.net] Permanently failed upload of 006q_000025227_0_2.zip 2004-08-05 19:34:10 [climateprediction.net] Giving up on upload of 006q_000025227_0_2.zip: server rejected file 2004-08-05 19:34:10 [climateprediction.net] Giving up on upload of 006q_000025227_0_2.zip: server rejected file SIGSEGV: segmentation violation Exiting... ------------------------------ Run ...6 ------------------------------ 2004-08-05 19:36:16 [climateprediction.net] Started download of 006x_000025234.zip HTTP::init_post2: couldn't get file size 2004-08-05 19:36:16 [climateprediction.net] Giving up on upload of 006q_000025227_0_5.zip: File downloaded was not the correct file or was garbage from bad URL 2004-08-05 19:36:16 [climateprediction.net] Giving up on upload of 006q_000025227_0_5.zip: File downloaded was not the correct file or was garbage from bad URL 2004-08-05 19:36:17 [climateprediction.net] Finished download of 006x_000025234.zip 2004-08-05 19:36:17 [climateprediction.net] Approximate throughput 7020.394246 bytes/sec 2004-08-05 19:36:17 [climateprediction.net] Starting computation for result 006x_000025234_0 using hadsm3 version 4.02 Starting model in /root/boinc/projects/climateprediction.net... Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 006x_000025234/dataout/ inflating: 006x_000025234/dataout/thist creating: 006x_000025234/jobs/ inflating: 006x_000025234/jobs/control.stashc inflating: 006x_000025234/jobs/double.stashc inflating: 006x_000025234/jobs/Recona.12 inflating: 006x_000025234/jobs/Recona.13 inflating: 006x_000025234/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 006x_000025234/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 006x_000025234/jobs/spin.stashc inflating: 006x_000025234/jobs/yabsd.ihist inflating: 006x_000025234/jobs/yabsd.PRESM_A extracting: 006x_000025234/jobs/yabsd.PRESM_O extracting: 006x_000025234/jobs/yabsd.PRESM_S extracting: 006x_000025234/jobs/yabsd.PRESM_W creating: 006x_000025234/tmp/ inflating: 006x_000025234/tmp/cache2 inflating: 006x_000025234/tmp/cp.namelists extracting: 006x_000025234/tmp/pipe_dummy creating: 006x_000025234/viz/ inflating: 006x_000025234/viz/globe.rgb inflating: 006x_000025234/registration_license.txt creating: 006x_000025234/datain/ creating: 006x_000025234/datain/ancil/ creating: 006x_000025234/datain/ancil/ctldata/ creating: 006x_000025234/datain/ancil/ctldata/stasets/ inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01001218 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01002207 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003236 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003237 extracting: 006x_000025234/datain/ancil/ctldata/stasets/X01003254 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003255 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003274 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003275 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003276 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003277 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003278 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003279 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003280 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003281 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01003286 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005207 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005208 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005222 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01005223 inflating: 006x_000025234/datain/ancil/ctldata/stasets/X01010206 creating: 006x_000025234/datain/ancil/ctldata/STASHmaster/ inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_A inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 006x_000025234/datain/ancil/ctldata/STASHmaster/STASHmaster_W inflating: 006x_000025234/datain/ancil/qrclim.icedp.32 inflating: 006x_000025234/datain/ancil/qrclim.newsst5.32 inflating: 006x_000025234/datain/ancil/qrclim.ozone_preind_corr inflating: 006x_000025234/datain/ancil/qrclim.uvcurr.32 creating: 006x_000025234/datain/dumps/ inflating: 006x_000025234/datain/dumps/slab32_1810.start inflating: 006x_000025234/datain/lats inflating: 006x_000025234/datain/ppcodes Archive: 006x_000025234.zip inflating: 006x_000025234/jobs/climate.spin inflating: 006x_000025234/jobs/climate.cont inflating: 006x_000025234/jobs/climate.doub inflating: 006x_000025234/jobs/ncatts.cpdc Created shared memory region key = 24810 Env Used=LD_LIBRARY_PATH=/root/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) Starting model ID 006x_000025234 Phase 1 Stack size=48.00 MB Waiting for model startup, this may take a minute... 006x_000025234 - PH 1 TS 000001 - 01/12/1810 00:30 - H:M:S=0000:00:00 AVG= 0.00 DLT= 0.00 006x_000025234 - PH 1 TS 000002 - 01/12/1810 01:00 - H:M:S=0000:00:09 AVG= 5.00 DLT=10.00 Model crashed...retrying... adding: ncatts.cpdc (deflated 72%) adding: climate.cont (deflated 78%) adding: climate.cpdc (deflated 79%) adding: climate.doub (deflated 78%) adding: climate.spin (deflated 79%) adding: 006x_000025234.xml (deflated 70%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: stderr_um.txt (deflated 74%) adding: yabsd.out (deflated 93%) adding: restart.day (deflated 43%) 2004-08-05 19:36:34 [climateprediction.net] Unrecoverable error for result 006x_000025234_0 (process exited with code 251 (0xfb)) 2004-08-05 19:36:34 [climateprediction.net] Unrecoverable error for result 006x_000025234_0 (process exited with code 251 (0xfb)) |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
Even after a complete wipeout of the BOINC directory including client_state etc. did it do it again... Running Linux Gentoo on a 2.6.5r1 kernel P4 hyperthreaded 2.4Ghz but running only one simulation at a time. |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Hi, Janus, I haven't heard of anyone running it in root (sounds dangerous, security-wise). Might be worth a try in your /home directory -- if for no other reason than to eliminate a possible conflict associated with root privileges. Otherwise. it sounds like one for Carl. ________________________________________________ Washing one's hands of the conflict between the powerful and the powerless means to side with the powerful, not to be neutral. -- Paulo Freire (1921-1997), educator, author. |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
Hi Janus: when I get into the office today I will see if I can find your upload server, the interesting stuff should be in the "yabsd.out" which is sent up. My guess is the model can be flakey with overclocking, or perhaps there is another library that I forgot to compile into the model (i.e. I tried to statically link in everything so different Linux versions wouldn't cause problems with the "sensitive" model) |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
I have been looking into it a bit more and found two errors. I don't know anything about the sourcecode, so can't say if they are important or not: stderr_um.txt: forrtl: info: Fortran error message number is 63. forrtl: warning: Could not open message catalog: ifcore_msg.cat. forrtl: info: Check environment variable NLSPATH and protection of /usr/lib/ifcore_msg.cat. yabsd.out: Model completed with the following : Error Code : 1 Message : P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. And a list of a LOT of points with negative pressure. Also in another run these errors are preceeded by: DAS- K_WEAK reset: NaN NaN DAS- K_WEAK reset: NaN NaN DAS- K_WEAK reset: NaN NaN DAS- K_WEAK reset: NaN NaN DAS- LSP_FORM- QCL and DELTA not updated: NaN 1800.000 NaN I'm going to try to run a few models today again and see if the error is the same. [Update] I have now installed the client under another user on the same system and it fails the same way - runs a few steps and then dies and uploads. So it wasn't because I was running it as root. |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
OK from those error messages obviously the Fortran code is causing trouble. Do you have a /usr/lib/ifcore_msg.cat? I think it's an Intel Fortran library, so perhaps it's a conflict with other libraries you may have installed? |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
> OK from those error messages obviously the Fortran code is causing trouble. > Do you have a /usr/lib/ifcore_msg.cat? I think it's an Intel Fortran library, > so perhaps it's a conflict with other libraries you may have installed? No I haven't got that file - at least I couldn't find it in the specified location. |
Send message Joined: 5 Aug 04 Posts: 36 Credit: 2,559,795 RAC: 0 |
Nope, Not just you. I have the same specific problem on forge - a dual 3.06G HT xeon machine with no other system loading of importance. this box is running 2.6.7-latest.smp FC2 linux, no ifcore libs here either. This is **entire output** up to the failure: 2004-08-17 18:18:48 [---] Starting BOINC client version 4.02 for i686-pc-linux-gnu 2004-08-17 18:18:48 [climateprediction.net] Project prefs: using your defaults 2004-08-17 18:18:48 [climateprediction.net] Host ID not assigned yet 2004-08-17 18:18:48 [---] General prefs: from climateprediction.net (last modified 2004-08-15 16:37:46) 2004-08-17 18:18:48 [---] General prefs: using your defaults 2004-08-17 18:18:48 [---] Running CPU benchmarks 2004-08-17 18:18:48 [---] Suspending computation and network activity - running CPU benchmarks 2004-08-17 18:19:49 [---] Benchmark results: 2004-08-17 18:19:49 [---] Number of CPUs: 4 2004-08-17 18:19:49 [---] 653 double precision MIPS (Whetstone) per CPU 2004-08-17 18:19:49 [---] 1248 integer MIPS (Dhrystone) per CPU 2004-08-17 18:19:49 [---] Finished CPU benchmarks 2004-08-17 18:19:50 [---] Resuming computation and network activity 2004-08-17 18:19:50 [---] CPU scheduler starvation imminent; requesting more work 2004-08-17 18:19:51 [---] CPU scheduler starvation imminent; requesting more work 2004-08-17 18:19:51 [climateprediction.net] Requesting 691200 seconds of work 2004-08-17 18:19:51 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi 2004-08-17 18:19:51 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2004-08-17 18:19:51 [---] General prefs: from climateprediction.net (last modified 2004-08-15 16:37:46) 2004-08-17 18:19:51 [---] General prefs: no separate prefs for home; using your defaults 2004-08-17 18:19:51 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults 2004-08-17 18:19:52 [climateprediction.net] Started download of hadsm3_4.02_i686-pc-linux-gnu 2004-08-17 18:19:52 [climateprediction.net] Started download of hadsm3se_4.02_i686-pc-linux-gnu.zip 2004-08-17 18:19:55 [---] CPU scheduler starvation imminent; requesting more work 2004-08-17 18:19:55 [climateprediction.net] Requesting 691200 seconds of work 2004-08-17 18:19:55 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi 2004-08-17 18:19:57 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2004-08-17 18:19:57 [---] CPU scheduler starvation imminent; requesting more work 2004-08-17 18:19:57 [climateprediction.net] Requesting 691200 seconds of work 2004-08-17 18:19:57 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi 2004-08-17 18:19:58 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2004-08-17 18:19:58 [---] CPU scheduler starvation imminent; requesting more work 2004-08-17 18:19:58 [climateprediction.net] Requesting 691200 seconds of work 2004-08-17 18:19:58 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi 2004-08-17 18:19:59 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2004-08-17 18:20:06 [climateprediction.net] Finished download of hadsm3_4.02_i686-pc-linux-gnu 2004-08-17 18:20:06 [climateprediction.net] Approximate throughput 80435.695686 bytes/sec 2004-08-17 18:20:06 [climateprediction.net] Started download of hadsm3um_4.02_i686-pc-linux-gnu.zip 2004-08-17 18:20:31 [climateprediction.net] Finished download of hadsm3se_4.02_i686-pc-linux-gnu.zip 2004-08-17 18:20:31 [climateprediction.net] Approximate throughput 97371.666877 bytes/sec 2004-08-17 18:20:31 [climateprediction.net] Started download of hadsm3data_4.02_i686-pc-linux-gnu.zip 2004-08-17 18:20:32 [climateprediction.net] Finished download of hadsm3um_4.02_i686-pc-linux-gnu.zip 2004-08-17 18:20:32 [climateprediction.net] Approximate throughput 100634.645973 bytes/sec 2004-08-17 18:20:32 [climateprediction.net] Started download of 03n2_000029703.zip 2004-08-17 18:20:32 [climateprediction.net] Finished download of 03n2_000029703.zip 2004-08-17 18:20:32 [climateprediction.net] Approximate throughput 24594.815698 bytes/sec 2004-08-17 18:20:32 [climateprediction.net] Started download of 005z_000025200.zip 2004-08-17 18:20:33 [climateprediction.net] Finished download of 005z_000025200.zip 2004-08-17 18:20:33 [climateprediction.net] Approximate throughput 23927.738069 bytes/sec 2004-08-17 18:20:33 [climateprediction.net] Started download of 0300_000028873.zip 2004-08-17 18:20:33 [climateprediction.net] Finished download of 0300_000028873.zip 2004-08-17 18:20:33 [climateprediction.net] Approximate throughput 19755.729853 bytes/sec 2004-08-17 18:20:33 [climateprediction.net] Started download of 0052_000025167.zip 2004-08-17 18:20:34 [climateprediction.net] Finished download of 0052_000025167.zip 2004-08-17 18:20:34 [climateprediction.net] Approximate throughput 18356.052990 bytes/sec 2004-08-17 18:20:58 [climateprediction.net] Finished download of hadsm3data_4.02_i686-pc-linux-gnu.zip 2004-08-17 18:20:58 [climateprediction.net] Approximate throughput 165114.078013 bytes/sec 2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 03n2_000029703_1 using hadsm3 version 4.02 2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 005z_000025200_1 using hadsm3 version 4.02 Starting model in /misc/boinc/projects/climateprediction.net... 2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 0300_000028873_1 using hadsm3 version 4.02 2004-08-17 18:20:58 [climateprediction.net] Starting computation for result 0052_000025167_1 using hadsm3 version 4.02 Starting model in /misc/boinc/projects/climateprediction.net... Archive: hadsm3se_4.02_i686-pc-linux-gnu.zip inflating: ./hadsm3se_4.02_i686-pc-linux-gnu Archive: hadsm3se_4.02_i686-pc-linux-gnu.zip Starting model in /misc/boinc/projects/climateprediction.net... Archive: hadsm3um_4.02_i686-pc-linux-gnu.zip inflating: ./hadsm3um_4.02_i686-pc-linux-gnu Starting model in /misc/boinc/projects/climateprediction.net... Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 005z_000025200/dataout/ inflating: 005z_000025200/dataout/thist creating: 005z_000025200/jobs/ inflating: 005z_000025200/jobs/control.stashc inflating: 005z_000025200/jobs/double.stashc inflating: 005z_000025200/jobs/Recona.12 inflating: 005z_000025200/jobs/Recona.13 inflating: 005z_000025200/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 005z_000025200/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 005z_000025200/jobs/spin.stashc inflating: 005z_000025200/jobs/yabsd.ihist inflating: 005z_000025200/jobs/yabsd.PRESM_A extracting: 005z_000025200/jobs/yabsd.PRESM_O extracting: 005z_000025200/jobs/yabsd.PRESM_S extracting: 005z_000025200/jobs/yabsd.PRESM_W creating: 005z_000025200/tmp/ inflating: 005z_000025200/tmp/cache2 inflating: ./hadsm3se_4.02_i686-pc-linux-gnu inflating: 005z_000025200/tmp/cp.namelists extracting: 005z_000025200/tmp/pipe_dummy creating: 005z_000025200/viz/ inflating: 005z_000025200/viz/globe.rgb inflating: 005z_000025200/registration_license.txt creating: 005z_000025200/datain/ creating: 005z_000025200/datain/ancil/ creating: 005z_000025200/datain/ancil/ctldata/ creating: 005z_000025200/datain/ancil/ctldata/stasets/ inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01001218 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01002207 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003236 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003237 extracting: 005z_000025200/datain/ancil/ctldata/stasets/X01003254 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003255 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003274 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003275 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003276 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003277 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003278 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003279 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003280 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003281 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01003286 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005207 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005208 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005222 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01005223 inflating: 005z_000025200/datain/ancil/ctldata/stasets/X01010206 creating: 005z_000025200/datain/ancil/ctldata/STASHmaster/ inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_A inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 005z_000025200/datain/ancil/ctldata/STASHmaster/STASHmaster_W inflating: 005z_000025200/datain/ancil/qrclim.icedp.32 inflating: 005z_000025200/datain/ancil/qrclim.newsst5.32 inflating: 005z_000025200/datain/ancil/qrclim.ozone_preind_corr inflating: 005z_000025200/datain/ancil/qrclim.uvcurr.32 inflating: ./viz inflating: ./libGL.so.1 creating: 005z_000025200/datain/dumps/ inflating: 005z_000025200/datain/dumps/slab32_1810.start inflating: ./viz inflating: ./libGL.so.1 Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 0052_000025167/dataout/ inflating: 0052_000025167/dataout/thist creating: 0052_000025167/jobs/ inflating: 0052_000025167/jobs/control.stashc inflating: 0052_000025167/jobs/double.stashc inflating: 0052_000025167/jobs/Recona.12 inflating: 0052_000025167/jobs/Recona.13 inflating: 0052_000025167/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 0052_000025167/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 0052_000025167/jobs/spin.stashc inflating: 0052_000025167/jobs/yabsd.ihist inflating: 0052_000025167/jobs/yabsd.PRESM_A extracting: 0052_000025167/jobs/yabsd.PRESM_O extracting: 0052_000025167/jobs/yabsd.PRESM_S extracting: 0052_000025167/jobs/yabsd.PRESM_W creating: 0052_000025167/tmp/ inflating: 0052_000025167/tmp/cache2 inflating: ./libGLU.so.1 inflating: ./libglut.so.3 inflating: 0052_000025167/tmp/cp.namelists extracting: 0052_000025167/tmp/pipe_dummy inflating: ./libGLU.so.1 creating: 0052_000025167/viz/ inflating: 0052_000025167/viz/globe.rgb inflating: ./hadsm3viz_4.02_i686-pc-linux-gnu inflating: 005z_000025200/datain/lats inflating: 005z_000025200/datain/ppcodes Archive: 005z_000025200.zip inflating: 005z_000025200/jobs/climate.spin inflating: ./libglut.so.3 inflating: 005z_000025200/jobs/climate.cont inflating: 005z_000025200/jobs/climate.doub inflating: 005z_000025200/jobs/ncatts.cpdc Created shared memory region key = 24390 inflating: ./hadsm3viz_4.02_i686-pc-linux-gnu inflating: 0052_000025167/registration_license.txt creating: 0052_000025167/datain/ creating: 0052_000025167/datain/ancil/ creating: 0052_000025167/datain/ancil/ctldata/ creating: 0052_000025167/datain/ancil/ctldata/stasets/ inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01001218 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01002207 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003236 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003237 extracting: 0052_000025167/datain/ancil/ctldata/stasets/X01003254 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003255 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003274 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003275 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003276 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003277 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003278 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003279 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003280 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003281 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01003286 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005207 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005208 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005222 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01005223 inflating: 0052_000025167/datain/ancil/ctldata/stasets/X01010206 creating: 0052_000025167/datain/ancil/ctldata/STASHmaster/ inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_A Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 0300_000028873/dataout/ inflating: 0300_000028873/dataout/thist creating: 0300_000028873/jobs/ inflating: 0300_000028873/jobs/control.stashc inflating: 0300_000028873/jobs/double.stashc inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 0300_000028873/jobs/Recona.12 inflating: 0300_000028873/jobs/Recona.13 inflating: 0300_000028873/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 0052_000025167/datain/ancil/ctldata/STASHmaster/STASHmaster_W inflating: 0052_000025167/datain/ancil/qrclim.icedp.32 inflating: 0300_000028873/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 0300_000028873/jobs/spin.stashc inflating: 0300_000028873/jobs/yabsd.ihist inflating: 0300_000028873/jobs/yabsd.PRESM_A extracting: 0300_000028873/jobs/yabsd.PRESM_O extracting: 0300_000028873/jobs/yabsd.PRESM_S extracting: 0300_000028873/jobs/yabsd.PRESM_W creating: 0300_000028873/tmp/ inflating: 0300_000028873/tmp/cache2 inflating: 0052_000025167/datain/ancil/qrclim.newsst5.32 Archive: hadsm3data_4.02_i686-pc-linux-gnu.zip creating: 03n2_000029703/dataout/ inflating: 03n2_000029703/dataout/thist creating: 03n2_000029703/jobs/ inflating: 03n2_000029703/jobs/control.stashc inflating: 03n2_000029703/jobs/double.stashc inflating: 03n2_000029703/jobs/Recona.12 inflating: 03n2_000029703/jobs/Recona.13 inflating: 0052_000025167/datain/ancil/qrclim.ozone_preind_corr inflating: 03n2_000029703/jobs/spec3a_lw_3_asol2c_hadcm3 inflating: 03n2_000029703/jobs/spec3a_sw_3_asol2b_hadcm3 inflating: 03n2_000029703/jobs/spin.stashc inflating: 03n2_000029703/jobs/yabsd.ihist inflating: 03n2_000029703/jobs/yabsd.PRESM_A extracting: 03n2_000029703/jobs/yabsd.PRESM_O extracting: 03n2_000029703/jobs/yabsd.PRESM_S extracting: 03n2_000029703/jobs/yabsd.PRESM_W creating: 03n2_000029703/tmp/ inflating: 03n2_000029703/tmp/cache2 inflating: 0052_000025167/datain/ancil/qrclim.uvcurr.32 creating: 0052_000025167/datain/dumps/ inflating: 0052_000025167/datain/dumps/slab32_1810.start inflating: 0300_000028873/tmp/cp.namelists extracting: 0300_000028873/tmp/pipe_dummy creating: 0300_000028873/viz/ inflating: 0300_000028873/viz/globe.rgb Env Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) inflating: 03n2_000029703/tmp/cp.namelists adding: ncatts.cpdc (deflated 72%) extracting: 03n2_000029703/tmp/pipe_dummy creating: 03n2_000029703/viz/ adding: climate.cont inflating: 03n2_000029703/viz/globe.rgb (deflated 79%) adding: climate.doub (deflated 79%) adding: climate.spin (deflated 79%) adding: 005z_000025200.xml (deflated 66%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) inflating: 0300_000028873/registration_license.txt creating: 0300_000028873/datain/ creating: 0300_000028873/datain/ancil/ creating: 0300_000028873/datain/ancil/ctldata/ creating: 0300_000028873/datain/ancil/ctldata/stasets/ inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01001218 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01002207 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003236 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003237 extracting: 0300_000028873/datain/ancil/ctldata/stasets/X01003254 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003255 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003274 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003275 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003276 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003277 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003278 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003279 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003280 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003281 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01003286 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005207 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005208 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005222 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01005223 inflating: 0300_000028873/datain/ancil/ctldata/stasets/X01010206 creating: 0300_000028873/datain/ancil/ctldata/STASHmaster/ inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_A inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 03n2_000029703/registration_license.txt inflating: 0300_000028873/datain/ancil/ctldata/STASHmaster/STASHmaster_W creating: 03n2_000029703/datain/ creating: 03n2_000029703/datain/ancil/ creating: 03n2_000029703/datain/ancil/ctldata/ creating: 03n2_000029703/datain/ancil/ctldata/stasets/ inflating: 0300_000028873/datain/ancil/qrclim.icedp.32 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01001218 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01002207 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003236 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003237 extracting: 03n2_000029703/datain/ancil/ctldata/stasets/X01003254 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003255 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003274 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003275 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003276 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003277 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003278 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003279 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003280 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003281 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01003286 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005207 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005208 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005222 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01005223 inflating: 03n2_000029703/datain/ancil/ctldata/stasets/X01010206 creating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/ inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_A inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_O inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_S inflating: 03n2_000029703/datain/ancil/ctldata/STASHmaster/STASHmaster_W inflating: 03n2_000029703/datain/ancil/qrclim.icedp.32 inflating: 0300_000028873/datain/ancil/qrclim.newsst5.32 inflating: 0300_000028873/datain/ancil/qrclim.ozone_preind_corr inflating: 0300_000028873/datain/ancil/qrclim.uvcurr.32 inflating: 03n2_000029703/datain/ancil/qrclim.newsst5.32 inflating: 0052_000025167/datain/lats inflating: 0052_000025167/datain/ppcodes Archive: 0052_000025167.zip inflating: 0052_000025167/jobs/climate.spin inflating: 0052_000025167/jobs/climate.cont inflating: 0052_000025167/jobs/climate.doub inflating: 0052_000025167/jobs/ncatts.cpdc Created shared memory region key = 24070 inflating: 03n2_000029703/datain/ancil/qrclim.ozone_preind_corr creating: 0300_000028873/datain/dumps/ inflating: 0300_000028873/datain/dumps/slab32_1810.start inflating: 03n2_000029703/datain/ancil/qrclim.uvcurr.32 creating: 03n2_000029703/datain/dumps/ inflating: 03n2_000029703/datain/dumps/slab32_1810.start inflating: 0300_000028873/datain/lats inflating: 0300_000028873/datain/ppcodes Archive: 0300_000028873.zip inflating: 0300_000028873/jobs/climate.spin inflating: 0300_000028873/jobs/climate.cont inflating: 0300_000028873/jobs/climate.doub inflating: 0300_000028873/jobs/ncatts.cpdc Created shared memory region key = 24340 Env Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) adding: ncatts.cpdc (deflated 72%) adding: climate.cont (deflated 79%) adding: climate.doub (deflated 79%) adding: climate.spin (deflated 79%) adding: 0052_000025167.xml inflating: 03n2_000029703/datain/lats (deflated 66%) adding: ncatts.cpdc inflating: 03n2_000029703/datain/ppcodes (deflated 72%) adding: ncatts.cpdc (deflated 72%) Archive: 03n2_000029703.zip inflating: 03n2_000029703/jobs/climate.spin adding: ncatts.cpdc inflating: 03n2_000029703/jobs/climate.cont (deflated 72%) inflating: 03n2_000029703/jobs/climate.doub inflating: 03n2_000029703/jobs/ncatts.cpdc Created shared memory region key = 24565 Env Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) adding: ncatts.cpdc (deflated 72%) adding: climate.cont (deflated 79%) adding: climate.doub (deflated 78%) adding: climate.spin (deflated 79%) adding: 0300_000028873.xml (deflated 66%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdcEnv Used=LD_LIBRARY_PATH=/misc/boinc/projects/climateprediction.net:/usr/local/lib:/usr/lib:/lib Copying files for startup... In pre_initialise_phase (part 1 of 3) In initialise_phase (part 2 of 3) In startup_phase (part 3 of 3) (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: climate.cont (deflated 79%) adding: climate.doub (deflated 79%) adding: climate.spin (deflated 79%) adding: 03n2_000029703.xml (deflated 66%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) adding: ncatts.cpdc (deflated 72%) 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 03n2_000029703_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 03n2_000029703_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Computation for result 03n2_000029703 finished 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 005z_000025200_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 005z_000025200_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Started upload of 03n2_000029703_1_1.zip 2004-08-17 18:20:59 [climateprediction.net] Started upload of 03n2_000029703_1_2.zip 2004-08-17 18:20:59 [climateprediction.net] Computation for result 005z_000025200 finished 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0300_000028873_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0300_000028873_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Computation for result 0300_000028873 finished 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0052_000025167_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Unrecoverable error for result 0052_000025167_1 (process exited with code 251 (0xfb)) 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds 2004-08-17 18:20:59 [climateprediction.net] Computation for result 0052_000025167 finished 2004-08-17 18:20:59 [climateprediction.net] Finished upload of 03n2_000029703_1_1.zip 2004-08-17 18:20:59 [climateprediction.net] Approximate throughput 4598.970499 bytes/sec 2004-08-17 18:21:00 [climateprediction.net] Started upload of 03n2_000029703_1_3.zip 2004-08-17 18:21:00 [climateprediction.net] Finished upload of 03n2_000029703_1_2.zip 2004-08-17 18:21:00 [climateprediction.net] Approximate throughput 27627.143844 bytes/sec 2004-08-17 18:21:00 [climateprediction.net] Started upload of 03n2_000029703_1_4.zip 2004-08-17 18:21:00 [climateprediction.net] Finished upload of 03n2_000029703_1_3.zip 2004-08-17 18:21:00 [climateprediction.net] Approximate throughput 3673.207244 bytes/sec 2004-08-17 18:21:00 [climateprediction.net] Started upload of 03n2_000029703_1_5.zip 2004-08-17 18:21:00 [climateprediction.net] Finished upload of 03n2_000029703_1_4.zip 2004-08-17 18:21:00 [climateprediction.net] Approximate throughput 4316.120637 bytes/sec 2004-08-17 18:21:00 [climateprediction.net] Started upload of 005z_000025200_1_1.zip 2004-08-17 18:21:01 [climateprediction.net] Finished upload of 03n2_000029703_1_5.zip 2004-08-17 18:21:01 [climateprediction.net] Approximate throughput 4631.697766 bytes/sec 2004-08-17 18:21:01 [climateprediction.net] Started upload of 005z_000025200_1_2.zip 2004-08-17 18:21:01 [climateprediction.net] Finished upload of 005z_000025200_1_1.zip 2004-08-17 18:21:01 [climateprediction.net] Approximate throughput 4700.718916 bytes/sec 2004-08-17 18:21:01 [climateprediction.net] Started upload of 005z_000025200_1_3.zip 2004-08-17 18:21:01 [---] Received signal 2 2004-08-17 18:21:01 [---] Exit requested by user I'm planning a re-install of FC2 on this box tomorrow - the 3ware raid controller gave me some fits the first time, so this will be a more surgical install of the system - it is the only one of my "big iron" doing this. jsc (Xcamel) |
Send message Joined: 16 Aug 04 Posts: 156 Credit: 9,035,872 RAC: 2,928 |
I have problems on my both Linux boxes, RH8 and FC1, keeps crashing. They are downclocked now to moderate settings but still unstable in CPDN, hmm..? |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
> I have problems on my both Linux boxes, RH8 and FC1, keeps crashing. > They are downclocked now to moderate settings but still unstable in CPDN, > hmm..? Now this is weird since I tested it on RH9 and it worked fine... RH8 and FC1 shares quite a lot of code with RH9... |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 14,887 RAC: 0 |
Ok, problem solved! It turned out to be a problem in the 2.6.5-gentoo-r1 kernel scheduler (I guess). Updating to the new 2.6.8-gentoo-r4 solved all problems! =) |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,193,804 RAC: 2,852 |
> OK from those error messages obviously the Fortran code is causing trouble. > Do you have a /usr/lib/ifcore_msg.cat? I think it's an Intel Fortran library, > so perhaps it's a conflict with other libraries you may have installed? > I DO NOT HAVE THIS PROBLEM. (In fact, ClimatePrediction seems to be running OK for me.) I run Red Hat Enterprise Linux 3 ES. But I am curious. I have no ifcore_msg.cat anywhere on my system. I hve no ifcore anywhere on my system. I have only the following .cat files on my system. $ locate .cat /homeB/jdbeyer/W95/quickenw/Intellic.cat /opt/IBM/db2/V8.1/msg/en_US.iso88591/db2icons.cat /opt/IBM/db2/V8.1/msg/en_US.iso88591/db2inst.cat /opt/IBM/db2/V8.1/msg/en_US.iso88591/db2install.cat /opt/IBM/db2/V8.1/msg/en_US.iso88591/db2istring.cat /usr/src/linux-2.4.21-20.EL/drivers/usb/.catc.o.flags /usr/src/linux-2.4.21-20.EL/fs/hfs/.catalog.o.flags /usr/src/linux-2.4.21-15.0.3.EL/drivers/usb/.catc.o.flags /usr/src/linux-2.4.21-15.0.3.EL/fs/hfs/.catalog.o.flags /usr/share/apps/ksgmltools2/docbook/xml-dtd-4.1.2/docbook.cat /usr/share/linuxdoc-tools/linuxdoc-tools.catalog /etc/sgml/sgml-docbook.cat /etc/sgml/xml-docbook.cat /etc/sgml/sgml-docbook-3.0-1.0-17.2.cat /etc/sgml/sgml-docbook-3.1-1.0-17.2.cat /etc/sgml/sgml-docbook-4.0-1.0-17.2.cat /etc/sgml/sgml-docbook-4.1-1.0-17.2.cat /etc/sgml/xml-docbook-4.1.2-1.0-17.2.cat /etc/sgml/sgml-docbook-4.2-1.0-17.2.cat /etc/sgml/xml-docbook-4.2-1.0-17.2.cat Can you really assume the existance of such files in all Linux distributions? My guess is that Red Hat Enterprise Linux is "fairly standard", whatever that may mean. Also, I am not sure what you mean by Intel Fortran Library since most Linux systems I know of run GNU compilation systems (e.g., gcc, g++, g77). I assume that by statically linking the client applications, you are evading this problem, but if so, why ask the O.P. about that particular library (if that is what it is)? I thought libraries ended in .a or .so... Perhaps your post came before you started statically linking. |
©2024 cpdn.org