Questions and Answers : Unix/Linux : Model finished but won\'t submit
Message board moderation
Author | Message |
---|---|
Send message Joined: 26 Aug 04 Posts: 13 Credit: 458,996 RAC: 0 |
model 25687 seems finished but the client seems hung and won\'t send it or request a new model. The boinc.log shows 2004-10-23 22:02:51 [---] Starting BOINC client version 4.05 for i686-pc-linux-gnu 2004-10-23 22:02:51 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults 2004-10-23 22:02:51 [climateprediction.net] Host ID is 2649 2004-10-23 22:02:51 [---] General prefs: from climateprediction.net (last modified 2004-08-25 22:38:01) 2004-10-23 22:02:51 [---] General prefs: no separate prefs for home; using your defaults Nothing in stderr. boinc is running, but nothing else seems to happen. The directory is full of zip files. What\'s it waiting for. I have other clients that have sent and completed their jobs... No diff if I kill and restart the boinc process. Brian |
Send message Joined: 26 Aug 04 Posts: 13 Credit: 458,996 RAC: 0 |
Unfortunately there has been no reply to this message, nor has the client resumed any activity. What do I do? The model seems done, but it won't send it nor will it download a new one. Brian |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
Hi Brian, <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=25687">Your result</a> has uploaded its phase 3 mega trickle and has full credit, so it's definitely finished. But the status information says otherwise. Do you have files 00yy_300026243_0_[n].zip (where n is 1 to 5) in your climateprediction.net directory? These are the ones that should be uploaded by BOINC. Also, does your climateprediction.net/00yy_300026243 directory only contain 1 xml file and 348 zip files? Anything else would indicate that something's gone wrong in the post-processing of the model. If so, are there any stdout and stderr files in the directory (they might indicate what might have gone wrong) and are there any hadsm3* programs running? <br><a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a> |
Send message Joined: 26 Aug 04 Posts: 13 Credit: 458,996 RAC: 0 |
> Hi Brian, > > <a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=25687">Your > result</a> has uploaded its phase 3 mega trickle and has full credit, so it's > definitely finished. But the status information says otherwise. > > Do you have files 00yy_300026243_0_[n].zip (where n is 1 to 5) in your > climateprediction.net directory? These are the ones that should be uploaded > by BOINC. No, I don't think so. The only zip files in that directory are -rw------- 1 brian brian 10547 Aug 26 17:25 00yy_300026243.zip -rw------- 1 brian brian 10573 Oct 18 22:00 2x0k_100157473.zip -rw------- 1 brian brian 4493630 Aug 26 17:25 hadsm3data_4.03_i686-pc-linux-gnu.zip -rw------- 1 brian brian 4493630 Oct 18 22:00 hadsm3data_4.04_i686-pc-linux-gnu.zip -rw------- 1 brian brian 3803382 Aug 26 17:25 hadsm3se_4.03_i686-pc-linux-gnu.zip -rw------- 1 brian brian 4010230 Aug 26 17:25 hadsm3um_4.03_i686-pc-linux-gnu.zip -rw------- 1 brian brian 4010230 Oct 18 21:57 hadsm3um_4.04_i686-pc-linux-gnu.zip Looks like the orginal model, plus the next one to start, but not the completed results of the first. > > Also, does your climateprediction.net/00yy_300026243 directory only contain 1 > xml file and 348 zip files? Close, I have 1 00yy_300026243.xml and 364 zip files. > > Anything else would indicate that something's gone wrong in the > post-processing of the model. If so, are there any stdout and stderr files in > the directory (they might indicate what might have gone wrong) and are there > any hadsm3* programs running? There stderr file is empty, and the log files show only what I posted, tho I may have accidently deleted the log file from the day it quit. The only boinc related processes running are boinc itself, no hadsm3 processes. Thanks for the reply, hope this helps diag the problem. Brian |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
> No, I don't think so. The only zip files in that directory are > -rw------- 1 brian brian 10547 Aug 26 17:25 00yy_300026243.zip > -rw------- 1 brian brian 10573 Oct 18 22:00 2x0k_100157473.zip > -rw------- 1 brian brian 4493630 Aug 26 17:25 hadsm3data_4.03_i686-pc-linux-gnu.zip > -rw------- 1 brian brian 4493630 Oct 18 22:00 hadsm3data_4.04_i686-pc-linux-gnu.zip > -rw------- 1 brian brian 3803382 Aug 26 17:25 hadsm3se_4.03_i686-pc-linux-gnu.zip > -rw------- 1 brian brian 4010230 Aug 26 17:25 hadsm3um_4.03_i686-pc-linux-gnu.zip > -rw------- 1 brian brian 4010230 Oct 18 21:57 hadsm3um_4.04_i686-pc-linux-gnu.zip It looks like your problem lies here, Brian. The download for the new job is incomplete. You're missing hadsm3se_4.04_i686-pc-linux-gnu.zip, and the new model can't run without it. BOINC isn't very good a retrying download failures, and there are 2 ways you could try to get round it. First thing to try is stopping BOINC and copying the missing file from one of your other Linux boxes (if hadsm3_4.04_i686-pc-linux-gnu is also missing you'll need to copy that one too). The new model should start running when you restart BOINC. If that doesn't work your best option is to run BOINC with the option <b>-reset_project http://climateprediction.net</b>. That will force the download to be retried, but a new model will be downloaded. > Close, I have 1 00yy_300026243.xml and 364 zip files. Which means that all the post-processing completed. <br><a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a> |
Send message Joined: 26 Aug 04 Posts: 13 Credit: 458,996 RAC: 0 |
> > Close, I have 1 00yy_300026243.xml and 364 zip files. > > Which means that all the post-processing completed. > <br><a href="http://www.teampicard.net"><img> |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
OK, it looks like your original model finished and uploaded the result files (there would be 00yy*_[1-5].zip files otherwise), but you just need to reset to complete download of the 4.04 CPDN files. |
Send message Joined: 5 Aug 04 Posts: 39 Credit: 87,633 RAC: 0 |
> OK, it looks like your original model finished and uploaded the result files > (there would be 00yy*_[1-5].zip files otherwise), but you just need to reset > to complete download of the 4.04 CPDN files. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=15836 (the model where the result belongs to) still is in state "In Progress" with outcome "unknown" though |
Send message Joined: 26 Aug 04 Posts: 13 Credit: 458,996 RAC: 0 |
I agree, it doesn't look like it was completely uploaded. To get it going again, I had to reset the boinc client, so its off working on a new model. I still have the work dir and all the zip files, but I don't know how to submit them. Brian |
©2024 cpdn.org