climateprediction.net (CPDN) home page
Thread 'Model finished but won\'t submit'

Thread 'Model finished but won\'t submit'

Questions and Answers : Unix/Linux : Model finished but won\'t submit
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user1265

Send message
Joined: 26 Aug 04
Posts: 13
Credit: 458,996
RAC: 0
Message 5595 - Posted: 24 Oct 2004, 3:49:00 UTC

model 25687 seems finished but the client
seems hung and won\'t send it or request
a new model.

The boinc.log shows
2004-10-23 22:02:51 [---] Starting BOINC client version 4.05 for i686-pc-linux-gnu
2004-10-23 22:02:51 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults
2004-10-23 22:02:51 [climateprediction.net] Host ID is 2649
2004-10-23 22:02:51 [---] General prefs: from climateprediction.net (last modified 2004-08-25 22:38:01)
2004-10-23 22:02:51 [---] General prefs: no separate prefs for home; using your defaults

Nothing in stderr.

boinc is running, but nothing else seems to happen.
The directory is full of zip files. What\'s it
waiting for. I have other clients that have
sent and completed their jobs... No diff if I
kill and restart the boinc process.

Brian

ID: 5595 · Report as offensive     Reply Quote
old_user1265

Send message
Joined: 26 Aug 04
Posts: 13
Credit: 458,996
RAC: 0
Message 5685 - Posted: 27 Oct 2004, 4:01:10 UTC

Unfortunately there has been no reply to this message,
nor has the client resumed any activity.

What do I do? The model seems done, but it won't send
it nor will it download a new one.

Brian

ID: 5685 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 5693 - Posted: 27 Oct 2004, 12:21:13 UTC

Hi Brian,

<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=25687">Your result</a> has uploaded its phase 3 mega trickle and has full credit, so it's definitely finished. But the status information says otherwise.

Do you have files 00yy_300026243_0_[n].zip (where n is 1 to 5) in your climateprediction.net directory? These are the ones that should be uploaded by BOINC.

Also, does your climateprediction.net/00yy_300026243 directory only contain 1 xml file and 348 zip files?

Anything else would indicate that something's gone wrong in the post-processing of the model. If so, are there any stdout and stderr files in the directory (they might indicate what might have gone wrong) and are there any hadsm3* programs running?
<br><a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a>
ID: 5693 · Report as offensive     Reply Quote
old_user1265

Send message
Joined: 26 Aug 04
Posts: 13
Credit: 458,996
RAC: 0
Message 5704 - Posted: 28 Oct 2004, 0:46:15 UTC - in response to Message 5693.  

&gt; Hi Brian,
&gt;
&gt; <a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=25687"&gt;Your
&gt; result</a> has uploaded its phase 3 mega trickle and has full credit, so it's
&gt; definitely finished. But the status information says otherwise.
&gt;
&gt; Do you have files 00yy_300026243_0_[n].zip (where n is 1 to 5) in your
&gt; climateprediction.net directory? These are the ones that should be uploaded
&gt; by BOINC.

No, I don't think so. The only zip files in that directory are
-rw------- 1 brian brian 10547 Aug 26 17:25 00yy_300026243.zip
-rw------- 1 brian brian 10573 Oct 18 22:00 2x0k_100157473.zip
-rw------- 1 brian brian 4493630 Aug 26 17:25 hadsm3data_4.03_i686-pc-linux-gnu.zip
-rw------- 1 brian brian 4493630 Oct 18 22:00 hadsm3data_4.04_i686-pc-linux-gnu.zip
-rw------- 1 brian brian 3803382 Aug 26 17:25 hadsm3se_4.03_i686-pc-linux-gnu.zip
-rw------- 1 brian brian 4010230 Aug 26 17:25 hadsm3um_4.03_i686-pc-linux-gnu.zip
-rw------- 1 brian brian 4010230 Oct 18 21:57 hadsm3um_4.04_i686-pc-linux-gnu.zip

Looks like the orginal model, plus the next one to start,
but not the completed results of the first.

&gt;
&gt; Also, does your climateprediction.net/00yy_300026243 directory only contain 1
&gt; xml file and 348 zip files?

Close, I have 1
00yy_300026243.xml
and 364 zip files.

&gt;
&gt; Anything else would indicate that something's gone wrong in the
&gt; post-processing of the model. If so, are there any stdout and stderr files in
&gt; the directory (they might indicate what might have gone wrong) and are there
&gt; any hadsm3* programs running?

There stderr file is empty, and the log files show only what I posted, tho I may
have accidently deleted the log file from the day it quit.
The only boinc related processes running are boinc itself,
no hadsm3 processes.

Thanks for the reply, hope this helps diag the problem.

Brian

ID: 5704 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 5718 - Posted: 28 Oct 2004, 8:09:50 UTC - in response to Message 5704.  

&gt; No, I don't think so. The only zip files in that directory are
&gt; -rw------- 1 brian brian 10547 Aug 26 17:25 00yy_300026243.zip
&gt; -rw------- 1 brian brian 10573 Oct 18 22:00 2x0k_100157473.zip
&gt; -rw------- 1 brian brian 4493630 Aug 26 17:25 hadsm3data_4.03_i686-pc-linux-gnu.zip
&gt; -rw------- 1 brian brian 4493630 Oct 18 22:00 hadsm3data_4.04_i686-pc-linux-gnu.zip
&gt; -rw------- 1 brian brian 3803382 Aug 26 17:25 hadsm3se_4.03_i686-pc-linux-gnu.zip
&gt; -rw------- 1 brian brian 4010230 Aug 26 17:25 hadsm3um_4.03_i686-pc-linux-gnu.zip
&gt; -rw------- 1 brian brian 4010230 Oct 18 21:57 hadsm3um_4.04_i686-pc-linux-gnu.zip

It looks like your problem lies here, Brian. The download for the new job is incomplete. You're missing hadsm3se_4.04_i686-pc-linux-gnu.zip, and the new model can't run without it. BOINC isn't very good a retrying download failures, and there are 2 ways you could try to get round it.

First thing to try is stopping BOINC and copying the missing file from one of your other Linux boxes (if hadsm3_4.04_i686-pc-linux-gnu is also missing you'll need to copy that one too). The new model should start running when you restart BOINC.

If that doesn't work your best option is to run BOINC with the option <b>-reset_project http://climateprediction.net</b>. That will force the download to be retried, but a new model will be downloaded.

&gt; Close, I have 1 00yy_300026243.xml and 364 zip files.

Which means that all the post-processing completed.
<br><a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a>
ID: 5718 · Report as offensive     Reply Quote
old_user1265

Send message
Joined: 26 Aug 04
Posts: 13
Credit: 458,996
RAC: 0
Message 5772 - Posted: 30 Oct 2004, 3:51:48 UTC - in response to Message 5718.  


&gt; &gt; Close, I have 1 00yy_300026243.xml and 364 zip files.
&gt;
&gt; Which means that all the post-processing completed.
&gt; <br><a href="http://www.teampicard.net"><img>
ID: 5772 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 5825 - Posted: 1 Nov 2004, 12:30:19 UTC

OK, it looks like your original model finished and uploaded the result files (there would be 00yy*_[1-5].zip files otherwise), but you just need to reset to complete download of the 4.04 CPDN files.

ID: 5825 · Report as offensive     Reply Quote
old_user169

Send message
Joined: 5 Aug 04
Posts: 39
Credit: 87,633
RAC: 0
Message 5827 - Posted: 1 Nov 2004, 14:32:02 UTC - in response to Message 5825.  

&gt; OK, it looks like your original model finished and uploaded the result files
&gt; (there would be 00yy*_[1-5].zip files otherwise), but you just need to reset
&gt; to complete download of the 4.04 CPDN files.


http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=15836
(the model where the result belongs to)

still is in state "In Progress" with outcome "unknown" though
ID: 5827 · Report as offensive     Reply Quote
old_user1265

Send message
Joined: 26 Aug 04
Posts: 13
Credit: 458,996
RAC: 0
Message 5886 - Posted: 3 Nov 2004, 16:33:46 UTC - in response to Message 5827.  

I agree, it doesn't look like it was completely uploaded.
To get it going again, I had to reset the boinc client, so
its off working on a new model.

I still have the work dir and all the zip files, but I don't
know how to submit them.

Brian

ID: 5886 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : Model finished but won\'t submit

©2024 cpdn.org