Questions and Answers :
Windows :
Arrrggghhh Carl - error during final results upload, again... :-(
Message board moderation
Author | Message |
---|---|
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
Close to 18 days crunching on <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&resultid=24713&phase=AT#graph">Result #24713</a> and the client throws an "upload error" during final results upload. :-( I wonder if this happened because I'm running BOINC Alpha v4.08 Carl..? Hmmnn, oh well, I guess I'll find out in an hour or so when 'Susan' completes her first BOINC model... Result #24713 <i>has</i> showed up in the 'Last 10 results returned' list - "Run Information Received: 12 Sep 2004 21:26:01 UTC." :-? <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
What kind of error, Nick? Temporarily failed upload of ...? http://www.climateprediction.net/board/viewtopic.php?t=2321 |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
Susan's <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=24658">Result #24658</a> threw the same error. :-( Honza: The error is listed at both the above links - I can't copy them here because it has XML tags & doesn't display properley. Basically, the CP-boinc servers refused to accept the "*_0_1.zip" file because "(Output file exceeded size limit)" both times. It lists as an "Unrecoverable error" although it is also shown as one of the "Last 10 results returned". :-? <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
oh I know what that is, hopefully the upload will go through soon, give me an hour. I had made a lot of regional means added to the first file but the limit is 1MB for upload; I just have to recompile and distribute to the upload servers (about an hour or so). OK, I have just updated the upload servers to allow this slightly larger first file through, hopefully it will go through now? I'm not sure if BOINC "gives up" immediately on this error or the file is still there. If it gives up the uploading and the *_1.zip file is there can you just email it to me? Thanks! |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
> oh I know what that is, hopefully the upload will go through soon, give me an > hour. I had made a lot of regional means added to the first file but the > limit is 1MB for upload; I just have to recompile and distribute to the upload > servers (about an hour or so). > > OK, I have just updated the upload servers to allow this slightly larger first > file through, hopefully it will go through now? I'm not sure if BOINC "gives > up" immediately on this error or the file is still there. If it gives up the > uploading and the *_1.zip file is there can you just email it to me? Thanks! Nope, it's just gone I'm afraid - that wouldn't matter much for a SETI work_unit result but it's a <i>big</i> waste of resources for CP-boinc. :-( Pity it doesn't archive the file like classic CPDN does - oh well, good to know it's fixed now anyways... <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
damn I wanted to see that too, I think yours was the first since I added the "regional means" from launch, a lot of fun stuff in there, so we get back averages on 29 regions on something like 100 fields. Oh well, that's what happened when I get rushed, they told me the "great idea" of adding regional means like a week before launch and I worked 7 days straight to get it out in the launch version. Sorry about that, but I guess it could have been worse (i.e. if I found out there will be 15K "smallexecs errors" on regional means, ack!) |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
> ...Sorry about that, but I guess it could have been worse. :Shrug: C'est la vie - just bad luck I had two machines finish a run at much the same time before you could fix it... > (i.e. if I found out there will be 15K "smallexecs errors" on regional means, ack!) Yeah, that would have been decidedly <i>ouch</i>..! ;-) <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
Aaarrrrrggggghhhh, <i>again</i>; <img src="http://cpdn.tuxie.org/uk_nick/CP-boinc/Dilly_CP-boinc_error.png"> Dilly this time, with the <i>exact same error code</i> "-131" - <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=24494">Result #24494</a> - it has also showed up okay in the "Last 10 Results Returned: " - That's three now from my machines. :-( Gah, I thought this was fixed - 'Alison' is due to finish her first CP-boinc run at 04:12 and 'Amanda' at 08:38 this morning - I hope they're not going to throw errors too..!?!? (Is that file size checked at this end too or only by the CP-boinc servers..?) Looking through the last 10 results returned I see that most are uploading okay but some others are throwing this same error. eg. <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&resultid=26620&phase=AT#graph">Result #26620</a>. (#26620 was run under BOINC v4.05, so it's not happening because I'm running BOINC v4.09) <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> PS: I have a backup of the whole CP-boinc folder from 40 minutes before results upload - any point in trying to run it through again Carl..? |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
And again, 'Alison' this time, same oversize file upload error "-131" for file "*_1.zip" - <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&resultid=26653&phase=AT#graph">Result #26653</a>. :-( I have temporarily disabled Amanda's network access so that she cannot attempt final results upload as yet... <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
Amanda has also thrown the exact same error <i>whilst network access was disabled</i>, so it's <i>this</i> end, not the servers Carl. :? <img src="http://cpdn.tuxie.org/uk_nick/CP-boinc/Amanda_CP-boinc_error.png"> This is all that is in her "\climateprediction.net" folder: <img src="http://cpdn.tuxie.org/uk_nick/CP-boinc/Amanda_CP_folder_contents.png"> So "016j_300026516_0_1.zip" has already been deleted, even with network access disabled... <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
And <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?field=Temperature&resultid=30197&phase=AT#graph">another one</a> - this one was under BOINC v4.05, so it's not just me, nor the change to BOINC v4.08 ~ 4.09... (Heh, looks like a 'cold equator' too. ;-) <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
> Amanda has also thrown the exact same error <i>whilst network access was > disabled</i>, so it's <i>this</i> end, not the servers Carl. :? > > So "016j_300026516_0_1.zip" has already been deleted, even with network access > disabled... I've tracked this one down in the BOINC source code, and the problem seems to be in the client_state.xml file. There are a couple of {max_nbytes} entries in the file for each of the output files, and the values for the _1.zip result are 1000000.000000 and 1000000 in my files (set to 5000000.000000 and 5000000 for the other 4 result files). I would guess that it's possible to work-around the problem by stopping BOINC, manually editing the client_state files to increase the value of the field and restarting BOINC. But I couldn't possibly advise doing it unless you know exactly what you're doing ;-) I'm afraid it looks like there's another general workunit problem, Carl :( <a href="http://www.teampicard.net"><img src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a> |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
Good tip, Thyme Lawn. Nick, you can also try max_nbytes 0.000000 as it states at apps files; means no limitation i guess. > I've tracked this one down in the BOINC source code, and the problem seems to > be in the client_state.xml file. There are a couple of {max_nbytes} entries in > the file for each of the output files, and the values for the _1.zip result > are 1000000.000000 and 1000000 in my files (set to 5000000.000000 and 5000000 > for the other 4 result files). > > I would guess that it's possible to work-around the problem by stopping BOINC, > manually editing the client_state files to increase the value of the field and > restarting BOINC. But I couldn't possibly advise doing it unless you know > exactly what you're doing ;-) > > I'm afraid it looks like there's another general workunit problem, Carl :( > > <a href="http://www.teampicard.net"><img> src="http://www.teampicard.net/templates/fisubice/images/phpbb2_logo.jpg"></a><a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join > us here</a> > |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
|
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
Thyme Lawn wrote: > I've tracked this one down in the BOINC source code, and the problem seems to > be in the client_state.xml file. There are a couple of {max_nbytes} entries in > the file for each of the output files, and the values for the _1.zip result > are 1000000.000000 and 1000000 in my files (set to 5000000.000000 and 5000000 > for the other 4 result files). Okay, that looks like it - both 'Helen' and 'Tracy' also have the figure 1000000 instead of 5000000 so I guess I oughta edit them. > I would guess that it's possible to work-around the problem by stopping BOINC, > manually editing the client_state files to increase the value of the field and > restarting BOINC. But I couldn't possibly advise doing it unless you know > exactly what you're doing ;-) Hmmn, what text editor is going to alter those figures without screwing something else up, as 'notepad' is liable to do. :? > I'm afraid it looks like there's another general workunit problem, Carl :( This must have been present at a certain period only TL - all the models I've downloaded recently are okay. Carl: Is it worthwhile editing this figure in the backups I have from 'Dilly', 'Alison' & 'Amanda' then re-running them or should I just leave it..? <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
I have changed the server side to accept up to 5MB for any CPDN/BOINC .zip file, but the remaining problem seems to be a batch of old workunits that I had the first file as 1MB upper limit. You can try the edit "1" to "5" for that _1.zip as Thyme Lawn pointed out, however don't do it on the bit as that will cause a validation error on the server (the servers are OK up to 5MB on all files, I've changed them all over). |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
> You can try the edit "1" to "5" for > that _1.zip as Thyme Lawn pointed out, however don't do it on the bit Carl: I don't understand what you mean by "don't do it on the bit"..? > as that > will cause a validation error on the server (the servers are OK up to 5MB on > all files, I've changed them all over). <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
Carl, do you suggest that such value had been changed lately? Are those already uploaded models from beta? I'm quite confused there... > I have changed the server side to accept up to 5MB for any CPDN/BOINC .zip > file, but the remaining problem seems to be a batch of old workunits that I > had the first file as 1MB upper limit. You can try the edit "1" to "5" for > that _1.zip as Thyme Lawn pointed out, however don't do it on the bit as that > will cause a validation error on the server (the servers are OK up to 5MB on > all files, I've changed them all over). |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
> > You can try the edit "1" to "5" for > > that _1.zip as Thyme Lawn pointed out, however don't do it on the bit > > Carl: I don't understand what you mean by "don't do it on the bit"..? > > > as that > > will cause a validation error on the server (the servers are OK up to 5MB > on > > all files, I've changed them all over). Okay, - don't alter the figure in the 'signed xml' segment is what you meant... <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
Carl: Okay, I've edited 'Helen' (23:01) and 'Tracy' (16:43) that are due to finish within 24 hours - all the rest already have 5000000 in there... Honza: All the models I've downloaded recently already have the 5000000 figure in there - it only seems to be from around the time of that 'fortran namelist' problem when I had to reset the project on a number of machines. <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> |
©2024 cpdn.org