Message boards : Number crunching : transient upload error
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Aug 12 Posts: 40 Credit: 4,773 RAC: 0 |
I've been getting the same thing must be over a week now and there aren't any problems with my internet connection. 10/2/2012 12:59:52 AM | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space 10/2/2012 12:59:52 AM | climateprediction.net | Temporarily failed upload of hadam3p_eu_6avw_2007_1_008175177_0_7.zip: transient upload error 10/2/2012 12:59:52 AM | climateprediction.net | Backing off 5 hr 33 min 17 sec on upload of hadam3p_eu_6avw_2007_1_008175177_0_7.zip |
Send message Joined: 19 Aug 05 Posts: 104 Credit: 1,866,495 RAC: 0 |
Are you having problems with your mouse sending extra clicks? That is a known disk problem in the upload server, a replacement is being worked on. Keep on crunching Pizza@Home |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Eighteen redundant posts: 'Tis frustrating dealing with that, especially when the server is S-L-O-W. Edit: ... and 21 redundant posts in a separate thread. There is no point in posting in two places. 40 posts is overkill (to state the obvious). "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,013,957 RAC: 21,195 |
The transient upload error is back Thu 11 Oct 2012 11:37:32 BST | climateprediction.net | Started upload of hadam3p_eu_w7zl_1992_1_006805705_2_3.zip Thu 11 Oct 2012 11:51:30 BST | climateprediction.net | [error] Error reported by file upload server: EOF on socket read : asked for 16382, got 14028 Thu 11 Oct 2012 11:51:30 BST | climateprediction.net | Temporarily failed upload of hadam3p_eu_w7zl_1992_1_006805705_2_3.zip: transient upload error Thu 11 Oct 2012 11:51:30 BST | climateprediction.net | Backing off 12 min 37 sec on upload of hadam3p_eu_w7zl_1992_1_006805705_2_3.zip Shortly before I looked at the messages, it was uploading albeit only at 4.5KB/s an order of magnitude slower than I normally get. I noticed the line file upload server: EOF on socket read : asked for 16382, got 14028 Does this indicate a problem with the work unit? An hour or so earlier a zip file from the other task running on the machine went through ok so it will be a few hours before I can check if it is experiencing the same symptoms and even longer till the next upload from my dual core atom machine! Edit: Watched the last try, progress indicator on upload gets to 100% before the error message. do I abort the work unit? |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,013,957 RAC: 21,195 |
OK panic over! It went through @ the 7th attempt. I am left with a certain curiosity as to what might have been the problem however and if it is transferring 100% each time before telling me it has failed that is a fair amount of bandwidth to waste. Will check when next zip from that task goes through and report back. Dave |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Close examination of all the 'tell tales' on my system, hd, the LAN socket led, router, and modem, indicate that the "data transmission", isn't. It seems to be that BOINC has to go through the entire zip file to get to where it left off, and THEN it starts actually sending data to the internet. A bit like looking for where you're up to on a video tape that rewinds to the start each time, rather than a dvd, which can just jump to anywhere in a fraction of a second. Backups: Here |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,013,957 RAC: 21,195 |
Thanks Les, I am not so sure about the data not being transmitted though as my impression from the lights on the router was that data was being transmitted. Anyway will check in a few hours time or tomorrow morning to see what happens with the next one. Dave |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
And another thing - clicking the Abort button doesn't just stop the current transfer, it causes BOINC to delete the zip file from the computer. Backups: Here |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,013,957 RAC: 21,195 |
Thanks for the reminder Les, I wasn't going to abort at least till I saw what happened with the next zip. If it happens even once I shall suspend network activity for a couple of days to see if things improve. Dave |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,013,957 RAC: 21,195 |
Fri 12 Oct 2012 13:33:06 BST | climateprediction.net | Temporarily failed upload of hadam3p_eu_w042_1961_1_006766698_2_6.zip: transient upload error Fri 12 Oct 2012 14:23:40 BST | climateprediction.net | Temporarily failed upload of hadam3p_eu_w7zl_1992_1_006805705_2_4.zip: transient upload error Got this along with the unexpected eof on both the latest zips so will try suspending internet activity for a while though one did get through from my atom box, albeit on 2nd attempt. Dave |
Send message Joined: 25 Sep 12 Posts: 4 Credit: 32,662 RAC: 0 |
I am also getting an Error however mine is slightly different. 10/22/2012 9:04:45 AM | climateprediction.net | Started upload of hadam3p_eu_2rhd_1982_1_008209863_0_11.zip 10/22/2012 9:04:45 AM | climateprediction.net | Started upload of hadam3p_eu_2kbg_1999_1_008210105_0_11.zip 10/22/2012 9:04:47 AM | climateprediction.net | Temporarily failed upload of hadam3p_eu_2rhd_1982_1_008209863_0_11.zip: transient HTTP error 10/22/2012 9:04:47 AM | climateprediction.net | Backing off 5 hr 6 min 30 sec on upload of hadam3p_eu_2rhd_1982_1_008209863_0_11.zip 10/22/2012 9:04:47 AM | climateprediction.net | Temporarily failed upload of hadam3p_eu_2kbg_1999_1_008210105_0_11.zip: transient HTTP error 10/22/2012 9:04:47 AM | climateprediction.net | Backing off 4 hr 19 min 15 sec on upload of hadam3p_eu_2kbg_1999_1_008210105_0_11.zip 10/22/2012 9:04:48 AM | climateprediction.net | Started upload of hadam3p_eu_2kbg_1999_1_008210105_0_12.zip 10/22/2012 9:04:49 AM | climateprediction.net | Temporarily failed upload of hadam3p_eu_2kbg_1999_1_008210105_0_12.zip: transient HTTP error 10/22/2012 9:04:49 AM | climateprediction.net | Backing off 4 hr 34 min 19 sec on upload of hadam3p_eu_2kbg_1999_1_008210105_0_12.zip This has been going on for over a Week. I thought that it was because uploader1.atm was not running, However after reading this I am not so sure... This is over 200 hours of work, I really don't want to lose it.... |
Send message Joined: 15 Jan 11 Posts: 175 Credit: 6,242,691 RAC: 699 |
I might be wrong but isn't a reasonable solution to suspend the project and wait until server problems are sorted out. That way, network activity isn't affected on other projects. I did this a while ago when there were similar problems. I checked each day to see if it was worth a try. Worked for me and stopped possible loss of results. We can't get any more work at the moment anyway. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
New News post up the top of this section. About the only thing that can be done on computers, is to set the Network activity off. But then it needs to be manually watched for those people with multiple projects with short completion times. Perhaps once an hour have a short on, and then off again when the other projects have uploaded. And Suspend cpdn from running in Tasks, so you don't get any more zips. Backups: Here |
Send message Joined: 25 Sep 12 Posts: 4 Credit: 32,662 RAC: 0 |
Andy is looking into whether there is anything that can be done to cancel these work units. So what your saying is that after spending 500 hours ( the equivalent of ~21 days ) crunching work for your project your intention is to cancel the WU and give me no credit for 3 weeks of work? Am I reading this correctly? |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Andy is looking into whether there is anything that can be done to cancel these work units. Hardly. To start with, a time-line analysis would be useful. Was your task in the lot Les mentioned? If its release can be squeezed into the recent few days, you might have a concern, but only for scientific relevance (identical tasks run twice). As has been mentioned on these boards, over and over and over..., credits are awarded for Trickles returned, not at the completion of the task. No credits withdrawn, no exceptions. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 6 Sep 12 Posts: 1 Credit: 11,715 RAC: 0 |
I am experiencing the same trouble as Dave Jackson. My hadam3p_eu files are not uploading. My user id is 684918. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,013,957 RAC: 21,195 |
Though all mine have gone through and have been going through again for a while now. I am a day or so away from requesting any new work so with a bit of luck the problems on that score will be gone by the time I need some! |
Send message Joined: 25 Sep 12 Posts: 4 Credit: 32,662 RAC: 0 |
Firstly, Thanks for responding astro; I have looked at this thread and the News thread mentioned here and I can find no reference as to how I can determine if my WU's are in a specific batch. I can tell you that they were both downloaded on 3 Oct 2012. As far as credits being awarded, I don't post or read the boards often & considering how long this project has been around there must be tens of thousands of posts. So I do apologize for not knowing your credit system. I have run this project on a couple/3 occasions and I have always had some kind of problem, there is a reason that out of over 250 million credits in 41 projects I have less than 12k in this one. It seems that I just have bad luck in Climate. I will just wait until y'all sort this one out. Thanks again for responding... |
Send message Joined: 25 Sep 12 Posts: 4 Credit: 32,662 RAC: 0 |
Woo Hoo, They are Gone!!! Good work Folks... |
©2024 cpdn.org