Message boards : Number crunching : ANOTHER UPLOAD PROBLEM
Message board moderation
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 33 · Next
Author | Message |
---|---|
Send message Joined: 28 Jul 11 Posts: 2 Credit: 61,196 RAC: 0 |
rapid-watch seems to be up, but with a big backlog. Like locally, here, almost 100 63GB uploads to go. Yep, you were right, thanks :) |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 30,992,465 RAC: 14,585 |
Mine finally cleared betwen 7:00 and 8:30 (UK time) this morning. No new tasks though:-(( |
Send message Joined: 3 Nov 10 Posts: 39 Credit: 2,494,427 RAC: 0 |
...odd behavior of 3n work unit at 100%...elapsed clock still running, completion clock shows "---", status "Running", and no messages about upload attempt... would this indicate waiting for server or something else ??? frank |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The only 3n that you seem to have, has the Time step of 1,036,800 as the last one received. This is the last trickle_up, so it looks like that one just doesn't know when to call it quits. You could try Exiting from BOINC and then restarting it, to see if that gets it going and gets you the Over Success Done set of messages, but otherwise just Abort it. |
Send message Joined: 3 Nov 10 Posts: 39 Credit: 2,494,427 RAC: 0 |
hello les did the exit/restart of BOINC as you suggested...status changed to "Computation error" and message file said that the 4.zip file could not be found...and that was the end...at 11,508 credits out of 12,440 or so... on the bright side, none of the wingmen on this task got past zero !!! frank |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Strange, but the only other option was to Abort. Credits are a different matter. The scripts get run occasionally now, as per the discussion in the Credits? thread. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
What do you know, it�s Friday night and we seem to have a upload problem. I presently have 2 zip files from 2 hadcm3s (1 from each) stuck in my transfer tab. I wonder how many I�ll have by Monday? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Yes, it seems that the server at BADC has failed. Been reported. |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
Friday night server failures are us. :) Been happening for years. Hope the denier crew don't get a conspriancy theory out of this |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,020,584 RAC: 20,684 |
Just had one go through with no problems. :) Someone must have come in on overtime to kick the box. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
Just had one go through with no problems. :) Someone must have come in on overtime to kick the box. The BADC server is on an independent site - the British Atmospheric Data Centre - which I would expect would aim for 24/7 operation on normal time. It's currently showing "The CEDA site and web services have been fullly resolved following this morning's fault interupting services." (though that message is dated two days ago) |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
It�s nice that the the CEDA says that it it fully up and running, but, I now have 4 hadcm3s zip files stuck in my transfer tab. Still getting the transient HTTP error. Messages follow: 10/18/2014 3:20:21 PM | climateprediction.net | Started upload of hadcm3s_3dtd_1993_2_009067812_1_1.zip 10/18/2014 3:20:21 PM | climateprediction.net | Started upload of hadcm3s_2wro_2003_2_009071879_0_1.zip 10/18/2014 3:23:23 PM | climateprediction.net | Temporarily failed upload of hadcm3s_3dtd_1993_2_009067812_1_1.zip: transient HTTP error 10/18/2014 3:23:23 PM | climateprediction.net | Backing off 00:08:24 on upload of hadcm3s_3dtd_1993_2_009067812_1_1.zip 10/18/2014 3:23:23 PM | climateprediction.net | Temporarily failed upload of hadcm3s_2wro_2003_2_009071879_0_1.zip: transient HTTP error 10/18/2014 3:23:23 PM | climateprediction.net | Backing off 00:06:22 on upload of hadcm3s_2wro_2003_2_009071879_0_1.zip 10/18/2014 3:23:26 PM | | Project communication failed: attempting access to reference site 10/18/2014 3:23:28 PM | | Internet access OK - project servers may be temporarily down. 10/18/2014 3:27:35 PM | climateprediction.net | Started upload of hadcm3s_2rwz_1981_2_009052675_1_2.zip 10/18/2014 3:27:58 PM | climateprediction.net | Temporarily failed upload of hadcm3s_2rwz_1981_2_009052675_1_2.zip: transient HTTP error 10/18/2014 3:27:58 PM | climateprediction.net | Backing off 03:32:46 on upload of hadcm3s_2rwz_1981_2_009052675_1_2.zip 10/18/2014 3:28:01 PM | | Project communication failed: attempting access to reference site 10/18/2014 3:28:03 PM | | Internet access OK - project servers may be temporarily down. 10/18/2014 3:29:46 PM | climateprediction.net | Started upload of hadcm3s_2wro_2003_2_009071879_0_1.zip 10/18/2014 3:30:09 PM | climateprediction.net | Temporarily failed upload of hadcm3s_2wro_2003_2_009071879_0_1.zip: transient HTTP error 10/18/2014 3:30:09 PM | climateprediction.net | Backing off 00:13:52 on upload of hadcm3s_2wro_2003_2_009071879_0_1.zip 10/18/2014 3:30:11 PM | | Project communication failed: attempting access to reference site 10/18/2014 3:30:13 PM | | Internet access OK - project servers may be temporarily down. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
That message just says that the data centre is back up. It says nothing about individual programs and services being run on the possibly many servers. And, as the Badwatch server that the BADC people use to store the data sent from modellers is far away from Oxford, and nothing to do with our Oxford people, our problem will have to wait until Jonathan sends them an email and the BADC IT people do something about it. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,020,584 RAC: 20,684 |
Got another coming up this morning, will see what happens. Interestingly, the last one that made it took 15 minutes which which is several times longer than normal. I wonder if this indicates that the BADC server is choked at some point? Could be anywhere from data going in from the interweb to the actual server. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,020,584 RAC: 20,684 |
Got another coming up this morning, will see what happens. I am now getting the transient http error as well. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
Looks like these people work a 5 day work week just the ones at Oxford.Still not movement on the uploads. And it seems that now Seti is down also. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,020,584 RAC: 20,684 |
Two more zips gone to BADC server. Again, taking 16 minutes each about twice the time they normally do. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
Looks like these people work a 5 day work week just the ones at Oxford.Still not movement on the uploads. And it seems that now Seti is down also. SETI is back up, and all my CPDN uploads have cleared. CPDN upload speed (when I watched the last one uploading) was just about the maximum my ADSL line can sustain, given what else was going on at the time - I can get about 1 Mbit/sec for a single upload, but only 500 Kbit/sec each if there are two uploads active at the same time). |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
All of mine have cleared also. |
Send message Joined: 31 Mar 05 Posts: 44 Credit: 234,235 RAC: 0 |
20/10/2014 10:46:13 AM | climateprediction.net | Task hadam3p_anz_ron8_2012_1_008958519_1 exited with zero status but no 'finished' file 20/10/2014 10:46:13 AM | climateprediction.net | If this happens repeatedly you may need to reset the project. 20/10/2014 10:46:13 AM | climateprediction.net | Task hadam3p_anz_ron6_2012_1_008958517_1 exited with zero status but no 'finished' file 20/10/2014 10:46:13 AM | climateprediction.net | If this happens repeatedly you may need to reset the project. Wasn't quite sure where to post this. Every time I restart my computer, I get the above message. I now have a total of 14 trickles, nothing gets updated and now I am beginning to wonder if I should reset as the message says? |
©2024 cpdn.org