Message boards : Number crunching : Uploads not working
Message board moderation
Previous · 1 · 2 · 3 · 4
Author | Message |
---|---|
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
Just reporting some good news. zip files seem to be uploading. |
Send message Joined: 19 Aug 05 Posts: 104 Credit: 1,866,495 RAC: 0 |
For my units the PNW units are uploading good, the EU units have been just setting here. One system is working on it's last model, hope there is new work this week. Keep on crunching Pizza@Home |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
PNW goes directly to Uni of Oregon, USA, so don't count. New work won't even be considered until ALL of the server problems are sorted, which may be another week yet. Michaelmas Term starts in a weeks time. or thereabouts, so Long Vacation will finish in a few days, and all of the IT people who scarpered as soon as it started should be back soon, and dealing with problems in their various areas. Backups: Here |
Send message Joined: 21 Oct 10 Posts: 53 Credit: 2,101,753 RAC: 3,985 |
Just when I was about to write "it's stuck with me too" it started to upload again :) edit : well then it gets stuck again, then it restarts again... so I guess we'll have to wait for the return of the Jedi... |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
I don�t believe that there is much that you can do to speed this up. The only real solution is to wait for the server problems to be fixed. You might suspend network activity so that the stuck zip file doesn�t keep trying to upload. If you are running other types of WU�s or other Boinc projects you can reenable network activity about once a day to let other types to upload and then resuspend. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
To: J. Patrick Malone I've hidden your post to stop spammers from getting your email address. As for mailing results back to the project, this isn't how BOINC projects work. You'll just have to wait patiently like all of us. If you read back through this thread, you'll find one of my earlier posts, where I listed the only steps that can be taken. Backups: Here |
Send message Joined: 8 Sep 10 Posts: 6 Credit: 1,475,984 RAC: 0 |
FWIW, my long queue of EU model uploads has decreased and some of the uploads are now getting through. |
Send message Joined: 16 May 07 Posts: 10 Credit: 2,368,487 RAC: 0 |
Here we go again :( 03/10/2012 12:01:08 | climateprediction.net | [error] Error reported by file upload server: can't write file /storage/incoming/uploader//hadam3p_eu_2r82_1972_1_008189180_0_7.zip: No space left on server 03/10/2012 12:01:08 | climateprediction.net | Temporarily failed upload of hadam3p_eu_2r82_1972_1_008189180_0_7.zip: transient upload error 03/10/2012 12:01:08 | climateprediction.net | Backing off 9 hr 1 min 29 sec on upload of hadam3p_eu_2r82_1972_1_008189180_0_7.zip |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
It's more a matter of "still" rather than "again". Have you read the News thread? It could be next week before the bulk of the uploads get through. Backups: Here |
Send message Joined: 16 May 07 Posts: 10 Credit: 2,368,487 RAC: 0 |
But all my uploads & of others went through so I thought the problem(s) were fixed that's why the "again". Never mind. And, of course, I've read both the News and Announcements plus the other threads (Uploads not working, Server out of disk space,...) not to mention my own topic "Permanent HTTP Error". |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
If everyone were to pick a day of the week and a time to enable internet activity, it would reduce the load on the server after outages. Even if quite a few people chose the same day, it would reduce the hammering when first back on line. Perhaps the information where people sign up should suggest this? I know it is nice to look at stats and see how you are doing but I am sure most of us could cope with getting our fix once a week rather than several times a day?........... |
Send message Joined: 14 Apr 05 Posts: 31 Credit: 16,491,691 RAC: 0 |
Possibly, but the point is that this issue has been present for some time. I currently have 20 eu zip files unable to upload. Hopefully we will be told when this has been resolved, on the news thread - although they did say "next week" about a week ago... Brian |
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
Just a few minuets ago while reading this thread, my 24 eu zip files are starting to upload. Yay! |
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
Just reporting some good news. all my 24 eu zip files have now uploaded at 175 kbps. Well done to the team @ Oxford! and thank Jonathan Miller CPDN SysAdmin |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I managed to get my remaining 16 EUs to upload 'overnight'. So it's not fixed yet, just "getting there". Data is still being moved off a couple of servers to storage, but more is coming in just as fast. I've been watching this in the messages on one of my computers, as 16 files slowly uploaded. According to the Status page yesterday, there were over 135,000 tasks running, and now it says 127,477, so it's coming down. Just thinking out loud, if only a quarter of those "running" were due to pending uploads, and each one only had a quarter of their files waiting, that's about 90,000 zips fighting each other for disk space. It must be somewhat like a person running an ultra-marathon through vast swarms of stampeding elephants, rinos and wildebeests, while juggling a dozen sharp knives. There's been more hardware and software failures since the weekend, but Jonathan and Andy have their eyes on things. Backups: Here |
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
Les thank you for your post, like you say we're not out of woods yet, there could be more bumps in the road ahead. best wishes to the team @ Oxford! and thank you again to Andy and Jonathan, CPDN SysAdmins for a job well done! Byron |
Send message Joined: 3 Sep 08 Posts: 23 Credit: 42,176,240 RAC: 10,608 |
Yes, thank you Les and others who work tirelessly to keep everything up and running. I just wanted to add an 'FYI' that the uploader server issues again seem to be interfering with attempts to download work from the 'reference site' onto a windows machine which I've just set up for CPDN number crunching. Searching the forum archives, my symptoms are the same as those experienced and explained in message 44708 (http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7442&nowrap=true#44708). Per the advice given there, I'll just remain patient while everything is returned to normal. Thanks again, -- Jim |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
According to the Status page yesterday, there were over 135,000 tasks running, My main machine has two tasks running but four more in the queue. As these are listed as being in progress when i look at the computer's page, I presume that those 135,000 include those queued on machines but not yet started? A trivial point, I know given the problems with hardware and software etc but it piqued my curiosity and I wondered how many tasks are actually, "in progress" My other linux machine doesn't have any in the queue at the moment so my own average would be half of those listed. I will leave it to someone else with more machines to work out something with more statistical validity! |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
According to the Status page yesterday, there were over 135,000 tasks running, The 135,000 number is inaccurate for at least two reasons. First, as you noted, some indeterminate number of those are waiting "Ready to start" on somebody's host(s). Another indeterminate number have downloaded to hosts that will never finish the task(s). I have 6 machines running 24/7 - 3 of them are somewhat fast. There's very little in the queue "Ready to start" but a whole lot pending upload. I am only letting each machine go online once a day to update stats and trickle up and download my preferred wu's from other projects. I let only one of them at a time stay online for a day (intil its upload queue clears, then I leave it online) -- that means one is network enabled each day until its upload queue clears - it will take a few days to finish the 80+ uploads (per fast host) that are still pending. My slower hosts are all caught up and online for new work. Running on a slowish DSL. Expect uploads to catch up within a couple of days. Getting downloads from time to time (probably old wu's that timed out and got resubmitted automatically) Figuring an overall reduction factor to adjust the supposed 130,000 tasks "out there" would be real difficult. I have prefs set to start downloading when any task is withing 28 hours of completing. So there's less than 25% ratio "Ready to start" versus "Running" here. I still have my 3 faster hosts that have more tasks "uploading" than they have "running" It will be a while - and I'm not going to push my uploads because there's lots of other people with worse network than what I have and I'm not going to do anything to overload the fragile servers. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
OK, my mistake. The label is actually Tasks in progress This is an abbreviation for: I've sent this number of work units to client computers, and they aren't yet on my work list as being completed or failed. Therefore they're still out there somewhere.. As for new work that's occasionally being received, that's due to the resubmission script being fired up to slowly produce new data sets in the sequence of that past work that has been returned intact. Backups: Here |
©2024 cpdn.org