climateprediction.net (CPDN) home page
Thread 'ANOTHER UPLOAD PROBLEM'

Thread 'ANOTHER UPLOAD PROBLEM'

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · 24 . . . 33 · Next

AuthorMessage
ProfileByron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 17 Aug 04
Posts: 289
Credit: 44,103,664
RAC: 0
Message 52289 - Posted: 22 Jul 2015, 3:41:22 UTC
Last modified: 22 Jul 2015, 4:09:07 UTC

it looks like project servers for pnw may be temporarily down :(

7/21/2015 8:30:24 PM | climateprediction.net | Temporarily failed upload of hadam3p_pnw_pox0_2013_1_009979930_1_6.zip: transient HTTP error
7/21/2015 8:30:24 PM | climateprediction.net | Backing off 00:02:44 on upload of hadam3p_pnw_pox0_2013_1_009979930_1_6.zip
7/21/2015 8:30:26 PM | | Project communication failed: attempting access to reference site
7/21/2015 8:30:27 PM | | Internet access OK - project servers may be temporarily down.
7/21/2015 8:30:40 PM | climateprediction.net | Temporarily failed upload of hadam3p_pnw_prai_2013_1_009982837_1_9.zip: transient HTTP error
7/21/2015 8:30:40 PM | climateprediction.net | Backing off 00:02:16 on upload of hadam3p_pnw_prai_2013_1_009982837_1_9.zip
7/21/2015 8:30:41 PM | | Project communication failed: attempting access to reference site
7/21/2015 8:30:42 PM | | Internet access OK - project servers may be temporarily down.
ID: 52289 · Report as offensive     Reply Quote
ProfileBonsai911

Send message
Joined: 9 Sep 04
Posts: 228
Credit: 30,750,791
RAC: 3,898
Message 52292 - Posted: 22 Jul 2015, 4:16:02 UTC

same over here:

22.07.2015 05:36:38 | climateprediction.net | [fxd] starting upload, upload_offset -1
22.07.2015 05:36:38 | climateprediction.net | Started upload of hadam3p_pnw_ppwa_2013_1_009981129_0_8.zip
22.07.2015 05:36:38 | climateprediction.net | [file_xfer] URL: http://cpdn-upload2.oerc.ox.ac.uk/cgi-bin/file_upload_handler

22.07.2015 05:41:43 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
22.07.2015 05:41:43 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
22.07.2015 05:41:43 | climateprediction.net | Temporarily failed upload of hadam3p_pnw_ppwa_2013_1_009981129_0_8.zip: transient HTTP error
22.07.2015 05:41:43 | climateprediction.net | [file_xfer] project-wide xfer delay for 666.026185 sec
22.07.2015 05:41:43 | climateprediction.net | Backing off 00:27:49 on upload of hadam3p_pnw_ppwa_2013_1_009981129_0_8.zip
22.07.2015 05:41:46 | | Project communication failed: attempting access to reference site
22.07.2015 05:41:47 | | Internet access OK - project servers may be temporarily down.

ID: 52292 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,708,278
RAC: 9,361
Message 52296 - Posted: 22 Jul 2015, 12:31:40 UTC

Seems to be running now.
ID: 52296 · Report as offensive     Reply Quote
ProfileByron Leigh Hatch @ team Carl ...
Avatar

Send message
Joined: 17 Aug 04
Posts: 289
Credit: 44,103,664
RAC: 0
Message 52297 - Posted: 22 Jul 2015, 13:01:01 UTC

Yes, mine started to upload about an hour ago and the backlog has now cleared :)
ID: 52297 · Report as offensive     Reply Quote
AySz88

Send message
Joined: 18 Sep 04
Posts: 2
Credit: 4,476,216
RAC: 0
Message 52365 - Posted: 30 Jul 2015, 2:40:39 UTC

I've been getting these "transient HTTP error" messages here - on about 30 hadcm3s uploads pending, each upload failing at a different percentage through. (I think this is unrelated, but in case it matters: currently the pending uploads are consuming the whole quota of disk space.)

7/29/2015 10:32:27 PM | climateprediction.net | Temporarily failed upload of hadcm3s_9tgm_1986_2_009894563_1_2.zip: transient HTTP error
7/29/2015 10:32:27 PM | climateprediction.net | Backing off 03:34:52 on upload of hadcm3s_9tgm_1986_2_009894563_1_2.zip
7/29/2015 10:32:27 PM | climateprediction.net | Started upload of hadcm3s_a2fc_1996_2_009906181_0_1.zip
7/29/2015 10:34:40 PM | climateprediction.net | Temporarily failed upload of hadcm3s_a90p_1996_2_009914726_2_2.zip: transient HTTP error
7/29/2015 10:34:40 PM | climateprediction.net | Backing off 03:31:16 on upload of hadcm3s_a90p_1996_2_009914726_2_2.zip
7/29/2015 10:34:42 PM | | Project communication failed: attempting access to reference site
7/29/2015 10:34:43 PM | | Internet access OK - project servers may be temporarily down.

ID: 52365 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,025,554
RAC: 20,468
Message 52367 - Posted: 30 Jul 2015, 6:51:18 UTC

currently the pending uploads are consuming the whole quota of disk space.


I see that your computer has had quite a few pnw tasks, possibly others as well fail. Often when tasks fail they don't delete the task folder in ...projects/clmateprediction.net It is likely that this is what is taking up lots of space rather than the failed uploads.

To reclaim the lost space go into wherever windows keeps your data for BOINC and you can safely delete folders for tasks that don't appear on your BOINC task list. This can be over 1GB/task.

Anyone running linux who hasn't noticed, the short tasks don't clean up even after completing successfully. I just reclaimed 20GB on one of my boxes yesterday!
ID: 52367 · Report as offensive     Reply Quote
AySz88

Send message
Joined: 18 Sep 04
Posts: 2
Credit: 4,476,216
RAC: 0
Message 52368 - Posted: 30 Jul 2015, 22:14:00 UTC - in response to Message 52367.  

I see that your computer has had quite a few pnw tasks, possibly others as well fail. Often when tasks fail they don't delete the task folder in ...projects/clmateprediction.net It is likely that this is what is taking up lots of space rather than the failed uploads.

To reclaim the lost space go into wherever windows keeps your data for BOINC and you can safely delete folders for tasks that don't appear on your BOINC task list. This can be over 1GB/task.


Thanks, that was the problem (for the disk space, anyway)!
ID: 52368 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,620,508
RAC: 4,981
Message 52800 - Posted: 6 Nov 2015, 17:32:50 UTC

It's been 4 days since my last successful upload of TRIFIDS .zips I constantly get the transient HTTP ERROR......and we are in the weekend...
ID: 52800 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 52803 - Posted: 7 Nov 2015, 5:34:01 UTC

It appears that there is a problem with the upload of zip files from hadam3p_afr tasks. I now have all 13 zip from a finished stuck in my transfer tab.


ID: 52803 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,025,554
RAC: 20,468
Message 52804 - Posted: 7 Nov 2015, 8:09:52 UTC
Last modified: 7 Nov 2015, 8:26:18 UTC

Don't expect any zips going to oxford to work before Monday now. With luck, the credit script will also be restarted then. Having only linux boxes I haven't kept track of which zips go where but if anyone has tasks left that send them to Oregon or anywhere except Oxford these should still work.

Edit: The credit script seems to have run.
ID: 52804 · Report as offensive     Reply Quote
jrapdx

Send message
Joined: 4 Jul 15
Posts: 63
Credit: 3,223,760
RAC: 0
Message 52816 - Posted: 9 Nov 2015, 1:17:38 UTC - in response to Message 52804.  

I hadn't checked my Linux box for a few days, and now I discover it has 31 uploads pending, about 1.9GB total. My Windows computer has 4 uploads pending, for 120MB. So I hope it does get cleared up tomorrow, but there may be quite a flood of data considering how many participants are are probably affected by the backlog.

Curiously, downloads of Linux tasks is not impeded. No problem running them, but communication being so unreliable leads me to think there's risk of losing the work that gets done.
ID: 52816 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 52817 - Posted: 9 Nov 2015, 2:31:42 UTC - in response to Message 52816.  

The IT people still haven't gotten all of the services running again after the shut down. That will start happening soon after start of Uni working hours, Monday.

And the upload servers where all of this is going may have run out of space.


ID: 52817 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,815,352
RAC: 5,242
Message 52819 - Posted: 9 Nov 2015, 11:18:56 UTC

New message to me, understandable in the circumstances:

09/11/2015 11:11:51 | climateprediction.net | Sending scheduler request: To send trickle-up message.
09/11/2015 11:11:51 | climateprediction.net | Not requesting tasks: too many uploads in progress
09/11/2015 11:11:53 | climateprediction.net | Scheduler request completed

All my Windows work is now finished, so I'll dust off my copy of Virtual Box and get some Linux models.
ID: 52819 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 52822 - Posted: 9 Nov 2015, 12:53:47 UTC

I asked about the lack of new work, "just in case", and have just been informed that Windows tasks have been waiting on the return to work after the project's recent hospital visit. As I suspected was the case.

ID: 52822 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 52824 - Posted: 9 Nov 2015, 18:27:23 UTC

At last! The hadam3p_afr zip files are starting to upload.
ID: 52824 · Report as offensive     Reply Quote
ATHANASIOS GKOLIARAS

Send message
Joined: 10 Dec 06
Posts: 1
Credit: 975,125
RAC: 0
Message 52827 - Posted: 9 Nov 2015, 20:47:24 UTC - in response to Message 52824.  
Last modified: 9 Nov 2015, 20:47:43 UTC

Mine too. They've finished now :)
ID: 52827 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 52921 - Posted: 20 Nov 2015, 19:09:15 UTC
Last modified: 20 Nov 2015, 19:09:34 UTC

Hello Misty

Those messages aren't error codes, just BOINC telling you that it can't find the last few zip files.

I posted an answer to a similar question a few days ago here.
ID: 52921 · Report as offensive     Reply Quote
WB8ILI

Send message
Joined: 1 Sep 04
Posts: 161
Credit: 81,522,141
RAC: 1,164
Message 52922 - Posted: 20 Nov 2015, 19:20:54 UTC
Last modified: 20 Nov 2015, 19:21:48 UTC

Misty -

I think you are being too concerned.

Sometimes the task tries to upload more zip files than were produced. This not a "failed" task. It is not abnormal.

Generally, if you don't see any "errors" in the Stderr text and the exit code is 0 (zero), you can be almost 100% sure everything is fine.

While I don't think you have any issues, you might want to change some settings to reduce the number of times the model gets interrupted (suspended). The task you pointed to in the previous post shows literally hundreds of suspensions. This is never good.
ID: 52922 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 54002 - Posted: 24 Apr 2016, 6:07:11 UTC
Last modified: 24 Apr 2016, 22:50:39 UTC

With the continuing problem of uploads getting stuck, I've found this old(ish) thread about uploads.

Please move to using this one, instead of the Credits thread.

Oxford will be starting Trinity term this Sunday, so hopefully the IT people from all over the city will be back from holidays this weekend.
As long as those that have been holding the fort don't suddenly take some leave. :)

Dates of Term
ID: 54002 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 54006 - Posted: 24 Apr 2016, 22:49:29 UTC

My eu25's are uploading OK to upload3, so it must just be upload2 that has a problem.

ID: 54006 · Report as offensive     Reply Quote
Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · 24 . . . 33 · Next

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM

©2024 cpdn.org