climateprediction.net (CPDN) home page
Thread 'ANOTHER UPLOAD PROBLEM'

Thread 'ANOTHER UPLOAD PROBLEM'

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 33 · Next

AuthorMessage
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 49335 - Posted: 11 Jun 2014, 4:17:13 UTC

I have 3 zip13 upload failure.
ID: 49335 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,018,099
RAC: 20,856
Message 49336 - Posted: 11 Jun 2014, 8:57:21 UTC

If we know that the answer is, "42" why are we crunching?
ID: 49336 · Report as offensive     Reply Quote
Professor Desty Nova
Avatar

Send message
Joined: 19 Sep 04
Posts: 92
Credit: 2,011,637
RAC: 351
Message 49337 - Posted: 11 Jun 2014, 9:00:52 UTC - in response to Message 49336.  

If we know that the answer is, "42" why are we crunching?


To find the question :-P


Professor Desty Nova
Researching Karma the Hard Way
ID: 49337 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49338 - Posted: 11 Jun 2014, 9:01:42 UTC - in response to Message 49336.  

If we know that the answer is, "42" why are we crunching?

To find out what the question was, of course.
ID: 49338 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 49339 - Posted: 11 Jun 2014, 13:51:16 UTC

Now that we know the answer is �42� and I believe that question was �What is the meaning of life, the Universe and everything� it is time to get back to the real purpose of this thread.

I have 3 13.zip files stuck in the transfer tab. All are for hadam3p_eu�s. Relevant error messages are as follows:

hadam3p_anz_r42c_2012_1_008734882_0_13.zip
6/11/2014 6:52:32 AM | climateprediction.net | [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_anz_r42c_2012_1_008734882_0_13.zip: No such file or directory
6/11/2014 6:52:32 AM | climateprediction.net | Temporarily failed upload of hadam3p_anz_r42c_2012_1_008734882_0_13.zip: transient upload error
6/11/2014 6:52:32 AM | climateprediction.net | Backing off 04:03:44 on upload of hadam3p_anz_r42c_2012_1_008734882_0_13.zip
6/11/2014 8:30:14 AM | climateprediction.net | Sending scheduler request: To send trickle-up message.
6/11/2014 8:30:14 AM | climateprediction.net | Not requesting tasks: don't need
6/11/2014 8:30:16 AM | climateprediction.net | Scheduler request completed
6/11/2014 8:30:23 AM | climateprediction.net | Started upload of hadam3p_eu_r3a6_2013_1_008749140_0_1.zip
6/11/2014 8:35:04 AM | climateprediction.net | Finished upload of hadam3p_eu_r3a6_2013_1_008749140_0_1.zip
6/11/2014 9:14:34 AM | climateprediction.net | Started upload of hadam3p_anz_r3vp_2012_1_008734643_0_13.zip
6/11/2014 9:21:17 AM | climateprediction.net | [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_anz_r3vp_2012_1_008734643_0_13.zip: No such file or directory
6/11/2014 9:21:17 AM | climateprediction.net | Temporarily failed upload of hadam3p_anz_r3vp_2012_1_008734643_0_13.zip: transient upload error
6/11/2014 9:21:17 AM | climateprediction.net | Backing off 04:24:15 on upload of hadam3p_anz_r3vp_2012_1_008734643_0_13.zip

It looks like it is time to remount the server again.

ID: 49339 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49340 - Posted: 11 Jun 2014, 14:05:24 UTC

Added to which, I have

11/06/2014 14:49:48 | climateprediction.net | Started upload of hadam3p_eu_r7bv_2013_1_008754385_0_13.zip
11/06/2014 14:55:58 | climateprediction.net | [error] Error reported by file upload server: can't open file /storage/cpdn-restarts/incoming/uploader/hadam3p_eu_r7bv_2013_1_008754385_0_13.zip: No such file or directory

Zip 13 for (all) HadAM3P models is uploaded to

cpdn-restarts.oerc.ox.ac.uk

whereas zips 1-12 are uploaded (for EU models) to

cpdn-upload2.oerc.ox.ac.uk

so my presumed answer is that server 'cpdn-restarts' is having problems with its file storage system (probably a remote mounted drive) during the back-end server work at Oxford.
ID: 49340 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 49341 - Posted: 11 Jun 2014, 18:39:52 UTC

One of mine has the same diagnostic message Richard reported, except it's for ANZ. I'm not sweating it because a lot of server maintenance is underway this month. (See Jonathan's schedule in 'News and Announcements' Thread, at top of 'Number Crunching.')

Repetitious upload followed by failure is frustrating and I commiserate. However it isn't a terminal condition. It's a known issue at the head shed and will be sorted-out eventually.

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 49341 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,018,099
RAC: 20,856
Message 49342 - Posted: 11 Jun 2014, 20:30:17 UTC

One of mine has the same diagnostic message Richard reported, except it's for ANZ.


Interesting, I wonder if there is a backlog causing the failures to be intermittent? One of my anz zips has just gone through with no problems and we are past normal working hours at Oxford.
ID: 49342 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49343 - Posted: 11 Jun 2014, 20:34:12 UTC - in response to Message 49342.  

One of mine has the same diagnostic message Richard reported, except it's for ANZ.


Interesting, I wonder if there is a backlog causing the failures to be intermittent? One of my anz zips has just gone through with no problems and we are past normal working hours at Oxford.

'one of'? Only zip 13 is affected - all the others go elsewhere.
ID: 49343 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49349 - Posted: 12 Jun 2014, 15:39:30 UTC - in response to Message 49343.  

My stalled uploads just cleared. Remember to allow for congestion, now that the flow has started again.
ID: 49349 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 49350 - Posted: 13 Jun 2014, 0:33:38 UTC

Mine have cleared also.

ID: 49350 · Report as offensive     Reply Quote
Eirik Redd

Send message
Joined: 31 Aug 04
Posts: 391
Credit: 219,896,461
RAC: 649
Message 49358 - Posted: 14 Jun 2014, 8:27:57 UTC

Backlog here cleared also. But very slowly. Figure it is the re-configure for faster updated ser4vers.

Just guessing, but thinking that the announced server work, and the interim between the MOSES re-issue, and the new regional batches --
The next few weeks might not have much new work.

BUT when all the new work hits the new server structure -- in weeks or months --

Be ready.
ID: 49358 · Report as offensive     Reply Quote
Waldmeister

Send message
Joined: 13 Jun 11
Posts: 34
Credit: 1,415,036
RAC: 1,383
Message 49359 - Posted: 14 Jun 2014, 17:32:23 UTC

Ok, my task cleared too. Case (temporarily) closed.
ID: 49359 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49375 - Posted: 18 Jun 2014, 14:02:30 UTC

I've been getting

18/06/2014 15:00:46 | climateprediction.net | Temporarily failed upload of hadam3p_eu_r7ci_2013_1_008754408_0_13.zip: connect() failed

sporadically all day. At least it's a quick error.
ID: 49375 · Report as offensive     Reply Quote
Harri Liljeroos

Send message
Joined: 9 Dec 05
Posts: 116
Credit: 12,547,934
RAC: 2,738
Message 49381 - Posted: 19 Jun 2014, 7:08:59 UTC

I've been getting:

448 climateprediction.net 19.06.14 06:29:56 Started upload of hadam3p_eu_fab3_2013_1_008765155_0_6.zip
449 climateprediction.net 19.06.14 06:29:59 [error] Error reported by file upload server: Server is out of disk space
450 climateprediction.net 19.06.14 06:29:59 Temporarily failed upload of hadam3p_eu_fab3_2013_1_008765155_0_6.zip: transient upload error
451 climateprediction.net 19.06.14 06:29:59 Backing off 4 hr 59 min 47 sec on upload of hadam3p_eu_fab3_2013_1_008765155_0_6.zip

ID: 49381 · Report as offensive     Reply Quote
ed2353

Send message
Joined: 15 Feb 06
Posts: 137
Credit: 35,299,134
RAC: 13,109
Message 49382 - Posted: 19 Jun 2014, 8:09:40 UTC - in response to Message 49381.  

Les Bayliss reported this to the project people yesterday.
ID: 49382 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49383 - Posted: 19 Jun 2014, 8:57:57 UTC - in response to Message 49382.  

Les Bayliss reported this to the project people yesterday.

And Jonathan has just replied:

I am playing musical disks at the moment...

Uploads should be OK again for a few days.
ID: 49383 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,808,726
RAC: 5,192
Message 49384 - Posted: 19 Jun 2014, 11:17:47 UTC

Has anyone had an ANZ Zip #13 upload recently? If so I'll wait for my stuck one to clear; otherwise I'll get the project people here to poke the project people there.
ID: 49384 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,706,621
RAC: 9,524
Message 49385 - Posted: 19 Jun 2014, 11:32:53 UTC - in response to Message 49384.  

Has anyone had an ANZ Zip #13 upload recently? If so I'll wait for my stuck one to clear; otherwise I'll get the project people here to poke the project people there.

My EU zip #13 has cleared: Jonathan said it was actually another symptom of the same problem (different upload servers, but evidently uploading - or trying to - to the same storage area).
ID: 49385 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,808,726
RAC: 5,192
Message 49386 - Posted: 19 Jun 2014, 13:30:32 UTC - in response to Message 49385.  

Has anyone had an ANZ Zip #13 upload recently? If so I'll wait for my stuck one to clear; otherwise I'll get the project people here to poke the project people there.

My EU zip #13 has cleared: Jonathan said it was actually another symptom of the same problem (different upload servers, but evidently uploading - or trying to - to the same storage area).

Gone now: thanks for the confirmation ...
ID: 49386 · Report as offensive     Reply Quote
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 33 · Next

Message boards : Number crunching : ANOTHER UPLOAD PROBLEM

©2024 cpdn.org