climateprediction.net (CPDN) home page
Thread 'Servers OK But Uploads Constantly Fail - Why?'

Thread 'Servers OK But Uploads Constantly Fail - Why?'

Message boards : Number crunching : Servers OK But Uploads Constantly Fail - Why?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profileex_brit
Avatar

Send message
Joined: 26 Aug 04
Posts: 84
Credit: 351,331
RAC: 0
Message 44258 - Posted: 29 May 2012, 11:46:13 UTC
Last modified: 29 May 2012, 12:01:11 UTC

I have two WU's that simply refuse to upload despite the servers now being 'Up'.

Here is a partial extract from the Event Log:

29/05/2012 7:27:07 AM | climateprediction.net | Started upload of hadam3p_pnw_bng9_1992_1_007916696_0_13.zip
29/05/2012 7:41:24 AM | climateprediction.net | [error] Error reported by file upload server: EOF on socket read : asked for 16382, got 14604
29/05/2012 7:41:24 AM | climateprediction.net | Temporarily failed upload of hadam3p_pnw_bng9_1992_1_007916696_0_13.zip: transient upload error
29/05/2012 7:41:24 AM | climateprediction.net | Backing off 3 hr 9 min 13 sec on upload of hadam3p_pnw_bng9_1992_1_007916696_0_13.zip

The reason for failure seems to change randomly.

Previously it was:

29/05/2012 6:35:29 AM | climateprediction.net | Started upload of hadam3p_pnw_bnga_1993_1_007916697_0_13.zip
29/05/2012 6:35:54 AM | climateprediction.net | Temporarily failed upload of hadam3p_pnw_bnga_1993_1_007916697_0_13.zip: transient HTTP error
29/05/2012 6:35:54 AM | climateprediction.net | Backing off 5 hr 12 min 14 sec on upload of hadam3p_pnw_bnga_1993_1_007916697_0_13.zip
29/05/2012 6:35:58 AM | | Project communication failed: attempting access to reference site
29/05/2012 6:36:00 AM | | Internet access OK - project servers may be temporarily down.
Peter
Toronto, Canada
ID: 44258 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 44259 - Posted: 29 May 2012, 12:05:25 UTC - in response to Message 44258.  
Last modified: 29 May 2012, 12:06:32 UTC

There is a problem with a hard disk mounting on that server. Also discussed in this thread [/url] http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7378
ID: 44259 · Report as offensive     Reply Quote
Profileex_brit
Avatar

Send message
Joined: 26 Aug 04
Posts: 84
Credit: 351,331
RAC: 0
Message 44261 - Posted: 29 May 2012, 13:00:11 UTC
Last modified: 29 May 2012, 13:22:15 UTC

They seemed to think it was solved....from what I read there in the penultimate post.
ID: 44261 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 44264 - Posted: 29 May 2012, 18:01:22 UTC

The upload server is currently being hammered by the backlog of uploads. Some connections will fail to be made until that backlog has been cleared. The "EOF on socket read" error was almost certainly due to packets being timed out when the network was overloaded.

Note that the server which receives the _13.zip files will be one of those affected by the essential maintenance planned for the storage infrastructure tomorrow morning.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 44264 · Report as offensive     Reply Quote
Profileex_brit
Avatar

Send message
Joined: 26 Aug 04
Posts: 84
Credit: 351,331
RAC: 0
Message 44265 - Posted: 29 May 2012, 18:38:05 UTC

Thanks.
ID: 44265 · Report as offensive     Reply Quote

Message boards : Number crunching : Servers OK But Uploads Constantly Fail - Why?

©2024 cpdn.org