Message boards : Number crunching : Servers OK But Uploads Constantly Fail - Why?
Message board moderation
Author | Message |
---|---|
Send message Joined: 26 Aug 04 Posts: 84 Credit: 351,331 RAC: 0 |
I have two WU's that simply refuse to upload despite the servers now being 'Up'. Here is a partial extract from the Event Log: 29/05/2012 7:27:07 AM | climateprediction.net | Started upload of hadam3p_pnw_bng9_1992_1_007916696_0_13.zip 29/05/2012 7:41:24 AM | climateprediction.net | [error] Error reported by file upload server: EOF on socket read : asked for 16382, got 14604 29/05/2012 7:41:24 AM | climateprediction.net | Temporarily failed upload of hadam3p_pnw_bng9_1992_1_007916696_0_13.zip: transient upload error 29/05/2012 7:41:24 AM | climateprediction.net | Backing off 3 hr 9 min 13 sec on upload of hadam3p_pnw_bng9_1992_1_007916696_0_13.zip The reason for failure seems to change randomly. Previously it was: 29/05/2012 6:35:29 AM | climateprediction.net | Started upload of hadam3p_pnw_bnga_1993_1_007916697_0_13.zip 29/05/2012 6:35:54 AM | climateprediction.net | Temporarily failed upload of hadam3p_pnw_bnga_1993_1_007916697_0_13.zip: transient HTTP error 29/05/2012 6:35:54 AM | climateprediction.net | Backing off 5 hr 12 min 14 sec on upload of hadam3p_pnw_bnga_1993_1_007916697_0_13.zip 29/05/2012 6:35:58 AM | | Project communication failed: attempting access to reference site 29/05/2012 6:36:00 AM | | Internet access OK - project servers may be temporarily down. Peter Toronto, Canada |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
There is a problem with a hard disk mounting on that server. Also discussed in this thread [/url] http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7378 |
Send message Joined: 26 Aug 04 Posts: 84 Credit: 351,331 RAC: 0 |
They seemed to think it was solved....from what I read there in the penultimate post. |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
The upload server is currently being hammered by the backlog of uploads. Some connections will fail to be made until that backlog has been cleared. The "EOF on socket read" error was almost certainly due to packets being timed out when the network was overloaded. Note that the server which receives the _13.zip files will be one of those affected by the essential maintenance planned for the storage infrastructure tomorrow morning. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 26 Aug 04 Posts: 84 Credit: 351,331 RAC: 0 |
Thanks. |
©2024 cpdn.org