climateprediction.net (CPDN) home page
Thread 'failed upload: can't resolve hostname'

Thread 'failed upload: can't resolve hostname'

Message boards : Number crunching : failed upload: can't resolve hostname
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,703,308
RAC: 9,860
Message 47257 - Posted: 8 Oct 2013, 13:23:15 UTC - in response to Message 47256.  



Sounds like one of the problems I am having, raised in this message thread.



I don't think it is the same thing. The uploads in this thread were failing because the name of the upload server was spelt wrong in the configuration file (apid-wattch), whereas your log file shows that the server name is spelt correctly (rapid-watch).

I can't see any obvious reason in your log files for it going wrong. What sort of filewall do you use? It may be worth taking a look at the firewall and antivirus logs to see if anything is appearing there (some security software blocks big zip files, for example, they may appear as a 'compression bomb' in the log).

He stated he has "another" problem after fixing the hosts file (the server name) issue. Thats the same error I get. Uploads start and fail part of the way through.

As for firewall and antivirus I don't have any errors being logged for either of them.

This is a case where enabling BOINC's http_debug logging may provide more details about the nature of that "transient upload error".
ID: 47257 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47258 - Posted: 8 Oct 2013, 15:24:56 UTC - in response to Message 47257.  

I did not get the transient upload problem solved and decided to delete the corresponding units. Editing the hosts file did not solve the transient upload problem a couple of units had.
ID: 47258 · Report as offensive     Reply Quote
Helmer Bryd

Send message
Joined: 16 Aug 04
Posts: 156
Credit: 9,035,872
RAC: 2,928
Message 47265 - Posted: 9 Oct 2013, 18:21:46 UTC
Last modified: 9 Oct 2013, 19:17:46 UTC

I guess this is apid-wattch thing is a linux problem.
Saw the same thing back in february in cpdnbeta when attaching a new box with 3.5 linux kernel, upload servers stanford and cpdnbeta got misspelled as staanford and pdnbeta in client-state.xml, also got apid in classic.
Got tired of it and reinstalled old ubuntu10 with 2.6 kernel and all was fine.
Then in july tried lubuntu 13.04 with 3.8 kernel and it was 23% faster! in hadcmn.6.07 but with the same apid crap, then tried to edit the client-state.xml to rapid-watch on all four lines for every task and it works :-)
Me thought the speedup was due to the light desktop in lubuntu but then newly installed linuxmint-15 and it's the same nice speed, yuipee! On par with windows now?
This is with boinc 6.4-6.10.58, higher got problems with libraries, don't like symbolic links and i am lazy, heh

Edit: this is only one box i sit and edit the client-state.xml in and don't bother much but it surely should be fixed if others have problems too
ID: 47265 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,009,815
RAC: 21,293
Message 47267 - Posted: 10 Oct 2013, 7:06:12 UTC - in response to Message 47265.  

Strange, I haven't had any problems since upgrading to the latest Ubuntu with the 3.8 kernel. All uploads have gone through normally. I have noticed faster boot times with it but haven't been paying attention to the speed of crunching.
ID: 47267 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,009,815
RAC: 21,293
Message 47268 - Posted: 10 Oct 2013, 8:16:13 UTC - in response to Message 47267.  

Too late to edit - my memory is that this only happened with a particular lot of tasks which probably explains my not having suffered this time around.
ID: 47268 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47269 - Posted: 10 Oct 2013, 16:52:49 UTC

Here we go again with the transient upload error... I can't really upgrade my kernel due to some other incompatibilities, but I am open for suggestions. Otherwise I guess I have go through my working units every now and then and delete the problematic ones. No need to upload 50MB if it does not go through anyway... :s

10-Oct-2013 12:37:18 [climateprediction.net] [fxd] starting upload, upload_offset -1
10-Oct-2013 12:37:18 [climateprediction.net] Started upload of hadcm3n_3bqz_2020_40_008389544_3_1.zip
10-Oct-2013 12:37:18 [climateprediction.net] [file_xfer_debug] URL: http://apid-wattch.badc.rl.ac.uk/cpdn_cgi/file_upload_handler
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Info:  About to connect() to apid-wattch.badc.rl.ac.uk port 80 (#0)
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Info:    Trying 130.246.191.84... 
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Info:  Connected to apid-wattch.badc.rl.ac.uk (130.246.191.84) port 80 (#0)
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: POST /cpdn_cgi/file_upload_handler HTTP/1.1
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: User-Agent: BOINC client (x86_64-pc-linux-gnu 6.10.58)
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: Host: apid-wattch.badc.rl.ac.uk
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: Accept: */*
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: Accept-Encoding: deflate, gzip
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: Content-Type: application/x-www-form-urlencoded
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: Content-Length: 292
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Sent header to server: 
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Received header from server: HTTP/1.1 200 OK
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Received header from server: Date: Thu, 10 Oct 2013 16:37:20 GMT
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Received header from server: Server: Apache/2.2.12 (Linux/SUSE)
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Received header from server: Transfer-Encoding: chunked
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Received header from server: Content-Type: text/plain
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Received header from server: 
10-Oct-2013 12:37:20 [---] [http_debug] [ID#2] Info:  Connection #0 to host apid-wattch.badc.rl.ac.uk left intact
10-Oct-2013 12:37:21 [climateprediction.net] [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval 0
10-Oct-2013 12:37:21 [climateprediction.net] [file_xfer_debug] parsing upload response: <data_server_reply>
    <status>0</status>
    <file_size>0</file_size>
</data_server_reply>
10-Oct-2013 12:37:21 [climateprediction.net] [file_xfer_debug] parsing status: 0
10-Oct-2013 12:37:21 [climateprediction.net] [fxd] starting upload, upload_offset 0
10-Oct-2013 12:37:23 [---] [http_debug] [ID#2] Info:  Re-using existing connection! (#0) with host apid-wattch.badc.rl.ac.uk
10-Oct-2013 12:37:23 [---] [http_debug] [ID#2] Info:  Connected to apid-wattch.badc.rl.ac.uk (130.246.191.84) port 80 (#0)
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: POST /cpdn_cgi/file_upload_handler HTTP/1.1
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: User-Agent: BOINC client (x86_64-pc-linux-gnu 6.10.58)
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: Host: apid-wattch.badc.rl.ac.uk
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: Accept: */*
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: Accept-Encoding: deflate, gzip
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: Content-Type: application/x-www-form-urlencoded
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: Content-Length: 54369772
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: Expect: 100-continue
10-Oct-2013 12:37:25 [---] [http_debug] [ID#2] Sent header to server: 
10-Oct-2013 12:37:26 [---] [http_debug] [ID#2] Received header from server: HTTP/1.1 100 Continue
10-Oct-2013 12:37:26 [---] [http_debug] [ID#2] Info:  Expire cleared
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Received header from server: HTTP/1.1 200 OK
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Received header from server: Date: Thu, 10 Oct 2013 16:37:25 GMT
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Received header from server: Server: Apache/2.2.12 (Linux/SUSE)
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Received header from server: Transfer-Encoding: chunked
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Received header from server: Content-Type: text/plain
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Received header from server: 
10-Oct-2013 12:43:33 [---] [http_debug] [ID#2] Info:  Connection #0 to host apid-wattch.badc.rl.ac.uk left intact
10-Oct-2013 12:43:34 [climateprediction.net] [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval 0
10-Oct-2013 12:43:34 [climateprediction.net] [file_xfer_debug] parsing upload response: <data_server_reply>
    <status>0</status>
</data_server_reply>
10-Oct-2013 12:43:34 [climateprediction.net] [file_xfer_debug] parsing status: -127
10-Oct-2013 12:43:34 [climateprediction.net] [file_xfer_debug] file transfer status -127
10-Oct-2013 12:43:34 [climateprediction.net] Temporarily failed upload of hadcm3n_3bqz_2020_40_008389544_3_1.zip: transient upload error
10-Oct-2013 12:43:34 [climateprediction.net] Backing off 3 hr 32 min 42 sec on upload of hadcm3n_3bqz_2020_40_008389544_3_1.zip
ID: 47269 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,703,308
RAC: 9,860
Message 47270 - Posted: 10 Oct 2013, 17:17:39 UTC - in response to Message 47269.  

It would be just as quick and easy - with BOINC shut down - to do a global search-and-replace on client_state.xml:

Find
http://apid-wattch.badc.rl.ac.uk

Replace with
http://rapid-watch.badc.rl.ac.uk
ID: 47270 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47271 - Posted: 10 Oct 2013, 20:02:30 UTC - in response to Message 47269.  

Bernardinho

Your problem is the well known BOINC/Linux problem, where the data is correct when it leaves Oxford, but the urls get corrupted when they get to certain computers.

There are 2 solutions:
1) Don't run this project on a Linux computer.
2) Shut down BOINC, make a copy of client_state.xml and put it in a safe place, and then edit the original file with a plain text editor to correct the faulty bits. (As per Richard's post below this.)

Commiserations.

ID: 47271 · Report as offensive     Reply Quote
Helmer Bryd

Send message
Joined: 16 Aug 04
Posts: 156
Credit: 9,035,872
RAC: 2,928
Message 47272 - Posted: 10 Oct 2013, 21:59:46 UTC - in response to Message 47271.  
Last modified: 10 Oct 2013, 22:21:06 UTC

Linux works fine, skip Les :-)
ID: 47272 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47273 - Posted: 10 Oct 2013, 22:08:39 UTC - in response to Message 47272.  

I know. PROVIDED you're prepared to patch the urls in each new download.

And it may only happen to 64 bit systems. My 32 bit box is OK.

ID: 47273 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,009,815
RAC: 21,293
Message 47274 - Posted: 11 Oct 2013, 6:33:21 UTC

Or only some 64bit systems. So far I seem to have escaped this one. I have installed by unpacking the tar.gz file into it's own directory as opposed to using the one provided by kubuntu's package manager. Don't know how relevant or not that is.
ID: 47274 · Report as offensive     Reply Quote
Helmer Bryd

Send message
Joined: 16 Aug 04
Posts: 156
Credit: 9,035,872
RAC: 2,928
Message 47275 - Posted: 11 Oct 2013, 13:09:46 UTC

Would be great to hear if any windows users get this "apid-wattch" thing too
ID: 47275 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,808,726
RAC: 5,192
Message 47276 - Posted: 11 Oct 2013, 13:12:24 UTC - in response to Message 47275.  

Would be great to hear if any windows users get this "apid-wattch" thing too

There have been occasions when an address has been entered wrongly into the system and that has propagated to all users, but the corruption of an apparently correct address during delivery is restricted to Linux as far as I recall.
ID: 47276 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47277 - Posted: 11 Oct 2013, 16:34:22 UTC

I have the feeling that the transient upload error is not necessarily related to the wrong host name, it seems to be another problem.
ID: 47277 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47278 - Posted: 11 Oct 2013, 19:49:26 UTC - in response to Message 47277.  
Last modified: 11 Oct 2013, 20:00:33 UTC

bernardinho

In your case, it IS the wrong host name. (Unless you've edited the client_state.xml file since you posted that list. In which case, you now have a different problem.)

Look near the top of the list that you posted earlier in this thread. You won't get anywhere sending it to that url.
ID: 47278 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47279 - Posted: 11 Oct 2013, 20:01:32 UTC - in response to Message 47275.  

cwhyl

This was discussed extensively on the old php board 2-3 years back when it first started happening. It was also tested a fair bit.

The files were/are OK on the server.
They're OK when they arrive zipped up on the user's computer.
At some point after unzipping and moving to their various locations, the data in the client_state.xml file shows up corrupted, in a couple of different ways.

So it's most likely a subtle bug in BOINC for a particular variety of Linux.

ID: 47279 · Report as offensive     Reply Quote
Helmer Bryd

Send message
Joined: 16 Aug 04
Posts: 156
Credit: 9,035,872
RAC: 2,928
Message 47288 - Posted: 12 Oct 2013, 11:59:55 UTC

What?
You knew about this all the time without saying anything?
Fix a new moderator please.
ID: 47288 · Report as offensive     Reply Quote
Alex Plantema

Send message
Joined: 3 Sep 04
Posts: 126
Credit: 26,610,380
RAC: 3,377
Message 47289 - Posted: 12 Oct 2013, 12:33:05 UTC

Or reopen the php board for reading only.
ID: 47289 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47296 - Posted: 12 Oct 2013, 21:18:53 UTC - in response to Message 47288.  

cwhyl

What?
You knew about this all the time without saying anything?
Fix a new moderator please.

The discussion on the php board was in a section open to the public, and numerous crunchers DID post there about it.
The person who tested the download on their Linux computer was a non moderator.

It was also posted about on this board, with links back to the php board for the detailed instructions on how to fix the problem.

You've been on this project for long enough to have known about the other board.
Anyone who joins a volunteer organisation and doesn't check the notice board each time they turn up at the meeting place to find out what's going on has only them self to blame for their ignorance.

ID: 47296 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47297 - Posted: 12 Oct 2013, 21:23:42 UTC - in response to Message 47289.  

Alex

The php board is NEVER coming back, in any form whatsoever.

See the posts in the News and Announcements thread around the 22 March 2013 about why.

ID: 47297 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : failed upload: can't resolve hostname

©2024 cpdn.org