climateprediction.net (CPDN) home page
Thread 'failed upload: can't resolve hostname'

Thread 'failed upload: can't resolve hostname'

Message boards : Number crunching : failed upload: can't resolve hostname
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47000 - Posted: 11 Sep 2013, 15:25:21 UTC

Hi, I just started calculating some days ago but have problems with the uploads. Does anyone have a clue how I can fix this? Thanks!

11-Sep-2013 11:17:37 [climateprediction.net] Started upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip
11-Sep-2013 11:17:40 [---] [http_debug] [ID#2] Info:  Could not resolve host: apid-wattch.badc.rl.ac.uk (Domain name not found)
11-Sep-2013 11:17:40 [---] [http_debug] [ID#2] Info:  Expire cleared
11-Sep-2013 11:17:40 [---] [http_debug] [ID#2] Info:  Closing connection #0
11-Sep-2013 11:17:40 [---] [http_debug] HTTP error: Couldn't resolve host name
11-Sep-2013 11:17:41 [---] Project communication failed: attempting access to reference site
11-Sep-2013 11:17:41 [climateprediction.net] Temporarily failed upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip: can't resolve hostname
11-Sep-2013 11:17:41 [climateprediction.net] Backing off 3 hr 29 min 24 sec on upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip
ID: 47000 · Report as offensive     Reply Quote
ProfileGreg van Paassen

Send message
Joined: 17 Nov 07
Posts: 142
Credit: 4,271,370
RAC: 0
Message 47004 - Posted: 11 Sep 2013, 20:37:12 UTC - in response to Message 47000.  

Hi bernadinho,

I see your computer is running other CPDN models as well as that one. How did the uploads for those other models go?

If the other models have uploaded their ..._1.zip files OK, I would just 'abort' the one with the HTTP error.

If not, do they all specify the same host, apid-wattch.badc.rl.ac.uk?
ID: 47004 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47017 - Posted: 12 Sep 2013, 12:13:42 UTC - in response to Message 47004.  

It looks to me that none of the uploads works, and yes, they all try to use the same server...
Could not resolve host: apid-wattch.badc.rl.ac.uk (Domain name not found)
ID: 47017 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 47018 - Posted: 12 Sep 2013, 13:17:49 UTC
Last modified: 12 Sep 2013, 13:52:23 UTC

I wonder if that is supposed to be 'rapid-watch' rather than 'apid-wattch'. I have sent a query off to the admins.

If it turns out to be a typo, then it may be possible for the Rutherford Appleton labs to set up a redirect on that address.


-- Edit:

Andy has confirmed - it is a typo in the model definition (it should be rapid-watch).
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 47018 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47020 - Posted: 12 Sep 2013, 14:51:27 UTC - in response to Message 47018.  

Thanks, than it is clear why the upload does not work. Can I fix it on my end somehow?
ID: 47020 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 47021 - Posted: 12 Sep 2013, 15:13:03 UTC - in response to Message 47020.  
Last modified: 12 Sep 2013, 16:34:55 UTC



Thanks, than it is clear why the upload does not work. Can I fix it on my end somehow?



Yes ... you can map apid-wattch.badc.rl.ac.uk onto the IP address 130.246.191.84 (= rapid-watch.badc.rl.ac.uk)



First, I will try to explain how this works: Addresses on the internet are actually all numeric, even although we see textual names. When your computer wants to know what apid-wattch.badc.rl.ac.uk means, it will ask a 'DNS' (domain name server) to translate it into a numeric internet protocol address (IP address). In this case, the server will reply that the address is unknown.

However, if it had been supplied with rapid-watch.badc.rl.ac.uk instead, the DNS server would have replied with the magic number 130.246.191.84.

Prior to asking the DNS server, the computer actually first checks a local list of hostnames & their IP addresses. We can add apid-wattch to this list on your computer.


On Windows, this is done by finding the file 'hosts', and editing it. On my PC, it is in the location C:\WINDOWS\system32\drivers\etc

Add the follwing line to the end of the file:

130.246.191.84 apid-wattch.badc.rl.ac.uk # redirecting apid-wattch to rapid-watch for CPDN


(The bit after the # is just a comment).

Note that this 'hosts' file is a system file, and it may be hidden (depending on the options in your windows explorer). Therefore a firewall / antivirus may try to prevent you from changing it.


However... I think you have something like 2 weeks before the upload fails. So you can simply sit back & hopefully the project might make this same change at Rutherford.
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 47021 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47022 - Posted: 12 Sep 2013, 17:01:33 UTC - in response to Message 47021.  

Thank you! I added it to my /etc/hosts file and it seems to be working. It should buy the project admins some time :)
ID: 47022 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47024 - Posted: 12 Sep 2013, 18:50:05 UTC - in response to Message 47022.  

Mhhh, looks like there is another problem now. It is certainly uploading the data, but I get another error message:

12-Sep-2013 14:44:56 [climateprediction.net] Temporarily failed upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip: transient upload error
12-Sep-2013 14:44:56 [climateprediction.net] Backing off 2 hr 1 min 41 sec on upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip
ID: 47024 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 47026 - Posted: 12 Sep 2013, 19:15:40 UTC - in response to Message 47024.  
Last modified: 12 Sep 2013, 19:16:03 UTC




Well, lets see if the address is accessible for you. Try visiting both of these in turn from your PC ... in theory you should get the same (minimal) web page on both.

http://apid-wattch.badc.rl.ac.uk/
http://rapid-watch.badc.rl.ac.uk/

If I look at them, only the second will work (since I have not touched 'hosts' on my PC).
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 47026 · Report as offensive     Reply Quote
bernardinho

Send message
Joined: 4 Sep 13
Posts: 9
Credit: 672,309
RAC: 0
Message 47040 - Posted: 13 Sep 2013, 13:28:12 UTC - in response to Message 47026.  

These links work with my updated hosts file. I also see that the file is uploading since there is outgoing traffic, it is just that I get this 'transient upload error' now. Maybe I should just wait a bit until the whole project is finished...
ID: 47040 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 47041 - Posted: 13 Sep 2013, 13:29:36 UTC



Well, what I was worried about was if the server was checking the incoming host name somehow (some web servers do this).



I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 47041 · Report as offensive     Reply Quote
Ingleside

Send message
Joined: 5 Aug 04
Posts: 126
Credit: 24,435,960
RAC: 23,907
Message 47111 - Posted: 18 Sep 2013, 22:15:04 UTC - in response to Message 47021.  

However... I think you have something like 2 weeks before the upload fails. So you can simply sit back & hopefully the project might make this same change at Rutherford.

It's 90 days, if you're not running an ancient BOINC-client like v6.2.xx or something even older.
ID: 47111 · Report as offensive     Reply Quote
ProfileJoe's Climate
Avatar

Send message
Joined: 10 Dec 11
Posts: 11
Credit: 253,758
RAC: 3
Message 47174 - Posted: 26 Sep 2013, 5:39:52 UTC - in response to Message 47111.  

if you're not running an ancient BOINC-client like v6.2.xx or something even older.


You mean ancient, as in something like 6.12.x ?
;-P

(...yes, I'll upgrade sometime...soon hopefully)
ID: 47174 · Report as offensive     Reply Quote
Antti Jarvinen

Send message
Joined: 17 Oct 08
Posts: 2
Credit: 509,405
RAC: 0
Message 47244 - Posted: 7 Oct 2013, 8:59:07 UTC - in response to Message 47021.  


Yes ... you can map apid-wattch.badc.rl.ac.uk onto the IP address 130.246.191.84 (= rapid-watch.badc.rl.ac.uk)



Could it be possible to add this "typo" also to DNS record of badc.rl.ac.uk so not every user would be required to edit her HOSTS file (and then forget the change in there and when then IP addr then some sunny day actually changes, leaving here wondering how come the client now suddenly stopped connecting..)
ID: 47244 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 47245 - Posted: 7 Oct 2013, 11:02:14 UTC
Last modified: 7 Oct 2013, 11:05:45 UTC

a) I did already ask for that (although there is no sign that it has happened),
and b) I am not sure yet whether this remapping works or not, note the following post from earlier in the thread:


Mhhh, looks like there is another problem now. It is certainly uploading the data, but I get another error message:

12-Sep-2013 14:44:56 [climateprediction.net] Temporarily failed upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip: transient upload error
12-Sep-2013 14:44:56 [climateprediction.net] Backing off 2 hr 1 min 41 sec on upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip


I was hoping that someone would come back & confirm whether it works (or doesn't work) before I re-raise this with the admins.
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 47245 · Report as offensive     Reply Quote
Antti Jarvinen

Send message
Joined: 17 Oct 08
Posts: 2
Credit: 509,405
RAC: 0
Message 47246 - Posted: 7 Oct 2013, 11:49:02 UTC - in response to Message 47245.  

a) I did already ask for that (although there is no sign that it has happened),
and b) I am not sure yet whether this remapping works or not, note the following post from earlier in the thread:


Hmm, right now looks like this:

$ nslookup rapid-watch.badc.rl.ac.uk 130.246.135.153
Server: 130.246.135.153
Address: 130.246.135.153#53

Name: rapid-watch.badc.rl.ac.uk
Address: 130.246.191.84

$ nslookup apid-watch.badc.rl.ac.uk 130.246.135.153
Server: 130.246.135.153
Address: 130.246.135.153#53

** server can't find apid-watch.badc.rl.ac.uk.katiska.org: REFUSED

$ nslookup apid-wattch.badc.rl.ac.uk 130.246.135.153
Server: 130.246.135.153
Address: 130.246.135.153#53

** server can't find apid-wattch.badc.rl.ac.uk.katiska.org: REFUSED

$

but is this right server I'm querying..
ID: 47246 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 47248 - Posted: 7 Oct 2013, 14:38:13 UTC - in response to Message 47245.  

Mike

I would think that all unsent data sets would have been fixed soon after this was discovered.

The posts earlier in this thread were 18th September and earlier.
In this thread, VANISHING WU'S, the date for the withdrawal of the large batch in question was 19th September.

The date of creation of Antti's latest model is 27th September, so it's one of the new batch, and shouldn't have a problem. My current 6 are from this batch and are uploading OK.
I've looked at client_state for "apid", and all of the zips, both sent and pending, say "rapid-watch".

ID: 47248 · Report as offensive     Reply Quote
MarkJ
Avatar

Send message
Joined: 28 Mar 09
Posts: 126
Credit: 9,825,980
RAC: 0
Message 47249 - Posted: 7 Oct 2013, 20:33:58 UTC - in response to Message 47024.  

Mhhh, looks like there is another problem now. It is certainly uploading the data, but I get another error message:

12-Sep-2013 14:44:56 [climateprediction.net] Temporarily failed upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip: transient upload error
12-Sep-2013 14:44:56 [climateprediction.net] Backing off 2 hr 1 min 41 sec on upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip

Sounds like one of the problems I am having, raised in this message thread.
BOINC blog
ID: 47249 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 47255 - Posted: 8 Oct 2013, 10:46:07 UTC - in response to Message 47249.  
Last modified: 8 Oct 2013, 10:55:13 UTC



Sounds like one of the problems I am having, raised in this message thread.



I don't think it is the same thing. The uploads in this thread were failing because the name of the upload server was spelt wrong in the configuration file (apid-wattch), whereas your log file shows that the server name is spelt correctly (rapid-watch).

I can't see any obvious reason in your log files for it going wrong. What sort of filewall do you use? It may be worth taking a look at the firewall and antivirus logs to see if anything is appearing there (some security software blocks big zip files, for example, they may appear as a 'compression bomb' in the log).
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 47255 · Report as offensive     Reply Quote
MarkJ
Avatar

Send message
Joined: 28 Mar 09
Posts: 126
Credit: 9,825,980
RAC: 0
Message 47256 - Posted: 8 Oct 2013, 11:49:34 UTC - in response to Message 47255.  
Last modified: 8 Oct 2013, 11:52:06 UTC



Sounds like one of the problems I am having, raised in this message thread.



I don't think it is the same thing. The uploads in this thread were failing because the name of the upload server was spelt wrong in the configuration file (apid-wattch), whereas your log file shows that the server name is spelt correctly (rapid-watch).

I can't see any obvious reason in your log files for it going wrong. What sort of filewall do you use? It may be worth taking a look at the firewall and antivirus logs to see if anything is appearing there (some security software blocks big zip files, for example, they may appear as a 'compression bomb' in the log).


He stated he has "another" problem after fixing the hosts file (the server name) issue. Thats the same error I get. Uploads start and fail part of the way through.

As for firewall and antivirus I don't have any errors being logged for either of them.
BOINC blog
ID: 47256 · Report as offensive     Reply Quote
1 · 2 · 3 · Next

Message boards : Number crunching : failed upload: can't resolve hostname

©2024 cpdn.org