Message boards : Number crunching : failed upload: can't resolve hostname
Message board moderation
Author | Message |
---|---|
Send message Joined: 4 Sep 13 Posts: 9 Credit: 672,309 RAC: 0 |
Hi, I just started calculating some days ago but have problems with the uploads. Does anyone have a clue how I can fix this? Thanks! 11-Sep-2013 11:17:37 [climateprediction.net] Started upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip 11-Sep-2013 11:17:40 [---] [http_debug] [ID#2] Info: Could not resolve host: apid-wattch.badc.rl.ac.uk (Domain name not found) 11-Sep-2013 11:17:40 [---] [http_debug] [ID#2] Info: Expire cleared 11-Sep-2013 11:17:40 [---] [http_debug] [ID#2] Info: Closing connection #0 11-Sep-2013 11:17:40 [---] [http_debug] HTTP error: Couldn't resolve host name 11-Sep-2013 11:17:41 [---] Project communication failed: attempting access to reference site 11-Sep-2013 11:17:41 [climateprediction.net] Temporarily failed upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip: can't resolve hostname 11-Sep-2013 11:17:41 [climateprediction.net] Backing off 3 hr 29 min 24 sec on upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
Hi bernadinho, I see your computer is running other CPDN models as well as that one. How did the uploads for those other models go? If the other models have uploaded their ..._1.zip files OK, I would just 'abort' the one with the HTTP error. If not, do they all specify the same host, apid-wattch.badc.rl.ac.uk? |
Send message Joined: 4 Sep 13 Posts: 9 Credit: 672,309 RAC: 0 |
It looks to me that none of the uploads works, and yes, they all try to use the same server... Could not resolve host: apid-wattch.badc.rl.ac.uk (Domain name not found) |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
I wonder if that is supposed to be 'rapid-watch' rather than 'apid-wattch'. I have sent a query off to the admins. If it turns out to be a typo, then it may be possible for the Rutherford Appleton labs to set up a redirect on that address. -- Edit: Andy has confirmed - it is a typo in the model definition (it should be rapid-watch). I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 4 Sep 13 Posts: 9 Credit: 672,309 RAC: 0 |
Thanks, than it is clear why the upload does not work. Can I fix it on my end somehow? |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Thanks, than it is clear why the upload does not work. Can I fix it on my end somehow? Yes ... you can map apid-wattch.badc.rl.ac.uk onto the IP address 130.246.191.84 (= rapid-watch.badc.rl.ac.uk) First, I will try to explain how this works: Addresses on the internet are actually all numeric, even although we see textual names. When your computer wants to know what apid-wattch.badc.rl.ac.uk means, it will ask a 'DNS' (domain name server) to translate it into a numeric internet protocol address (IP address). In this case, the server will reply that the address is unknown. However, if it had been supplied with rapid-watch.badc.rl.ac.uk instead, the DNS server would have replied with the magic number 130.246.191.84. Prior to asking the DNS server, the computer actually first checks a local list of hostnames & their IP addresses. We can add apid-wattch to this list on your computer. On Windows, this is done by finding the file 'hosts', and editing it. On my PC, it is in the location C:\WINDOWS\system32\drivers\etc Add the follwing line to the end of the file: 130.246.191.84 apid-wattch.badc.rl.ac.uk # redirecting apid-wattch to rapid-watch for CPDN (The bit after the # is just a comment). Note that this 'hosts' file is a system file, and it may be hidden (depending on the options in your windows explorer). Therefore a firewall / antivirus may try to prevent you from changing it. However... I think you have something like 2 weeks before the upload fails. So you can simply sit back & hopefully the project might make this same change at Rutherford. I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 4 Sep 13 Posts: 9 Credit: 672,309 RAC: 0 |
Thank you! I added it to my /etc/hosts file and it seems to be working. It should buy the project admins some time :) |
Send message Joined: 4 Sep 13 Posts: 9 Credit: 672,309 RAC: 0 |
Mhhh, looks like there is another problem now. It is certainly uploading the data, but I get another error message: 12-Sep-2013 14:44:56 [climateprediction.net] Temporarily failed upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip: transient upload error 12-Sep-2013 14:44:56 [climateprediction.net] Backing off 2 hr 1 min 41 sec on upload of hadcm3n_o5ss_1980_40_008385337_3_1.zip |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Well, lets see if the address is accessible for you. Try visiting both of these in turn from your PC ... in theory you should get the same (minimal) web page on both. http://apid-wattch.badc.rl.ac.uk/ http://rapid-watch.badc.rl.ac.uk/ If I look at them, only the second will work (since I have not touched 'hosts' on my PC). I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 4 Sep 13 Posts: 9 Credit: 672,309 RAC: 0 |
These links work with my updated hosts file. I also see that the file is uploading since there is outgoing traffic, it is just that I get this 'transient upload error' now. Maybe I should just wait a bit until the whole project is finished... |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Well, what I was worried about was if the server was checking the incoming host name somehow (some web servers do this). I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 5 Aug 04 Posts: 127 Credit: 24,498,085 RAC: 21,454 |
However... I think you have something like 2 weeks before the upload fails. So you can simply sit back & hopefully the project might make this same change at Rutherford. It's 90 days, if you're not running an ancient BOINC-client like v6.2.xx or something even older. |
Send message Joined: 10 Dec 11 Posts: 11 Credit: 253,758 RAC: 3 |
if you're not running an ancient BOINC-client like v6.2.xx or something even older. You mean ancient, as in something like 6.12.x ? ;-P (...yes, I'll upgrade sometime...soon hopefully) |
Send message Joined: 17 Oct 08 Posts: 2 Credit: 509,405 RAC: 0 |
Could it be possible to add this "typo" also to DNS record of badc.rl.ac.uk so not every user would be required to edit her HOSTS file (and then forget the change in there and when then IP addr then some sunny day actually changes, leaving here wondering how come the client now suddenly stopped connecting..) |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
a) I did already ask for that (although there is no sign that it has happened), and b) I am not sure yet whether this remapping works or not, note the following post from earlier in the thread: Mhhh, looks like there is another problem now. It is certainly uploading the data, but I get another error message: I was hoping that someone would come back & confirm whether it works (or doesn't work) before I re-raise this with the admins. I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 17 Oct 08 Posts: 2 Credit: 509,405 RAC: 0 |
a) I did already ask for that (although there is no sign that it has happened), Hmm, right now looks like this: $ nslookup rapid-watch.badc.rl.ac.uk 130.246.135.153 Server: 130.246.135.153 Address: 130.246.135.153#53 Name: rapid-watch.badc.rl.ac.uk Address: 130.246.191.84 $ nslookup apid-watch.badc.rl.ac.uk 130.246.135.153 Server: 130.246.135.153 Address: 130.246.135.153#53 ** server can't find apid-watch.badc.rl.ac.uk.katiska.org: REFUSED $ nslookup apid-wattch.badc.rl.ac.uk 130.246.135.153 Server: 130.246.135.153 Address: 130.246.135.153#53 ** server can't find apid-wattch.badc.rl.ac.uk.katiska.org: REFUSED $ but is this right server I'm querying.. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Mike I would think that all unsent data sets would have been fixed soon after this was discovered. The posts earlier in this thread were 18th September and earlier. In this thread, VANISHING WU'S, the date for the withdrawal of the large batch in question was 19th September. The date of creation of Antti's latest model is 27th September, so it's one of the new batch, and shouldn't have a problem. My current 6 are from this batch and are uploading OK. I've looked at client_state for "apid", and all of the zips, both sent and pending, say "rapid-watch". |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
Mhhh, looks like there is another problem now. It is certainly uploading the data, but I get another error message: Sounds like one of the problems I am having, raised in this message thread. BOINC blog |
Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Sounds like one of the problems I am having, raised in this message thread. I don't think it is the same thing. The uploads in this thread were failing because the name of the upload server was spelt wrong in the configuration file (apid-wattch), whereas your log file shows that the server name is spelt correctly (rapid-watch). I can't see any obvious reason in your log files for it going wrong. What sort of filewall do you use? It may be worth taking a look at the firewall and antivirus logs to see if anything is appearing there (some security software blocks big zip files, for example, they may appear as a 'compression bomb' in the log). I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
He stated he has "another" problem after fixing the hosts file (the server name) issue. Thats the same error I get. Uploads start and fail part of the way through. As for firewall and antivirus I don't have any errors being logged for either of them. BOINC blog |
©2024 cpdn.org