Message boards : Number crunching : Download stalled WAH2
Message board moderation
Author | Message |
---|---|
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
Hi folks, I receive Transient HTTP error when trying to download wah2_pnw25_zhlk_200312_24_406_010600701 model and its parts. BOINC suggests that Internet access OK - project servers may be temporarily down, but server status page looks OK. Anyone else having download problems? Cheers |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
A second WU is failing to download on another machine, so I suspended those tasks and set BOINC to no new tasks, until it is resolved. The second machine is Windows one so BOINC log is more informative: Failed to connect on port 80 of download.cpdn.org. |
Send message Joined: 28 May 14 Posts: 34 Credit: 705,936 RAC: 0 |
Yeah i'm having the same issue. I got 2 wus that won't download; they just keep backing off. Seems that we can"t connect to the download server but Server Status says its 'running'. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Server Status says that the computer is running, not that all of the daemons are. One or more of these must have failed. Email sent. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
Seems fixed. Units downloaded and now crunching. Thanks Les |
Send message Joined: 28 May 14 Posts: 34 Credit: 705,936 RAC: 0 |
Yep, seems back on track. Thanks, Les. |
Send message Joined: 6 Aug 04 Posts: 124 Credit: 9,195,838 RAC: 0 |
Why do I get no WU? 04-Aug-2016 13:28:44 [climateprediction.net] No tasks are available for Weather At Home 2 (wah2) But server status shows 12700 available. Linux Users Everywhere @ BOINC |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
You may have to look at other message lines to work it out. However, one of your computers is crashing everything, and the other has crashed several. These 2 links may give you some ideas about your error messages: "finish file present too long" error finish file present too long |
Send message Joined: 6 Aug 04 Posts: 124 Credit: 9,195,838 RAC: 0 |
There are no other messages. 06-Sep-2016 04:38:12 [climateprediction.net] Sending scheduler request: To fetch work. 06-Sep-2016 04:38:12 [climateprediction.net] Requesting new tasks for CPU 06-Sep-2016 04:38:14 [Einstein@Home] Started upload of LATeah0003L_656.0_0_0.0_7535850_1_0 06-Sep-2016 04:38:14 [Einstein@Home] Started upload of LATeah0003L_656.0_0_0.0_7535850_1_1 06-Sep-2016 04:38:15 [Einstein@Home] Finished upload of LATeah0003L_656.0_0_0.0_7535850_1_0 06-Sep-2016 04:38:15 [Einstein@Home] Finished upload of LATeah0003L_656.0_0_0.0_7535850_1_1 06-Sep-2016 04:38:15 [climateprediction.net] Scheduler request completed: got 0 new tasks 06-Sep-2016 04:38:15 [climateprediction.net] Project has no tasks available Linux Users Everywhere @ BOINC |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
[climateprediction.net] Project has no tasks available And consistent with this server status page showing no work at present. |
Send message Joined: 27 Jan 05 Posts: 74 Credit: 1,047,809 RAC: 0 |
I do not recall having seen the bin empty before. Dave, yours is the only comment, and one would expect much more. I presume that the proper procedure now would be "No New Tasks", to allow orderly recovery, if one is planned. ????? |
Send message Joined: 27 Jan 05 Posts: 74 Credit: 1,047,809 RAC: 0 |
Wrong! Desti and his reply to a previous message solves the problem. Daemons. |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Hi folks, Reverting to Bernard's original problem: Two of my machines each have a wah2_cafr25 ... batch 468 task with five small download files hung -- swinging in the breeze for more than a day -- analogous to the situation Bernard describes. Anyone else have the problem? With batch 468 or any other tasks/batches? "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
I have now wah2_sas50_cqx2_209112_13_432 hanging in download status. Project backed off for several hours Sun 27 Nov 2016 10:02:55 EET | climateprediction.net | Started download of wah2_sas50_cqx2_209112_13_432_010667068.zip Sun 27 Nov 2016 10:02:55 EET | climateprediction.net | Started download of restart_atmos_s005_1986-1201_rd0001.gz Sun 27 Nov 2016 10:02:57 EET | | Project communication failed: attempting access to reference site Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Temporarily failed download of wah2_sas50_cqx2_209112_13_432_010667068.zip: connect() failed Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Backing off 00:30:31 on download of wah2_sas50_cqx2_209112_13_432_010667068.zip Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Temporarily failed download of restart_atmos_s005_1986-1201_rd0001.gz: connect() failed Sun 27 Nov 2016 10:02:57 EET | climateprediction.net | Backing off 00:15:05 on download of restart_atmos_s005_1986-1201_rd0001.gz Sun 27 Nov 2016 10:02:59 EET | | Internet access OK - project servers may be temporarily down. ..........and it is weekend |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Perhaps it's to do with some parts being on the new servers, but links are pointing to the old servers. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,033,903 RAC: 14,766 |
Looks like it. Can't ping the IP address from the log (126.67.195.140) and traceroute only gets as far as the Oxford ja.net address. Does this help? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Traceroute and similar programs are blocked at the entry point to the JA network. My guess is that everyone will have to wait until such time as sufficient parts of our system have been migrated to the new servers, etc. Which is why my computers are not only set for NNW, they've turned off. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
I'll wait hopping BOINC (or other power) won't finally dismiss the WU that can't be downloaded while waiting for Oxford to get the new servers on. I do not want to abort as ultimately the WU may be lost due to errors limit. |
Send message Joined: 22 Mar 06 Posts: 144 Credit: 24,695,428 RAC: 0 |
Download of 9 tasks have stalled for 4 hours. I'll keep watch, but can't see any server issues. e.g. 1/12/2016 1:33:12 PM | climateprediction.net | Temporarily failed download of wah2_pnw25_a64n_20399_16_478_010792165.zip: connect() failed 1/12/2016 1:33:13 PM | | Project communication failed: attempting access to reference site 1/12/2016 1:33:15 PM | | Internet access OK - project servers may be temporarily down. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
Mine is old and has been stuck for 5 days already, yours seem to be brand new WUs and if they are stalled then I hope this will become priority to be fixed. |
©2024 cpdn.org