Message boards : Number crunching : Download Failed
Message board moderation
Author | Message |
---|---|
Send message Joined: 15 Dec 06 Posts: 13 Credit: 2,539,487 RAC: 0 |
"Download Failed" err. mess., HASAM3P Pacific Northwest 6.09 WU: hadam30_pnw_8rsv_1199_1_8987_1 The second WU to fail within the last 1+ days. Info., from the BOINC Event Log: 1/6/2013 7:18:04 PM | climateprediction.net | Sending scheduler request: To fetch work. 1/6/2013 7:18:04 PM | climateprediction.net | Requesting new tasks for CPU 1/6/2013 7:18:06 PM | climateprediction.net | Scheduler request completed: got 1 new tasks 1/6/2013 7:18:09 PM | climateprediction.net | Started download of hadam3p_pnw_8rsv_1999_1_007708987.zip 1/6/2013 7:18:09 PM | climateprediction.net | Started download of xaclfa.start.0000.gz 1/6/2013 7:18:11 PM | climateprediction.net | Giving up on download of hadam3p_pnw_8rsv_1999_1_007708987.zip: permanent HTTP error 1/6/2013 7:18:11 PM | climateprediction.net | Started download of so2dms_N96_1999_12_2001_02f.gz 1/6/2013 7:18:16 PM | climateprediction.net | Finished download of so2dms_N96_1999_12_2001_02f.gz 1/6/2013 7:18:16 PM | climateprediction.net | Started download of dchaba.start.pnw.b.0000.gz 1/6/2013 7:18:27 PM | climateprediction.net | Finished download of dchaba.start.pnw.b.0000.gz 1/6/2013 7:18:27 PM | climateprediction.net | Started download of HadISST_SI_N96_1999_12_2001_01f.gz 1/6/2013 7:18:29 PM | climateprediction.net | Finished download of HadISST_SI_N96_1999_12_2001_01f.gz 1/6/2013 7:18:29 PM | climateprediction.net | Started download of HadISST_SST_N96_1999_12_2001_01f.gz 1/6/2013 7:18:36 PM | climateprediction.net | Finished download of HadISST_SST_N96_1999_12_2001_01f.gz 1/6/2013 7:18:51 PM | climateprediction.net | Finished download of xaclfa.start.0000.gz * This problem is new; other WU's have been downloaded/completed successfully. Any suggestions? Thanks, in advance. Thanks, in advance. |
Send message Joined: 10 Dec 11 Posts: 11 Credit: 253,758 RAC: 3 |
Check if you have ample harddrive space. Some work units are very large. If you have boinc running on vfat, you may want to consider running it in ntfs since vfat is limited to 2GB file sizes. In boinc, there is an option to test/verify the integrity of your file downloads since some ISPs may alter your downloads - maybe you'll want to turn that on. Make sure your computer is up to date with all patches and upgrades. If the WU fails to download, let boinc kill the download and cleanup itself. The reason for mentioning that is because some projects tended to have a lot of this happen and users got in the habit of killing failed downloads - which sort of messed things up on the project's side, and some projects actually penalized your computer for that sort of stuff making it harder to download a new replacement project. I don't know if any of the above will help, but hopefully it might. |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
Check if you have ample harddrive space. Some work units are very large. What you recommend sounds reasonable. There have been some broken wu that fail to download because some files are missing on the server. This was supposed to have been fixed but it is possible that some broken wu are still out there on the server. Aside from that small possibility -- indeed - make sure you have space for downloads -- some here on cpdn are rather large. Likewise what Joe said about keeping up to date with the BOINC software also good idea. |
Send message Joined: 15 Dec 06 Posts: 13 Credit: 2,539,487 RAC: 0 |
Thank you, gentlemen. Although I think my computer (Sony VAIO, dual-cpu's, Windows Vista Business) has enough disk space, will continue to run WU's and monitor things. Some very earlier WU's were over 3,000 hrs. long, with no disk space pblms. cheers, |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
|
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
Windows Vista Business) has enough disk space, Clicking on the Disk tab in BOINC Manager will confirm that you have plenty of disk space. - On this machine it tells me I have over 26GB free available to BOINC. On my other machine a netbook it is 7.07GB free with 2.93GB used by BOINC. If the free space were to drop below 4GB on that machine I would start looking to see if crashed tasks were taking up space. I think 10GB available for BOINC should be fine for any dual core machine. If I had 4 or more I might want to up it a bit. |
Send message Joined: 15 Dec 06 Posts: 13 Credit: 2,539,487 RAC: 0 |
Dave -- 42.63 GB is available, per "Disk Space". So, the pblms. must reside on CPDN's side of things. No anxiety, from my end -- just a wait for the next WU. Cheers, to all. |
Send message Joined: 21 Oct 10 Posts: 53 Credit: 2,101,753 RAC: 3,985 |
Same issue here, got 2 where it seems to be able to download all files except one which ends in "permanent download failure" and the WU goes error... I know CPDN is going through a long period of almost no WU but it's giving false hope when this happens. Luckily last time I got 2 very long WUs that lasted 650 hours... hopefully I'll get others like those ones. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
Check Les's last post in this thread, Might as well abort the unit it won't ever download properly. It is a re-issue of one that has failed to download before. With a bit of luck the stock of these is running out now so there shouldn't be too many more of them. And as Les posted in another thread the next batch of hadmc3n's is a few weeks away. Doesn't know about the hadam3p regional models. I have just re-enabled World Community Grid on my other machine which had a fee core.I am just allowing it to get a couple of days work at a time in case there are some regional models coming up soon. |
Send message Joined: 21 Oct 10 Posts: 53 Credit: 2,101,753 RAC: 3,985 |
Yes I saw that, no problem for me, it's just I can only run CPDN on that computer where boinc cannot have its own access to Internet (corporate) so I move WU with USB key, and I can only do this with long running kind of WU, only CPDN is long enough to let me do that... so I'll wait :) |
Send message Joined: 21 Oct 10 Posts: 53 Credit: 2,101,753 RAC: 3,985 |
I got 2 long ones ! Happy :) |
Send message Joined: 15 Dec 06 Posts: 13 Credit: 2,539,487 RAC: 0 |
Follow-up: Rec'd. an hadcm3n WU, and no pblms, after approx. 4% run. A good long one: about 2200 hrs. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
2200 hours! What are you running it on, a pocket calculator? My computers are no speed demons, but, even the 1.5 GHz machine finishes an Hadm3n WU in about 900 hours. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
2200 hours! What are you running it on, a pocket calculator? Over 3000 hours on my dual core atom netbook. Less than a thousand on this machine. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
Another download failure in case it needs to be passed on to anyone @ the project. Sat 26 Jan 2013 12:01:02 GMT | climateprediction.net | Started download of hadcm3n_o6l4_2100_40_008239984.zip Sat 26 Jan 2013 12:01:02 GMT | climateprediction.net | Started download of ocean_o6l4_2100_40_008239984_0.gz Sat 26 Jan 2013 12:01:02 GMT | climateprediction.net | File SPARC_O3_rebuild_1900.gz exists already, skipping download Sat 26 Jan 2013 12:01:03 GMT | climateprediction.net | Finished download of hadcm3n_o6l4_2100_40_008239984.zip Sat 26 Jan 2013 12:01:03 GMT | climateprediction.net | Giving up on download of ocean_o6l4_2100_40_008239984_0.gz: permanent HTTP error Sat 26 Jan 2013 12:01:03 GMT | climateprediction.net | Started download of atmos_o6l4_2100_40_008239984_0.gz Sat 26 Jan 2013 12:01:03 GMT | climateprediction.net | Started download of DMSSO2NH3_1900_RCP.gz Sat 26 Jan 2013 12:01:04 GMT | climateprediction.net | Giving up on download of atmos_o6l4_2100_40_008239984_0.gz: permanent HTTP error Sat 26 Jan 2013 12:01:04 GMT | climateprediction.net | Finished download of DMSSO2NH3_1900_RCP.gz |
Send message Joined: 6 Aug 04 Posts: 264 Credit: 965,476 RAC: 0 |
Me too. I have a download failure. What must I do? Tullio |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Nothing you can do, tullio. You probably received a regenerated task from an earlier failed batch. See Les' post here: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7527&nowrap=true#45428 "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 15 Dec 06 Posts: 13 Credit: 2,539,487 RAC: 0 |
The 2200hrs figure is the initial estimated time to completion. My dual-core Sony VIAO usually finishes before that max. figure. Some years, ago, I had a WU of over 3300hrs (Those were the good old days.). It behooved a person to do periodic saves. . . |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
When running a model type that is going to take 1000+ hours I find that making periodic backups is a must. Over a period of months something is bound to go wrong. Systems can lockup requiring a cold reboot, power can fail, or system hardware can fail. Any of these can wipe out thousands of work and kill a good model. Reboots after updating system software are a good time to make backups. You had to exit the model and shutdown boinc anyway so why not make a backup at that point. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
It looks like we have more bad downloads. Hadam3p_pnw_2vjk_1970_1_008294027_1 is presently stuck in the transfer tab. Fortunately, I also received 2 usable downloads (a hadcm3n and a hadam3p_eu) that downloaded just fine, so the problem is not general. Maybe the PNW is from an old, flawed batch? |
©2024 cpdn.org