Message boards : Number crunching : Download problems
Message board moderation
Author | Message |
---|---|
Send message Joined: 29 Dec 09 Posts: 34 Credit: 18,395,130 RAC: 0 |
CPDN stopped in the process of downloading HADAM3P-EU models few hours ago. All files stuck 100% in transfer. Should I abort or hope for the best? Thanks |
Send message Joined: 20 May 11 Posts: 1 Credit: 10,581 RAC: 0 |
Same issue here. Apparently this is a problem with uploader1.atm not responding. |
Send message Joined: 29 Apr 07 Posts: 5 Credit: 1,961,201 RAC: 0 |
Same here. Please inform us in case we shall abort the WU. |
Send message Joined: 6 Aug 04 Posts: 195 Credit: 28,405,498 RAC: 10,268 |
You should not need to abort models or transfers at this time. They will get completed once the server issues are resolved. “Patience is the companion of wisdom” Saint Augustine 354-430 |
Send message Joined: 7 Jan 09 Posts: 8 Credit: 177,252 RAC: 0 |
I have had a Famous model stuck downloading for about 3 days. Please advise is a my end or a project problem. I suspect that is on the project end. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
It's a project problem. The University of Oxford is 'mostly' closed for the weekend, which is a long one this time because 30th May is a "bank holiday". About 36 hours before it re-opens. Backups: Here |
Send message Joined: 7 Jan 09 Posts: 8 Credit: 177,252 RAC: 0 |
I gave up and killed the WU. Temps are spiking here and I have no AC right now so I had to stop all DC projects until later. I will be back as soon as I have AC or the temps drop. Have to report one result and I am all done with BOINC projects. Computing is now done for all projects just one more project to report; and it is down due to server problems so I can't report the work until later this week. |
Send message Joined: 3 Oct 06 Posts: 43 Credit: 8,017,057 RAC: 0 |
That seems a bit over-kill to me. Couldn't you simply have suspended all projects? |
Send message Joined: 7 Jan 09 Posts: 8 Credit: 177,252 RAC: 0 |
That seems a bit over-kill to me. Couldn't you simply have suspended all projects? I am totally offline now for BOINC looks like it could be over a month depending on weather until I restart BOINC. I have the program off so it would have been a long time until I got around to downloading the 3 missing files. I figured it would be faster/easier for the project to kill the one stuck WU. I reported the stuck waiting to report SETI beta result about 24 hours after I first posted and then killed the stuck downloading CPDN WU and shut it all down. With BOINC offline it is finally cool enough in the apartment to sleep again. Also with the long potential down time I set no new tasks reported all work before the shutdown and only then killed the stuck CPDN WU, it was only one WU. |
Send message Joined: 5 May 11 Posts: 9 Credit: 53,072 RAC: 0 |
Hey guys, are your download servers down?? Since hours I'm getting these status reports from BOINC and it would be nice, if I could start with that task, since it is a very long "UK Met Office Coupled Model Full Resolution Ocean v6.07" and is already calculated by BOINC to need >1000 hours! Is there maybe another way to download these files?: 02.06.2011 14:43:07 climateprediction.net Started download of hadcm3n_o17t_1940_40_007264856.zip 02.06.2011 14:43:07 climateprediction.net Started download of SPARC_O3_rebuild_1900.gz 02.06.2011 14:43:28 Project communication failed: attempting access to reference site 02.06.2011 14:43:28 climateprediction.net Temporarily failed download of hadcm3n_o17t_1940_40_007264856.zip: connect() failed 02.06.2011 14:43:28 climateprediction.net Backing off 2 hr 28 min 4 sec on download of hadcm3n_o17t_1940_40_007264856.zip 02.06.2011 14:43:28 climateprediction.net Temporarily failed download of SPARC_O3_rebuild_1900.gz: connect() failed 02.06.2011 14:43:28 climateprediction.net Backing off 3 hr 30 min 35 sec on download of SPARC_O3_rebuild_1900.gz 02.06.2011 14:43:28 climateprediction.net Started download of atmos_o17t_1940_40_007264856_0.gz 02.06.2011 14:43:28 climateprediction.net Started download of DMSSO2NH3_1900_RCP.gz 02.06.2011 14:43:29 Internet access OK - project servers may be temporarily down. 02.06.2011 14:43:50 Project communication failed: attempting access to reference site 02.06.2011 14:43:50 climateprediction.net Temporarily failed download of atmos_o17t_1940_40_007264856_0.gz: connect() failed 02.06.2011 14:43:50 climateprediction.net Backing off 3 hr 0 min 59 sec on download of atmos_o17t_1940_40_007264856_0.gz 02.06.2011 14:43:50 climateprediction.net Temporarily failed download of DMSSO2NH3_1900_RCP.gz: connect() failed 02.06.2011 14:43:50 climateprediction.net Backing off 2 hr 55 min 16 sec on download of DMSSO2NH3_1900_RCP.gz 02.06.2011 14:43:51 Internet access OK - project servers may be temporarily down. 02.06.2011 14:44:51 climateprediction.net Started download of sulpc_oxidants_19_A2_1990f.gz 02.06.2011 14:44:51 climateprediction.net Started download of spec3a_lw_3_asol2c_hadcm3.gz 02.06.2011 14:45:13 Project communication failed: attempting access to reference site 02.06.2011 14:45:13 climateprediction.net Temporarily failed download of sulpc_oxidants_19_A2_1990f.gz: connect() failed 02.06.2011 14:45:13 climateprediction.net Backing off 1 hr 1 min 26 sec on download of sulpc_oxidants_19_A2_1990f.gz 02.06.2011 14:45:13 climateprediction.net Temporarily failed download of spec3a_lw_3_asol2c_hadcm3.gz: connect() failed 02.06.2011 14:45:13 climateprediction.net Backing off 2 hr 6 min 30 sec on download of spec3a_lw_3_asol2c_hadcm3.gz 02.06.2011 14:45:14 Internet access OK - project servers may be temporarily down. |
Send message Joined: 23 Dec 06 Posts: 3 Credit: 704,502 RAC: 0 |
Seems to be a theme - I haven't been able to download work units for almost a week now. For the most part, I get the "Project has no jobs available", but on the odd occasion that I do get a unit, the download sits at 0.00% and/or fails completely. Not impressed. |
Send message Joined: 5 May 11 Posts: 9 Credit: 53,072 RAC: 0 |
It still didn't download anything here...now i got another one of these big tasks (so, 2x Full Resolution Ocean v6.07) and i can't get started with them, because the download doesn't work...what's wrong there, guys?? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
A batch of RAPIT models are being created. However, each of these is being grabbed as soon as they come off the conveyor belt. With over 40,000 computers attached and most probably looking for work, the 'gimmee' messages from the computers are clogging up the Uni's network, (JANET), and also causing the servers to overload, which in turn is causing what downloads there are to fail. I'm going to suggest that computers are limited to one model at a time for a while, and that the data pool is kept blocked until the batch, (only a few thousand), is fully created. Backups: Here |
Send message Joined: 23 Dec 06 Posts: 3 Credit: 704,502 RAC: 0 |
I won't complain then - good to see that much interest in the project. As long as there's nothing wrong with Boinc or CPDN, I'm satisfied. I thought the newer builds of Boinc were the problem. I shall patiently wait for new jobs. Thanks! |
Send message Joined: 5 May 11 Posts: 9 Credit: 53,072 RAC: 0 |
And what shall I do now? Just wait or abort? Because the download of the two "Full Resolution Ocean v6.07" tasks still didn't work out... |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
I have stopped requesting new work - I guess it will take a lot of people to do this to stop JANET being flooded. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The project config has now been changed slightly to reduce what is virtually a "denial of service" attack on the servers by the huge number of computers wanting work. It should mean that less models are sent to each computer. Anyone aborting models stuck in downloads may not get even the start of one for a while afterwards. Backups: Here |
Send message Joined: 15 Jan 11 Posts: 175 Credit: 6,242,691 RAC: 699 |
Hi Everyone, I think that the problems here are to some extent 'good news'. It means that there are huge numbers of people wanting to contribute to this project. It's extremely rare (if not previously unknown) for any research project to have more resources that it can usefully utilise at a particular moment in time. I've done the same thing as Dave, suspended my requests and and am now running tasks from another project, (Malaria research in Switzerland) in order to usefully use my spare resources. I check on most days on the current state of events with this project so I can resume when necessary. Cheers to all involved in this project. David |
Send message Joined: 6 Aug 04 Posts: 264 Credit: 965,476 RAC: 0 |
I am running 6 BOINC projects with a very short cache (0.25 days) which means that I get a new WU only when the preceding one has been completed and uploaded. But I have several results in a pending state, especially in SETI@home, because people download too many WUs not to remain without supplies in lean times. Tullio |
Send message Joined: 5 May 11 Posts: 9 Credit: 53,072 RAC: 0 |
I still don't have any change...both of the "Full Resolution Ocean v6.07" tasks are still trying to be downloaded...is there any chance, that it will happen soon? I really would like to crunch them both! :) Cheers |
©2024 cpdn.org