Message boards : Number crunching : May Project reject hosts?
Message board moderation
Author | Message |
---|---|
Send message Joined: 28 Aug 04 Posts: 90 Credit: 2,736,552 RAC: 0 |
Hello, surfing through the cp sites I found one interesting host: <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/results.php?hostid=79040">http://climateapps2.oucs.ox.ac.uk/cpdnboinc/results.php?hostid=79040</a> this host loads wus and is crashing them permanently. Is there any way to reject this host from getting new work from the project side? This may be an interesting point in cases of workunit shortage. ciao, tom The best source for asking problems with BOINC and the ClimatePrediction client |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
Hi tom, i'm afraid this host doesn't meet CPDN requirements; 350MHz Pentium is propably not up to CPDN project which is quite demanding. I would suggest to try another BOINC project with this machine. See Recommended System Requirements http://climateapps2.oucs.ox.ac.uk/cpdnboinc/tech_faq_boinc.php > Hello, > > surfing through the cp sites I found one interesting host: > > <a> href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/results.php?hostid=79040">http://climateapps2.oucs.ox.ac.uk/cpdnboinc/results.php?hostid=79040</a> > > this host loads wus and is crashing them permanently. Is there any way to > reject this host from getting new work from the project side? This may be an > interesting point in cases of workunit shortage. > > ciao, tom > |
Send message Joined: 28 Aug 04 Posts: 90 Credit: 2,736,552 RAC: 0 |
Hi Honza, for information: this host belongs not to me *g*. It belongs to user <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_user.php?userid=33275">fenix</a>. ciao, tom The best source for asking problems with BOINC and the ClimatePrediction client |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Some sort of filter may need to be implemented for the coupled ocean model if people are going to ignore the minimum hardware requirements. In which case in would probably apply to all model types. 'fenix' appears to be running automaticly, without checking what's happening. For now, an email from the admins is the only answer. Les |
Send message Joined: 17 Aug 04 Posts: 753 Credit: 9,804,700 RAC: 0 |
I was initially opposed to enforcing a minimum requirement when BOINC was launched, because we have had a few people successfully completing runs in Classic with very slow computers. Carl was not sure whether the BOINC version would be more or less demanding, and part of the purpose of the project is to educate people about climate. None of wanted to unnecessarily exclude people just because they could not afford a new machine. It may be, though, that the team should do so now, but I don't know whether it is practicable for them to establish whether there are underpowered machines that are succeeding in running without problems. The other aspect is that there are many reasons why WUs might fail repeatedly, besides lack of cycles. I think this has arisen in SETI as well. A daily download limit does not screen these out and I wonder whether there might not be another solution. For example, flagging up those who had exceeded a limit over a longer period. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_user.php?userid=4309"> This user</a> has a similar problem on several computers. One very slow machine, but also others, including a 3.2G laptop. Are well. :( I'm not involved in the science side of this. Les |
Send message Joined: 1 Nov 04 Posts: 185 Credit: 4,166,063 RAC: 857 |
> I was initially opposed to enforcing a minimum requirement when BOINC was > launched, because we have had a few people successfully completing runs in > Classic with very slow computers. Carl was not sure whether the BOINC version > would be more or less demanding, and part of the purpose of the project is to > educate people about climate. None of wanted to unnecessarily exclude people > just because they could not afford a new machine. > It may be, though, that the team should do so now, but I don't know whether it > is practicable for them to establish whether there are underpowered machines > that are succeeding in running without problems. The other aspect is that > there are many reasons why WUs might fail repeatedly, besides lack of cycles. > I think this has arisen in SETI as well. A daily download limit does not > screen these out and I wonder whether there might not be another solution. For > example, flagging up those who had exceeded a limit over a longer period. They seem to have something like it over @Einstein. At least that's the way I read <a href="http://einstein.phys.uwm.edu/forum_thread.php?id=603">this thread.</a> Grüße vom Sänger |
Send message Joined: 17 Aug 04 Posts: 753 Credit: 9,804,700 RAC: 0 |
> They seem to have something like it over @Einstein. At least that's the way I read <a href="http://einstein.phys.uwm.edu/forum_thread.php?id=603">this thread.</a> > Not sure about that. That thread seems to be discussing the rejection of clients that will not complete work before a deadline. That is a very unreliable measure, and anyway the user may well respond by upgrading/leaving their machine on longer/give up when they realise how long a CPDN unit is likely to take. The server cannot have that conversation with the CPDN user! I seem to recall that there is the facility in BOINC for the server to refuse to allocate WUs to a client with less than the minimum CPU specification, just as it does now if there is insufficient disk space. My concern is whether the project has sufficient information on which to base that limit. Set it too low and it will do little good. Set it too high, and you will rightly get complaints from people who have spent xx months completing a run and who are then told that their help is no longer welcome. It would be sensible to set limits on participation in the coupled ocean or even sulphur experiments, as Les Bayliss suggested, but less powered machines would not be excluded by that from participating altogether. And we would still have the problems of the powerful machines that are still not able to run CPDN, as Les also points out. |
Send message Joined: 28 Aug 04 Posts: 90 Credit: 2,736,552 RAC: 0 |
> I seem to recall that there is the facility in BOINC for the server to refuse > to allocate WUs to a client with less than the minimum CPU specification, just > as it does now if there is insufficient disk space. My concern is whether the > project has sufficient information on which to base that limit. Set it too low > and it will do little good. Set it too high, and you will rightly get > complaints from people who have spent xx months completing a run and who are > then told that their help is no longer welcome. It would be sensible to set > limits on participation in the coupled ocean or even sulphur experiments, as > Les Bayliss suggested, but less powered machines would not be excluded by that > from participating altogether. And we would still have the problems of the > powerful machines that are still not able to run CPDN, as Les also points out. > So it seems for me that there exists only one way to solve this kind of problem: Hosts that causes wus to crash permanently and that never send back trickles are most likely to not fulfil the minimum hardware requirements and should therefore locked out at least temporary from getting further work units. But on thinking about this I'm asking is this really a serious problem? Perhaps, but perhaps not. ciao, tom The best source for asking problems with BOINC and the ClimatePrediction client |
Send message Joined: 5 Aug 04 Posts: 66 Credit: 2,146,056 RAC: 0 |
> > I seem to recall that there is the facility in BOINC for the server to > refuse > > to allocate WUs to a client with less than the minimum CPU specification, > just > > as it does now if there is insufficient disk space. My concern is whether > the > > project has sufficient information on which to base that limit. Set it > too low > > and it will do little good. Set it too high, and you will rightly get > > complaints from people who have spent xx months completing a run and who > are > > then told that their help is no longer welcome. It would be sensible to > set > > limits on participation in the coupled ocean or even sulphur experiments, > as > > Les Bayliss suggested, but less powered machines would not be excluded by > that > > from participating altogether. And we would still have the problems of > the > > powerful machines that are still not able to run CPDN, as Les also points > out. > > > > So it seems for me that there exists only one way to solve this kind of > problem: Hosts that causes wus to crash permanently and that never send back > trickles are most likely to not fulfil the minimum hardware requirements and > should therefore locked out at least temporary from getting further work > units. > > But on thinking about this I'm asking is this really a serious problem? > Perhaps, but perhaps not. > > ciao, tom > Boinc runs exponential backoff when clients fail to connect to servers to prevent a massive buildup of pent up comms demand swamping the servers when they come back up. Could the maximum number of WU also be made to back off exponentially? I.e. five today, only three tomorrow, just one the next day, another one two days later, and no more for a week, etc. This way, machines that simply chew up WUs would soon be timed out for quite long periods. On the other hand, machines with temporary problems that are quite quickly fixed would soon get back on track. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
That's an idea, KeeperC. I thought of something after last posting: Have access to the new models be through a notice page: First, give an example time for completion on a reasonably fast machine. Then say that overclocking is known to cause instability in processing, so don't do it. Recommend that the models be the only dc project running on that computer etc Finally, say that the 'standard' model could still be run by those with slow computers. And insert the button to access the download at the bottom of the page. That way, users would have little excuse to complain about crashes. Which would probably not stop some. :( Les |
©2024 cpdn.org