Message boards : Number crunching : Replication and error counts
Message board moderation
Author | Message |
---|---|
Send message Joined: 6 Apr 05 Posts: 17 Credit: 744,057 RAC: 0 |
No good deed goes unpunished. I was trying to give other folks a shot at some of the new regional models (since I already had as many as I thought I could handle), so I aborted some I had not started yet. Since then, I've adjusted my queue depth to something more reasonable. I was expecting that the WU's would be reassigned to other crunchers. Unfortunately, some of these WU's already had two other errors of various types from other machines. They are now unavailable to anyone because of "too many error results". I would like to suggest that "aborted by user" and "detached from project" not be counted against the WU as errors. These are not issues with either BOINC or the science, and CPDN is being hurt by it. =Mike |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
It's a problem with the way the BOINC server code works. This is a well know problem, and has been discussed behind the scenes for a year or so. Basically, the server issues ALL the models that are specified by the values for the 3 lines before the list right at the start, and then waits for results. Data sets are only re-issued if the failures don't exceed the limits, and aborting models will make it exceed these limits. Once people have a model that's more than they need, the best solution is to Suspend the excess for latter processing. Backups: Here |
Send message Joined: 6 Apr 05 Posts: 17 Credit: 744,057 RAC: 0 |
|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Tolu has left the project, and Milo is on a long awaited and several times postponed holiday. So I'm thinking that the answer is: No. Backups: Here |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
I agree with Les. It is nearly always better to suspend extra climate models and process them later rather than aborting them. If you have too many models and cannot complete some before their deadline, do not worry. The CPDN servers accept results uploaded after model deadlines and late results will be used by the researchers. (This is only true for the CPDN servers, not for other projects.) Don't give any importance to the Boinc message 'Too many error results' on workunit pages. It doesn't apply to CPDN and is there because other projects need it. Cpdn news |
©2024 cpdn.org