climateprediction.net (CPDN) home page
Thread 'Too many everything...'

Thread 'Too many everything...'

Message boards : Number crunching : Too many everything...
Message board moderation

To post messages, you must log in.

AuthorMessage
Robi

Send message
Joined: 30 Aug 04
Posts: 7
Credit: 236,201
RAC: 0
Message 35822 - Posted: 5 Jan 2009, 13:53:14 UTC

I don\'t understand this WU.
It says \"errors: Too many error results Too many total results\"
On the list I seem to be the only one out of 8 that is actually doing some work on it. Everybody else had either \"client error\", 1 \"client detached\" and 2 \"didn\'t need\"?
What does that mean: \"Too many error results\"?
What does: \"Too many total results\" mean?

I suppose my work in progress will still be useful, as it\'s still chugging silently along...
Robi
ID: 35822 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 35823 - Posted: 5 Jan 2009, 15:54:10 UTC

That part of the software is used by other projects, but here the info is fairly meaningless. Just ignore it. The only thing of use on that page is the list of other models running, if YOU start to have a problem with yours and you want to see how others are doing.

This matter has been mentioned a lot of times, and you may be able to find these posts by using the search option at the top of this page if you want more information.

ID: 35823 · Report as offensive     Reply Quote
Robi

Send message
Joined: 30 Aug 04
Posts: 7
Credit: 236,201
RAC: 0
Message 35824 - Posted: 5 Jan 2009, 17:31:33 UTC - in response to Message 35823.  

That part of the software is used by other projects, but here the info is fairly meaningless. Just ignore it. The only thing of use on that page is the list of other models running, if YOU start to have a problem with yours and you want to see how others are doing.

This matter has been mentioned a lot of times, and you may be able to find these posts by using the search option at the top of this page if you want more information.


Hi Les, thank you for the explanation.

BTW I did try the search before I posted :) I also looked through the FAQs, but couldn\'t find anything that would explain it, at least I didn\'t see anything on the \"too many\" search that I entered (right now only my post shows up if I search for \"too many\") ;)

...and as I mentioned, I seem to have the ONLY model running %-/
But again, thanks for the explanation. Now I know that all I need to do is just keep it up & running :)
Robi
ID: 35824 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 35825 - Posted: 5 Jan 2009, 18:01:13 UTC

Before Christmas we were discussing with Milo, one of the CPDN programmers, whether it\'s possible to get rid of at least some of these phrases that mislead lots of our members. Now that the holidays are over we\'ll need to restart this discussion. The problem is that this stuff is built into the BOINC-provided web pages because most projects need it all.

To maximise your chances of completing the model, have a look at the CPDN README collections. There\'s a link in my signature. It\'s very useful to select a backup method from the collection so that if your model does crash you can restore and continue it.
Cpdn news
ID: 35825 · Report as offensive     Reply Quote
Robi

Send message
Joined: 30 Aug 04
Posts: 7
Credit: 236,201
RAC: 0
Message 35840 - Posted: 6 Jan 2009, 3:42:10 UTC - in response to Message 35825.  

Before Christmas we were discussing with Milo, one of the CPDN programmers, whether it\'s possible to get rid of at least some of these phrases that mislead lots of our members. Now that the holidays are over we\'ll need to restart this discussion. The problem is that this stuff is built into the BOINC-provided web pages because most projects need it all.

To maximise your chances of completing the model, have a look at the CPDN README collections. There\'s a link in my signature. It\'s very useful to select a backup method from the collection so that if your model does crash you can restore and continue it.


Hi mo.v,
if and when you restart those discussions, it would be grand to implement a CPDN own backup strategy and not rely on users who might or rather not do backups of

  • their systems
  • BOINC
  • CPDN


or the ones that do a backup don\'t stop CPDN... and then have a crash of the model...


Robi
ID: 35840 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 35846 - Posted: 6 Jan 2009, 15:08:00 UTC
Last modified: 6 Jan 2009, 15:08:57 UTC

There are now some other projects with fairly long tasks that would also benefit. However, as far as I know nobody has yet thought of a way to make BOINC completely exit from itself (ie stop running completely) but still remain active to back itself up. It would have to be done by BOINC itself.
Cpdn news
ID: 35846 · Report as offensive     Reply Quote
Robi

Send message
Joined: 30 Aug 04
Posts: 7
Credit: 236,201
RAC: 0
Message 35850 - Posted: 6 Jan 2009, 18:14:46 UTC - in response to Message 35846.  

There are now some other projects with fairly long tasks that would also benefit. However, as far as I know nobody has yet thought of a way to make BOINC completely exit from itself (ie stop running completely) but still remain active to back itself up. It would have to be done by BOINC itself.


what I meant to say is, that CPDN should back up its own task(s) and upon error (if it\'s not computation but crash) restore to its last \"checkpoint\"?
Id est, let each project backup its own tasks.

yes, it would be nice if BOINC could implement an automatic backup strategy (especially for the service installation), but with my meager 3 projects, CPDN seems to be the only one with crashing problems (note: so far I haven\'t had a crash on CPDN with this system [knock on wood] although 4 systems have this far quit on me :(... )

Robi
ID: 35850 · Report as offensive     Reply Quote
Robi

Send message
Joined: 30 Aug 04
Posts: 7
Credit: 236,201
RAC: 0
Message 35853 - Posted: 6 Jan 2009, 19:32:17 UTC - in response to Message 35846.  

There are now some other projects with fairly long tasks that would also benefit. However, as far as I know nobody has yet thought of a way to make BOINC completely exit from itself (ie stop running completely) but still remain active to back itself up. It would have to be done by BOINC itself.


I think there are some ways for XP and Vista running as BOINC service:
[code]net stop BOINC
backup
net start BOINC[/code]

or
[code]sc stop BOINC
backup
sc start BOINC[/code]

should do the trick...
Robi
ID: 35853 · Report as offensive     Reply Quote
ProfileIain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 35855 - Posted: 6 Jan 2009, 23:03:13 UTC - in response to Message 35850.  

what I meant to say is, that CPDN should back up its own task(s) and upon error (if it\'s not computation but crash) restore to its last \"checkpoint\"

The CPDN science application does this already: it will rewind to the start of the day in which the crash occurs and try again - that\'s rather closer than a checkpoint. The checkpoint mechanism will also have saved many quick crashes, when the computer reboots and the science application just restarts from the last checkpoint. Lots of models that complete sucessfully will actually have been automatically recovered, possibly many times, without the user noticing.

The use of manual backups simply means that BOINC/CPDN is pessimistic relative to experienced users - i.e. more models can be restarted than BOINC/CPDN supposes. It may be that more completions could be achieved by making the joint BOINC/CPDN error handling more subtle and optimistic than by inventing an inevitably problematic backup mechanism.
ID: 35855 · Report as offensive     Reply Quote

Message boards : Number crunching : Too many everything...

©2024 cpdn.org