climateprediction.net (CPDN) home page
Thread 'HADAM3P - Maximum elapsed time exceeded'

Thread 'HADAM3P - Maximum elapsed time exceeded'

Message boards : Number crunching : HADAM3P - Maximum elapsed time exceeded
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
old_user587059

Send message
Joined: 3 Sep 09
Posts: 5
Credit: 509,410
RAC: 0
Message 42818 - Posted: 26 Aug 2011, 22:10:23 UTC

All hadam3p WU's that I have received during the past week or so have shown an estimated run time betweem 500 hrs and 1200 hrs with a 'due date' approximately 3 mo hence. The WU's run for several days with little or no decrease in remaining estimated run time, resulting in the WU running at high priority and blicking any other project from running on single core machines. Some have eventually settled down after a few days and start showing a systematic decrease in remaining run time, but others continued to run at high priority for many days until I finally aborted them. I have therefore suspended the project on my single core machines and only allow it to run on my three multicore machines (under two different user IDs) which can still run other projects while the climate model is running at high priority). Just thought someone might like to know...
ID: 42818 · Report as offensive     Reply Quote
ProfileGreg van Paassen

Send message
Joined: 17 Nov 07
Posts: 142
Credit: 4,271,370
RAC: 0
Message 42819 - Posted: 27 Aug 2011, 0:39:13 UTC - in response to Message 42818.  
Last modified: 27 Aug 2011, 0:46:31 UTC

As I understand it, the Boinc time estimation algorithm works better the more models the PC has run to completion. So the more models you let run, the better will be the initial estimate.

During the course of a model run, Boinc seems to stick with its original estimate much longer than we humans do. In my experience it doesn't get really accurate till about 90% completed, or even later. There's nothing we can do about this.

So it's a case of just letting Boinc do its thing, if you still want to contribute. In the long run, things will work out. (That is, after finishing the CPDN model, the PC will spend its time with other projects, to work off its "time deficit".)

Oh and yes, the CPDN models do take weeks (or months!) to run. That's normal.

Cheers.
ID: 42819 · Report as offensive     Reply Quote
w1hue

Send message
Joined: 31 Aug 05
Posts: 20
Credit: 1,969,695
RAC: 0
Message 42820 - Posted: 27 Aug 2011, 4:36:35 UTC - in response to Message 42819.  
Last modified: 27 Aug 2011, 4:49:10 UTC

Oh and yes, the CPDN models do take weeks (or months!) to run. That's normal.

And while it's running at high priority on a single core machine, nothing else gets done until one of the other WU's gets near its 'due date'. The problem would be resolved if the due date (of the HADAM3P WU's) was made much longer. Previous ClimatePrediction WU's typically had a due date about one year in the future -- in which case the run time estimate was always << less than the time until the due date.

EDIT: I just noticed that the offending WU's are HADCM3N's rather than HADAM3P's -- guess this is the wrong thread... :-[
ID: 42820 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 42821 - Posted: 27 Aug 2011, 4:48:16 UTC - in response to Message 42820.  

The problem with long "due dates", is that people treated this as "The project's not in a hurry to get any results".

Except that each of the research groups do want results fairly quickly, as the next lot of work is dependent on what's currently running.

And with the tight deadline for completion of the RAPIT project, (it may even be past it, due to problems getting out the bugs at the start of the year), there's just no time to "be nice" to single core/multi project crunchers.

Welcome to the New Look cpdn.


Backups: Here
ID: 42821 · Report as offensive     Reply Quote
w1hue

Send message
Joined: 31 Aug 05
Posts: 20
Credit: 1,969,695
RAC: 0
Message 42822 - Posted: 27 Aug 2011, 4:52:22 UTC - in response to Message 42821.  

And with the tight deadline for completion of the RAPIT project, (it may even be past it, due to problems getting out the bugs at the start of the year), there's just no time to "be nice" to single core/multi project crunchers.


If that is the project's attitude, then I'll just donate my puny little computers' time elsewhere... :-(

ID: 42822 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,842,730
RAC: 5,006
Message 42823 - Posted: 27 Aug 2011, 13:01:36 UTC - in response to Message 42822.  

And with the tight deadline for completion of the RAPIT project, (it may even be past it, due to problems getting out the bugs at the start of the year), there's just no time to "be nice" to single core/multi project crunchers.


If that is the project's attitude, then I'll just donate my puny little computers' time elsewhere... :-(

It isn't the project's attitude. CPDN models are usually a pretty relaxed affair, with results continuing to be accepted beyond any reasonable deadline. However, as Les says, the RAPIT sub-project is different - they do have a more constrained timeline. If that's a problem then it is always possible to deselect the HADCM3N models from your project preferences and select another model type.

I take the use of CPDN by other research groups as a credit to the project team and also to the prodigious efforts of the volunteers. It shouldn't be a surprise if different teams have different objectives - and we volunteers may need to adjust our contributions accordingly.
ID: 42823 · Report as offensive     Reply Quote
w1hue

Send message
Joined: 31 Aug 05
Posts: 20
Credit: 1,969,695
RAC: 0
Message 42825 - Posted: 27 Aug 2011, 17:03:43 UTC - in response to Message 42823.  

If that's a problem then it is always possible to deselect the HADCM3N models from your project preferences and select another model type.

Good point -- I'll deselect HADCM3N. But not much else seems to be available lately...

ID: 42825 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 42826 - Posted: 28 Aug 2011, 0:15:30 UTC - in response to Message 42825.  

Good point -- I'll deselect HADCM3N. But not much else seems to be available lately.

Hadam3p are still being produced in limited batches. They seem to be released at irregular intervals. The problem is that they go fast. Its is not like in the past when slab models were available in seemingly endless numbers. If you keep connected 24/7 you should get 1 or 2 in a few days. Just make sure that your work buffer is set for 10 days.
ID: 42826 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : HADAM3P - Maximum elapsed time exceeded

©2024 cpdn.org