climateprediction.net (CPDN) home page
Thread 'Any idea when there will be new work?'

Thread 'Any idea when there will be new work?'

Message boards : Number crunching : Any idea when there will be new work?
Message board moderation

To post messages, you must log in.

AuthorMessage
MarkJ
Avatar

Send message
Joined: 28 Mar 09
Posts: 126
Credit: 9,825,980
RAC: 0
Message 42827 - Posted: 28 Aug 2011, 3:52:17 UTC

Well we're out of work again. According to latest Server Status there are about 275,000 out in the field and nothing to send.

Any idea when the next batch of work will be created and if so what type of wu?

I managed to pickup a resend of a HADAM3P_EU but thats all I have managed to get.

Thanks,
MarkJ
ID: 42827 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 42829 - Posted: 28 Aug 2011, 8:27:27 UTC - in response to Message 42827.  

I've made a News item about this.


Backups: Here
ID: 42829 · Report as offensive     Reply Quote
MarkJ
Avatar

Send message
Joined: 28 Mar 09
Posts: 126
Credit: 9,825,980
RAC: 0
Message 42830 - Posted: 28 Aug 2011, 11:16:41 UTC
Last modified: 28 Aug 2011, 11:19:19 UTC

Thanks for the reply Les.

Do we know if they will be waiting on all 275,000 wu to finish then before they produce another set of wu? Presumably they will have an idea how they are progressing, especially the Rapit ones seeing as they have a shorter deadline, to decide if another set are required or not.
ID: 42830 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 42831 - Posted: 28 Aug 2011, 21:34:54 UTC - in response to Message 42830.  

There are 2 lots of info on the RAPIT experiment:
RAPID-RAPIT: What is the risk of thermohaline circulation collapse?
Welcome to the RAPIT experiment!

Information about the 3 Regional models are scattered a bit over the 2 boards.

No other information is available or expected.


Backups: Here
ID: 42831 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 42834 - Posted: 30 Aug 2011, 4:54:09 UTC - in response to Message 42830.  

Dear MarkJ

Most of those 275,000 WU are likely lost units that died in ways that server never heard about (like a hard disk crash or someone uninstalling boinc with a running WU) and the server has just not given up on them yet.

Its going to be a long time before most of those WUs call home.

ID: 42834 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 42844 - Posted: 1 Sep 2011, 20:20:06 UTC

I seem to recall a script the project admins ran that detected idle workunits. If there is a workunit with "In Progress" tasks, it checks that at least one trickle has been received in the last 6 weeks or so. If not, they could cause those tasks to abort and resend.

I think running such a query this weekend would be insightful.
ID: 42844 · Report as offensive     Reply Quote
Eirik Redd

Send message
Joined: 31 Aug 04
Posts: 391
Credit: 219,896,461
RAC: 649
Message 42847 - Posted: 2 Sep 2011, 4:39:19 UTC
Last modified: 2 Sep 2011, 5:38:21 UTC

Right now, there's about a thousand hadcm3n available for download.
That changes every hour.
I'm only guessing , but I believe that the project admins are not using the BOINC infrastructure all that much -- not for timeout, reissue, verified -- what I guess is that when they get enough results for the models that interest the researchers they issue another batch with parameters that will refine those results. Or, with hadcm3n when there are enough results from one stage to issue wu for the next stage -- or reissue the lost ones.
The BOINC infrastructure is not real well suited to the climateprediction project, but it's what's available, so, as has been posted here many times, some of the the BOINC concepts just don't apply. CPDN has such __long-running__ models.

That 275000 models thing is a BOINC artifact -- don't give it moment's consideration

Right now, the longest hadcm3n models take a month or so.

Me , a geezer, years back, ran a few that took 4 months.

I believe that the tech crew are doing a good job issuing high-priority wu as needed by the science people. It's just that it takes a long time to know when wu aren't going to finish.

Keep on crunching.

Eric
ID: 42847 · Report as offensive     Reply Quote
MarkJ
Avatar

Send message
Joined: 28 Mar 09
Posts: 126
Credit: 9,825,980
RAC: 0
Message 42850 - Posted: 2 Sep 2011, 13:46:57 UTC

It seems there has been some work generated since this message thread started.

I managed to pickup a bunch of regional models on 1 machine and there are also 126 HADCM3N's ready to go. Once I knock over the regional work units i'll see if I can grab a couple of them before they all disappear.
ID: 42850 · Report as offensive     Reply Quote

Message boards : Number crunching : Any idea when there will be new work?

©2024 cpdn.org