climateprediction.net (CPDN) home page
Thread 'Missed daily trickle?'

Thread 'Missed daily trickle?'

Message boards : Number crunching : Missed daily trickle?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
BarryAZ

Send message
Joined: 13 Jul 05
Posts: 125
Credit: 11,778,421
RAC: 0
Message 30573 - Posted: 20 Sep 2007, 7:01:26 UTC

For the first time in a very long time, it looks like the daily trickle credits were missed. Didn\'t see any announcement (though typically in Climate that sort of thing, if it does appear shows up buried in a thread somewhere it seems).

Does anyone from the project side know what happened -- and if it is a harbinger of multiple days of missed trickle credits?

ID: 30573 · Report as offensive     Reply Quote
ProfileIain Inglis

Send message
Joined: 9 Jan 07
Posts: 467
Credit: 14,549,176
RAC: 317
Message 30575 - Posted: 20 Sep 2007, 9:41:59 UTC

If an event is known about it\'ll be added to the news thread here. It\'s a good idea to subscribe to that thread.

However, the failure of a trickle run is not normally a planned event and news of it will emerge as people spot something awry, as you have ...
ID: 30575 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 30578 - Posted: 20 Sep 2007, 12:08:21 UTC

Yesterday 19 Sep the server web pages for cpdn were down again for a few hours. That\'s the \'data-driven web pages\' server. The boinc software was upgraded last Friday for all 4 cpdn projects and the job still isn\'t finished on the BBC project. However, uploads, downloads and trickles have not been affected by the upgrade.

I imagine that when the cpdn web pages server catches up on the backlog, you\'ll get the credits. In any case, every time a trickle is received, the server recalculates the credits for that model from when it began. This method should ensure that every trickle received gets its credits.

So I don\'t think your credits will have gone missing for ever.
Cpdn news
ID: 30578 · Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 13 Jul 05
Posts: 125
Credit: 11,778,421
RAC: 0
Message 30583 - Posted: 20 Sep 2007, 17:44:34 UTC - in response to Message 30575.  

OK -- I\'ve subscribed to that thread. I figured to report what I observed in the message boards -- looks like that worked...


If an event is known about it\'ll be added to the news thread here. It\'s a good idea to subscribe to that thread.

However, the failure of a trickle run is not normally a planned event and news of it will emerge as people spot something awry, as you have ...


ID: 30583 · Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 13 Jul 05
Posts: 125
Credit: 11,778,421
RAC: 0
Message 30584 - Posted: 20 Sep 2007, 17:46:44 UTC - in response to Message 30578.  

Thanks for the update. I figured the credits hadn\'t got lost, just that the updates were not reflected in the daily update to the statistics pages. I will be on the lookout this evening for the updates.... and will whine here if they don\'t show the updated trickle information (smile).


Yesterday 19 Sep the server web pages for cpdn were down again for a few hours. That\'s the \'data-driven web pages\' server. The boinc software was upgraded last Friday for all 4 cpdn projects and the job still isn\'t finished on the BBC project. However, uploads, downloads and trickles have not been affected by the upgrade.

I imagine that when the cpdn web pages server catches up on the backlog, you\'ll get the credits. In any case, every time a trickle is received, the server recalculates the credits for that model from when it began. This method should ensure that every trickle received gets its credits.

So I don\'t think your credits will have gone missing for ever.


ID: 30584 · Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 13 Jul 05
Posts: 125
Credit: 11,778,421
RAC: 0
Message 30595 - Posted: 21 Sep 2007, 5:49:22 UTC

Looks like, notwithstanding that the outage was only a couple of hours yesterday, data the trickle update statistics outage is still in effect for a second day.

See, I told you I\'d be back this evening to whine. (smile).

ID: 30595 · Report as offensive     Reply Quote
old_user201554

Send message
Joined: 3 Oct 06
Posts: 12
Credit: 572,668
RAC: 0
Message 30596 - Posted: 21 Sep 2007, 7:58:38 UTC - in response to Message 30595.  

Looks like, notwithstanding that the outage was only a couple of hours yesterday, data the trickle update statistics outage is still in effect for a second day.

See, I told you I\'d be back this evening to whine. (smile).


And hey I have 3 kinds of cheese too!.. (VBG)

Anybody have decent crackers?

Seriously though, having been through this before I don\'t mind getting a bunch of credits in a lump sum. Just so long as the data doesn\'t get lost.

Arion

ID: 30596 · Report as offensive     Reply Quote
Profileold_user472680
Avatar

Send message
Joined: 14 Sep 07
Posts: 2
Credit: 263,586
RAC: 0
Message 30597 - Posted: 21 Sep 2007, 8:19:05 UTC - in response to Message 30596.  

i confirm my credits have not been updated for 2 days either.
ID: 30597 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 30599 - Posted: 21 Sep 2007, 11:21:17 UTC
Last modified: 21 Sep 2007, 11:29:59 UTC

Brazzmonkey, your HADCM model hasn\'t produced enough trickles to check that all of them are there on your webpage, but your slab HADSM model

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=6825916
shows all its trickles correctly recorded.

On Wednesday 19 Sep, the only server down was the one for data-driven web pages. This is the server that provides this forum and lets us see our account and results pages. Cpdn uploads, downloads and trickles were not affected at any time as far as I know.

Milo is still working on the servers. The BBC data-driven web pages server has been down for a whole week.

There is usually a delay of one day and sometimes two days between results (trickles and file uploads) being received by the cpdn server, the credits being awarded, and the stats sites seeing these credits and displaying them.

Nobody\'s trickles should be missing on their results pages. And as I said before, every time the server receives a trickle, it recalculates ALL the credits for ALL the trickles received for that model from when the model started. So if all your trickles are recorded, you will get all your credits.

We have (very very rarely) seen cases where trickles or file uploads have apparently been lost. But I don\'t think we\'ve ever seen a case where trickles received by the server have not generated the correct number of credits.

So we will need to be patient.

Barry, I couldn\'t check your current model trickles because there are so many computers and so many models..... And I couldn\'t whether your recent trickles are all present and correct, Arion, because your computers are hidden.

I\'ve been copying my News thread posts about server outages to Neil Munday\'s stats site forum and to the project news section of Willy\'s BoincStats forum.
Cpdn news
ID: 30599 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,716,561
RAC: 8,355
Message 30600 - Posted: 21 Sep 2007, 11:32:18 UTC - in response to Message 30599.  
Last modified: 21 Sep 2007, 11:32:42 UTC

Mo,

I monitor and log activity on the single host I have connected to CPDN (host 518112).

I can confirm that all appropriate trickles have been uploaded and are correctly shown on the results pages - no problem there.

However, the BOINC credits are now two days out of date - the translation process from trickles to credits (which normally runs once every 24 hours, at around 4am BST) has failed to run on each of the last two nights. I\'m not worried - the science is there, and the credits will catch up eventually - but it is evidence that at least one CPDN server process has stalled, and perhaps needs attention alongside Milo\'s work on the BBC servers.
ID: 30600 · Report as offensive     Reply Quote
Profileold_user472680
Avatar

Send message
Joined: 14 Sep 07
Posts: 2
Credit: 263,586
RAC: 0
Message 30602 - Posted: 21 Sep 2007, 11:54:38 UTC - in response to Message 30599.  

@mo.v
thanks for making things clear. i don\'t care about credits anyhow...
ID: 30602 · Report as offensive     Reply Quote
Profileold_user211396
Avatar

Send message
Joined: 2 Dec 06
Posts: 3
Credit: 894,841
RAC: 0
Message 30603 - Posted: 21 Sep 2007, 11:57:48 UTC - in response to Message 30600.  

Mo,

I monitor and log activity on the single host I have connected to CPDN (host 518112).

I can confirm that all appropriate trickles have been uploaded and are correctly shown on the results pages - no problem there.

However, the BOINC credits are now two days out of date - the translation process from trickles to credits (which normally runs once every 24 hours, at around 4am BST) has failed to run on each of the last two nights. I\'m not worried - the science is there, and the credits will catch up eventually - but it is evidence that at least one CPDN server process has stalled, and perhaps needs attention alongside Milo\'s work on the BBC servers.



Hi,

I have the same effects on my host (host 506953). Five trickles and no credits ???

jan
ID: 30603 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 30607 - Posted: 21 Sep 2007, 14:34:02 UTC

I\'ve just sent Milo a message. He\'s just now got the BBC data server and forum up and running again, so if something else is holding cpdn credits up, it has probably had to wait until the BBC crunchers got their results pages and forum back.

I\'ll post back as soon as possible.
Cpdn news
ID: 30607 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 30613 - Posted: 21 Sep 2007, 16:42:22 UTC

I\'ve just posted this in the News thread (top of the Number Crunching section):

All ClimatePrediction servers for all projects are now running again.

However, Milo says there was a problem with the database yesterday 20 Sep - a mysterious crash the cause of which could not be determined. This was entirely separate from the outage to upgrade the web pages, forums &c. It is likely that the credit generation script didn\'t run for that reason.

Credits should be up again by tomorrow; the script will run overnight. It may be a bit slow to catch up, as usual after an outage. So members may not receive their outstanding credits until Sunday or Monday.

A new file server will be installed in Oxford soon, probably within a week or two, easing the pressure on disk space and of course on Milo and Tolu.
Cpdn news
ID: 30613 · Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 13 Jul 05
Posts: 125
Credit: 11,778,421
RAC: 0
Message 30614 - Posted: 21 Sep 2007, 16:43:00 UTC - in response to Message 30607.  

OK -- for me, the \'non-posting\' was pretty obvious -- I\'d been running between 2K and 3K daily and then when I checked on Wednesday -- 0. I temporarily suspended climate on *some* of my farm of workstations, and when I checked on Thursday, there was still no change from Tuesday. What I have done since then as sort of a \'test case\' is \'unsuspend\' a few workstations (did that yesterday afternoon). I figure to wait to see a change in the credits (typically the daily update shows up in my time zone around 10PM). Once I see it is \'moving\' again, I\'ll bring my other Climate PC\'s back into the fray. During the interim, SETI, Spinhenge, Rosetta, World Grid, and Einstein get the extra cycles.



I\'ve just sent Milo a message. He\'s just now got the BBC data server and forum up and running again, so if something else is holding cpdn credits up, it has probably had to wait until the BBC crunchers got their results pages and forum back.

I\'ll post back as soon as possible.


ID: 30614 · Report as offensive     Reply Quote
Profileold_user211396
Avatar

Send message
Joined: 2 Dec 06
Posts: 3
Credit: 894,841
RAC: 0
Message 30628 - Posted: 21 Sep 2007, 23:02:07 UTC - in response to Message 30613.  

I\'ve just posted this in the News thread (top of the Number Crunching section):

All ClimatePrediction servers for all projects are now running again.

However, Milo says there was a problem with the database yesterday 20 Sep - a mysterious crash the cause of which could not be determined. This was entirely separate from the outage to upgrade the web pages, forums &c. It is likely that the credit generation script didn\'t run for that reason.

Credits should be up again by tomorrow; the script will run overnight. It may be a bit slow to catch up, as usual after an outage. So members may not receive their outstanding credits until Sunday or Monday.

A new file server will be installed in Oxford soon, probably within a week or two, easing the pressure on disk space and of course on Milo and Tolu.


Thanks mo and milo for your engagement.

jan
ID: 30628 · Report as offensive     Reply Quote
old_user201554

Send message
Joined: 3 Oct 06
Posts: 12
Credit: 572,668
RAC: 0
Message 30634 - Posted: 22 Sep 2007, 5:11:39 UTC - in response to Message 30599.  

Barry, I couldn\'t check your current model trickles because there are so many computers and so many models..... And I couldn\'t whether your recent trickles are all present and correct, Arion, because your computers are hidden.



I\'m not worried about my credits and I\'ve seen my trickles going up. As for my computers being hidden I have to do that right now because of some testing I\'m doing. I\'ll be releasing them as soon as I am able to. We went through this not long ago for about a week or so and got all our credits over a 2 day period. I\'m not concerned as long as the data is okay.

Arion

ID: 30634 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,716,561
RAC: 8,355
Message 30636 - Posted: 22 Sep 2007, 8:54:13 UTC

Trickle-to-credit script seems to have run correctly this morning. Thanks, Milo.
ID: 30636 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 30648 - Posted: 22 Sep 2007, 15:46:02 UTC
Last modified: 22 Sep 2007, 15:47:18 UTC

It was fortunate that you guys all reported your lack of recent credits because it was only when I told Milo about this that he realised the credits script must have been affected by the server crash. That\'s why none of this was explained in the news thread earlier - nobody had realised what had happened.


Cpdn news
ID: 30648 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,716,561
RAC: 8,355
Message 30649 - Posted: 22 Sep 2007, 16:26:31 UTC - in response to Message 30648.  

It was fortunate that you guys all reported your lack of recent credits because it was only when I told Milo about this that he realised the credits script must have been affected by the server crash. That\'s why none of this was explained in the news thread earlier - nobody had realised what had happened.

In a discussion about a similar but unrelated problem in another project, I coined the phrase \"distributed wisdom\", as being something that project administrators get for free when they set up a \'distributed computing\' project. Sometimes we users can spot a problem quite early in its formation, and even analyse quite closely which part of the system needs attention. In the best of all possible worlds, that analysis happens by discussion among users in a different time-zone, and the solution is available on a plate before the project administrators even wake up in their morning!

It\'s up to project administrators how much advantage they want to take of the available \'distributed wisdom\'. Sometimes the signal-to-noise ratio is rather low: as Milo already knows, reading every thread in every bulletin board would waste more time than it saves.

CPDN is fortunate in that the forum moderators take an active interest in the technical aspects of the project, and have a direct line of communication with the administrative staff. Other project moderators are sometimes more involved with the \'social\' aspects of forum management, and don\'t always have such a close working relationship with the technical staff. Then, technical glitches don\'t always get attended to in as timely a fashion as this one has been.
ID: 30649 · Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Missed daily trickle?

©2024 cpdn.org