climateprediction.net (CPDN) home page
Thread 'When my model crashes, what is on server?'

Thread 'When my model crashes, what is on server?'

Message boards : Number crunching : When my model crashes, what is on server?
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user92466

Send message
Joined: 12 Aug 05
Posts: 9
Credit: 1,164,760
RAC: 0
Message 25660 - Posted: 19 Dec 2006, 13:42:19 UTC

My model crashed. Then I restored it from backup and continued crunching. But on server there is state with problems, thus points are granted to me.
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=6106654

Can this be changed and how?
ID: 25660 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 25664 - Posted: 19 Dec 2006, 15:51:30 UTC


Hi,

The first reported error will stick on the website, but is ignored by the scientists, who are only interested in the uploaded data. So you can ignore it.
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 25664 · Report as offensive     Reply Quote
old_user92466

Send message
Joined: 12 Aug 05
Posts: 9
Credit: 1,164,760
RAC: 0
Message 25778 - Posted: 30 Dec 2006, 12:22:41 UTC - in response to Message 25664.  


Hi,

The first reported error will stick on the website, but is ignored by the scientists, who are only interested in the uploaded data. So you can ignore it.


Hmm. I\'ve got one thought about it:
When my model crashes - I restroe it from backup. If this computer is online (the most of my computers, which are in office with LAN) - it says to server, thta model crashed and downloads new model. In the morning I come to work and restore model from backup, which I make every morning, when come to work. So, serve r then gives this model to someone else, with last number in name, increased by one. And I can get model with last number in name not equal 0, which can means, that someone crashed model, reported server, and then restored model from backup and cotinued crunching, dispite everything happened.
So, what is the \"Idea of crunching the same model by some persons simulatiously\"?

Is there any mechanism, which catches this model, when this happens?

And one more qustion: as I understood, if there were not trickles for some days (45, AFAIK), the results are ot accepted by the server. How much time has to be elapsed, when crashed model has to be posted to new person? Are those 45 days is that period?
ID: 25778 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 25779 - Posted: 30 Dec 2006, 18:39:15 UTC

The way to get around that is to set \"No new tasks\" in Projects tab and \"Network activity suspended\" in Activity.

Servers do not refuse data, unless the database is full or otherwise disabled. If they receive duplicates from a computer because of running from a backup, no additional credit awarded and that continues until the last previously received Trickle is passed by the restored Run.

If duplication occurs because of the circumstance you describe, both Runs will be honored and the science team will compare them. That said, I suspect the situation seldom comes up; people who restore typically do it in short order -- and the server waits before reissue.

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 25779 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 25789 - Posted: 31 Dec 2006, 12:27:01 UTC


A (small number) of duplicates is potentially quite useful, for comparing the results between different platforms. Too many would be a waste of course. If there were none the researchers would probably need to deliberately issue a few in order to do the comparisons.
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 25789 · Report as offensive     Reply Quote

Message boards : Number crunching : When my model crashes, what is on server?

©2024 cpdn.org