climateprediction.net (CPDN) home page
Thread 'HADSM3 V4.13 Models resetting back to 0% - Bye Bye CPDN'

Thread 'HADSM3 V4.13 Models resetting back to 0% - Bye Bye CPDN'

Message boards : Number crunching : HADSM3 V4.13 Models resetting back to 0% - Bye Bye CPDN
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 20490 - Posted: 19 Feb 2006, 14:08:19 UTC

*ugh*

I just lost (again) another ~500h CPU time, as I witnessed one of my machines suddenly showing its progress at 0.x % again...
Model : http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=812546

This wasn\'t the first time either, quite a number of Models took a 2nd start for no reason, causing enourmous amounts of CPU time to go down to the dumps :(

(BOINC V5.2.13)

Well, what can I say. That\'s it, I\'ve now set all Systems (that still do have Models \"attempting\" to finish) to deplete work; after they\'re done I\'ll unattach from CPDN.

Losing a bit here and there due to Software Bugs is not problematic, but CPDN has so far wasted no less than ten thousands of hours of CPU time on me.
I simply can\'t afford that to happen, in these numbers we begin talking about serious amounts of money keeping those machines in operation, which I won\'t accept seeing wasted to poor Software.
Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 20490 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 20498 - Posted: 19 Feb 2006, 17:18:48 UTC

Overclocked machines?

With all that trouble, no backups?

You\'ve done a lot of work for the project; thanks for that. Sorry to lose you.
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 20498 · Report as offensive     Reply Quote
old_user3434
Avatar

Send message
Joined: 30 Aug 04
Posts: 77
Credit: 1,785,934
RAC: 0
Message 20513 - Posted: 19 Feb 2006, 21:18:37 UTC - in response to Message 20498.  
Last modified: 19 Feb 2006, 21:25:18 UTC

I hate to leave like that as well, since I always liked CPDN alot. Everybody around here has always been very friendly and helpful.

Backups on 24 machines \"on the fly\" are impossible without interfering with the running BOINC Clients (which reportedly can hate access to their Directory while running) are impossible I\'m afraid. And technically, it shouldn\'t be needed (except to protect against Harddrive failure or alike)

None of the machines are overclocked, all on stable performance settings and work 100% fine on all other Projects. With all the reports about CPDN bugs (and recent BBC CPDN Bugs as well), I\'m afraid it\'s in the Software with no solution I could go for to fix it.

If it wasn\'t for the tremendous amounts of lost CPU time, I would ignore it but with the inherent Model runtimes, it\'s just too much to justify :(

PS.
I\'ll periodically check the Progress around CPDN Client development and sure will be back, as soon as the Problems are solved. It\'s too good of a project to let go completely...
Scientific Network : 44800 MHz - 77824 MB - 1970 GB
ID: 20513 · Report as offensive     Reply Quote
old_user1098
Avatar

Send message
Joined: 25 Aug 04
Posts: 41
Credit: 34,555
RAC: 0
Message 22035 - Posted: 14 Apr 2006, 12:20:14 UTC
Last modified: 14 Apr 2006, 12:22:58 UTC

Something like that just happened to me.... I had just reattached, downloaded a BBC model v. 5.08, had got over 1% and suddenly this morning it went back to 0.64%. Pretty disappointed, I must say.... ;( No overclocking whatsoever, running on an HT machine. When I finish this WU (still 2848 hours to go!) unless it fails first, I think I will detach again..... as much as I like CPDN... :(
<br>***********************************<br>
WinXP Home, P4 3.2GHz HT, 512Mb RAM. Running SETI, CPDN, Predictor, LHC, Einstein, Orbit, Lattice and Folding
ID: 22035 · Report as offensive     Reply Quote
ProfileMikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 22036 - Posted: 14 Apr 2006, 12:50:49 UTC

I think someone wrote an automated backup for the BBC model (RRodney on the BBC boards), you could try that?

http://bbc.cpdn.org/forum_thread.php?id=2748
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 22036 · Report as offensive     Reply Quote

Message boards : Number crunching : HADSM3 V4.13 Models resetting back to 0% - Bye Bye CPDN

©2024 cpdn.org