Message boards : Number crunching : Schedulers down too now..?
Message board moderation
Author | Message |
---|---|
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
"Master File Fetch Failed" for all of my machines right now - looks like the CP scheduler went down at about 6:00am. I've disabled network access on all my machines for the moment, so that they don't back off too far... <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> <a href="http://www.nmvs.dsl.pipex.com/">Distributed Mania</a> |
Send message Joined: 20 Aug 04 Posts: 10 Credit: 132,163 RAC: 0 |
Same here ... > "Master File Fetch Failed" for all of my machines right now - looks like the > CP scheduler went down at about 6:00am. I've disabled network access on all my > machines for the moment, so that they don't back off too far... ALL GLORY TO THE HYPNOTOAD! Potrebujete pomoc? My Stats |
Send message Joined: 10 Aug 04 Posts: 94 Credit: 309,849 RAC: 0 |
Ditto, just in time for a ICE BALL to jam up Darwin. Lovely. <img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=35&trans=off"> |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
My machines are not always lucky on 1st try with scheduler but second try will eventually go through.... |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
The schedulers seem to be up now, new users will want to attach directly to: http://climateapps2.oucs.ox.ac.uk/cpdnboinc (instead of the usual climateprediction.net of course) |
Send message Joined: 13 Sep 04 Posts: 161 Credit: 284,548 RAC: 0 |
> The schedulers seem to be up now, new users will want to attach directly to: > > http://climateapps2.oucs.ox.ac.uk/cpdnboinc > > (instead of the usual climateprediction.net of course) > > > I've now got ..master file parse failed....could not contact any schedulers.... and communication deferred for 14+ HOURS. ..... only now its gone up to 1 DAY 17 HRS+ Marj :((( _________________________________ |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
Checked with three machines there - everything's went fine on first try... > I've now got ..master file parse failed....could not contact any > schedulers.... and communication deferred for 14+ HOURS. > > ..... only now its gone up to 1 DAY 17 HRS+ > > Marj :((( > |
Send message Joined: 13 Sep 04 Posts: 161 Credit: 284,548 RAC: 0 |
> Checked with three machines there - everything's went fine on first try... > > I daren't check again it might go up even more!!! _________________________________ |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
I think there's some sort of router problem on 'Janet' - I see 'address unreachable' with a ping plotter traceroute & '100% loss' for a ping. Most of my machines managed to get through, including one final result upload, but two still cannot. I'm now seeing "Master File Parse Failed" on all machines... <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> <a href="http://www.nmvs.dsl.pipex.com/">Distributed Mania</a> |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
> > Checked with three machines there - everything's went fine on first > try... > > > I daren't check again it might go up even more!!! > Don't worry Marj, you can still force a manual update whenever they come back online properley & that resets the delay back to one minute. I've disabled all my machine's network access again for now though. <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> <a href="http://www.nmvs.dsl.pipex.com/">Distributed Mania</a> |
Send message Joined: 13 Sep 04 Posts: 161 Credit: 284,548 RAC: 0 |
> Don't worry Marj, you can still force a manual update whenever they come back > online properley & that resets the delay back to one minute. I've disabled > all my machine's network access again for now though. > I've got Einstein running as well and thats happily crunching away - if I stop access for cpdn it seems to stop both and they're only short WUs. I've changed the preferences so cpdn is only doing 1/5 hrs as its only got 20 hrs left on it. It keeps trying to get more work so if this model finishes before its fixed it won't be doing anything anyway. (I must say when you've only got a slow machine -it takes me 40ish days/model, it's amazing how paranoid you get about the final upload!) Marj _________________________________ |
Send message Joined: 5 Aug 04 Posts: 186 Credit: 1,612,182 RAC: 0 |
> I've got Einstein running as well and thats happily crunching away - if I stop > access for cpdn it seems to stop both and they're only short WUs. I've changed > the preferences so cpdn is only doing 1/5 hrs as its only got 20 hrs left on > it. It keeps trying to get more work so if this model finishes before its > fixed it won't be doing anything anyway. Luckily the machine that uploaded her final results earlier today already had a new model to crunch on - 'Jana', due to upload in 24 hours, already has a new model too - since I'm only crunching CP-boinc at the moment, I increased my queue to 2 days so a server outage <i>ought</i> to be fixed before they run out of work. > (I must say when you've only got a slow machine -it takes me 40ish days/model, > it's amazing how paranoid you get about the final upload!) Final results upload seemed to go okay for 'Alison' but the model still shows as 'ready to report' in the BOINC GUI - I guess that won't clear until she can get through to a scheduler. <a href="http://www.nmvs.dsl.pipex.com/"><img src="http://boinc.mundayweb.com/cpdn/stats.php?userID=6&team=off&trans=off"></a> <a href="http://www.nmvs.dsl.pipex.com/">Distributed Mania</a> |
Send message Joined: 30 Aug 04 Posts: 7 Credit: 1,554,414 RAC: 0 |
You've said that we can connect to the server at http://climateapps2.oucs.ox.ac.uk/cpdnboinc instead of the climatepredition.net. Can this be done while crunching or will it cause problems with the WUs? And how is this done? Thanks! |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
It works for attaching to climateapps2.oucs.ox.ac.uk/cpdnboinc instead of classic address. > You've said that we can connect to the server at > http://climateapps2.oucs.ox.ac.uk/cpdnboinc instead of the > climatepredition.net. Can this be done while crunching or will it cause > problems with the WUs? And how is this done? > > Thanks! > |
Send message Joined: 20 Aug 04 Posts: 10 Credit: 132,163 RAC: 0 |
As I see from our team, people sent results yesterday and today. But me NOT! How this is possible? I just don't have enough luck? I did not change anything, BOINC is running on this slow 2 x P3 800MHz server for a weeks. And I can't connect from my P4 workstation too ... so it is really strange. ALL GLORY TO THE HYPNOTOAD! Potrebujete pomoc? My Stats |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
This is what i ment earlier in another thread - only some machines/regions e.g. first floor vs. second floor in the same house :-) are affected. After a restart, my main machine has the Master file fetch error, other two are happy... > As I see from our team, people sent results yesterday and today. But me NOT! > How this is possible? I just don't have enough luck? I did not change > anything, BOINC is running on this slow 2 x P3 800MHz server for a weeks. > And I can't connect from my P4 workstation too ... so it is really strange. > |
Send message Joined: 27 Aug 04 Posts: 55 Credit: 1,106,201 RAC: 0 |
Add me to the list of those having the same exact symptoms as Honza. My symptoms started not long after a reboot, also. Now I'm wary of doing any power cycling on my other machines! :-o |
Send message Joined: 5 Aug 04 Posts: 66 Credit: 2,146,056 RAC: 0 |
> Add me to the list of those having the same exact symptoms as Honza. My > symptoms started not long after a reboot, also. Now I'm wary of doing any > power cycling on my other machines! :-o > I must be one of the lucky ones. My two machines continue to trickle without any problems. |
Send message Joined: 5 Aug 04 Posts: 173 Credit: 1,843,046 RAC: 0 |
The scheduler RPC mechanism should now be working as it should. You can force an update to update stats amongst other things and remove those annoying warning messages. |
Send message Joined: 20 Aug 04 Posts: 10 Credit: 132,163 RAC: 0 |
I still have the same error ... climateprediction.net - 2004-12-07 14:00:22 - Master file parse failed climateprediction.net - 2004-12-07 14:00:22 - Could not contact any schedulers for http://climateprediction.net/. climateprediction.net - 2004-12-07 14:00:22 - Could not contact any schedulers for http://climateprediction.net/. climateprediction.net - 2004-12-07 14:00:22 - Deferring communication with project for 1 weeks, 5 days, 13 hours, 53 minutes, and 9 seconds (times are in GMT+1) ALL GLORY TO THE HYPNOTOAD! Potrebujete pomoc? My Stats |
©2024 cpdn.org