Message boards : Number crunching : Problem uploading final results
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
> Found the trickle files. Took all but 30 of then into a new directory. How can > I "do an update"? Hope your travels were enjoyable Jose :) Go to the 'Projects' tab on the BOINC window and right click on the 'climateprediction.net' label. This will open up a menu which has 'Update' as its second item. To merge your computers go to <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/hosts_user.php">your computer list</a>, and select your highest 'Comupter Id' number (I think it's 149014). Just above the trickle list at the bottom of the page you'll find a link 'merge this computer'. I think <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/host_edit_form.php?hostid=149014">this</a> should give you a direct link to the computer's page. If you go to the bottom of the page there is a 'Select all' option, which makes it a lot easier than marking each host individually (particularly as you've now got 208 to merge with this one!). Then all you need to do is click on the 'Merge hosts' button. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 3 Sep 04 Posts: 268 Credit: 256,045 RAC: 0 |
Hi, I was wondering why and how so much computers were created for this user. On CPDN Alpha, a new computer was created in my account but the host ID was different in Boinc: result: impossible to contact the scheduler, the host ID was not recognised. I simply detached and lost the work done :o(. Is there a solution to resolve this kind of problem (edition of a xml file?) Cheers Arnaud |
Send message Joined: 5 Aug 04 Posts: 390 Credit: 2,475,242 RAC: 0 |
There is host id recorded in sched_request_*.xml ; not sure what will happen after editing this data. But as editing of host_venue has been suggested as solving problem of location: home/work, i asume editing host id is also possible. <i>phpBB forum for CPDN, all are </i><a href="http://www.climateprediction.net/board">invited</a> |
Send message Joined: 3 Sep 04 Posts: 268 Credit: 256,045 RAC: 0 |
Thanks Honza, I will try to edit sched_request.xml the next time I have this problem again :o) Arnaud |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
> I was wondering why and how so much computers were created for this user. Me too, but my ramblings below might give an explanation ... > On CPDN Alpha, a new computer was created in my account but the host ID was > different in Boinc: result: impossible to contact the scheduler, the host ID > was not recognised. I simply detached and lost the work done :o(. I can't understand how your host id could have been mixed up like that Arnaud. I can only think of 2 circumstances when the scheduler could send a sched_response_*.xml file with a host id: 1) when you initially connect to the project. 2) when you restore client_state.xml from a backup the scheduler will detect if your scheduler request number is less than it was expecting and allocate you a new host number. I wonder if #2 (or something else I haven't thought of) might have happened and you didn't receive the scheduler response. You will continue to use the old host id for your scheduler requests. #2 will create a new host id until your request count increases to what the scheduler expects (anything else will just continue creating new host ids until a scheduler reply gets through with the new host id, and I suspect this might be what's happening in Jose's case). If you then merge your computers the scheduler will stop recognising all the merged host ids but your client host id will stay with an old value if it didn't receive the last scheduler response setting new host id. > Is there a solution to resolve this kind of problem (edition of a xml file?) The host id values in the sched_request_*.xml files are taken from the project sections in the client_state.xml file, and that's where you need to make any changes. You need to make sure you're making the change for the right project ;) "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 3 Sep 04 Posts: 268 Credit: 256,045 RAC: 0 |
I think #2 is the right answer.:o) I don't remember having used a back-up, but it's possible (I do it sometimes when Boinc freeze without reason). As the network access is always disabled on this machine, I have nonetheless difficulties to understand how this new host was created (but i'm a little dumb sometimes :o)) From time to time I upgrade from CC4.19 to CC4.25 to meet the deadlines of certains Wus and I wonder if boinc 4.25 doesn't create a new host ID when connecting a project for the first time. As CPDN Alpha scheduler isn't working very well, it could have been a problem of communication between CC4.25 and the CPDN Alpha site. Unfortunately I remarked that the host ID in boinc and on my account page were different <b>after</b> I merged the computers: 121 in boinc and 313 on the web page. Next time, I'll check the number in BOINC and on the web site before merging. Arnaud |
Send message Joined: 23 Sep 04 Posts: 7 Credit: 341,690 RAC: 0 |
Thanks a lot Thyme. I think all the files have been sent, all the computers id merged and there's a new model calculating at the moment. (The "select all" option to merge the computers didn't work an i had to do it manually) |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
> I think #2 is the right answer.:o) > I don't remember having used a back-up, but it's possible (I do it sometimes > when Boinc freeze without reason). > As the network access is always disabled on this machine, I have nonetheless > difficulties to understand how this new host was created (but i'm a little > dumb sometimes :o)) > > From time to time I upgrade from CC4.19 to CC4.25 to meet the deadlines of > certains Wus and I wonder if boinc 4.25 doesn't create a new host ID when > connecting a project for the first time. As CPDN Alpha scheduler isn't working > very well, it could have been a problem of communication between CC4.25 and > the CPDN Alpha site. > > Unfortunately I remarked that the host ID in boinc and on my account page were > different after I merged the computers: 121 in boinc and 313 on the web page. > Next time, I'll check the number in BOINC and on the web site before merging. Hi Arnaud, I posted the details of your problem to the BOINC development mailing list, including an idea for a server change to force the merged hostid down to the client. David Anderson has implemented a fix and deployed it on the BOINC alpha test and S@H projects for testing. If it works as expected the server and client hostid should be aligned very quickly after a host merge. Mind you, we will have to wait for the rebuild of the CPDN scheduler and webpages to reach the top of Tolu's workload ... "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 3 Sep 04 Posts: 268 Credit: 256,045 RAC: 0 |
Hi Thyme Lawn I've seen the mail in the dev mailing list, but I didn't know it was you who had written it (what a nice coincidence, thought I :o)) I have seen that I was not the only user who had had this problem (E@H users too) and I thank you and David for the improvement of the code. As it seems there are still problems with 4.12 apps in Alpha CPDN, Tolu must be buried under tons of work: Hope he doesn't burn-out :o) Arnaud |
Send message Joined: 30 Jan 05 Posts: 16 Credit: 65,203 RAC: 0 |
File transfer problems showing up again. This time it's feeding me the sulfer model -- about 30+ Meg on a crappy dialup. I know no way of forcing the project to send me another 4.13 HadSM model instead, and not have to take this new software. I'm updating to BOINC 4.45 to see if it resolves some incompatability with my dial-up software -- I get no data transfer when BOINC is open, even in suspended or disable network mode -- and then when I exit BOINC, my connection works again for other apps. If the incompatability persists -- I guess I'm done. I've said that before, but this time the file transfer problems involve 30+ Meg instead of 10+, which generally consumes about 2-3 days to complete. That means a week or more with the damned sulfur model. Unless I can force a 4.13 download. Is there a way? |
Send message Joined: 31 Aug 04 Posts: 239 Credit: 2,933,299 RAC: 0 |
Aparrently not. It looks like the new sulphur model is all that is going to be issued, at least that is according to the news. I would trade shoes with you except I can't do a model at all... :( The OS-X application kept client erroring on me with 4.43 and 4.13 so, I could not complete models. Eventually I gave up. Now, it looks like the updated model is not going to be issued for OS-X ... :( very depressing ... <p> <a href="http://boinc-doc.net/boinc-wiki/index.php"><b>BOINC-Wiki</b></a> <img src="http://www.boincstats.com/stats/banner.php?cpid=a6477942e70ed39f669d1ff2ede05be8"> |
Send message Joined: 3 Sep 04 Posts: 268 Credit: 256,045 RAC: 0 |
Hi, Paul Tolu said on the sulphur site that a Mac app would be available soon (they're working on it, the apps should be available in a couple of weeks). ----------------------------------------------- <a href="http://boinc-doc.net/boinc-wiki/index.php?title=Main_Page">Boinc Wiki</a> <a href="http://forum.boinc.fr/">L'Alliance Francophone</a> |
Send message Joined: 31 Aug 04 Posts: 239 Credit: 2,933,299 RAC: 0 |
Praise be to heaven ... :) I feel naked not having CPDN. During the SETI@Home crisis it has been bad here. Especially as it seems to be taking Einstein@Home with it somehow ... So, I can only run Predictor@Home. Makes me feel undressed. :) <p> <a href="http://boinc-doc.net/boinc-wiki/index.php"><b>BOINC-Wiki</b></a> <img src="http://www.boincstats.com/stats/banner.php?cpid=a6477942e70ed39f669d1ff2ede05be8"> |
Send message Joined: 5 Aug 04 Posts: 907 Credit: 299,864 RAC: 0 |
I haven't quite figured out why BOINC is just sending out sulphur cycle projects; I have it in the config that if someone comes in with lower disk space than needed, they should get hadsm3. I will have to look at it next week. |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
While you're at it, here's another one that needs to revert to hadsm3 Carl Message from server: No work sent (won't finish in time) Computer available 56.0% of time, this project gets 100.0% of that <br><a href="http://www.teampicard.net/"><img src="http://www.teampicard.net/images/picardmini.gif"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a> |
Send message Joined: 31 Aug 04 Posts: 239 Credit: 2,933,299 RAC: 0 |
I just installed 4.72 to see if that fixes the problem I am having getting Einstein@Home work ... at the moment I seem to be fighting the "ghost" issue problem with 4.43 and SETI@Home being down ... But, for CPDN, I am getting this: 2005-08-26 10:47:12 [climateprediction.net] Sending scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi 2005-08-26 10:47:12 [climateprediction.net] Reason: Requested by user 2005-08-26 10:47:12 [climateprediction.net] Requesting 345600 seconds of work, returning 0 results 2005-08-26 10:47:13 [climateprediction.net] Scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded 2005-08-26 10:47:13 [climateprediction.net] Message from server: No work sent 2005-08-26 10:47:13 [climateprediction.net] Message from server: (there was work for other platforms) which is interesting ... <p> <a href="http://boinc-doc.net/boinc-wiki/index.php"><b>BOINC-Wiki</b></a> <img src="http://www.boincstats.com/stats/banner.php?cpid=a6477942e70ed39f669d1ff2ede05be8"> |
Send message Joined: 30 Jan 05 Posts: 16 Credit: 65,203 RAC: 0 |
Well -- I still don't understand the nature of the network issues in the intermittent ability to download, amidst the more common shutdown of all network activity when I try to download from CPDN. That 23 Meg file keeps aborting and restarting from zero. The fourth time, I set "no new work" from CP... and will complete my last project about 3 weeks from now -- or 1 week should it prove unstable in Phase II, as the graphics seem to suggest already, half way into Phase II. I guess I'll see how proteins fold. I'm no big fan of Carl Sagen, and see SETI as essentially spinning wheels. With users dropping out from the sulfur model for various reasons, and each run taking 5x the computation time as HadSM3 -- then I hope to hell they've made good use of HadSM3 to cull unreasonable values from within model space -- like by 2 orders of magnitude, to be able to explore much of model space at all in the sulfur model. |
Send message Joined: 16 Oct 04 Posts: 692 Credit: 277,679 RAC: 0 |
It should be about 2.7 times longer rather than 5 times longer. >2 orders of magnitude Well I suppose keeping the reduction to 1 order of magnitude would be a tough target. With over 100,000 slab models that would require 10,000 sulphur models and 500? ocean spin ups by Jan 06 in order to be able to start the hindcast in Feb 2006 as planned. I expect it will be possible to complete sulphur models after the start of the hindcast and still be useful. It would be a shame if lack of computers meant the Feb 06 target for the hindcast was not possible. If it is not downloading the file, perhaps rebooting the computer might help or maybe someone will make it available elsewhere or start putting these files on CD for modem users... _______________________________ Visit <a href="http://boinc-doc.net/boinc-wiki/index.php?title=Climateprediction_FAQ">BOINC WIKI</a> for help And join <a href="http://www.boincsynergy.com/">BOINC Synergy</a> for all the news in one place. |
Send message Joined: 30 Jan 05 Posts: 16 Credit: 65,203 RAC: 0 |
I was hoping, Crandles, for some opinion of what to do with the last slab model I'm running, if Phase II shows unstable. Is anything of value to be gained by its completion? Is completion the only way to do the large trickle, and otherwise you're just left with summary data from the little trickles. In the 3 days preceeding giving up, I redial more than 100 times and rebooted more than a dozen trying to get a clean connection. Those I did get never amounted to more than 5 or 6 Meg worth of download of that 23 Meg file, and it kept resetting and starting again at Zero -- rather than resuming. It was time to quit. I guess Sulfur runs don't introduce new parameters, and work within the parameter space of the slab models. I had imagined some 3x3x3 matrix of parameters specific to aerosol physics, and thus 2.7e6 runs covering the same parameter space as before. Good luck with circulating ocean. It would be cool -- though frightening -- if circulation in shown to stop within a reasonable subset of parameter space. I think the instafreeze of the Dennis Quaid movie is a bit far-fetched, though, in imagining disaster scenarios. The plot of that movie has some holes you could move a battleship through. --- an afterthought -- you mention beginning a hindcast in 2006. Is that described anywhere, especially the suite of forcings involved? |
Send message Joined: 16 Oct 04 Posts: 692 Credit: 277,679 RAC: 0 |
Some very good questions Bob. I don't know the answers but I will see if I can attract Carl's attention to them. None of the parameters described at http://www.climateprediction.net/board/viewtopic.php?t=2022 seem to be specific to sulphur. There is also http://www.climateprediction.net/board/viewtopic.php?t=2031 regarding number of runs. On the hindcast, there is http://www.climateprediction.net/science/strategy.php and http://www.climateprediction.net/science/strategy_adv.php (the 2005 date on the advanced page is out of date.) But there is not much about the forcings involved. _______________________________ Visit <a href="http://boinc-doc.net/boinc-wiki/index.php?title=Climateprediction_FAQ">BOINC WIKI</a> for help And join <a href="http://www.boincsynergy.com/">BOINC Synergy</a> for all the news in one place. |
©2024 cpdn.org