climateprediction.net (CPDN) home page
Thread 'Problem uploading final results'

Thread 'Problem uploading final results'

Message boards : Number crunching : Problem uploading final results
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 11655 - Posted: 6 Apr 2005, 10:23:12 UTC - in response to Message 11654.  

> Found the trickle files. Took all but 30 of then into a new directory. How can
> I "do an update"?

Hope your travels were enjoyable Jose :)

Go to the 'Projects' tab on the BOINC window and right click on the 'climateprediction.net' label. This will open up a menu which has 'Update' as its second item.

To merge your computers go to <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/hosts_user.php">your computer list</a>, and select your highest 'Comupter Id' number (I think it's 149014). Just above the trickle list at the bottom of the page you'll find a link 'merge this computer'. I think <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/host_edit_form.php?hostid=149014">this</a> should give you a direct link to the computer's page.

If you go to the bottom of the page there is a 'Select all' option, which makes it a lot easier than marking each host individually (particularly as you've now got 208 to merge with this one!). Then all you need to do is click on the 'Merge hosts' button.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 11655 · Report as offensive     Reply Quote
Arnaud

Send message
Joined: 3 Sep 04
Posts: 268
Credit: 256,045
RAC: 0
Message 11658 - Posted: 6 Apr 2005, 15:02:38 UTC

Hi,
I was wondering why and how so much computers were created for this user.
On CPDN Alpha, a new computer was created in my account but the host ID was different in Boinc: result: impossible to contact the scheduler, the host ID was not recognised. I simply detached and lost the work done :o(.
Is there a solution to resolve this kind of problem (edition of a xml file?)
Cheers
Arnaud
ID: 11658 · Report as offensive     Reply Quote
ProfileHonza
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 390
Credit: 2,475,242
RAC: 0
Message 11659 - Posted: 6 Apr 2005, 15:26:06 UTC

There is host id recorded in sched_request_*.xml ; not sure what will happen after editing this data. But as editing of host_venue has been suggested as solving problem of location: home/work, i asume editing host id is also possible.
<i>phpBB forum for CPDN, all are </i><a href="http://www.climateprediction.net/board">invited</a>
ID: 11659 · Report as offensive     Reply Quote
Arnaud

Send message
Joined: 3 Sep 04
Posts: 268
Credit: 256,045
RAC: 0
Message 11660 - Posted: 6 Apr 2005, 15:58:01 UTC

Thanks Honza,
I will try to edit sched_request.xml the next time I have this problem again
:o)
Arnaud
ID: 11660 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 11662 - Posted: 6 Apr 2005, 17:58:43 UTC - in response to Message 11658.  

&gt; I was wondering why and how so much computers were created for this user.

Me too, but my ramblings below might give an explanation ...

&gt; On CPDN Alpha, a new computer was created in my account but the host ID was
&gt; different in Boinc: result: impossible to contact the scheduler, the host ID
&gt; was not recognised. I simply detached and lost the work done :o(.

I can't understand how your host id could have been mixed up like that Arnaud. I can only think of 2 circumstances when the scheduler could send a sched_response_*.xml file with a host id:

1) when you initially connect to the project.

2) when you restore client_state.xml from a backup the scheduler will detect if your scheduler request number is less than it was expecting and allocate you a new host number.

I wonder if #2 (or something else I haven't thought of) might have happened and you didn't receive the scheduler response. You will continue to use the old host id for your scheduler requests. #2 will create a new host id until your request count increases to what the scheduler expects (anything else will just continue creating new host ids until a scheduler reply gets through with the new host id, and I suspect this might be what's happening in Jose's case). If you then merge your computers the scheduler will stop recognising all the merged host ids but your client host id will stay with an old value if it didn't receive the last scheduler response setting new host id.

&gt; Is there a solution to resolve this kind of problem (edition of a xml file?)

The host id values in the sched_request_*.xml files are taken from the project sections in the client_state.xml file, and that's where you need to make any changes. You need to make sure you're making the change for the right project ;)
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 11662 · Report as offensive     Reply Quote
Arnaud

Send message
Joined: 3 Sep 04
Posts: 268
Credit: 256,045
RAC: 0
Message 11666 - Posted: 6 Apr 2005, 19:04:29 UTC
Last modified: 6 Apr 2005, 19:29:54 UTC

I think #2 is the right answer.:o)
I don't remember having used a back-up, but it's possible (I do it sometimes when Boinc freeze without reason).
As the network access is always disabled on this machine, I have nonetheless difficulties to understand how this new host was created (but i'm a little dumb sometimes :o))

From time to time I upgrade from CC4.19 to CC4.25 to meet the deadlines of certains Wus and I wonder if boinc 4.25 doesn't create a new host ID when connecting a project for the first time. As CPDN Alpha scheduler isn't working very well, it could have been a problem of communication between CC4.25 and the CPDN Alpha site.

Unfortunately I remarked that the host ID in boinc and on my account page were different <b>after</b> I merged the computers: 121 in boinc and 313 on the web page. Next time, I'll check the number in BOINC and on the web site before merging.

Arnaud
ID: 11666 · Report as offensive     Reply Quote
old_user20331

Send message
Joined: 23 Sep 04
Posts: 7
Credit: 341,690
RAC: 0
Message 11680 - Posted: 7 Apr 2005, 7:40:29 UTC

Thanks a lot Thyme.
I think all the files have been sent, all the computers id merged and there's a new model calculating at the moment. (The "select all" option to merge the computers didn't work an i had to do it manually)
ID: 11680 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 11710 - Posted: 8 Apr 2005, 13:51:29 UTC - in response to Message 11666.  

&gt; I think #2 is the right answer.:o)
&gt; I don't remember having used a back-up, but it's possible (I do it sometimes
&gt; when Boinc freeze without reason).
&gt; As the network access is always disabled on this machine, I have nonetheless
&gt; difficulties to understand how this new host was created (but i'm a little
&gt; dumb sometimes :o))
&gt;
&gt; From time to time I upgrade from CC4.19 to CC4.25 to meet the deadlines of
&gt; certains Wus and I wonder if boinc 4.25 doesn't create a new host ID when
&gt; connecting a project for the first time. As CPDN Alpha scheduler isn't working
&gt; very well, it could have been a problem of communication between CC4.25 and
&gt; the CPDN Alpha site.
&gt;
&gt; Unfortunately I remarked that the host ID in boinc and on my account page were
&gt; different after I merged the computers: 121 in boinc and 313 on the web page.
&gt; Next time, I'll check the number in BOINC and on the web site before merging.

Hi Arnaud,

I posted the details of your problem to the BOINC development mailing list, including an idea for a server change to force the merged hostid down to the client. David Anderson has implemented a fix and deployed it on the BOINC alpha test and S@H projects for testing.

If it works as expected the server and client hostid should be aligned very quickly after a host merge. Mind you, we will have to wait for the rebuild of the CPDN scheduler and webpages to reach the top of Tolu's workload ...
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 11710 · Report as offensive     Reply Quote
Arnaud

Send message
Joined: 3 Sep 04
Posts: 268
Credit: 256,045
RAC: 0
Message 11711 - Posted: 8 Apr 2005, 14:59:45 UTC
Last modified: 8 Apr 2005, 15:37:57 UTC

Hi Thyme Lawn
I've seen the mail in the dev mailing list, but I didn't know it was you who had written it (what a nice coincidence, thought I :o))
I have seen that I was not the only user who had had this problem (E@H users too) and I thank you and David for the improvement of the code.
As it seems there are still problems with 4.12 apps in Alpha CPDN, Tolu must be buried under tons of work: Hope he doesn't burn-out :o)
Arnaud
ID: 11711 · Report as offensive     Reply Quote
old_user46965

Send message
Joined: 30 Jan 05
Posts: 16
Credit: 65,203
RAC: 0
Message 15410 - Posted: 26 Aug 2005, 12:27:00 UTC

File transfer problems showing up again. This time it's feeding me the sulfer model -- about 30+ Meg on a crappy dialup. I know no way of forcing the project to send me another 4.13 HadSM model instead, and not have to take this new software.

I'm updating to BOINC 4.45 to see if it resolves some incompatability with my dial-up software -- I get no data transfer when BOINC is open, even in suspended or disable network mode -- and then when I exit BOINC, my connection works again for other apps.

If the incompatability persists -- I guess I'm done. I've said that before, but this time the file transfer problems involve 30+ Meg instead of 10+, which generally consumes about 2-3 days to complete. That means a week or more with the damned sulfur model.

Unless I can force a 4.13 download. Is there a way?
ID: 15410 · Report as offensive     Reply Quote
Profileold_user5994

Send message
Joined: 31 Aug 04
Posts: 239
Credit: 2,933,299
RAC: 0
Message 15415 - Posted: 26 Aug 2005, 15:29:41 UTC

Aparrently not. It looks like the new sulphur model is all that is going to be issued, at least that is according to the news. I would trade shoes with you except I can't do a model at all... :(

The OS-X application kept client erroring on me with 4.43 and 4.13 so, I could not complete models. Eventually I gave up. Now, it looks like the updated model is not going to be issued for OS-X ... :(

very depressing ...
<p>
<a href="http://boinc-doc.net/boinc-wiki/index.php"><b>BOINC-Wiki</b></a>
<img src="http://www.boincstats.com/stats/banner.php?cpid=a6477942e70ed39f669d1ff2ede05be8">
ID: 15415 · Report as offensive     Reply Quote
Arnaud

Send message
Joined: 3 Sep 04
Posts: 268
Credit: 256,045
RAC: 0
Message 15417 - Posted: 26 Aug 2005, 15:36:19 UTC
Last modified: 26 Aug 2005, 15:37:03 UTC

Hi, Paul
Tolu said on the sulphur site that a Mac app would be available soon (they're working on it, the apps should be available in a couple of weeks).

-----------------------------------------------
<a href="http://boinc-doc.net/boinc-wiki/index.php?title=Main_Page">Boinc Wiki</a>
<a href="http://forum.boinc.fr/">L'Alliance Francophone</a>
ID: 15417 · Report as offensive     Reply Quote
Profileold_user5994

Send message
Joined: 31 Aug 04
Posts: 239
Credit: 2,933,299
RAC: 0
Message 15418 - Posted: 26 Aug 2005, 15:42:38 UTC

Praise be to heaven ... :)

I feel naked not having CPDN. During the SETI@Home crisis it has been bad here. Especially as it seems to be taking Einstein@Home with it somehow ... So, I can only run Predictor@Home. Makes me feel undressed. :)
<p>
<a href="http://boinc-doc.net/boinc-wiki/index.php"><b>BOINC-Wiki</b></a>
<img src="http://www.boincstats.com/stats/banner.php?cpid=a6477942e70ed39f669d1ff2ede05be8">
ID: 15418 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 15424 - Posted: 26 Aug 2005, 16:11:31 UTC - in response to Message 15418.  

I haven't quite figured out why BOINC is just sending out sulphur cycle projects; I have it in the config that if someone comes in with lower disk space than needed, they should get hadsm3. I will have to look at it next week.
ID: 15424 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 15431 - Posted: 26 Aug 2005, 16:51:35 UTC
Last modified: 26 Aug 2005, 16:51:53 UTC

While you're at it, here's another one that needs to revert to hadsm3 Carl

Message from server: No work sent (won't finish in time) Computer available 56.0% of time, this project gets 100.0% of that
<br><a href="http://www.teampicard.net/"><img src="http://www.teampicard.net/images/picardmini.gif"></a><a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=3">Join us here</a>
ID: 15431 · Report as offensive     Reply Quote
Profileold_user5994

Send message
Joined: 31 Aug 04
Posts: 239
Credit: 2,933,299
RAC: 0
Message 15433 - Posted: 26 Aug 2005, 17:43:24 UTC

I just installed 4.72 to see if that fixes the problem I am having getting Einstein@Home work ... at the moment I seem to be fighting the "ghost" issue problem with 4.43 and SETI@Home being down ...

But, for CPDN, I am getting this:

2005-08-26 10:47:12 [climateprediction.net] Sending scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2005-08-26 10:47:12 [climateprediction.net] Reason: Requested by user
2005-08-26 10:47:12 [climateprediction.net] Requesting 345600 seconds of work, returning 0 results
2005-08-26 10:47:13 [climateprediction.net] Scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
2005-08-26 10:47:13 [climateprediction.net] Message from server: No work sent
2005-08-26 10:47:13 [climateprediction.net] Message from server: (there was work for other platforms)

which is interesting ...
<p>
<a href="http://boinc-doc.net/boinc-wiki/index.php"><b>BOINC-Wiki</b></a>
<img src="http://www.boincstats.com/stats/banner.php?cpid=a6477942e70ed39f669d1ff2ede05be8">
ID: 15433 · Report as offensive     Reply Quote
old_user46965

Send message
Joined: 30 Jan 05
Posts: 16
Credit: 65,203
RAC: 0
Message 15447 - Posted: 27 Aug 2005, 2:26:45 UTC

Well -- I still don't understand the nature of the network issues in the intermittent ability to download, amidst the more common shutdown of all network activity when I try to download from CPDN. That 23 Meg file keeps aborting and restarting from zero. The fourth time, I set "no new work" from CP... and will complete my last project about 3 weeks from now -- or 1 week should it prove unstable in Phase II, as the graphics seem to suggest already, half way into Phase II.

I guess I'll see how proteins fold. I'm no big fan of Carl Sagen, and see SETI as essentially spinning wheels.

With users dropping out from the sulfur model for various reasons, and each run taking 5x the computation time as HadSM3 -- then I hope to hell they've made good use of HadSM3 to cull unreasonable values from within model space -- like by 2 orders of magnitude, to be able to explore much of model space at all in the sulfur model.
ID: 15447 · Report as offensive     Reply Quote
crandles
Volunteer moderator

Send message
Joined: 16 Oct 04
Posts: 692
Credit: 277,679
RAC: 0
Message 15453 - Posted: 27 Aug 2005, 10:44:45 UTC

It should be about 2.7 times longer rather than 5 times longer.

&gt;2 orders of magnitude

Well I suppose keeping the reduction to 1 order of magnitude would be a tough target. With over 100,000 slab models that would require 10,000 sulphur models and 500? ocean spin ups by Jan 06 in order to be able to start the hindcast in Feb 2006 as planned. I expect it will be possible to complete sulphur models after the start of the hindcast and still be useful.

It would be a shame if lack of computers meant the Feb 06 target for the hindcast was not possible.

If it is not downloading the file, perhaps rebooting the computer might help or maybe someone will make it available elsewhere or start putting these files on CD for modem users...
_______________________________
Visit <a href="http://boinc-doc.net/boinc-wiki/index.php?title=Climateprediction_FAQ">BOINC WIKI</a> for help

And join <a href="http://www.boincsynergy.com/">BOINC Synergy</a> for all the news in one place.
ID: 15453 · Report as offensive     Reply Quote
old_user46965

Send message
Joined: 30 Jan 05
Posts: 16
Credit: 65,203
RAC: 0
Message 15514 - Posted: 29 Aug 2005, 9:46:42 UTC
Last modified: 29 Aug 2005, 10:25:09 UTC

I was hoping, Crandles, for some opinion of what to do with the last slab model I'm running, if Phase II shows unstable. Is anything of value to be gained by its completion? Is completion the only way to do the large trickle, and otherwise you're just left with summary data from the little trickles.

In the 3 days preceeding giving up, I redial more than 100 times and rebooted more than a dozen trying to get a clean connection. Those I did get never amounted to more than 5 or 6 Meg worth of download of that 23 Meg file, and it kept resetting and starting again at Zero -- rather than resuming. It was time to quit.

I guess Sulfur runs don't introduce new parameters, and work within the parameter space of the slab models. I had imagined some 3x3x3 matrix of parameters specific to aerosol physics, and thus 2.7e6 runs covering the same parameter space as before. Good luck with circulating ocean. It would be cool -- though frightening -- if circulation in shown to stop within a reasonable subset of parameter space. I think the instafreeze of the Dennis Quaid movie is a bit far-fetched, though, in imagining disaster scenarios. The plot of that movie has some holes you could move a battleship through.

--- an afterthought -- you mention beginning a hindcast in 2006. Is that described anywhere, especially the suite of forcings involved?
ID: 15514 · Report as offensive     Reply Quote
crandles
Volunteer moderator

Send message
Joined: 16 Oct 04
Posts: 692
Credit: 277,679
RAC: 0
Message 15516 - Posted: 29 Aug 2005, 11:12:17 UTC

Some very good questions Bob.

I don't know the answers but I will see if I can attract Carl's attention to them.

None of the parameters described at http://www.climateprediction.net/board/viewtopic.php?t=2022
seem to be specific to sulphur.

There is also http://www.climateprediction.net/board/viewtopic.php?t=2031 regarding number of runs.

On the hindcast, there is
http://www.climateprediction.net/science/strategy.php and http://www.climateprediction.net/science/strategy_adv.php
(the 2005 date on the advanced page is out of date.)
But there is not much about the forcings involved.
_______________________________
Visit <a href="http://boinc-doc.net/boinc-wiki/index.php?title=Climateprediction_FAQ">BOINC WIKI</a> for help

And join <a href="http://www.boincsynergy.com/">BOINC Synergy</a> for all the news in one place.
ID: 15516 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Problem uploading final results

©2024 cpdn.org