climateprediction.net (CPDN) home page
Thread 'Valid or not valid?'

Thread 'Valid or not valid?'

Message boards : Number crunching : Valid or not valid?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile[AF>Libristes] erik

Send message
Joined: 6 Mar 08
Posts: 6
Credit: 836,193
RAC: 0
Message 51486 - Posted: 28 Feb 2015, 11:41:25 UTC

I have a problem with the Stderr with a UK Met Office HadCM3 short v7.24 task:

<core_client_version>7.0.27</core_client_version>
<![CDATA[
<stderr_txt>
from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
.../...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:43:38 (2968): called boinc_finish

</stderr_txt>
]]>

It's the task or my BOINC client? 7.0.27
ID: 51486 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 51487 - Posted: 28 Feb 2015, 11:55:05 UTC

I have seen this before when a task falls over after successfully completing. I would say it is valid.

However looking at the number of suspend requests on this and another task that did fall over it may well be worth looking at this post from Les [url]http://climateapps2.oerc.ox.ac.uk/cpdnboinc/forum_thread.php?id=7931#50571
[/url]
I also notice your wingman on another work unit is crashing everything due to missing 32bit libs so will pass that on to get owner notified and tasks not sent to said computer till problem fixed.

ID: 51487 · Report as offensive     Reply Quote
Profile[AF>Libristes] erik

Send message
Joined: 6 Mar 08
Posts: 6
Credit: 836,193
RAC: 0
Message 51491 - Posted: 1 Mar 2015, 7:12:19 UTC

My computer sent another task with the same problem this night.
It seems this is due to BOINC manager CPU parameter not at 100% because heat problems.
I'll try another task with the new parameters.
Thanks for the link.
erik.
ID: 51491 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 51492 - Posted: 1 Mar 2015, 8:00:09 UTC

If like most machines these days it is multicore then best way of reducing heat is to limit BOINC to using n-1 cores where n is the number of cores the machine has.
ID: 51492 · Report as offensive     Reply Quote
Profile[AF>Libristes] erik

Send message
Joined: 6 Mar 08
Posts: 6
Credit: 836,193
RAC: 0
Message 51494 - Posted: 1 Mar 2015, 19:11:21 UTC

I strictly followed all the parameters provided in the link you provided me. I still have a task to finish before loading a new one and test the new parameters.
Thank you for your help.
erik.
ID: 51494 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 51495 - Posted: 1 Mar 2015, 21:35:53 UTC - in response to Message 51494.  

At least with Linux we still have a reasonable number of tasks sitting there waiting :)
ID: 51495 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 51496 - Posted: 1 Mar 2015, 22:27:15 UTC

Yes, it would be nice is they could drop some more tasks in the hopper for Windows.

ID: 51496 · Report as offensive     Reply Quote

Message boards : Number crunching : Valid or not valid?

©2024 cpdn.org