climateprediction.net (CPDN) home page
Thread 'Why can\'t my pc crunch cpdn wus?'

Thread 'Why can\'t my pc crunch cpdn wus?'

Message boards : Number crunching : Why can\'t my pc crunch cpdn wus?
Message board moderation

To post messages, you must log in.

AuthorMessage
Kenneth Larsen

Send message
Joined: 26 Aug 04
Posts: 59
Credit: 438,133
RAC: 0
Message 7106 - Posted: 30 Dec 2004, 15:07:14 UTC

I seem to constantly loose work units due to computation errors or model crashes on one of my computers, an Athlon XP 3000+. At first I thought the problem was overheating, but since everything else runs stable I'm beginning to doubt that. I've checked the ram with Memtest86.
It has been running stable for the past months, but about one month ago it started crashing on cpdn. Now I almost can't run for an hour before the model crashes.
Is this to be expected with such a complex model, or is it just my pc that is weird?

Happy new year :-)
Proud owner of the CPDN Wow-Mug!
ID: 7106 · Report as offensive     Reply Quote
ProfileAndrew Hingston
Volunteer moderator

Send message
Joined: 17 Aug 04
Posts: 753
Credit: 9,804,700
RAC: 0
Message 7110 - Posted: 30 Dec 2004, 21:58:44 UTC - in response to Message 7106.  


> Is this to be expected with such a complex model, or is it just my pc that is
> weird?

There are other tests you can try - UK_Nick suggests some <a href="http://www.climateprediction.net/board/viewtopic.php?t=2126">here</a>. I would not suggest that it is 'to be expected'; I've been running an XP 3200+ without problems with CPDN in BOINC since August, and before that with the classic version. Plenty of others have been using Athlons for CPDN without difficulty, though some people have had problems with almost anything.

If the machine passes the other tests, then was there any pattern to the failure under CPDN? Were you using the graphics/screensaver?
ID: 7110 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 7115 - Posted: 31 Dec 2004, 0:02:53 UTC - in response to Message 7106.  

&gt; I seem to constantly loose work units due to computation errors or model
&gt; crashes on one of my computers, an Athlon XP 3000+. At first I thought the
&gt; problem was overheating, but since everything else runs stable I'm beginning
&gt; to doubt that. I've checked the ram with Memtest86.
&gt; It has been running stable for the past months, but about one month ago it
&gt; started crashing on cpdn. Now I almost can't run for an hour before the model
&gt; crashes.
&gt; Is this to be expected with such a complex model, or is it just my pc that is
&gt; weird?
&gt;
&gt; Happy new year :-)
&gt;
You also might want to try the basic hardware maintenance thread suggestions here

http://www.climateprediction.net/board/viewtopic.php?t=2124
ID: 7115 · Report as offensive     Reply Quote
Kenneth Larsen

Send message
Joined: 26 Aug 04
Posts: 59
Credit: 438,133
RAC: 0
Message 7121 - Posted: 31 Dec 2004, 12:33:03 UTC

I have followed all the suggestions from both tests, and have run Prime95 for a few hours without errors. I will tr and let it run for a full day soon and see if it changes anything.
I've not been able to see any pattern in the cpdn crashes, most of the day the pc runs with blank screensaver and when I check it, the model has crashed.

@Andrew Hingston, how warm is your cpu when running cpdn? Perhaps overheating is indeed the problem here, since it gets about 54C when running.
ID: 7121 · Report as offensive     Reply Quote
old_user29560

Send message
Joined: 12 Nov 04
Posts: 3
Credit: 3,374
RAC: 0
Message 7125 - Posted: 31 Dec 2004, 18:19:52 UTC

The only time i've had one model crash on me was when i was running a few other DC projects at the same time. Since then i run cpdn and 1 other and have had no problems.

AMD64@2.3
ID: 7125 · Report as offensive     Reply Quote
ProfileAndrew Hingston
Volunteer moderator

Send message
Joined: 17 Aug 04
Posts: 753
Credit: 9,804,700
RAC: 0
Message 7136 - Posted: 1 Jan 2005, 23:08:17 UTC - in response to Message 7121.  


&gt; @Andrew Hingston, how warm is your cpu when running cpdn? Perhaps overheating
&gt; is indeed the problem here, since it gets about 54C when running.
&gt;

This doesn't seem likely to be the problem. My Abit monitor is reporting a temp of 49 degrees with a big Coolermaster CPU cooler in place of the standard one. But these figures are not very reliable, so a difference of 5 degrees may be within the margin of error, and Athlons are reasonably tolerant of high temperatures.
ID: 7136 · Report as offensive     Reply Quote
Kenneth Larsen

Send message
Joined: 26 Aug 04
Posts: 59
Credit: 438,133
RAC: 0
Message 7139 - Posted: 2 Jan 2005, 8:47:57 UTC

This is indeed what I've been told too. Some people have their cpu (same type) running at close to 60C without problems. I'm beginning to think this is either a windows problem or a bad cpu...
Also, the computer has a bad habit of spontaneously restarting every 3-8 hours or so. I'm suspecting this is only happening when running cpdn, I'll have to investigate this further.
ID: 7139 · Report as offensive     Reply Quote
old_user23880
Volunteer tester

Send message
Joined: 10 Oct 04
Posts: 223
Credit: 4,664
RAC: 0
Message 7172 - Posted: 4 Jan 2005, 23:20:15 UTC

Hi Kenneth

Have a look at the boinc Questions and Problems message board, the Windows section. Go to the 'I've had it' thread, and also look at the other thread further down that is referred to.
__________________________________________________

ID: 7172 · Report as offensive     Reply Quote
Kenneth Larsen

Send message
Joined: 26 Aug 04
Posts: 59
Credit: 438,133
RAC: 0
Message 7179 - Posted: 5 Jan 2005, 12:45:21 UTC

After having run some tests and read the Questions and Problems board, I've come to the conclusion that the problem IS overheating. I've now had the pc running with the case plates off for several days without a single error or restart. But as soon as I put them on, it runs from 1 minute to a few hours and then crashes. Now I just have to find out how to get more airflow and still make it run silently (and not spend any money ;-) ).

Thanks for your help everyone!
ID: 7179 · Report as offensive     Reply Quote
old_user909

Send message
Joined: 17 Aug 04
Posts: 56
Credit: 63,814
RAC: 0
Message 7232 - Posted: 10 Jan 2005, 9:56:32 UTC

Then the question is what is overheating? It isn't neccessarily your CPU. It could be one of the chipset chips on the motherboard or your RAM. I was having similar problems to yours. I could run seti@home and LHC all I wanted with no ill effects but as soon as I started CPDN I had to severely underclock the system to keep it stable. In my case it seems to have turned out to be a bad stick of RAM. Both memtest86 and prime95 gave me clean bills of health but one day the stick of RAM finally gave up the ghost and refused to work at all. Since taking it out, the system has become more stable and a few other small quirks have gone away. Now if I can just get newegg to take back the RAM...
ID: 7232 · Report as offensive     Reply Quote
Kenneth Larsen

Send message
Joined: 26 Aug 04
Posts: 59
Credit: 438,133
RAC: 0
Message 7251 - Posted: 11 Jan 2005, 14:54:36 UTC

Today I bought some ram cooling sinks, I'll see if that changes anything.
ID: 7251 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 7252 - Posted: 11 Jan 2005, 16:05:09 UTC - in response to Message 7251.  

&gt; Today I bought some ram cooling sinks, I'll see if that changes anything.
&gt;
Maybe a few of the tips here could help keep your case cooler. There may be better articles out there on how to do it and you could find them with google.

http://www.endpcnoise.com/cgi-bin/e/computercooling.html
ID: 7252 · Report as offensive     Reply Quote
Kenneth Larsen

Send message
Joined: 26 Aug 04
Posts: 59
Credit: 438,133
RAC: 0
Message 7275 - Posted: 13 Jan 2005, 20:32:14 UTC

Finally my computer seems to be running stable and is happily crunching on a cpdn work unit, so far without any errors or crashes. The solution was to mount a cooling fan inside the case to maintain flow throughout, and perhaps also the ram cooling I bought.
To top it all, I just ordered some Arctic Silver thermal paste.

I thank you all for your suggestions and info, it was a great help! :-D
Proud owner of the CPDN Wow-Mug!
ID: 7275 · Report as offensive     Reply Quote
old_user4607

Send message
Joined: 31 Aug 04
Posts: 6
Credit: 229,431
RAC: 0
Message 7641 - Posted: 26 Jan 2005, 15:47:50 UTC
Last modified: 26 Jan 2005, 15:48:30 UTC

CPDN goes down as the toughest DC project on pc's I've run with the possible exception of Folding. CPU temps run 4 - 5C warmer than the easiest of the lot which seems to be SETI. It also begs to ask the question whether we're contributing to global warming :) Does the model account for it? :)
ID: 7641 · Report as offensive     Reply Quote

Message boards : Number crunching : Why can\'t my pc crunch cpdn wus?

©2024 cpdn.org