climateprediction.net (CPDN) home page
Thread 'New unit started without finishing old one.'

Thread 'New unit started without finishing old one.'

Questions and Answers : Windows : New unit started without finishing old one.
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user89315

Send message
Joined: 22 Jul 05
Posts: 5
Credit: 17,368
RAC: 0
Message 14755 - Posted: 30 Jul 2005, 13:53:44 UTC

I got this in my Messages:

7/29/2005 9:10:13 AM|climateprediction.net|Started upload of 137o_100071351_1_2.zip
7/29/2005 9:10:16 AM|climateprediction.net|Finished upload of 137o_100071351_1_2.zip
7/29/2005 9:10:16 AM|climateprediction.net|Throughput 68224 bytes/sec
7/29/2005 9:10:21 AM|climateprediction.net|Deferring communication with project for 29 minutes and 42 seconds
7/29/2005 9:40:05 AM|climateprediction.net|Sending scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
7/29/2005 9:40:05 AM|climateprediction.net|Requesting 345600 seconds of work, returning 1 results
7/29/2005 9:40:08 AM|climateprediction.net|Scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded
7/29/2005 9:40:09 AM|climateprediction.net|Started download of 14nd_200073230.zip
7/29/2005 9:40:10 AM|climateprediction.net|Finished download of 14nd_200073230.zip
7/29/2005 9:40:10 AM|climateprediction.net|Throughput 20120 bytes/sec


Now, there\'s no way I completed the 137o_... unit since I\'d only done about 20 hours of work on it. But that unit is no longer on my machine and it is now working on the 14nd_... unit.

Any idea why this would happen? The 137o_... unit was the first unit I\'ve worked on in this project. I\'d really like to provide complete results to the project since I\'m sure that\'s much more helpful.

Is there some reason this unit might have been removed from my queue? Maybe someone else had already completed it?

Both units would take about 550 hours of computation on my machine, and since I have 3 other projects, all with 100 share, it should take roughly 3 months for me to complete a unit.

Pete

Thanks.
ID: 14755 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 14758 - Posted: 30 Jul 2005, 14:28:59 UTC

it's just getting a new one to have "on hand" -- it's probably not actually started it. it's not like the "old" CPDN windows client which only downloads right at the end, a few trickles (or more) before the end it may get a new workunit just to have "ready."
ID: 14758 · Report as offensive     Reply Quote
crandles
Volunteer moderator

Send message
Joined: 16 Oct 04
Posts: 692
Credit: 277,679
RAC: 0
Message 14766 - Posted: 30 Jul 2005, 16:54:48 UTC

Afaid Carl is wrong here. It crashed with a -5 error and uploaded. That is a catch-all for computation error so it is not enormously helpful.

You may want to try some stability test or check whether your Antivirus software is known to cause problems.
_______________________________
Visit <a href="http://boinc-doc.net/boinc-wiki/index.php?title=Climateprediction_FAQ">BOINC WIKI</a> for help

And join <a href="http://www.boincsynergy.com/">BOINC Synergy</a> for all the news in one place.
ID: 14766 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 14767 - Posted: 30 Jul 2005, 17:13:34 UTC

oh OK, the old one crashed with error # -5, which is sort of a "catch all" for anything unusual. hopefully this one will be better; sometimes models crash for reasons outside of your control, parameter sets in the workunit that crashes the model, hardware troubles, "uncooperative" antivirus software, etc.

ID: 14767 · Report as offensive     Reply Quote
old_user89315

Send message
Joined: 22 Jul 05
Posts: 5
Credit: 17,368
RAC: 0
Message 14768 - Posted: 30 Jul 2005, 17:29:47 UTC - in response to Message 14766.  

First of all, where are you guys coming up with the "error# -5"? Maybe I'm blind, but I'm not seeing any error messages anywhere.

How might my antivirus software be interfering?

As for the "just getting a new one to have "on hand", no, it started working the new unit and the old unit is gone with probably a bit over 2% completed.


ID: 14768 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 14769 - Posted: 30 Jul 2005, 17:41:51 UTC
Last modified: 30 Jul 2005, 17:48:27 UTC

To see the errors:
Go to your account page, and click on Results.
Then click on the Result ID that you want to see. In this case, 1020931.
This will show you what happened with the model.

For a discussion about AV, see <a href="http://www.climateprediction.net/board/viewtopic.php?t=2895"> this</a> thread.
Basically, some AV programs seem to lock files while they scan them.
If it does this with some of cp's files when cp wants to write to them, cp crashes.
It's a bit like an untied shoelace; if you're running and tread on it, you crash.
Painfully.

ID: 14769 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 14771 - Posted: 30 Jul 2005, 17:58:55 UTC - in response to Message 14768.  

&gt; First of all, where are you guys coming up with the "error# -5"? Maybe I'm
&gt; blind, but I'm not seeing any error messages anywhere.
&gt;
&gt; How might my antivirus software be interfering?

You can see the error message for the lost work unit at:

<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1020931">http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1020931</a>

and it was probably in the list of messages in the messages tab of the GUI after the error as well.

<a href="http://www.free-av.com/">AntiVir</a> has been implicated in the instability of models by a number of users. Other AV software may or may not cause problems given their option to scan every written file. Since CPDN writes a lot, any contention for file access may cause problems. That said...most people have no problems running CPDN with AV software running.
ID: 14771 · Report as offensive     Reply Quote
old_user89315

Send message
Joined: 22 Jul 05
Posts: 5
Credit: 17,368
RAC: 0
Message 14772 - Posted: 30 Jul 2005, 18:01:57 UTC - in response to Message 14769.  

Thank you. That's exactly what happened. My damn auto-virus checking. I've been meaning to disable it... Thanks.
ID: 14772 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 14773 - Posted: 30 Jul 2005, 18:24:53 UTC - in response to Message 14772.  

&gt; Thank you. That's exactly what happened. My damn auto-virus checking. I've
&gt; been meaning to disable it... Thanks.
&gt;

actually that's good to know about the anti-virus checking, as I just wrote a paper on CPDN and it seems we lose 30% of users on the first day! I bet it could be attributable to anti-virus software (which most people would run but we haven't really tested thoroughly).
ID: 14773 · Report as offensive     Reply Quote
old_user89315

Send message
Joined: 22 Jul 05
Posts: 5
Credit: 17,368
RAC: 0
Message 14776 - Posted: 30 Jul 2005, 20:12:32 UTC - in response to Message 14773.  

I'd suspect most people are lost because they think it's going to be several weeks to several months before they get credit for their units. It took a little digging to figure out that wasn't the case. Not a lot, but I'd probably make that aspect very prominent on the web site somewhere without going into the whole trickle explanation. Simply explain that credit is given for every 1/72nd of a unit and then maybe add a link to the longer explanation.

We live in the remote control generation. People want immediate gratification.

Anyway, maybe I'm wrong, but that would be my first inclination.

My anti-virus software only does a full check once a week, and that was the default. I didn't look real hard, but I couldn't figure out how to completely disable the once a week scan or even reschedule it, but I did at least find a place to exclude directories, so I just excluded the boinc directory and the sub directories. That's good enough for now.

Pete
ID: 14776 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 14791 - Posted: 30 Jul 2005, 22:29:56 UTC

Info on credits is in the FAQ <a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/quick_faq.php#l"> here.</a>
A lot of people just don't read FAQs or "About" files.

ID: 14791 · Report as offensive     Reply Quote
old_user89315

Send message
Joined: 22 Jul 05
Posts: 5
Credit: 17,368
RAC: 0
Message 14830 - Posted: 31 Jul 2005, 12:59:02 UTC - in response to Message 14791.  

I know WHERE it is. I've read it. My point is that just a quick explanation should be somewhere very prominent, perhaps on the sign-up page. I suspect a lot of people get it, see 600 hours to complete a unit and say, "Forget that," and never come back.

Anyway, again, just my opinion. I could be wrong.

ID: 14830 · Report as offensive     Reply Quote

Questions and Answers : Windows : New unit started without finishing old one.

©2025 cpdn.org