Message boards : Number crunching : Not complete - running but not using CPU
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
Had 3 models last 2 weeks that got near end and then stopped running -showed running - BOINC manager showed as running but task manager showed not using any CPU. Graphics display stayed stuck at same point for a day or two (real days -- model days never progressed.) After a day or two killed all 3. Two of them at 99% plus one in the 80% range. Seemed weird that tasks would show running in BOINC but graphics screen showed same point for days and system showed no CPU usage. Killed the jobs and going on - but strange strange strange. |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
I've had that too, with one model. A system reboot fixed it, allowed it to carry on and finish successfully. (I needed to apply some updates anyway...) |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
Unfortunately the ones I had were not helped by reboot. |
Send message Joined: 13 May 12 Posts: 2 Credit: 191,869 RAC: 0 |
I've got one I think is doing the same - stuck at the same 25.193% for about 10 days. After each reboot I lose "elapsed time"; last night it had done 156 hours and this morning it had done 152 hours. After reading this thread I see it doesn't show up in top, so I've killed the job. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Ah, the dreaded "Failure at zip time" problem. |
Send message Joined: 13 May 12 Posts: 2 Credit: 191,869 RAC: 0 |
Sorry Les, I'm not familiar with that. Could you send me a link to an explanation? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
There are posts all through this board about how touchy this type of model is to being interrupted, especially at all of the 25% points. And then there's a few that complete, and then just run in a short loop just past the finish point. There's a short, fairly recent, thread here about the former, and one about the later here. |
©2024 cpdn.org