Message boards : Number crunching : Model stopped, no symptoms
Message board moderation
Author | Message |
---|---|
Send message Joined: 9 Oct 04 Posts: 3 Credit: 18,779,188 RAC: 0 |
A64 X2 4400+, WinXP, boinc 5.2.13 upgraded to 5.4.9 One Run had frequent Exit-zero-status messages. Upgraded boinc as possible solution. After restart, the other Run, at ~81% ran awhile, then stopped. Status shows \"Running\", but no CPU or TS time advancement. Restarted from an hour-old backup. Same symptoms. Suspended the dead-in-the-water Run and started Prime95 Torture test. Both cores fully engaged, one with CPDN, the other with Prime95, so it seems safe to dismiss CPU problems. Any ideas? [Written by astroWX] |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Follow-up on the problem. After more attempts, the ~81% Model decided to run. In circles. It was suspended because the download servers were empty and off the air. The ~70% Model continued its exited-zero-status failures then it, too, started running in circles. The machine was shut down. After discovering that Carl\'s dedication got things on track early (good job, Carl!), I called my friend, last evening Pacific Time, and talked her through \"Reset Project\". Her X2 4400+ is no longer cold. There is nothing about the circumstances surrounding the ~81% Model that I understand -- until it started off in circles. It\'s also curious that two 5.08 Models went loopy about the same time, though the Models were separated by ~11%. Further, only the ~70% Model was affected by the exited-zero status problem, which largely accounts for it\'s laggard position. When I\'ve experienced that on my machines, both Models were taken down and restarted at the previous Checkpoints. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
©2024 cpdn.org