Message boards : Number crunching : Stuck in 1940...
Message board moderation
Author | Message |
---|---|
Send message Joined: 2 May 06 Posts: 17 Credit: 505,526 RAC: 0 |
One of my computers appears to be continually working out the results for 1940. I wasn\'t watching it too closely so I\'m not 100% but I expected it (on past performance) to be ~1946-7 by now. It is producing trickle files (trickle_up_hadcm3lbm_bfio_25302595_0_11567….), which upload to the server, but they never appear on the trickle list. The last entry is 22-08-2006, but the computer transferred ~8 files on the 25th and a further 6 today! Can anyone advise? Should I just kill this model? The Scottish BOINC Team Forum |
Send message Joined: 5 Feb 05 Posts: 465 Credit: 1,914,189 RAC: 0 |
Keep going. It looks like it\'s one of those that reports, but the credits get only posted every so often. If you look at the trickles of the model, several were posted on one day, then several more posted on another day. They will post, eventually. |
Send message Joined: 2 May 06 Posts: 17 Credit: 505,526 RAC: 0 |
Keep going. It looks like it\'s one of those that reports, but the credits get only posted every so often. If you look at the trickles of the model, several were posted on one day, then several more posted on another day. They will post, eventually. Sorry, I probably should have said that this machine only gets connected every so often, normal fortnightly. All the trickles that are showing, appeared within five minutes of being uploaded. My actual worry is that the last (successful) trickle is for 1939, and I\'m still calculating 1940 now, a week later. Previously it was calculating just below one year per day... The Scottish BOINC Team Forum |
Send message Joined: 5 Feb 05 Posts: 465 Credit: 1,914,189 RAC: 0 |
Sometimes the Trickle server gets behind. It\'s been out for several days at a time, a few times. Do not worry, just watch. It will all come through at some point. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The model may have hit a snag, and the program is doing a rewind to try and correct it. First a day, and retry. Then a month and retry. And finally a year, and retry. If the problem is still there, then the model is supposed to abort, but there was a problem with some of then, and they didn\'t. They just keep going around the same months/year, endlessly. These are called \"looping\" models. Keep an eye on the date at close intervals, and if you see the same dates being repeated, then click the Abort button. And better luck with your next model. |
Send message Joined: 2 May 06 Posts: 17 Credit: 505,526 RAC: 0 |
The model may have hit a snag, and the program is doing a rewind to try and correct it. Well, I watched. I watched it reset three times to Dec 1st, 1939 and consistantly freeze on September 15th, 1940. The only way to get it to shift from 00:30 15:09:1940 was to exit and restart. :( The plug has now been pulled and the wu laid to rest. R.I.P. hadcm3lbm_bfio_25302595 The Scottish BOINC Team Forum |
Send message Joined: 5 Feb 05 Posts: 465 Credit: 1,914,189 RAC: 0 |
Sorry to see you had one of the bad WU loops. It is OK, you got quite a lot of work done on that unit, and it will help the project. As long as 10 or more years are done, good information gets into the project. I hope this doesn\'t discourage you in doing more. They fixed the application (5.15) that will now detect that issue, and automatically abort it. The application you were using just looped, and locked as you saw. I am glad you have figured this out. So people crunch and do not notice this phenomenom for a long time, and have just waisted CPU cycles for a while. Good job on it! |
Send message Joined: 3 Sep 04 Posts: 126 Credit: 26,610,380 RAC: 3,377 |
My workunit got in a loop at 53%, so I aborted it. But now it looks like it will be sent to someone else: http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=5086830 |
Send message Joined: 5 Feb 05 Posts: 465 Credit: 1,914,189 RAC: 0 |
My workunit got in a loop at 53%, so I aborted it. But now it looks like it will be sent to someone else: http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=5086830 With the new application, if it gets stuck by the new person, it will automatically abort and flag the unit. The new application is also better at not allowing them to loop. So, it is fine that it is going to another person to crunch. |
©2024 cpdn.org