Message boards : Number crunching : 72 days for wah2_sam25?
Message board moderation
Author | Message |
---|---|
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
I am running two wah2_sam25's that after 1 day are only about 1.36% complete on my i7-4790 (Win7 64-bit) machine. The estimated completion time is about 72 days. All my other work units are reasonable enough. Can this be correct? They have each errored out on other machines, all running Win10 after a short period of time. Note that they are running in separate BOINC instances, so have different machine numbers. But only six cores total are in use for BOINC, with another supporting a GPU on Folding, and another "free". https://www.cpdn.org/cpdnboinc/result.php?resultid=21383066 https://www.cpdn.org/cpdnboinc/result.php?resultid=21382971 I was assuming that these estimates would correct themselves, but they have not done so thus far. |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
I have 4 running on my Ryzen and the manual estimates are for 25 to 35 days. BOINC estimated 45 to 60 days when the tasks began. |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
Calculating it manually only makes things worse. The longest running one for me is now at 28.5 hours, and 1.442% complete. That works out to 82 days. I expect it is slowing down and is going to get stuck. I will keep it around a while longer. EDIT: But the other one is now at 17.3 hours and 1.595% complete, which works out to 45 days. Who knows what is happening. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Here is one of mine. This was on Linux Mint, running Wine, running a Windows version of BOINC. Your problem is probably due to you having fiddled with your computer by putting 2 BOINCs on it. |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
Your problem is probably due to you having fiddled with your computer by putting 2 BOINCs on it. Maybe. But the wah2_global are estimating 12 to 14 days on either instance. ALSO: I have another machine, an i7-3770 also running Win7 64-bit with only a single BOINC instance. It has several wah2_global currently running (after 3 days) with a total time of around 12 days, as manually calculated. (BOINC estimates 6 days left, or 9 days total). But it also has two wah2_sam25 that have not started yet, but have an estimated time of 73 days 18 hours. There is something different about them. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
A wah2_sam25, 85 month Wu’s on one of my machines is predicted to take 102days 8 hours! I have 3 of these. That’s a lot of crunching. It’s almost like the old days when Wu’s took 6 – 8 month to complete. I-5 2.6 GHz processor with 4 gb of ram running Win7. My oldest, slowest machine. |
Send message Joined: 13 Jul 18 Posts: 38 Credit: 62,933,508 RAC: 84,702 |
On my slow computers the 85 month tasks (batches 762, 763, 764, 765) will take 80 to 160 days. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The sam25's (South American), and the "Global" models are 2 totally different things. And the Global's have their own problems, but way off near the end. So much so, that in our talks with the project people, it's felt that they should be killed off, and that research re-thought. A couple of us would like to see them continue, as some may make it. IF we can get some storage space :( SO ... If anyone has any Global models, batches 766-770, that have only just started, or are still waiting, then I'd suggest that they be Aborted now, and save the electricity for the New Look Globals, Coming Soon to a Download Server near you. :) |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
SO ... I was hoping to hear something like that. It is getting near Thanksgiving here in the U.S., so I can get out my axe. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,805,483 RAC: 8,941 |
I've got two 766s at 11% in 2 days... so I guess the end should come rather planned and soon than in few days right? |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
I've got two 766s at 11% in 2 days... 2/0.11 = 18 days. That is not too bad, all things considered. I would hope that the scientists would let us know about unusual work units (size, length, etc.) so we will know that we don't have a machine problem of some sort or a bad work unit. Otherwise, they might get aborted unnecessarily. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I've got two 766s at 11% in 2 days... so I guess the end should come rather planned and soon than in few days right? Yes, you may as well dump them. ***************** On the bright side, for me: I've just uploaded a set of zip 144 and the restarts, (which are a little over 40 Megs), so one set of zippity dodahs to go. Which should be in about an hour and a half from now. Now for the 2nd computer. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
My problem is that I have 4 globals on 2 different machines that are more than 11 days in and have 19 days to go. Abort or not, that is the question. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I'd be inclined to hang in there. My 3 on this computer finished OK, and are currently uploading. This will be one of them in an hour or so, after the uploads and Reporting. About an hour under 16 days, but that's with no other projects. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
I think I will stick with them and hope. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
And now there's a new problem: 21/11/2018 12:39:49 PM | climateprediction.net | Server can't open database Hopefully that's the last of the problems. Except for repeating the whole cycle again in a few terabytes. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
[quote]SO ... I see that the global models from batches 766 – 770 are still in the hopper. I found that I had downloaded 4 more of them overnight. They were all _0 so they were first timers. They went to model heaven. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,805,483 RAC: 8,941 |
I got one 766 and it is at 30% after six days, created on 4 Nov and this is its 2nd attempt. I kind of want to give it a chance, though Les suggested to abort them. and I have two sam25s at 10% on 10th day. Expected to finish in 100 days ;) I've reduced the load to 2 full cores so let's see if I get some gain on the sam25s |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,805,483 RAC: 8,941 |
and I have two sam25s at 10% on 10th day. Expected to finish in 100 days ;) I've reduced the load to 2 full cores so let's see if I get some gain on the sam25s No real difference btw 2 or 4 cores on my i5-2520M sec/Ts almost the same. And I still keep one global from 766 now at 61% after 14 days I will just leave it until it crashes as expected. |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
Yeah -- those long-running sam models look like taking 6-12 weeks on a typical machine running 24/7 per core but less problems than some other long-runners - Eh? I've got a few, will let them run until they die (don't think likely) and I have two sam25s at 10% on 10th day. Expected to finish in 100 days ;) I've reduced the load to 2 full cores so let's see if I get some gain on the sam25s |
©2024 cpdn.org