Message boards : Number crunching : Africa v7.22 Errors
Message board moderation
Author | Message |
---|---|
Send message Joined: 29 May 08 Posts: 128 Credit: 6,289,876 RAC: 0 |
Getting a few errors on the new Africa batch. Wingmen are having problems, too. Typical WU is 9428038. Stderr output: <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> couldn't start app: CreateProcess() failed - The system cannot find the file specified. (0x2) </message> ]]> |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Finally got some processors free, and got 4 Africa's. 3 crashed at 6 seconds with INITTIME: Atmosphere basis time mismatch which is a data file mismatch. I've gone back to EUs for replacements. |
Send message Joined: 29 May 08 Posts: 128 Credit: 6,289,876 RAC: 0 |
Les Bayliss wrote: Finally got some processors free, and got 4 Africa's. I haven't had any further problems with the AFRs I've got running now. Les Bayliss also wrote: ...I've gone back to EUs for replacements. Before I topped off with AFRs, I wasn't able to get any EUs (see my post here). Are those for Linux only? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Are those for Linux only? I think that they may be. The applications page shows Windows and Mac as being version 6.09 from 23 Mar 2011, and Linux as 7.23 from 11 Dec 2014, and the ones that I have are version 7.23 |
Send message Joined: 29 May 08 Posts: 128 Credit: 6,289,876 RAC: 0 |
Les Bayliss wrote: I think that they may be... Thanks, Les, that's what I was afraid of. I saw the new Linux app listed with the older Windows apps, but was hoping there might still be Windows tasks to run. |
Send message Joined: 16 Aug 04 Posts: 156 Credit: 9,035,872 RAC: 2,928 |
These latest _afr_7.22 WUs has worked very well on my Linux box. But now it refuses to download any more even if the server say it has +4,000 available. 16-Jan-2015 12:29:43 [climateprediction.net] Sending scheduler request: To fetch work. Huh? |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,034,731 RAC: 14,558 |
Current issue look as if they are Windows and Mac only. Check the applications in the sidebar. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,826,970 RAC: 5,066 |
Current issue look as if they are Windows and Mac only. Check the applications in the sidebar. ... looks like the AFR application was also removed to steer Linux users to the current EU model, which is Linux-only and high priority. |
Send message Joined: 16 Aug 04 Posts: 156 Credit: 9,035,872 RAC: 2,928 |
That is about the silliest thing I've ever seen here, most of my completed models are reruns from crashed windows models, rescued, recovered or whatever.. Well well there are other things to do like Dnetc or Asteroids and.. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
A lot of the EU models are resends because they're being grabbed by the "set and forget" crowd of Linux users with missing 32 bit libs. Or, as I call them "serial killers". Their computers are now being concentrated into one model type, and they are about to be targeted en-masse for blocking. |
Send message Joined: 16 Aug 04 Posts: 156 Credit: 9,035,872 RAC: 2,928 |
Well, that has nothing to do with _afr_7.22 |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The Africa models have also been Linux only for at least a week now. So, same thing. |
Send message Joined: 16 Aug 04 Posts: 156 Credit: 9,035,872 RAC: 2,928 |
I really wonder if you know what you are talking about Les. Bye bye, there is a big world outside.. |
Send message Joined: 31 Aug 04 Posts: 391 Credit: 219,896,461 RAC: 649 |
Big world, getting warmer. Me, run the models here, despite the various tech problems, because -- Seems good to me to run the models -- Despite the many problems -- that's what science is about -- yeah? Models fail - whatever that means -- Takes time, especially with the Monte Carlo model, to get useful results. I keep on running the CPDN because I think (and have a clue about the statistics, and about the problems with Distributed Computing -- ) I think there's possible value here. repeat I think there's probable value here at CPDN. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
What�s with the sudden increase in batches of Linux only WU�s. The researchers may like Linux, but, most of the crunchers don�t. There may be 30,000 computers attached to this project, but, I would be willing to bet that about 25,000 of them are running some form of Windows. If they drive them away by not providing work for Windows users, they (the researchers) may find that they have large numbers of tasks and a very small number of Linux computers to run them on. That will slow down the work. |
Send message Joined: 6 Aug 04 Posts: 264 Credit: 965,476 RAC: 0 |
The only solution is to adopt a Virtual Machine model, like CERN does. Its Scientific Linux programs can run on Windows, Mac OS X and other Linux distros systems using a a "wrapper" built by CERN around Virtual Box. Tullio |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,826,970 RAC: 5,066 |
What�s with the sudden increase in batches of Linux only WU�s. ... As far as I know, there isn't a change in policy, just some practical scheduling concerns. There are tens of thousands of models for Windows users and Linux users - so no great problem yet. |
Send message Joined: 15 Feb 06 Posts: 137 Credit: 35,337,237 RAC: 12,975 |
[quote The Africa models have also been Linux only for at least a week now. So, same thing.][/quote] So why are they still being downloaded to my Windows computer? However, the AFR models I downloaded at the beginning of January seem to mis-estimate the run time. About halfway through (about 6 trickles out of 12), the original estimated time has already elapsed and the Remaining time actually starts to increase! I've not seen that happen before. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
However, the AFR models I downloaded at the beginning of January seem to mis-estimate the run time. About halfway through (about 6 trickles out of 12), the original estimated time has already elapsed and the Remaining time actually starts to increase! I've not seen that happen before.[/quote] Errors in the time remaining counter aren�t uncommon. They are also harmless and don�t effect the outcome of the tasks. The hadam3p_afr currently running on my machine is going to take about 200 to finish. The time remaining counter reads 72 hours and the �elapsed� counter is at 28. It will probably reach �0� at the 50% point. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
So why are they still being downloaded to my Windows computer? Because the available applications for the various models has been changed yet again, now that the newer Mac apps have been removed. |
©2024 cpdn.org