Message boards : Number crunching : Iceworlds & Slowdowns hadsm3/mh - Closed - Discussion
Message board moderation
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · Next
Author | Message |
---|---|
Send message Joined: 6 Sep 04 Posts: 12 Credit: 1,767,663 RAC: 0 |
I think I have eight of these bad boys: 7714560 7714126 7703437 7703654 7703204 7703491 7714497 7703217 Should I abort them? |
Send message Joined: 6 Aug 04 Posts: 195 Credit: 28,405,498 RAC: 10,268 |
They all appear to be very slow processing, some for up to 3 months. Presumably they are also blue ice-worlds? Your computers have succesfully run other models. I\'d suggest you abort the slow running models. Good luck with the new ones. |
Send message Joined: 6 Sep 04 Posts: 12 Credit: 1,767,663 RAC: 0 |
They all appear to be very slow processing, some for up to 3 months. Presumably they are also blue ice-worlds? Your computers have succesfully run other models. I\'d suggest you abort the slow running models. Good luck with the new ones. They do all display totally Blue on graphics. |
Send message Joined: 16 Mar 06 Posts: 28 Credit: 3,219,100 RAC: 0 |
Greetings. I\'ve just spotted a couple of very slow-running models. I should probably abort - can someone please advise... 1. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7673807 Current timestep = 49417 of 259248 s/TS = 56.6 2. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7731951 Current timestep = 39775 of 259248 s/TS = 3.64 Neither are blue / ice worlds, but the to-completion time is rising, so maybe they\'re turning blue... It\'s an Intel machine runnig WinXP, not overclocked and has, prior to these two models, turned in a few complete runs. Thanks - Richard |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
1. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7673807 #1 looks like it has gone back to the beginning of the model (without resetting the CPU time), so it\'s not an iceworld. Occasionally this happens if Windows is shutdown or rebooted while the model is running. But you won\'t get any more trickles or credits until it reaches the point it last trickled. It\'s up to you whether you want to abort it, or run it to the end. The workunit #2 is in has other models completed by other Windows/Intel computers, and the s/TS are not rising, so probably isn\'t an iceworld. I\'m not sure why it is not making progress, or not trickling. |
Send message Joined: 16 Mar 06 Posts: 28 Credit: 3,219,100 RAC: 0 |
Greetings, again. Further to the post just above, I suspended those two tasks, and the waiting task that started up is behaving the same... a rising to-completion number from the outset. Can someone help me fathom this out? Might it be because this m/c is running with quite a full hard drive - 10Gb free out of 70. This is a 2-cpu m/c and BOINC hasn\'t downloaded a second task to me. I found a BOINC discussion thread (http://einstein.phys.uwm.edu/forum_thread.php?id=6389) that suggested BOINC knows this m/c is struggling, but I\'m not sure how to go about trouble-shooting... Oh, I also upgraded from 6.4.5 to 6.4.7, so it should be a clean installation here. All help gratefully received - as ever! Many thanks - Richard |
Send message Joined: 9 Jan 07 Posts: 467 Credit: 14,549,176 RAC: 317 |
They all appear to be very slow processing, some for up to 3 months. Presumably they are also blue ice-worlds? Your computers have succesfully run other models. I\'d suggest you abort the slow running models. Good luck with the new ones. While what hagar says is right - there is no point continuing with a slow-processing iceworld - there may be more to it. Quite a few of the models you mention occur in work units where there are Windows/Intel finishers. That suggests to me that there may be machine problems of some sort, rather than an iceworld problem. Here is your list, linked to the output of a crude iceworld detector ... 7714560 7714126 7703437 7703654 7703204 7703491 7714497 7703217 ... there are a lot of singletons and other finishers. One model is a confirmed iceworld, as another model in the same work unit had the same problem (i.e. this one). You may want to try some of the stability tests in the Read Me, particularly the Prime95 test. |
Send message Joined: 3 Sep 08 Posts: 23 Credit: 42,176,240 RAC: 10,608 |
I\'ve recently had to abort two models (on two different computers) that had become iceworlds: WorkUnit 6278634, Task ID 8362432. and WorkUnit 6277746, Task ID 8354443. Both machines are Intel CPUs, Windows XP SP3. Temperature graphics on both were blue, with no clouds except around the Antarctic coast. Time to completion was rising instead of falling, and models were no longer uploading trickles. -- Jim |
Send message Joined: 31 Aug 04 Posts: 145 Credit: 2,080,724 RAC: 753 |
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/workunit.php?wuid=6193392 149066 of 259248 Next, not sure, pressed z and cannot see what you want... what it says is... Atmos model time 13:00 Atmos model date 16:07/2059 Hours elapsed 0418:33:33 (2.26 s/TS) <-- maybe that! Timestep 149066 of 259248 - Phase 3 of 3 Progress 85.83% Not blue Intel Quad under Windows XP Not over clocked. Hope that helps, was running on a system not visited everyday so may have been like that for 4-5 days at least. Best wishes, Adrian. Wave upon wave of demented avengers march cheerfully out of obscurity into the dream. |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Jim\'s models both sound like typical iceworlds, so well done for noticing and diagnosing them. Hi Adrian Here\'s your model\'s workunit. Its sec/ts on its task page shows the first sign of the slowdown and it looks as if Eric\'s model from the same workunit, also with Intel & Windows, has suffered the same fate at the same point. So yes to the iceworld diagnosis. Re the graphics display, Z eliminates the overlay. 8 toggles the checkpoint countdown on and off. H gives the help menu. You may need to press T to show the temp view which is where the iceworld blue is apparent. P for pressure will also show a different monochrome colour in the case of an iceworld. Also re diagnosis, the sec/ts displayed in the graphics window is a cumulative average (ie the average since the model started). So in the case of a genuine iceworld that has almost stopped progressing, the sec/ts value shown is not the current very slow speed but it gradually gets worse and worse if the model is allowed to continue. Cpdn news |
Send message Joined: 6 Jul 06 Posts: 147 Credit: 3,615,496 RAC: 420 |
I see you've now aborted that iceworld, Mican. Its whole workunit #6278040 is rather interesting. A model on a machine with Linux crashed at about the same point. The model Hagar mentioned that seems to have speeded up was on a Mac. It crashed. G'Day again mo.v, Well it took a bit of time but it looks like Linux triumphs this time. I have finished the model to completion (took 547 hours). Time steps stayed the same right through and it shows 'Success', I just have to wait for the credits to be granted. Here is the WU 8357087 if you want to check it out. I can't read the data very well so really I have no idea if this was getting close to an iceworld or not, I don't have graphics on Linux enabled. |
Send message Joined: 13 Sep 04 Posts: 60 Credit: 1,124,063 RAC: 0 |
My first iceworld :) The globe is blue. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=8423384. I'm at TS 253706 (99.47%), doing 1.23s/TS, though in reality for well over 24 hours now it has said 2 hours remaining. I'm using Windows XP SP2, C2D E6750, 2.66GHz, with 2GB of RAM. I don't have any higher-priority processes running; the work unit is taking 50% CPU i.e. one saturated core. Should I let it run (it'll take weeks) or abort? The science results matter more to me than any credits. I notice another user with the same work unit has already aborted at the same point. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I've had several, and I don't mess around with them. If they go blue, I abort them and get another one. Backups: Here |
Send message Joined: 6 Jul 06 Posts: 147 Credit: 3,615,496 RAC: 420 |
I've had several, and I don't mess around with them. If they go blue, I abort them and get another one. G'Day Les, A question if you will, Why has my Work Unit (WU 8357087) which ran on Linux and completed successfully yet every Windows computer that ran this WU all failed and became Ice Worlds.???? |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
In the few cases where the cause is computer instability you could correct the computer problem, restore a backup and expect the model to complete normally. In almost all cases the problem lies within the model itself on an Intel/Windows setup. Nothing would make these iceworlds become normal again. The moment the iceworld develops the model stops processing its data properly and there's no point in trying to continue. If another model from that WU running on Intel/Windows has developed an iceworld at the same point you know the fault lies in the model, not in your computer. Cpdn news |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
My MH task 8424391 has become an iceworld in phase 4. Typical monochrome graphics, but it's atypical in that it's on my AMD and has speeded up by about 30%. Ananas has completed a model from the same WU on Intel/Windows. I'm not happy about this. I'll let the model run in its iceworld condition to complete in about 2 days' time to can see what graphs it produces. I may then restore a week-old backup and run it long enough to see whether it gets past the iceworld moment, which I haven't identified exactly, on a rerun. Cpdn news |
Send message Joined: 29 Sep 08 Posts: 5 Credit: 4,330,352 RAC: 0 |
http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=10450514 172,832 5.0911 Yes Intel, Windows No http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=9673466 140,426 4.3909 Yes Intel, Windows Yes http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=9629486 97,218 5.7759 Yes Intel, Windows Yes http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=9819246 248,446 4.0848 Yes Intel, Windows No |
Send message Joined: 27 Jan 05 Posts: 16 Credit: 790,158 RAC: 0 |
Ice World... http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=10377841 Timestep 149706 of 259248 - Phase 3 of 3 1.27 s/TS The globe graphic is blue. The graphic isn\'t stable though, it flashes triangles and other stuff on the globe. This happens all the time if I display graphics, other units have completed ok. Also won\'t close cleanly. Anything to do with dual processors? Dell Vostro 220, Intel(R) Corel(TM)2 Duo CPU E7300 @ 2.666GHz Microsoft Windows XP Pro Version 2002 Service Pack 3. |
Send message Joined: 6 Aug 04 Posts: 195 Credit: 28,405,498 RAC: 10,268 |
Ice World...No, not dual-processor related, it sounds like a slow-processing Intel/Windows iceworld. Iain Inglis is collecting iceworld data iceworld appeal if you have a backup. Otherwise abort it and get another one. Good luck. |
Send message Joined: 21 Oct 04 Posts: 24 Credit: 207,633 RAC: 0 |
Bad luck, I got an Iceworld after 1 Phase was completed: A link to the model/ResultID webpage - ResultID=10528029 A current timestep of that model (on the globe graphic) - 9799 Phase 2 of 4 (9800 TS in 10 hour ! after finish Phase 1) The s/TS value (on the globe graphic.) - 1.34 s/TS , before it was 1.31 Whether the temperature display of the globe graphic is blue. - yes , blue -42°C What your processor/CPU and Operating System is (i.e. Intel or AMD on Windows or Linux) - GenuineIntel Pentium(R) Dual-Core CPU T4300 @ 2.10GHz [Intel64 Family 6 Model 23 Stepping 10] - Microsoft Windows 7 Home Premium x64 Edition, (06.01.7600.00) Whether you are overclocking. - no I will abort this WU greetz littleBouncer |
©2024 cpdn.org