|
Questions and Answers : Windows : Comments for \'Generic solutions to models\' sticky
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next
Author | Message |
---|---|
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
Thanks for the reply. Yes, it locks solid ... no blue screen, just locks with all windows intact. At first the mouse pointer still moves, but no response even from taskbar and eventually mouse stops responding too. My CPU rarely goes above 50 deg C, so should not be a problem. I have 2Gb, so should not cause the problem, except for faults, but Prime95 runs without error. I may have too many processes/apps running sometimes and will look into this although I should have enough resources. |
![]() ![]() Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Tobie, you have the Intel which only has 256Mb of memory and last contacted the server on 22 Dec. Plus the AMD which has 2Gb memory but doesn\'t seem to have contacted the server since 22 Nov. I can\'t get to your trickle page because the model details are taking far too long to download from the server. Are both computers crunching a model? Do your boinc messages show any problems with trickles? Eg http errors. Are the freeze-ups happening on the AMD? Whichever machine\'s freezing up, I\'d recommend disabling the screensaver if you haven\'t already done so. Cpdn news |
![]() ![]() Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Well spotted, I hadn\'t seen the entry for the AMD since it had left the active hosts list... 1st Nov is the most recent trickle from the AMD, model hadcm3lbm_alxe_25264245_1. It\'s the 11th trickle received, so was sent when the model was in 1931 or 1932. The following is a link to the trickles alone without the other result stuff (graphs and whatnot), it may work quicker when the server is busy. http://climateapps2.oucs.ox.ac.uk/cpdnboinc/trickle.php?resultid=5746291 Could you open up your Boinc manager, and then copy-and-paste in the first (say) 20 or 30 lines into a reply here? It should tell us what model is running and also tell us if it\'s stuck there, or unable to communicate, or what. Also, if you could tell us the %complete from the second tab (work/tasks) that would be handy. I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
Are both computers crunching a model? Do your boinc messages show any problems with trickles? Eg http errors.mo.v, I only recently started CPDN on the Intel and it is all it does when running, so I\'m not sure if the memory will be a problem. I basically decided to start using it to see if I also get problems on the Intel. The AMD freezes up. How do I disable the screensaver? Could you open up your Boinc manager, and then copy-and-paste in the first (say) 20 or 30 lines into a reply here? It should tell us what model is running and also tell us if it\'s stuck there, or unable to communicate, or what. Also, if you could tell us the %complete from the second tab (work/tasks) that would be handy.Mike, I am pasting the messages in below. It is now on 9.420% Thanks for the replies. 10/01/07 13:48:47 PM||Starting BOINC client version 5.4.11 for windows_intelx86 10/01/07 13:48:47 PM||libcurl/7.15.3 OpenSSL/0.9.8a zlib/1.2.3 10/01/07 13:48:47 PM||Data directory: D:\\Program Files\\BOINC 10/01/07 13:48:47 PM||Processor: 2 AuthenticAMD AMD Athlon(tm) 64 X2 Dual Core Processor 4400+ 10/01/07 13:48:47 PM||Memory: 2.00 GB physical, 3.85 GB virtual 10/01/07 13:48:47 PM||Disk: 111.78 GB total, 3.20 GB free 10/01/07 13:48:47 PM|CPDN Seasonal Attribution Project|URL: http://attribution.cpdn.org/; Computer ID: 12096; location: Work; project prefs: default 10/01/07 13:48:47 PM|proteins@home|URL: http://biology.polytechnique.fr/proteinsathome/; Computer ID: 425; location: work; project prefs: work 10/01/07 13:48:47 PM|rosetta@home|URL: http://boinc.bakerlab.org/rosetta/; Computer ID: 307345; location: Work; project prefs: default 10/01/07 13:48:47 PM|boincsimap|URL: http://boinc.bio.wzw.tum.de/boincsimap/; Computer ID: 41629; location: Work; project prefs: default 10/01/07 13:48:47 PM|climateprediction.net|URL: http://climateprediction.net/; Computer ID: 480116; location: work; project prefs: default 10/01/07 13:48:47 PM|Rectilinear Crossing Numbers|URL: http://dist.ist.tugraz.at/cape5/; Computer ID: 6717; location: home; project prefs: default 10/01/07 13:48:47 PM|Docking@Home|URL: http://docking.utep.edu/; Computer ID: 355; location: Work; project prefs: default 10/01/07 13:48:47 PM|Einstein@Home|URL: http://einstein.phys.uwm.edu/; Computer ID: 740093; location: Work; project prefs: default 10/01/07 13:48:47 PM|Project TANPAKU|URL: http://issofty17.is.noda.tus.ac.jp/; Computer ID: 6792; location: Work; project prefs: default 10/01/07 13:48:47 PM|lhcathome|URL: http://lhcathome.cern.ch/; Computer ID: 1492589; location: work; project prefs: work 10/01/07 13:48:47 PM|Project Neuron|URL: http://neuron.mine.nu/neuron/; Computer ID: 695; location: Work; project prefs: default 10/01/07 13:48:47 PM|Pirates@Home|URL: http://pirates.spy-hill.net/; Computer ID: 25234; location: home; project prefs: default 10/01/07 13:48:47 PM|QMC@HOME|URL: http://qah.uni-muenster.de/; Computer ID: 22080; location: Work; project prefs: default 10/01/07 13:48:47 PM|ralph@home|URL: http://ralph.bakerlab.org/; Computer ID: 4629; location: Work; project prefs: default 10/01/07 13:48:47 PM|RenderFarm@Home|URL: http://server2.povaddict.com.ar/pov/; Computer ID: 768; location: ; project prefs: default 10/01/07 13:48:47 PM|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 2680765; location: Work; project prefs: default 10/01/07 13:48:47 PM|SETI@home Beta Test|URL: http://setiweb.ssl.berkeley.edu/beta/; Computer ID: 9829; location: ; project prefs: default 10/01/07 13:48:47 PM|Spinhenge@home|URL: http://spin.fh-bielefeld.de/; Computer ID: 4554; location: Work; project prefs: default 10/01/07 13:48:47 PM|SZTAKI Desktop Grid|URL: http://szdg.lpds.sztaki.hu/szdg/; Computer ID: not assigned yet; location: ; project prefs: default 10/01/07 13:48:47 PM|malariacontrol.net beta|URL: http://www.malariacontrol.net/; Computer ID: 11158; location: work; project prefs: work 10/01/07 13:48:47 PM|NanoHive@Home|URL: http://www.nanohive-1.org/atHome/; Computer ID: 541; location: work; project prefs: default 10/01/07 13:48:47 PM|uFluids|URL: http://www.ufluids.net/; Computer ID: 31526; location: Work; project prefs: default 10/01/07 13:48:47 PM|World Community Grid|URL: http://www.worldcommunitygrid.org/; Computer ID: 66744; location: work; project prefs: work 10/01/07 13:48:47 PM||General prefs: from Project TANPAKU (last modified 2007-01-06 09:36:33) 10/01/07 13:48:47 PM||General prefs: no separate prefs for Work; using your defaults 10/01/07 13:48:47 PM||Local control only allowed 10/01/07 13:48:47 PM||Listening on port 31416 10/01/07 13:48:47 PM||Suspending network activity - user request 10/01/07 13:48:59 PM|CPDN Seasonal Attribution Project|Deferring task hadam3h_a_075s41_2000_2000_0_0 10/01/07 13:48:59 PM|climateprediction.net|Deferring task hadcm3lbm_alxe_25264245_1 10/01/07 13:48:59 PM|proteins@home|Deferring task 1Ctf_7-106-76_1 10/01/07 13:48:59 PM|proteins@home|Deferring task 1Edk_7-26-11_0 10/01/07 13:48:59 PM|Project TANPAKU|Deferring task pep4_190_247_32_2_0 10/01/07 13:48:59 PM||Resuming computation 10/01/07 13:48:59 PM||Rescheduling CPU: Resuming computation 10/01/07 13:48:59 PM||Resuming network activity 10/01/07 13:48:59 PM||Contacting account manager at http://bam.boincstats.com/ 10/01/07 13:48:59 PM|climateprediction.net|Restarting task hadcm3lbm_alxe_25264245_1 using hadcm3lb version 515 10/01/07 13:48:59 PM|Project TANPAKU|Restarting task pep4_190_247_32_2_0 using BD version 508 10/01/07 13:49:02 PM||Account manager: BAM Host-ID: 8687 10/01/07 13:49:02 PM||Account manager contact succeeded |
![]() ![]() Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
256MB is an extremely tight fit even if nothing else is running. For example, antivirus programmes can take 80-100MB... I'm a volunteer and my views are my own. News and Announcements and FAQ |
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
My computer froze up again while running the WU and I noticed, and think I saw it before, the last message in Boinc Manager was: Contacting account manager at http://bam.boincstats.com/ I have now set the time on BAM to 48hours (was 3 hours) and will see if the WU runs smoothly until BAM contact again. |
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
|
![]() Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
10/01/07 13:48:47 PM||Disk: 111.78 GB total, 3.20 GB free Noticed that in the Messages you posted, Tobie. Windows is getting starved for HD space and it gets twitchy when that happens. If you aren\'t already getting low disk space messages, you might expect them whenever you start another program. (Nice machine -- but then, I\'m prejudiced.) "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
![]() ![]() Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
To disable the screensaver: Right-click on the desktop Click Properties Select Screensaver Select None Anyone finding that the model interferes with normal use of the computer should first disable the screensaver - it\'s easy to do and often helps. Some people have managed to run a model with only 256Mb RAM, but it\'s pushing things as others have said. In this situation I\'d advise frequent backups. Could you give this computer an extra 256? Minimum spec for cpdn is supposed to be 512. Cpdn news |
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
10/01/07 13:48:47 PM||Disk: 111.78 GB total, 3.20 GB free I do get low on space on D (my Windows drive) from time to time and then move some files. Although my Temp dir and Pagefile is on different drives - E and F. In total I have more than 100Gb free. Prejudiced toward Intel (like them too) or Mac (never used them)? |
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
To disable the screensaver: Ok, my screensaver is normally turned off, so I am ok there. My WU trickled last night and it still running. It now seems that the contact with BAM was causing the computer to freeze. Will be sure when it next contacts BAM. I\'ll see what I can do to increase the RAM. Thank you. |
![]() Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Prejudiced toward Intel (like them too) or Mac (never used them)? It was a comment on your A64 X2 4400+. I\'m prejudiced (in favor) because I have three of them, built specifically to run CPDN. (Also have three mixed-bag Intels, from P4 2.8 to Core 2 Duo E6600, also built to run CPDN.) "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 1 Sep 06 Posts: 11 Credit: 4,627 RAC: 0 |
Prejudiced toward Intel (like them too) or Mac (never used them)? I see :) I had two Intel and two AMD\'s in turn and from now I\'ll stick to AMD as my main computer. I am now trying to OC ... taking 17 hours for one trickle is too slow. How much free space is minimum to be allowed for CPDN? |
![]() Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
We\'re getting rather far OT, but as the question deals with potential errors, though not likely the errors in the Thread heading... I\'m not the best person to ask about that because, having been responsible for system performance in a couple of mainframe shops (decades ago), my tendency is to balance loads and, if that doesn\'t work, throw hardware at the problem --> and not to cut too near the bone. My machines all run two Models and have Folders in the 950 Meg range. (A single Model will be more than half that, even without residuals from errored Runs.) Even if nothing else was ever done in the partition, I\'d keep it above two Gig, just to keep Windoze from getting twitchy about space. (And, like Topsy, stdoutdae.txt grows and grows -- to about 2,049KB before changing its name to stdoutdae.OLD and starting afresh. [It\'s the historical archive of Manager\'s Messages.]) If you\'re asking about the HD parameters in Prefs, mine are set to take whatever is necessary (because I also run Alpha/Beta/Special versions when they come along). My prefs:
None of my machines has a 100 Gig Partition. In fact, two have 10,000 RPM HD and the entire disk is 72 Gig, -- partitioned C: & CPDN, in Windoze; plus Swap/Home/all else, in Linux. (Even so, that\'s excess space because the boxes exist only to run CPDN, though they\'re configured with OpenOffice suite, Firefox, and Thunderbird, so any of them could be used as a general-purpose box in either Windoze or Linux.) Again, I\'m the wrong one to ask about minimum free space, though I hope my rationale is useful. Others will weigh-in with something more specific, I think. Edit: Re: O/C -- some do it and have stable machines, others do it with disastrous results (literally, by cooking something). I\'ve played with it but only by using the percentage increments in the BIOS, never messing manually with voltages. Then, ran at least 24 hours of Prime-95 Torture Test. (Also, I put good memory in my machines and set it to factory-certified specs. Then, run hours of Memtest-86.) By the way, on AMD machines, if you can set RAM timing from 2T to 1T and still have a stable machine, that will help a lot. Only the P4 2.8 is O/C now, by 10%. It was rock-solid at 20% during its 200-year Spinup, but that was done only because time was critical for finishing the Spinups. (We use the Spinup results in these TCM, one of 60 perturbed, plus Arnaud\'s unperturbed Run). "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 13 Oct 06 Posts: 60 Credit: 7,893 RAC: 0 |
Just have to get back here to give a big thumbs-up to the project staff. On this reasonably fast box, the model causes no problems and has been running fine since October (I\'m at 17% now, running other BOINC projects too). And I\'m not even especially careful... my model has so far survived everything from gaming nights (using about every hardware-hungry graphics option I could find and a 1600*1200 resolution) to a 20-minute power failure without ever showing signs of a problem. Not that I\'d recommend this kind of behavior, I\'m only using this as an example of the fact that the app copes with difficult circumstances really well in my case. Well, of course my computer is running rock solid for days if it has to and I make regular backups, but the smaller things usually work out okay ;-) Only \"problem\" I\'ve ever had was a few \"zero code\" errors which were easily traced back to time sync and caused no further problems. After having to use one of my backups for the first time (for the reason that my Windows died on me, not problems with CPDN) I can confirm that this is really easy and causes no problems aswell... I didn\'t even use the same BOINC version as before and the model is crunching away happily. So, keep up the great work, guys :-) then I\'m ready to do some more runs. |
![]() Send message Joined: 20 Mar 07 Posts: 8 Credit: 39,126 RAC: 0 |
I had my first crash today while in my first WU and only having 4.5% on the WU. It hapenned when another program froze up. I had to kill that program. As I did up until now I just suspended BOINC and reset the system. Turning it back on i saw 100% indication for the programm and because i was off line just left it like that. When i finally got a connection the program killed itself and in my account i saw the following result: Exit status 1073807364 (0x40010004) <core_client_version>5.8.11</core_client_version> <![CDATA[ <message> - exit code 1073807364 (0x40010004) </message> <stderr_txt> (null): cannot open input file dataout/atmos_restart.day (null): cannot open input file dataout/ocean_restart.day Not a JPEG file: starts with 0x01 0xda CPDN Monitor - Quit request from BOINC... Not a JPEG file: starts with 0x01 0xda Not a JPEG file: starts with 0x01 0xda CPDN Monitor - Quit request from BOINC... Not a JPEG file: starts with 0x01 0xda Not a JPEG file: starts with 0x01 0xda Not a JPEG file: starts with 0x01 0xda Not a JPEG file: starts with 0x01 0xda Not a JPEG file: starts with 0x01 0xda Not a JPEG file: starts with 0x01 0xda And then looped the above for 4-5 times. What kind of an error is this? I read at some point that the project is backing up by itself every 10 project years. But from what i read here this is not the case. Have i got it right or do i have to back up every now and then? And I final one. What can i do with the data of the crashed WU? How can i retieve them and store them (if i\'m asked to share it at some point)? |
![]() ![]() Send message Joined: 9 Jan 07 Posts: 497 Credit: 342,899 RAC: 0 |
Hello Giorgio, I\'m amused by the fact that you want to contribute your \"tickle\" because we in team Scotlad decided some time ago that, while others may trickle, our team tickles! I\'m sure someone who knows a lot more than I do will be along to advise you shortly but meantime, don\'t worry about storing the data from your crashed model (commiserations by the way :-(), these models trickle by sending info to Oxford every year, so the project already has the data from the work done on your model. And no they don\'t back themselves up, you should back up the whole folder at regular intervals - I usually do it weekly but many people do it daily, or even twice a day. You can do it manually or there\'s an autobackup program you can use. See this thread DON\'T USE THE RESET BUTTON unless you want to delete all your models! Good luck! Visit the Scotland team ![]() |
![]() Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Read the first post in this thread re. -107 errors. Also: Les\' comments for Exit Code -1 and -107... here: http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=4710#23372 Thyme Lawn\'s Testing Graphics Compatibility & driver update: http://bbc.cpdn.org/forum_thread.php?id=1038&nowrap=true#3977 It\'s not sufficient to merely suspend a Model before shutdown, it\'s necessary to suspend the Model, then Exit boinc. If suspended, pieces can be left in memory or swap space -- and CPDN files can be out of sync on restart. Your laptop is at the bottom of machines expected to be able to run CPDN. Also, the amount of memory listed suggests that it has an on-board graphics chip, not a graphics card. That could be problematic. Is the machine rather old? If so, it could be overheating from build-up of internal dust, if not recently cleaned out. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
![]() ![]() Send message Joined: 13 Jan 06 Posts: 1498 Credit: 15,613,038 RAC: 0 |
Hi, ... It hapenned when another program froze up. I had to kill that program. As I did up until now I just suspended BOINC and reset the system. Turning it back on i ... It\'s a good idea to suspend or exit before running something which is graphically intensive (such as a game, video encoding, a TV card...). If a graphical programme crashes it\'ll often kill the model as a side effect. ... The model uploads it\'s climate to the servers as it goes along (a summary every model year, a bigger upload every 10 years, and finally a restart dump every 40 years. But this can\'t be used by you to get the same model going again (the admins can do this from the restart dump, but it would be someone else running it). This is why many people keep a backup - so they themselves can get the model running again. ... Just delete it - anything of use has already been uploaded to the servers. I'm a volunteer and my views are my own. News and Announcements and FAQ |
![]() Send message Joined: 20 Mar 07 Posts: 8 Credit: 39,126 RAC: 0 |
To Strathpeffer Pavilion...
My english is getting worse by the day here in the no-other-language-but-italian land. I wanted to write trickle but tickles work just fine as well...:D astroWX my laptop is 2 years old, the thing is that it\'s been running non stop for 15 months now! (First ACER that survives that kind of beating I suppose) It has probably accumulated a little bit of dust. I will try that. Thanks. MikeMarsUK thanks for the info. It\'s all clear to me now. There is a lot of information circulating in the site and i didn\'t have the time to go through everything. Thanks again. I guess that everything that has to do with the former model and any reference to its name can be deleted. |
©2025 cpdn.org