Message boards : Number crunching : Climate prediction using twice CPU allocation
Message board moderation
Author | Message |
---|---|
Send message Joined: 11 Dec 05 Posts: 4 Credit: 383,991 RAC: 0 |
BOINC setting 50% Climate & 50% SETI But Climate running High Priority in both 50%s SETI has been stoped and waiting to run Problem has been around for over a month and new Climate jobs continue to displace SETI jobs How do I fix so there is proper sharing? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Resource share is only an average over a long period of time, say 6 months. There can be several reasons why the climate models are constantly running in high priority, but the general reason is that BOINC currently considers that it's not going to have enough time to complete the 2 long models that you have before the deadline. (It doesn't know that there's no actual deadline with this project.) Some things to look at, are the settings that you have for the cache, and the amount of time that you run the computer. The cache settings are Computer is connected to the Internet about every, and Maintain enough work for an additional. These should be kept low for this project so that you don't get too many models downloaded. One quick fix may be to make sure that you have the project set to No new tasks in the Projects tab, and then to Suspend one of the models, and keep it for later. Backups: Here |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
Resource share is only an average over a long period of time, say 6 months. No, it won't take that long for two projects to reach equilibrium. Ed, the main reason you're seeing this behavior is you're running two hadcm3n's (coupled, full resolution ocean's) on a single-core, hyper-threaded Pentium 4. Your seven seconds per timestep is forcing BOINC to run them in high priority, in an effort to finish them before the four-month deadline. You could turn off hyper-threading, which should lower your s/TS by around 40%, but this would limit your processing to one task at a time and somewhat diminish the machine's multi-tasking performance with other applications. Probably the best solution is to avoid hadcm3n's, changing your climateprediction.net preferences to select only the hadam3p models, of which there are plenty right now :) edit: grammar |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
... with regard to your computation errors, you may want to exclude your anti-virus from scanning the BOINC data directory. This is just my stab in the dark, but it's a fairly common reason for errors mid-task on Windows machines. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
6 months isn't about reaching equilibrium, it's about the mental attitude that people need to have when talking about "resource share", when running a mix of WUs, ranging from those project's that have WUs of a few minute/hours, to the very long climate models that take months. Backups: Here |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
I'm running a 1000 hour hadcm3n alongside 8-12 hour WCG tasks on my dual-core laptop right now, and each project gets 50% of CPU time--a core each. There just isn't enough oomph in the OP's machine to crunch one long task requiring 2016 hours (7s/TS) within seventeen weeks of shared time (2856 hours * 50% = 1428 hours). Another solution would be to change the resource allocation to 25% SETI, 75% CPDN, then 2856 * 75% = 2142 would just make the squeeze, but the machine would need to be on 24/7. Edit: by the way my first solution of turning off hyper-threading will not produce the desired sharing, since that faster s/TS is now only given half the time. I wasn't thinking. |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
Hold on, I messed up. The hyper-threading of course allows you to run two tasks at the same time within BOINC so the 7s/TS does fit within the 2856 hours available. But you're downloading two hadcm3n at a time so that is what's cutting your available time in half Best solution still is to stick to the hadam3p's. Edited because it's Saturday. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
|
Send message Joined: 11 Dec 05 Posts: 4 Credit: 383,991 RAC: 0 |
Thanks for all your comments. Unfortunately I don't have enough background to understand many of them. I shall try the route of "no new jobs" and "suspend" one of the pair till the first is finished. Isn't automatic, but run time is long enough it isn't a burden. |
Send message Joined: 11 Dec 05 Posts: 4 Credit: 383,991 RAC: 0 |
Forgot to mention computer is running 24/7 |
Send message Joined: 19 Apr 08 Posts: 179 Credit: 4,306,992 RAC: 0 |
Hi Ed, the crux of the problem is the downloading of two hadcm3n's. It would be great if BOINC would download one task at a time, but this is a shortcoming with BOINC. Even if you set your resource share to 99/1% and your work buffer to 0.1 days, as soon as BOINC requests work for the 1% project it will download the same number of tasks as cores in your machine (real or virtual). BOINC was originally developed at a time when dual-core machines were a dream, and I think the difficulty lies not in rewriting the client, but rather the hundreds of custom server applications and related database interactions. A workaround is to manually request work: suspend network activity in BOINC, change the number of processors BOINC uses to one (50% for dual-core, 25% for quad-core, etc.), activate the network and update the project for which you need work, then suspend network activity again and set processors back to 100%. Yes, a pain. Selecting only hadam3p's is the long-term answer (they're about one-fifth as long), but short-term you could suspend SETI and slog through the two hadcm3n's. The aliens can wait ;) |
©2024 cpdn.org