Message boards : Number crunching : New work discussion - 2
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 42 · Next
Author | Message |
---|---|
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
Fair enough. It's useful to set the <cpu_sched> event log flag, so you can see exactly what, and how much, is being requested. |
Send message Joined: 29 Oct 17 Posts: 1049 Credit: 16,432,494 RAC: 17,331 |
I was playing with the event log options. <work_fetch_debug> gave alot more information about whether projects could or couldn't request work. I wonder whether one approach would be to run two clients on the same host. One solely for the CPDN projects and another for everything else. That way, the CPDN client would always be free. Might depend on whether the server was smart enough to spot that a single hostname & IP was running more than 1 client and still treat them as one. Still, might be worth a try and straightforward to do. Probably been thought of before. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
Probably been thought of before.Yes, it has. I have a working installation for reference and occasional testing, but that's under Windows. Trying to get two different Linux services running together would be a different kettle of fish. I can send you my sample files, and explain why the various elements are needed, if it would help. |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
Trying to get two different Linux services running together would be a different kettle of fish. I can send you my sample files, and explain why the various elements are needed, if it would help.Are you referring to two BOINC instances? I do it all the time on Ubuntu. I find it easier to set up than on Windows, or at least to start up both instances. https://www.overclock.net/threads/guide-setting-up-multiple-boinc-instances.1628924/ Then, I use BoincTasks to manage both instances (on different ports) on Ubuntu machines from my main Windows machine. I have used it when one BOINC instance is on CPDN, and the other instance on another project. I have not tried both instances on CPDN, but it probably will work. (I had a problem on one project a few years ago when both instances were on the same project, but not recently. I think the current BOINC server version works OK.) |
Send message Joined: 29 Oct 17 Posts: 1049 Credit: 16,432,494 RAC: 17,331 |
Are you referring to two BOINC instances? I do it all the time on Ubuntu. I find it easier to set up than on Windows, or at least to start up both instances.I see I'm playing catchup here, thanks for the input. It was straightforward to get two client instances running, one specifically for CPDN which will make it easier when debugging issues with OpenIFS. Getting correct systemctl startup files for both clients & boincmgr working took some time but it all works now. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,024,725 RAC: 20,592 |
I would love to see the CPDN server logs. There were 25 linux tasks in the 'dev' queue yesterday, but despite suspending all my projects/tasks except cpdn on my linux box and pinging the server every minute, it refused to give me any. User geophi seemed the only one getting them. Then I booted up my WSL boinc instance which only has cpdn & cpdn-dev as projects and it sent me one. So I can't help but wonder if the server gets jealous of other projects :) Maybe someone who understands the server scheduling algorithm might know what's going on.If you are talking about the hadcm3s tasks, those were Mac only, the linux ones having been deprecated. |
Send message Joined: 29 Oct 17 Posts: 1049 Credit: 16,432,494 RAC: 17,331 |
They were HADsm, the slab ocean version which is Linux. I know the coupled model is Mac. |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
That's your problem. If it was the main CPDN project (not the dev site), the server asks you to wait for 1 hour plus 1% - 3636 seconds - between updates.Why the 1%? |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
The plan I believe is to run these experiments end Oct/start Nov. Which is when I hope I'll have some tests for the multi-core, high-resolution models ready to go too.Does that include virtualbox ones for Windows? |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
Just in case there are minor discrepancies - e.g. rounding errors - between the clocks running on the server and the user's home computer. All clocks are slightly different, but are re-synced periodically to an authoritative time server. Which is itself periodically adjusted with leap seconds. You wouldn't want work fetch to be denied because of those discrepancies.That's your problem. If it was the main CPDN project (not the dev site), the server asks you to wait for 1 hour plus 1% - 3636 seconds - between updates.Why the 1%? |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
BTW, while we're here - which exactly is the current dev site? Over the years, we've used several: I must have accounts on all of them... |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
Ah, I assumed "wait 1 hour" meant "wait 1 hour from your current time on your clock". If I told you to meet me in an hour, you'd meet me 1 hour later by your own watch, I haven't even told you what my watch reads.Just in case there are minor discrepancies - e.g. rounding errors - between the clocks running on the server and the user's home computer. All clocks are slightly different, but are re-synced periodically to an authoritative time server. Which is itself periodically adjusted with leap seconds. You wouldn't want work fetch to be denied because of those discrepancies.That's your problem. If it was the main CPDN project (not the dev site), the server asks you to wait for 1 hour plus 1% - 3636 seconds - between updates.Why the 1%? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Richard Sent a PM. |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
RichardYou remind me of colleagues who would stop me in the corridor and ask if I'd got their email. Surely he'll see the PM just as likely as seeing this message? |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,708,278 RAC: 9,361 |
Yes, saw the email notification of the PM before I saw the reply here, but some projects have broken mail servers and the web notification is very discrete - I sometimes miss them. Anyway, I did already have an account, and I'm back in - ready for those IFS tasks. I've also upgraded the memory on my Linux machines to 64 GB / 32 GB. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,024,725 RAC: 20,592 |
They were HADsm, the slab ocean version which is Linux. I know the coupled model is Mac. Was away so didn't even notice them and they must have been short enough that there was nothing about them under users in last 24 hours on server status page last night. |
Send message Joined: 6 Jul 06 Posts: 147 Credit: 3,615,496 RAC: 420 |
I would like to join the cpdnboinc-dev project to help out but it appears to need an invitation code and it's not mentioned anywhere that I could see. I think I tried a long while back but could not get a leg in, but memory a bit fuzzy about that. My Linux computer was upgraded a while back to 64 GB RAM (for other BOINC projects requiring 1 or more GB of RAM per work unit, with 32 GB and 24 threads I was maxing out my memory). An invitation would be nice but perhaps they might have enough testers? So I might still not get in. Thanks Conan |
Send message Joined: 9 Oct 20 Posts: 690 Credit: 4,391,754 RAC: 6,918 |
Me too, but I can only provide 8 Windows machines. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Sorry Conan, the Dev site has very restricted access. |
Send message Joined: 29 Oct 17 Posts: 1049 Credit: 16,432,494 RAC: 17,331 |
Sorry Conan, the Dev site has very restricted access.It's also quite oversubscribed, I rarely get dev tasks. There's also no credit and the risk of getting misconfigired workunits that can disrupt the client (eg wrong memory settings) |
©2024 cpdn.org