Questions and Answers :
Unix/Linux :
Running 32-bit MacOS Tasks on Linux with KVM
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 15 May 09 Posts: 4529 Credit: 18,661,594 RAC: 14,529 |
Well, that was discouraging. We had a power cut this morning. Bringing up my linux PC, then the MacOS VM, 3 of the 6 tasks that were running (and 13 model months along) immediately crashed. Still not got it running here and being out of work both with CPDN and WCG have switched to FAH. |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
Sounds about right, they're touchy on restore, like most of the rest. I've actually had better luck with just killing the VM entirely than trying to suspend tasks if I need to power cycle something - suspend/resume cycles are likely to kill 40-60% of the running tasks, and a power cut kills fewer. I'm trying not to do that, though - like all the CPDN tasks, they're best left running, non-stop, until completion. I have a bunch of solar powered systems that I just suspend at night (suspend the host, not the guest), and that works fine too - they've been CPDN crunch boxes for a while and are quite reliable in that way. |
Send message Joined: 7 Aug 04 Posts: 2183 Credit: 64,822,615 RAC: 5,275 |
Sounds about right, they're touchy on restore, like most of the rest. If you do do the suspend tasks thing and then exit boinc before shutting the vm down, check the associated slots directories. If there are files listed with a "finished" in the filenames in those directories, delete those files. When the boinc client starts up and sees those files, it will think the task is over and error out. Usually it will have some error listed in stderr like "finish file present too long" and have an exit status of "194 (0x000000C2) EXIT_ABORTED_BY_CLIENT". |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
The ones I had error were ones like this: https://www.cpdn.org/result.php?resultid=22194275 <core_client_version>7.16.11</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63)</message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 3 received, exiting... 22:46:16 (397): called boinc_finish(193) </stderr_txt> ]]> This was with "suspend, shutdown VM, reboot, resume." If I just kill stuff, it doesn't seem to fail, though I've been trying to avoid any interruptions to compute that aren't sleep/resume (which doesn't bother them). |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
Well, my attempt to get it running on a 4440S I have laying around wouldn't get past the installer, so that system is still working Einstein units until something else comes along. I did set the "days of work to acquire" values up on some of my boxes, since they're actually completing WUs instead of spitting them out every hour with the CPU type errors, so... hopefully that doesn't bother anyone too much. They'll get chewed eventually, and I've figured out a way to keep my solar boxes running at night with the solar trailer hookup - assuming I've had some decent sun during the day. Should be a bit more steady, on top of the grid tied stuff in the house. Going to tweak one to let it use full hyperthreading as well, should improve throughput at the cost of latency. |
Send message Joined: 7 Jun 17 Posts: 23 Credit: 44,434,789 RAC: 2,600,991 |
Hello. Strange events: I've just set up a qemu/kvm instance of mojave, installed boinc manager, attached to the project and apparabntly successfully downloaded files, (there are four marked as in progress (computer ID: 1528682)). However, Boinc manager is empty and holds the message 'No work available to process'. I am used to managing boinc through boinctui and I wondered it there was a terminal interface available in the mac version so I can see what's going on. Many thanks leloft |
Send message Joined: 15 May 09 Posts: 4529 Credit: 18,661,594 RAC: 14,529 |
Hello. Strange events: I've just set up a qemu/kvm instance of mojave, installed boinc manager, attached to the project and apparabntly successfully downloaded files, (there are four marked as in progress (computer ID: 1528682)). However, Boinc manager is empty and holds the message 'No work available to process'. I am used to managing boinc through boinctui and I wondered it there was a terminal interface available in the mac version so I can see what's going on. boinccmd should work the same as in the Linux version Edit:There may or may not be permissions issues. I have never been close enough to a Mac for long enough but if there are any, anyone capable of getting the VM set up can probably navigate them. |
Send message Joined: 6 Aug 04 Posts: 195 Credit: 28,192,402 RAC: 10,436 |
I've actually had better luck with just killing the VM entirely than trying to suspend tasks if I need to power cycle something - suspend/resume cycles are likely to kill 40-60% of the running tasks, and a power cut kills fewer. I'm trying not to do that, though - like all the CPDN tasks, they're best left running, non-stop, until completion.Today, this Windows host (running the mac mojave VM) froze after excel threw a wobbly. Power cycling the host and restarting the mac VM brought back all four running tasks. That's 100% better result than carefully closing down BOINC and the VM. |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
Yeah... there are an awful lot of "suspending" style errors in the workunits I've poked around too. This binary doesn't seem to like suspend/resume, though that's no different than the rest of the CPDN binaries. Best handled by dedicated systems that sleep instead of shut down if needed for power reasons. Looks like the pool of WUs is drying up. I've got my machines stuffed to the gills with as many as they can fit (143 in progress at the moment on a total of 24 cores - I normally try not to hoard WUs, but my VMs are actually running them to completion and producing valid results, unlike quite a few of the machines chewing on them) and will let them run until their individual stockpiles are done, which should be in about a week and a half for the grid tied stuff and a bit longer for the solar stuff (I've got a power trailer right now that will run those machines overnight without too much trouble, so they're doing 24/7 work in my office if I have sufficient sun during the day to recharge the power trailer). It's been an interesting little diversion, for sure! I'll just shut the VMs down and see if there's anything else they can be useful for later - they're all set up and updated, won't be hard to get them back online if anything else comes down the pipeline. Otherwise, it'll be... well, I'm not actually sure what I'll point the cores at now, with WCG down for maintenance and CPDN emptying out yet again. |
Send message Joined: 22 Feb 06 Posts: 490 Credit: 30,766,944 RAC: 10,886 |
"with WCG down for maintenance and CPDN emptying out yet again." Milky Way is being flaky as well at the moment so I've added Einstein on my Linux box. |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
Rosetta doesn't have anything good either, something was faulty about their tasks and they're empty on anything that's not "python projects," which I think are VirtualBox only. I might mess with those if I'm bored... I'm more interested in the tasks that are solving practical, real world problems, as opposed to "Finding larger prime numbers" or "Doing some weird corner of math." I'm sure some people find that interesting, but I just can't get excited about them. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,193,804 RAC: 2,852 |
I'm more interested in the tasks that are solving practical, real world problems, as opposed to "Finding larger prime numbers" or "Doing some weird corner of math." I'm sure some people find that interesting, but I just can't get excited about them. Me too. seti@home is gone, ... The only one I found that is possibly useful is universe@home and it seems to always have work, when their server is not down. On the other hand, they hand out way to much credit for the small amount of work they do. https://universeathome.pl/universe/home.php |
Send message Joined: 7 Jun 17 Posts: 23 Credit: 44,434,789 RAC: 2,600,991 |
boinccmd should work the same as in the Linux version Thanks for the edit. The 'mac' terminal doesn't seem to understand the command 'boinccmd' and I'm just using Boinc manager, but it's complaining that it needs to be reinstalled. I'm not satisfied that the VM route is anything other than a distraction (although an interesting one):, it's very volatile: the machine needed a kernel update and after the reboot, the VM dropped a couple of WU that had been downloaded but not started; they are no longer visible in Boinc manager. I'd like to finish the 8 WU under computation and then close the VM. So here's the questions: how in Boinc manager do I set the equivalent of 'No New Tasks' in boinctui ? and can I abort those two dropped WU from the cpdn site so that they get re-assigned promptly? Or, can anyone direct me to where they might be stored on the 'mac' and how to re-attach them in BM? Many thanks leloft |
Send message Joined: 15 May 09 Posts: 4529 Credit: 18,661,594 RAC: 14,529 |
So here's the questions: how in Boinc manager do I set the equivalent of 'No New Tasks' in boinctui ? In Manager use advanced view then click on ,projects. tab. highlight the project you want to set to no new tasks and click that from the options on the left. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
Rosetta doesn't have anything good either, something was faulty about their tasks and they're empty on anything that's not "python projects," which I think are VirtualBox only. I might mess with those if I'm bored... I too. If an Asteroid has to hit Earth, then it has to hit. Just now I am running zero CPU tasks. All those WU's out there and only one seems to be answering back. I wish someone would re-issue them. If wishes were horses, then beggars would ride. |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
I'm not satisfied that the VM route is anything other than a distraction (although an interesting one):, it's very volatile: the machine needed a kernel update and after the reboot, the VM dropped a couple of WU that had been downloaded but not started; they are no longer visible in Boinc manager. I'm not sure it's fair to call it a distraction, as there are fewer and fewer Intel Macs on Mojave or earlier out there. I've been able to chew quite a few of the 32-bit WUs on my VMs, though I admit I have no idea why some other systems refuse to run them - hasn't been a priority. One of my machines, a 4440S, won't finish the install, so I just keep it doing other things. But as far as I can tell, by the time they drain, my VMs will have successfully chewed ~270 CM3 tasks with only a few errors (a few aborted tasks getting things working when I had colliding machine IDs, a few suspend/resume issues, and then some of the "legitimate errors" where planetary conditions go nonsensical and the model quits). I'd like to finish the 8 WU under computation and then close the VM. So here's the questions: how in Boinc manager do I set the equivalent of 'No New Tasks' in boinctui ? In the "Project" tab, click on the CPDN project and there should be a button on the left, "No New Tasks." That should do what you need. No idea about the missing WUs, I wouldn't worry about one or two weird ones given how many failed due to not being supported by the OS anymore. |
Send message Joined: 7 Aug 04 Posts: 2183 Credit: 64,822,615 RAC: 5,275 |
and can I abort those two dropped WU from the cpdn site so that they get re-assigned promptly? Once you are done with the tasks you can see in the task list, if you detach the computer from cpdn, the tasks should show up as abandoned in your task list, and should be available to download to other PCs. That is if these tasks didn't have task names with an _2 at the end, which means that they were the last of the 3 tasks issued from a work unit. |
Send message Joined: 7 Jun 17 Posts: 23 Credit: 44,434,789 RAC: 2,600,991 |
I'm not sure it's fair to call it a distraction, Haha, my bad. The distraction is mine: I am not used to having to be in front of any one machine to manage boinc. The machines are spread over two locations 15km apart, two labs, three classrooms and two offices, all of which have their own timetables. I manage everything over ssh via boinctui from wherever I happen to be working. Having boinc running as a service means that workunits are likely to survive a reboot on a remote host, even if with a small loss of work since the last checkpoint. I guess I'll just wait for the next batch of WU, they've got to be coming soon! |
Send message Joined: 7 Sep 16 Posts: 262 Credit: 34,896,361 RAC: 16,952 |
If that's your environment, then CPDN MacOS 32-bit tasks may not be for you. ;) All my compute boxes are quite local, and I enjoy the obscure challenges in computing stuff like this poses. Though I may just update one VM, archive it off, instead of keeping my whole mess of compute VMs around. As long as you let one update a few cycles before starting another one, or change the MAC address, they don't end up merging. Just don't start multiple VMs with identical MAC addresses at the same time. That does break things as they can't be distinguished from each other. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,193,804 RAC: 2,852 |
Rosetta doesn't have anything good either, something was faulty about their tasks and they're empty on anything that's not "python projects," which I think are VirtualBox only. I might mess with those if I'm bored... Rosetta now has a huge number of non-Python jobs available. All of them crash in under a minute. This is true no matter what version of Linux or Window you may be running. I think that they have an administration-management-quality control problem. Sad that is when WCG is down and ClimatePrediction has no work except for a few MacIntosh ones. |
©2024 cpdn.org