Message boards : Number crunching : New work Discussion
Message board moderation
Previous · 1 . . . 51 · 52 · 53 · 54 · 55 · 56 · 57 . . . 91 · Next
Author | Message |
---|---|
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
Now, obviously, I cannot check the server for an uncleared loch file but is there any way I can change my host id and is there any way I can resend my code sign key? I’ve tried that many times. The last thing I tried was to create a new user id and attach to that but still no joy :-( I’ll ask at Boinc and see what they say. |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
Bryn Mawr - Try: ls /var/run/lock |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
Bryn Mawr - This would be a lock file on the server set after it has looked up my user id and before it starts updating the dB. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
YES! For the first time in about 6 month I got 6 new windows tasks. Hopefully this marks the return of work for computers running Windows.. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
Yes but I wish there was a way to attach hooters to CPDN. They should start hooting before work is sent by the server. Last night when new WU's were dispatched my machine was already happily munching on Linux WU's. L3 cache keeps appearing in various threads. L3 has a story, poor L3. I have an i7 well both are i7's. 9Mb L3 cache divided amongst six physical cores and six Hyper-Threaded Virtual cores, on top of which I had a VM running with three Linux tasks. Which makes fifteen WU's scrambling after 9Mb. (On one very stable machine, I have switched off Hyper-Threading so six physical cores to fight over 9Mb L3). Last night work landed on the un-switched off machine. By the time the propeller sound switched to the turbo-prop mode and warned me about the shenanigans going on, sad but two WU's had errored out :'(. Any ideas as to how to fit air-raid sirens which should scream when CPDN is in the mood? |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
Now, obviously, I cannot check the server for an uncleared loch file but is there any way I can change my host id and is there any way I can resend my code sign key? I might, just might, have resolved this. I found that I had no alt platform set in the cc config file. I removed that and re-read config and it made no never mind but 3 days later I rebooted and promptly started receiving work. Later this morning I’ll try rebooting the other machine and see if that one wakes up as well. |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
I think that confirms it, within an hour of rebooting I have work. So, changing no_alt_platform from 1 to 0 followed by a reboot appears to have fixed the fault. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,019,755 RAC: 20,934 |
Well done! Next question is why do some clean installs put it in and others not or were these all carrying over old cc_config.xml files? Edit:Only just noticed that this was a carry over from running WCG. Please do not private message myself or other moderators for help. This limits the number of people who are able to help and deprives others who may benefit from the answer. |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
Well done! All we need now is to see whether others suffering the same problem are using the same setting. Les Bayliss is one possibility :-) |
Send message Joined: 17 Jan 09 Posts: 124 Credit: 2,030,323 RAC: 2,771 |
Can this be shared on the BOINC message boards... It sounds like something that both Development and perhaps WCG might want to look at. Bill F In October 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic; There was no expiration date. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,019,755 RAC: 20,934 |
Can this be shared on the BOINC message boards... It sounds like something that both Development and perhaps WCG might want to look at. It is here on the BOINC forums. Please do not private message myself or other moderators for help. This limits the number of people who are able to help and deprives others who may benefit from the answer. |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
Can this be shared on the BOINC message boards... It sounds like something that both Development and perhaps WCG might want to look at. Aye, it would be very helpful if Boinc put out a message giving the reason for no task sent, even if only as part of the debug stream. |
Send message Joined: 28 Dec 17 Posts: 18 Credit: 1,097,261 RAC: 147 |
I managed to snag four WUs of the new SAFR50s from HAPPI, but sadly two of them so far got computational errors (signal 11, segment stuff yet again). I would really like to help with this project given its importance, but I think the other two will eventually fail, too. My computer isn't overtaxed with processes, but it is old. Not sure what you would advise. Hopefully the people who take over my WUs have success. :( |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
I managed to snag four WUs of the new SAFR50s from HAPPI, but sadly two of them so far got computational errors (signal 11, segment stuff yet again). I would really like to help with this project given its importance, but I think the other two will eventually fail, too. My computer isn't overtaxed with processes, but it is old. Not sure what you would advise. Hopefully the people who take over my WUs have success. It looks like your computer is suspending processing quite frequently. This has been known to cause computation errors. Try setting you preferences to not suspend and to keep work in memory if it does suspend. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,019,755 RAC: 20,934 |
I managed to snag four WUs of the new SAFR50s from HAPPI, but sadly two of them so far got computational errors (signal 11, segment stuff yet again). I would really like to help with this project given its importance, but I think the other two will eventually fail, too. My computer isn't overtaxed with processes, but it is old. Not sure what you would advise. Hopefully the people who take over my WUs have success. Last time I looked, 14 had completed successfully, I think the number that had hard failed, i.e. all three attempts had failed was 15. It was certainly more than the successes. That suggests the failures have little to do with the computer in question but more to do with the particular tasks. This issue has been seen by one of the moderators who has reported it back to the project. Please do not private message myself or other moderators for help. This limits the number of people who are able to help and deprives others who may benefit from the answer. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,808,726 RAC: 5,192 |
I managed to snag four WUs of the new SAFR50s from HAPPI, but sadly two of them so far got computational errors (signal 11, segment stuff yet again). I would really like to help with this project given its importance, but I think the other two will eventually fail, too. My computer isn't overtaxed with processes, but it is old. Not sure what you would advise. Hopefully the people who take over my WUs have success. This batch, 890, has a very high error rate, so I wouldn’t worry about the machine. Some models in the batch have finished, so please persist with any model that is still running — but don’t be surprised if the model doesn’t finish. Mine all crashed on all my machines. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
Could someone please tell me after reading my results page, why have 100% of my WU's errored out? This is embarrassing plus I do not see any reason behind the why? |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,019,755 RAC: 20,934 |
Could someone please tell me after reading my results page, why have 100% of my WU's errored out? This is embarrassing plus I do not see any reason behind the why? If you are talking about the latest safr batch, it is as noted in other posts in this thread it is a problem with the batch. I just looked, Only 20 have so far succeeded and well over that have hard failed which means all three computers they have run on have failed them. Please do not private message myself or other moderators for help. This limits the number of people who are able to help and deprives others who may benefit from the answer. |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
Yes and thank you. Any reason why? |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
Yes and thank you. Any reason why? If I had to guess (and yes, I am guessing) I would say it's because Unix used UDP 42 for the Host Name Server Protocol where Windows uses if for all *.dll inter-process communication. Just a guess. |
©2024 cpdn.org