climateprediction.net (CPDN) home page
Thread 'Misconfiguration e-mail'

Thread 'Misconfiguration e-mail'

Message boards : climateprediction.net Science : Misconfiguration e-mail
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 25 · Next

AuthorMessage
ProfileMilo Thurston
Volunteer moderator
Volunteer developer

Send message
Joined: 2 Mar 06
Posts: 253
Credit: 363,646
RAC: 0
Message 39494 - Posted: 7 Apr 2010, 15:23:50 UTC - in response to Message 39493.  

ok!
lib32 installed
done.


Thanks - I\'ve now reactivated your host.
ID: 39494 · Report as offensive     Reply Quote
old_user609160

Send message
Joined: 5 Jan 10
Posts: 3
Credit: 2,143,399
RAC: 0
Message 39498 - Posted: 7 Apr 2010, 21:02:32 UTC

Hello, Got a misconfigured message too. I had a bunch of pending system updates, so did that and got updated BOINC client pieces. Projects like Milkyway restarted fresh.

I also manually added the gcc lib32 runtime libraries, but not sure what exactly might still need to add for CPDN.
My stderrdae.txt file was cleaned up on the reboot, so I\'m not sure what
the exact problem was at this point. And I don\'t see it on this website.

Running Ubuntu x64 server.

Dear rvireday
Your computer (host # 1043879) described below appears to have a misconfigured BOINC
installation and is crashing models.
ID: 39498 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 39501 - Posted: 7 Apr 2010, 22:51:36 UTC - in response to Message 39498.  
Last modified: 7 Apr 2010, 23:11:23 UTC

Hello, Got a misconfigured message too. I had a bunch of pending system updates, so did that and got updated BOINC client pieces. Projects like Milkyway restarted fresh.

I also manually added the gcc lib32 runtime libraries, but not sure what exactly might still need to add for CPDN.
My stderrdae.txt file was cleaned up on the reboot, so I\'m not sure what
the exact problem was at this point. And I don\'t see it on this website.

Running Ubuntu x64 server.

Here is a list of tasks for that computer.

Here is one of the crashed tasks. If you click on the \"+\" symbol next to the stderr out label, you will see one of the messages as \"execv: No such file or directory\" which is usually indicative of the lack of 32 bit compatibility libraries. This entry in the BOINC wiki talks about the need for these 32bit libraries for some applications and what files may be needed.
ID: 39501 · Report as offensive     Reply Quote
old_user28483

Send message
Joined: 4 Nov 04
Posts: 1
Credit: 90,875
RAC: 0
Message 39502 - Posted: 7 Apr 2010, 23:05:53 UTC

I\'ve updated the BOINC Client but have no idea how to troubleshoot otherwise. Any assistance would be greatly appreciated.

ID: 1040561
Created: 26 Dec 2009 5:44:19 UTC
Venue: home
Total credit: 0
Average credit: 0
Average update time: 7 Apr 2010 2:18:41 UTC
IP address: 192.168.1.101 (same the last 528 times)
Domain name: Danny-PC
Local Time = UTC -4 hours
Number of CPUs: 4
CPU: AuthenticAMD AMD Phenom(tm) 9600 Quad-Core Processor [x86 Family 16 Model 2 Stepping 2]
FP ops/sec: 2181187431.18974
Int ops/sec: 4776483720.40348
memory bandwidth: 250000000
Operating System: Microsoft Windows 7 Ultimate x86 Edition, (06.01.7600.00)
Memory: 3071.11 MB
Cache: 512 KB
Swap Space: 6140.5 MB
Total Disk Space: 170.1 GB
Free Disk Space: 161.82 GB
Avg network bandwidth (upstream): 26098.595049 bytes/sec
Avg network bandwidth (downstream): 5330141.838379 bytes/sec
Average turnaround: 0 days
Number of RPCs: 452
Last RPC: 6 Apr 2010 23:44:53 UTC
% of time client on: 99.3995 %
% of time host connected: 99.8751 %
% of time user active: 88.3322 %
# of results today: 4

ID: 39502 · Report as offensive     Reply Quote
old_user609160

Send message
Joined: 5 Jan 10
Posts: 3
Credit: 2,143,399
RAC: 0
Message 39503 - Posted: 8 Apr 2010, 0:00:58 UTC - in response to Message 39501.  

Okay, I have all the libs are installed now. Machine is stable and running.
ID: 39503 · Report as offensive     Reply Quote
ProfileMilo Thurston
Volunteer moderator
Volunteer developer

Send message
Joined: 2 Mar 06
Posts: 253
Credit: 363,646
RAC: 0
Message 39505 - Posted: 8 Apr 2010, 7:42:18 UTC - in response to Message 39503.  

Okay, I have all the libs are installed now. Machine is stable and running.


Thanks - it has now been reactivated.
ID: 39505 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 39506 - Posted: 8 Apr 2010, 9:34:22 UTC
Last modified: 8 Apr 2010, 9:44:56 UTC

Revireday

Could you please go into your account (Click on Taking part in CPDN in the blue menu on the left here and you\'ll see a submenu with your account). In the ClimatePrediction preferences of your account could you unhide your computer(s) at least for a week or two. This will allow us to check easily that the problem computer can now start and run models.
Cpdn news
ID: 39506 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 39507 - Posted: 8 Apr 2010, 16:31:44 UTC

Hi Danny

Thank you for posting.

You have two computers listed and I think they both have the same problem. Models cannot start on these computers.

On computer 1067547 a typical task is model 11296994. Click on + beside stderr out to see the messages. It failed with exit code -185 and \'CreateProcess() failed - Access is denied. (0x5)\'.

If you look at computer 1040561 then click on its tasks and select any of them you see the same exit code and messages.

In the Boinc FAQs Jorden explains a fix for this error here. Could you please try his suggestions on both computers. If this works on the computer that can still receive tasks and a new model can start, I am sure that Milo will then allow tasks for the second computer.

Please post back and tell us when you have done this.
Cpdn news
ID: 39507 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 39508 - Posted: 8 Apr 2010, 16:54:39 UTC
Last modified: 8 Apr 2010, 16:55:25 UTC

REMINDER

If you have received the email we can only help you to solve the problem if you post.

Most of the computers that have crashed a lot of climate models are very nice! Most of the solutions to problems are not very complicated!

Just click on the \'Post to thread\' button at the bottom. You may need to log in before you can post. Say for example \'I have received the email about computer number [give us its number which is in the email]. What should I do?\'

When you think you have corrected the problem please post again to tell us. Milo will then allow your computer to download models again.

Cpdn news
ID: 39508 · Report as offensive     Reply Quote
Doug Schwerin

Send message
Joined: 20 Mar 05
Posts: 2
Credit: 2,123,420
RAC: 0
Message 39526 - Posted: 10 Apr 2010, 5:59:07 UTC

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1029100
Reset the project. All other projects on the computer seem to be doing fine. Error seems to be around failure to create a file
ID: 39526 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 39527 - Posted: 10 Apr 2010, 6:30:51 UTC

Hi Doug
Sounds like a permissions problem.
You\'re using a very old version of BOINC, but back when that version was \'in vogue\', this post mentioned a cure.
It\'s in the middle of a longish thread, so it may take a while to jump to the post in question.


Backups: Here
ID: 39527 · Report as offensive     Reply Quote
Doug Schwerin

Send message
Joined: 20 Mar 05
Posts: 2
Credit: 2,123,420
RAC: 0
Message 39531 - Posted: 11 Apr 2010, 5:02:45 UTC - in response to Message 39527.  

Hi Doug
Sounds like a permissions problem.
You\'re using a very old version of BOINC, but back when that version was \'in vogue\', this post mentioned a cure.
It\'s in the middle of a longish thread, so it may take a while to jump to the post in question.


I have upgraded the computer and some others to version 6.10.43. Lets see what that will do.
ID: 39531 · Report as offensive     Reply Quote
ProfileMilo Thurston
Volunteer moderator
Volunteer developer

Send message
Joined: 2 Mar 06
Posts: 253
Credit: 363,646
RAC: 0
Message 39532 - Posted: 11 Apr 2010, 10:34:51 UTC - in response to Message 39531.  


I have upgraded the computer and some others to version 6.10.43. Lets see what that will do.


I\'ve set it to 1 task per day to see if it now works.
ID: 39532 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 39537 - Posted: 12 Apr 2010, 21:45:02 UTC

The owner of computer 750658 has said in an RT email that he\'s installed a new version of Boinc (it had a 5* version).

Milo, could you please reset the quota for this computer so we can see whether it now works?
Cpdn news
ID: 39537 · Report as offensive     Reply Quote
Nylanfs

Send message
Joined: 17 Feb 06
Posts: 1
Credit: 947,631
RAC: 0
Message 39540 - Posted: 13 Apr 2010, 3:33:05 UTC
Last modified: 13 Apr 2010, 3:52:12 UTC

Responding to email received on 04/07

Dear Nylanfs
Your computer (host # 971800) described below appears to have a misconfigured BOINC
installation and is crashing models. Would you please have a look at it?

If you need assistance, please post in this thread on our BOINC forums and we will suggest a way to fix the problem. You may post in any language:

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=6880

When you have applied the fix please post to say so. Until the problem is fixed no more work will be sent to your computer.

ID: 971800
Created: 7 May 2009 3:53:01 UTC
Venue: home
Total credit: 0
Average credit: 0
Average update time: 7 Apr 2010 2:18:41 UTC
IP address: 192.168.1.102 (same the last 626 times)
Domain name: owner-c7c912008
Local Time = UTC -4 hours
Number of CPUs: 1
CPU: AuthenticAMD AMD Athlon(tm) XP [x86 Family 6 Model 10 Stepping 0]
FP ops/sec: 1472300469.48357
Int ops/sec: 2465812817.88225
memory bandwidth: 1000000000
Operating System: Microsoft Windows XP Home x86 Edition, Service Pack 3, (05.01.2600.00)
Memory: 2047.48 MB
Cache: 512 KB
Swap Space: 3433.54 MB
Total Disk Space: 149.05 GB
Free Disk Space: 16.42 GB
Avg network bandwidth (upstream): 13428.791377 bytes/sec
Avg network bandwidth (downstream): 116946.168397 bytes/sec
Average turnaround: 0 days
Number of RPCs: 989
Last RPC: 7 Apr 2010 1:59:34 UTC
% of time client on: 94.9928 %
% of time host connected: 91.6345 %
% of time user active: 91.6277 %
# of results today: 1


I just downloaded and uninstalled then re-installed BOINC 6.10.42. So hopefully this will be fixed.

EDIT: Or I just found this which might be the problem. I have set my preferences to use HADSM3, HADCM3 & HADSM3MH.
ID: 39540 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 39541 - Posted: 13 Apr 2010, 4:42:03 UTC - in response to Message 39540.  

EDIT: Or I just found this which might be the problem. I have set my preferences to use HADSM3, HADCM3 & HADSM3MH.

That should fix it. Milo will have to reenable downloads for that PC in order for it to start picking up more work.
ID: 39541 · Report as offensive     Reply Quote
ProfileMilo Thurston
Volunteer moderator
Volunteer developer

Send message
Joined: 2 Mar 06
Posts: 253
Credit: 363,646
RAC: 0
Message 39543 - Posted: 13 Apr 2010, 7:52:08 UTC

I\'ve re-enabled 971800 and 750658.
ID: 39543 · Report as offensive     Reply Quote
Dave512

Send message
Joined: 11 Oct 07
Posts: 2
Credit: 92,002
RAC: 0
Message 39581 - Posted: 18 Apr 2010, 12:26:47 UTC - in response to Message 39186.  

G\'day, I got the e-mail about a misconfigured computer.

Here is the link quoted for my computer

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=834157


I am not really sure what the problem is, so don\'t know where to start looking. I upgraded boinc and moved the data files to a drive with more space (it was preventing other projects from loading)

When I upgraded boinc it created a new host on all projects. it immediately tried to start this project again, I have suspended it for now.

This is the new ID for this cpu

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1069861


Sorry about the delayed response to your mail, I don\'t check it often.

Thanks
-Dave
ID: 39581 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 39583 - Posted: 18 Apr 2010, 15:00:37 UTC
Last modified: 18 Apr 2010, 15:02:28 UTC

Dave512,

The first error you received was on a task downloaded on Jan 1, that crashed on Feb 1. It crashed with exit code 185 and all tasks since then have crashed immediately with that code. Within in the stderr listing on those result pages, it says \"Can\'t get shared memory segment name: shmget() failed\". The BOINC FAQ for that exit code says
ERR_RESULT_START -185

This is an error that will occur:
- if BOINC couldn\'t start the application
- if files are missing
- upon catch of other error returns
- on nonzero exit or signal
- if exceeded resource limit
- as catch-all for resume/start errors


The timing of your first problem is close to coincident with when some antivirus software (AVG among them) misidentified one of the climateprediction.net applications as possible malware, and quarantined it. What antivirus software is installed on that PC? Updates to AVG appear to have fixed that problem. With a reinstallation of BOINC, new cpdn applications would have been downloaded and if your antivirus software is playing well, it could start working again. I would let the model you downloaded run and see what happens.
ID: 39583 · Report as offensive     Reply Quote
Dave512

Send message
Joined: 11 Oct 07
Posts: 2
Credit: 92,002
RAC: 0
Message 39589 - Posted: 19 Apr 2010, 7:58:54 UTC - in response to Message 39583.  

I have avg running so that could be it.

I think I disrupted the file transfer when I suspended the task, I have reset the project so will probably have a failed task already. I have also requested no new tasks for now so that if this one fails it wont clog up your system.

Thanks for the reply
-Dave
ID: 39589 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 25 · Next

Message boards : climateprediction.net Science : Misconfiguration e-mail

©2024 cpdn.org