climateprediction.net (CPDN) home page
Thread 'No work being downloaded on Linux host'

Thread 'No work being downloaded on Linux host'

Questions and Answers : Unix/Linux : No work being downloaded on Linux host
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Hal Bregg

Send message
Joined: 20 Nov 18
Posts: 20
Credit: 816,342
RAC: 1,139
Message 61413 - Posted: 27 Oct 2019, 20:15:02 UTC

I am trying to run project on 64-bit Linux host with 32-bit libraries installed as per this thread
https://www.cpdn.org/forum_thread.php?id=8008#54197

but no tasks are being downloaded, despite server status showing over 2000 of them ready to send.
Any particular reason why client is not downloading?

I was able to download one task on 32-bit host installed in VM but running time is rather appalling.
ID: 61413 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 61414 - Posted: 27 Oct 2019, 20:27:23 UTC - in response to Message 61413.  

I am trying to run project on 64-bit Linux host with 32-bit libraries installed as per this thread
https://www.cpdn.org/forum_thread.php?id=8008#54197

but no tasks are being downloaded, despite server status showing over 2000 of them ready to send.
Any particular reason why client is not downloading?

I was able to download one task on 32-bit host installed in VM but running time is rather appalling.


With regards to running time, yes these are long tasks

I don't know why you aren't getting any on the 64 bit host though. If you try and download from it before the one hour time out after the last request by hitting the update button, the one hour just starts again. As your computers are hidden, I can't seen any clues there.
ID: 61414 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 61415 - Posted: 27 Oct 2019, 20:44:13 UTC - in response to Message 61413.  

What are the messages saying (in the Event Log) ?
ID: 61415 · Report as offensive     Reply Quote
Hal Bregg

Send message
Joined: 20 Nov 18
Posts: 20
Credit: 816,342
RAC: 1,139
Message 61430 - Posted: 30 Oct 2019, 20:31:30 UTC - in response to Message 61415.  

What are the messages saying (in the Event Log) ?


Just standard message like this
Tue 29 Oct 2019 20:21:37 GMT | climateprediction.net | Scheduler request completed: got 0 new tasks


However not long after I created this topic I got one task and 2 days later another one. I suspect that server is sometimes picky when it comes to sending new work.[/url]
ID: 61430 · Report as offensive     Reply Quote
Bryn Mawr

Send message
Joined: 28 Jul 19
Posts: 150
Credit: 12,830,559
RAC: 228
Message 61646 - Posted: 10 Dec 2019, 13:59:22 UTC

I’m having the same problem, I have a Ryzen 5 2600 set up (host 1489273) that’s processing quite happily but I set up a second system (host 1495523) that I set up at the end of November with identical software and nothing I’ve tried will persuade it to download any cpdn work units.

Any suggestions?
ID: 61646 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 61647 - Posted: 10 Dec 2019, 14:16:42 UTC - in response to Message 61646.  
Last modified: 10 Dec 2019, 15:00:32 UTC

I’m having the same problem, I have a Ryzen 5 2600 set up (host 1489273) that’s processing quite happily but I set up a second system (host 1495523) that I set up at the end of November with identical software and nothing I’ve tried will persuade it to download any cpdn work units.

Any suggestions?

I had an anomaly recently. Without going into the details, try rebooting and then manually forcing an update. It might work.

EDIT: And I see that you are on BOINC 7.9.3. Version 7.16.3 is better (though not perfect).
You can get it by adding this repository: https://launchpad.net/~costamagnagianfranco/+archive/ubuntu/boinc
ID: 61647 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 61648 - Posted: 10 Dec 2019, 16:20:01 UTC - in response to Message 61646.  
Last modified: 10 Dec 2019, 16:44:20 UTC

I’m having the same problem, I have a Ryzen 5 2600 set up (host 1489273) that’s processing quite happily but I set up a second system (host 1495523) that I set up at the end of November with identical software and nothing I’ve tried will persuade it to download any cpdn work units.

Any suggestions?


I would try again to make sure all the 32bit libraries are installed. For your version of Ubuntu...

sudo apt-get install lib32ncurses5 lib32z1 lib32stdc++-6-dev

If that doesn't do it, post back here with the boinc manager message log messages from when it sends out a request for work and any messages that are a response to that request.
ID: 61648 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 61649 - Posted: 10 Dec 2019, 16:32:32 UTC

Strange, I had expected one or two messages with regards to emails going out to serial crashers who don't have the 32bit libs. Maybe all my fellow Mods are getting PMs?
ID: 61649 · Report as offensive     Reply Quote
Profilegeophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2187
Credit: 64,822,615
RAC: 5,275
Message 61650 - Posted: 10 Dec 2019, 16:43:21 UTC - in response to Message 61649.  

Strange, I had expected one or two messages with regards to emails going out to serial crashers who don't have the 32bit libs. Maybe all my fellow Mods are getting PMs?

I've gotten a couple from people who followed the instructions, PM'd me, and I notified Andy who re-enabled their hosts. It might have been they PM'd me since my name's on the posts with the proper instructions for popular distributions.
ID: 61650 · Report as offensive     Reply Quote
Bryn Mawr

Send message
Joined: 28 Jul 19
Posts: 150
Credit: 12,830,559
RAC: 228
Message 61651 - Posted: 10 Dec 2019, 17:49:33 UTC

I’ve checked that the libs specified are loaded which they are but I’m having difficulty adding the repository to update versions, I’ll try again after tea.
ID: 61651 · Report as offensive     Reply Quote
Bryn Mawr

Send message
Joined: 28 Jul 19
Posts: 150
Credit: 12,830,559
RAC: 228
Message 61652 - Posted: 10 Dec 2019, 18:39:27 UTC

Ok, repository added, 7.16.3 installed, system rebooted and a manual update request performed resulting in one work unit downloaded.

Than you for all the assistance - I’ll update the main system in a couple of days when this has bedded in.
ID: 61652 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 61654 - Posted: 10 Dec 2019, 19:52:24 UTC

It would also be worth a look at the special threads in Number crunching for the HADAM4 models, especially the N216 model.
They require LOTS of memory.
ID: 61654 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 31,135,131
RAC: 15,406
Message 61656 - Posted: 10 Dec 2019, 23:11:52 UTC - in response to Message 61654.  

They can also take 20days + to run!
ID: 61656 · Report as offensive     Reply Quote
Bryn Mawr

Send message
Joined: 28 Jul 19
Posts: 150
Credit: 12,830,559
RAC: 228
Message 61660 - Posted: 11 Dec 2019, 12:41:48 UTC - in response to Message 61654.  

Yes, I now restrict the N216 WUs to one at a time on my main machine and that has 16gb.

I’ll have to look at the second machine, it has 8gb loaded but my account here is only reporting 6gb, could be a problem there.
ID: 61660 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 61664 - Posted: 11 Dec 2019, 16:36:04 UTC - in response to Message 61660.  
Last modified: 11 Dec 2019, 16:39:06 UTC

Yes, I now restrict the N216 WUs to one at a time on my main machine and that has 16gb.

That is plenty of memory for the N216 (they take less than 1.5 GB each).
The real limitation for running multiple work units is the cache on the CPU.
But your Ryzen 2600 is very good at that; I run four at a time on mine without a significant slowdown.

The real question is the OpenIFS, when they come along.
ID: 61664 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 61666 - Posted: 11 Dec 2019, 16:59:07 UTC - in response to Message 61664.  

The real question is the OpenIFS, when they come along.


The most demanding of these took a bit over 5GB/core peak in testing. On a machine with 8GB RAM running two at once was OK as they rarely if ever peaked at the same time. Running three or four there was a significant hit because of swapping to disk though total throughput still increased. My ageing desktop with only 4GB of RAM was recognised as not having enough by BOINC and wouldn't download them.
ID: 61666 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 61667 - Posted: 11 Dec 2019, 17:22:22 UTC - in response to Message 61666.  

My ageing desktop with only 4GB of RAM was recognised as not having enough by BOINC and wouldn't download them.

I was wondering how they handled insufficient memory. With a little luck, they won't cause problems, though I will be running them only on machines with 32 GB.
I am ready when they are, I think.
ID: 61667 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 61669 - Posted: 11 Dec 2019, 19:14:46 UTC - in response to Message 61667.  

Which, apparently, won't be well into next year. :(
But you know what "they're" like with new work. :)
ID: 61669 · Report as offensive     Reply Quote
Bryn Mawr

Send message
Joined: 28 Jul 19
Posts: 150
Credit: 12,830,559
RAC: 228
Message 61672 - Posted: 11 Dec 2019, 22:28:12 UTC - in response to Message 61664.  

Yes, I now restrict the N216 WUs to one at a time on my main machine and that has 16gb.

That is plenty of memory for the N216 (they take less than 1.5 GB each).
The real limitation for running multiple work units is the cache on the CPU.
But your Ryzen 2600 is very good at that; I run four at a time on mine without a significant slowdown.

The real question is the OpenIFS, when they come along.


Alongside the cpdn WUs I’m running Rosetta and WCG WUs on all cores, again I restrict MIP to one thread at a time.

When OpenIFS comes along I think I’ll need to double the memory to cope.
ID: 61672 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 61673 - Posted: 12 Dec 2019, 7:11:32 UTC - in response to Message 61672.  

When OpenIFS comes along I think I’ll need to double the memory to cope.


Probably worth waiting till the next batch or two gets through testing at least whenever that is. I think the later ones peaked about 3.6GB/task but I didn't get any of them so can't be sure.
ID: 61673 · Report as offensive     Reply Quote
1 · 2 · 3 · Next

Questions and Answers : Unix/Linux : No work being downloaded on Linux host

©2024 cpdn.org