climateprediction.net (CPDN) home page
Thread 'What Happened ???'

Thread 'What Happened ???'

Message boards : Number crunching : What Happened ???
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 7 · Next

AuthorMessage
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,829,455
RAC: 5,056
Message 58137 - Posted: 22 Apr 2018, 23:23:50 UTC - in response to Message 58133.  

[Vicki wrote:]... 13+ hours trying to download 4 tasks is making dial up look fast. ...

Your machines are hidden so I can't see the tasks that are taking so long to download, but I've had a group of very old task reissues do that today (HADAM3P, batches 469/470): if so, they're from 2016 and will never download - they can safely be aborted, or at least that's what I did to mine.
ID: 58137 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 58141 - Posted: 23 Apr 2018, 8:55:14 UTC
Last modified: 23 Apr 2018, 8:58:44 UTC

I too was a bit surprised to see a task from batch 470 trying to download. Some of the files did but the rest all went to backoff. I too have aborted.

Edit: On looking at my account page, that is the only tasks listed as in progress on that machine, the four actually running aren't listed.
ID: 58141 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 58143 - Posted: 23 Apr 2018, 13:44:24 UTC

if so, they're from 2016 and will never download


Except, I have a 470 that has just downloaded and is now running.
ID: 58143 · Report as offensive     Reply Quote
flashawk

Send message
Joined: 29 Jun 12
Posts: 31
Credit: 1,438,478
RAC: 0
Message 58144 - Posted: 23 Apr 2018, 15:02:46 UTC

I have 8 out of 14 that are from 2016 that unstuck and downloaded, I'm currently crunching 3 of them. They went in to project backoff for over 24 hours, everything has cleared now.
ID: 58144 · Report as offensive     Reply Quote
ProfileSaenger
Avatar

Send message
Joined: 1 Nov 04
Posts: 185
Credit: 4,166,063
RAC: 857
Message 58145 - Posted: 23 Apr 2018, 17:12:18 UTC

The two on my computer currently running, but not in my task list, are named:



Anyone any explanation where and why they have venished from the database?


Grüße vom Sänger
ID: 58145 · Report as offensive     Reply Quote
ProfileVicki

Send message
Joined: 28 Nov 15
Posts: 50
Credit: 4,099,809
RAC: 0
Message 58146 - Posted: 23 Apr 2018, 19:30:45 UTC - in response to Message 58137.  

My computers are now visible. 1 computer has completed downloading, the other is labeled vicki-ace & is still stuck. A mix of sam25, wah2_global and wah2_ea50
ID: 58146 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 58147 - Posted: 23 Apr 2018, 19:47:17 UTC - in response to Message 58145.  

The two on my computer currently running, but not in my task list, are named:



Anyone any explanation where and why they have venished from the database?




Because we're still running on the slave database while Andy works on the main server.

Nothing that is seen on any lists can be assumed to be real.
ID: 58147 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 58148 - Posted: 24 Apr 2018, 15:54:59 UTC - in response to Message 58089.  

When this site shuts down, people switch to the BOINC site, in particular the Projects section, top post, which is: News on Project Outages, where a message, usually from Andy, is posted.

In this case, a new thread was also created: CPDN project going offline this afternoon, which is full of posts from people, including one that I put there near the end of the thread, which I felt explains things well enough.


First of all, I (and likely many others) did not know there was a specific forum on some other site, somewhere, where a cryptic message about some maintenance was posted.... Furthermore, the site has been down for over a month! That sounds much worse than some routine maintenance, so I consider that post deceptive.

I'm glad the site is back, and I hope such a long outage does not happen again. In the future, a post on the HOME PAGE of climateprediction.net would be much more appropriate and visible to the community.
ID: 58148 · Report as offensive     Reply Quote
Alex Plantema

Send message
Joined: 3 Sep 04
Posts: 126
Credit: 26,610,380
RAC: 3,377
Message 58149 - Posted: 24 Apr 2018, 16:24:23 UTC - in response to Message 58148.  

In the future, a post on the HOME PAGE of climateprediction.net would be much more appropriate and visible to the community.

There was a post on the homepage about the problems.
ID: 58149 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 58150 - Posted: 24 Apr 2018, 22:08:10 UTC

What I don’t understand is why when the Project Admins need to communicate with the Crunchers they don’t post in the notices tab if the Boinc Manager? Presumably, that’s what it’s there for. Even if it’s only to tell us that more info is available at another website. Other projects make extensive use of it. Seti posts there all the time.
ID: 58150 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 58153 - Posted: 25 Apr 2018, 12:22:43 UTC

The entire cpdn.org website was down for an extended time. I visited climateprediction.net several times but did not see any notice about downtime.
ID: 58153 · Report as offensive     Reply Quote
ProfileVicki

Send message
Joined: 28 Nov 15
Posts: 50
Credit: 4,099,809
RAC: 0
Message 58154 - Posted: 25 Apr 2018, 21:04:27 UTC - in response to Message 58153.  

The last stuck file just sprinted to its finish line; the fastest download speed I have seen on that desktop in ages, esp from cpdn. 3+ days to download. I will stick to my plan of catch up football on other projects, which will give them more time to fix up the server properly. My lappy has about a weeks worth of Einstein to crunch on in the meantime & the desktop crunching on mixture of Seti & Einstein. The last month I have finally crossed over the 800k mark in total Seti credit, long overdue given that it was the first project I joined, way back before Bonic changed into its current form.
ID: 58154 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 58162 - Posted: 3 May 2018, 17:40:27 UTC
Last modified: 3 May 2018, 17:44:28 UTC

Latest update I saw after the project went off line.

The CPDN project is now offline. I will be taking a database dump of the database in order to resurrect the master-slave relationship on the two database servers in the project. In order to do this I need a database where no transactions are taking place. Once this is complete, it is likely we will start the project from the backup project server, rather than the main project server, due to ongoing instabilities in the main GPFS infrastructure.


Project updates seem to be working again though no tasks available. I haven't looked to see if the information on my account pages is more accurate than it was yet.

Edit: Information on tasks running etc. still wildly out on my account so I imagine the same is still true for others. Will wait till things improve before checking out some other issues I have had.
ID: 58162 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 58163 - Posted: 3 May 2018, 19:13:52 UTC - in response to Message 58162.  

I am not getting any new tasks either. But both of my work units that finished today are showing properly as completed on my tasks page. So things are a bit better.
ID: 58163 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 58164 - Posted: 3 May 2018, 19:30:46 UTC

Hi All,

Just to let you know that the project is now back online, rather than running from the backup project server, we are running from the main project server. The infrastructure still remains at risk due to ongoing instabilities in the main VMware/GPFS infrastructure.

Best regards,

Andy


So don't keep poking it, or you might break it again. :)
ID: 58164 · Report as offensive     Reply Quote
Alex Plantema

Send message
Joined: 3 Sep 04
Posts: 126
Credit: 26,610,380
RAC: 3,377
Message 58165 - Posted: 3 May 2018, 19:58:32 UTC

Last week I downloaded three tasks (20878730, 20878130 and 20876968) on https://www.cpdn.org/cpdnboinc/results.php?hostid=1425854 which I already downloaded last November and returned successfully.
Task 20425445 was listed as downloaded in May 2017 but wasn't actually received. It was also downloaded last week and returned successfully.
The original tasks aren't shown in the list anymore.
So if you find these low task numbers you might have processed them already.
ID: 58165 · Report as offensive     Reply Quote
flashawk

Send message
Joined: 29 Jun 12
Posts: 31
Credit: 1,438,478
RAC: 0
Message 58166 - Posted: 4 May 2018, 0:54:56 UTC

Server page is out dated, are there any tasks or are we out right now?
ID: 58166 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 58167 - Posted: 4 May 2018, 2:57:34 UTC - in response to Message 58166.  

The first part of this has been known for some time. There are no answers for the other two questions.

However, occasionally someone posts about getting tasks. Whether or not these tasks are valid, or some that the BOINC server code has regurgitated, isn't known.
ID: 58167 · Report as offensive     Reply Quote
CJ Xuereb

Send message
Joined: 24 Oct 16
Posts: 6
Credit: 1,866,525
RAC: 1,022
Message 58168 - Posted: 4 May 2018, 9:49:25 UTC

What happened ???

The expected happened.

Climateprediction.net must hold the honour of probably being the worst maintained project in BOINC history.
ID: 58168 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 58169 - Posted: 4 May 2018, 11:15:26 UTC

Climateprediction.net must hold the honour of probably being the worst maintained project in BOINC history.


There is another climate science project that goes for even longer periods with apparently nothing happening. I have run it sometimes when this project is down/has no work. I am sure that a substantial increase in funding would make a difference.

As Les says, things have improved and I expect they will continue to improve as Andy continues to work on rebuilding everything. However work on the script to provide daily credit updates is clearly going to be on the back burner for a while.
ID: 58169 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 7 · Next

Message boards : Number crunching : What Happened ???

©2024 cpdn.org