Message boards : Number crunching : News and Announcements
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 11 · Next
Author | Message |
---|---|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Just for completeness in this News thread. Posted Friday June 3, by Jonathan, the projects Sysadmin: Bad news here - there has been a water leak in the offices, which means that the electricity is to be turned off for some hours at 1 pm GMT. So that's it for the day and the week. :( Backups: Here |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
CPDN Main Project Jonathan posted the following update on the phpBB forum: The firewall alterations mentioned above were made at 11:15 BST today, so downloads from manticore should be taking place now. Some users might find that downloads continue to fail until they restart BOINC (that was certainly the case for me). "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Two new batches of the hadcm3N were created in the last few days. One of these was a control set, the other included forcings. One of the forcings was a bit overly enthusiastic, and has/is causing models to fail after a few seconds. The project people know about this, and there's no need to report these failures. More thumb twiddling time. :) Backups: Here |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
A new batch of 3,000 RAPID-RAPIT models are currently being created and released. This is for a short term project, and the results are needed soonest. The models are 40 years long, (if they complete :) ). If your computer hasn't run a long climate model before, BOINC may be shocked into 'panic mode'. Don't worry; the time-to-completion will drop fairly fast as BOINC learns. DON'T abort one of them just because you think that it'll take too long. Just keep calm and carry on. Or, at least, let BOINC carry on. :) Some that I picked up overnight are saying 970-980 hours. This is about 40 days, which is about right for my machines for these long models. Backups: Here |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
A new problem has appeared for the latest batch of hadcm3N models, which may fail at about 13 hours in. :( More details when details are known. No need to start posting about it. :) |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The next phase of the RAPIT/RAPID models are now being auto-generated from the completed models from the first phase. Note 1: They're being grabbed as soon as they appear, and the one hour back off still applies to prevent a few computers from hogging all of the work. Note 2: Attempts to use the Update button to speed things up will have the opposite effect - the timer will be reset to 1 hour (+/-), and you'll be back to square one. ************************ More of the Regional models will be available Real Soon Now, for people who prefer shorter models. Backups: Here |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
To make it a bit more "official", I'm re-posting part of one of my posts from another thread:
Basically, there's little to no work from this project at the moment. Backups: Here |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
CPDN Main Project After a number of false starts with faulty greenhouse gas (GHG) forcing parameters the second phase of the RAPIT project has now started in earnest. HadCM3N tasks have names in the format hadcm3n_{umid}_{start year}_40_* (where 40 is the number of model years the task runs for). {umid} is a 4 character universal model identity, with the first character being the main indicator of the type of model being run as follows:
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The recently released regional models were of two types: Some new ones, which are failing for reasons as yet unknown, and Some auto-regen models, which are apparently running OK. This is confirmed by a few reports. Backups: Here |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
A new batch of regional models were created over the weekend, all of them regens from previously completed work, so they should be OK. Backups: Here |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
Mac users on CPDN Main and Beta Projects We have evidence from a number of users that all CPDN applications can start failing immediately after an upgrade to BOINC 6.12.26. This appears to be related to a permissions change which makes it impossible for the controller process to launch the worker. Resetting the project (or detaching and reattaching) should fix the problem. See here for further details. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Climateapps2 has been (mostly) restored, with an upgrade to both the OS and the BOINC server software. It was complicated by the server being located in a room of a different department, and needing the IT person from there to do the work. Hopefully things will be more stable for a while. :) Backups: Here |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
|
Send message Joined: 31 Oct 04 Posts: 336 Credit: 3,316,482 RAC: 0 |
The BOINC server IP seems to have changed, if you still experience problems connecting to the scheduler, you might need to restart your BOINC client. Allem Anschein nach hat sich die IP des BOINC-Servers geaendert, wenn weiterhin Probleme beim Schedulerkontakt auftreten, muesst Ihr evtl. Euren BOINC-Client einmal durchstarten. edit : This fixes only the "Couldn't connect to server" error, server side errors cannot be fixed that easily. Das behebt nur den Fehler "Couldn't connect to server", Fehler auf Serverseite kann man leider nicht so leicht beheben. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The recent 2 batches of regional models, (pnw and then eu), have a problem with trying to return the final file, zip13. This is the restart file, and is intended to go to climateapps1.oucs. This is one of the servers that was moved from the oerc department to the oucs department. Redirects were put in place for the present, but the one to climateaps1 isn't working. This is resulting in HTTP errors when attempting to upload zip13 files. To fix this, an edit of client_state.xml is necessary. ======================== Suspend BOINC in the manager, and then exit from both the manager and the client parts. With a plain text editor, e.g. Notepad, open client_state.xml. Locate the uploader section of the pnw/eu model. Keep "locating" until you reach the one for zip13. It's in the <file_info> section for each model, just after <upload_when_present/>. DON'T touch the second one! It's in a signed (security) section! Change the 4 characters in the string uploader.oerc.ox.ac.uk from oerc to oucs Do this for all pnw/eu models THAT ARE RUNNING. Save the file Restart BOINC. The files should now upload. (Been there, done that, as they say. ) ======================== Don't bother with pnw models that haven't started yet. They have another problem, and should be aborted. (They're going to be regenerated.) (The project may use a Killer trickle on these unstarted models.) Also abort them if they've been started, but you haven't applied the fpops fix to them. This is getting complicated, so feel free to post. The big problem is the people who don't read either of the boards. Backups: Here |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
Change the 4 characters in the string uploader.oerc.ox.ac.uk from oerc to oucs A DNS redirection is now in place and the final upload for HadAM3P regional models (the *_13.zip file) should now work without this change. If you continue to have problems uploading the final upload file please let us know. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I've just finished uploading the last file (zip13), for an eu model, so the DNS redirect is working. Backups: Here |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Posts are starting to appear asking about the next lot of work, so this may be best answered as a news item. At present there are only 2 groups of researchers, the RAPIT group, and the dual resolution (regional models) group. The original models for both groups were created and sent out ages ago, and, as they get returned, the next step in the series is automatically created by an 'auto-regen' program. Only those models that make it all the way to the end will be continued to the next stage. Those that become unstable and fail, and those that are abandoned or aborted won't go any further. Occasionally, the researchers may ask for more new series to be created, if it looks like there won't be enough making it to form a good sample. With only a few thousand models, and 35,000+ computers connected, you just need to be patient if you're after work. Backups: Here |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Some model files have recently failed to upload when the CPDN upload server rejected them because of an invalid signature. This has happened since Boinc changed the procedure regarding the way files are (or are not) signed. CPDN complied with the requested change but did not realise at first that the change applies to every upload server. All files from all model types should now be accepted by all the upload servers. Cpdn news |
©2025 cpdn.org