Message boards : Number crunching : Notice: Problems with PNW 'd' series Weather at Home models issued on Feb 22
Message board moderation
Author | Message |
---|---|
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
These models appear to have multiple issues: missing download files, and missing files within the zip files that are present and do get downloaded. See this thread in the phpBB forums. |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
I can second that. I have received 10 of them on the 22nd of Feb, all failed. Most of my wingmen have also errored out or haven't reported yet. I can't seem to look at the task details on the website for any of them. I am clicking on the TaskId links. The website simply displays the CPDN logo at the top and the circle running around indicating its waiting on the website. Using IE9 under Win7. Links to some wu: One Two Three Looks like the whole batch are stuffed. I wonder if they could check 1 or 2 in a batch before they send them out? Maybe they could generate one, see if that works and then generate the rest once its successful. BOINC blog |
Send message Joined: 5 Sep 04 Posts: 21 Credit: 2,494,378 RAC: 2,175 |
I'm getting similar problems 3 WU's all have computation error's within 1 - 3 mins of starting. "All man born has a right to life and no man born has the right to take that life" |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,009,815 RAC: 21,293 |
The links are now opening. Looking at the first link, the tasks have two different errors. The machine running Darwin has a segmentation violation, the other two both have something pretty similar, this is the first one. Signal 11 received, exiting... [/url] |
Send message Joined: 13 Jan 07 Posts: 195 Credit: 10,581,566 RAC: 0 |
I've had 6 PNWs fail this morning. Running on Windows 7 and Intel. |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
I've just had two PNW failures on Win7 and geophi's had several, probably on Linux. There's clearly something wrong with this batch. At least they don't spend much time crunching. Cpdn news |
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
I've also had one PNW fail this morning. Running on Windows 7 and Intel within 1 min of starting -- hadam3p_pnw_df38_2046_1_008313166 |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
I've just had another PNW crash. I knew it would probably crash at about 25sec so I started the graphics to try to see what was happening. Just a completely black window - the model didn't appear to have started crunching. Cpdn news |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
I had 3 that failed in one computer and 1 failed in another. In all cases after a few hours after having started. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Occasionally, some ranges of models get released with the wrong values in some of the supporting files. This leads to the model(s) in question failing when it/they get to the incorrect part. Backups: Here |
Send message Joined: 30 Jan 12 Posts: 38 Credit: 10,197,388 RAC: 0 |
I had all 25 fail on my machines. Are they going to rework these wu's and send them back out? |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
ccandido, I can't see your models because your computers are hidden. Flashawk, your crashed PNW models all seem to be from the problematic batch created on 22 February. This is a nuisance, but they do crash very quickly after starting and don't use much processing time. I expect this batch of models will indeed be reworked and reissued as this is usually done when a batch doesn't run successfully as expected. Cpdn news |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The usual way of 'reworking' a large batch of faulty WUs, is to inform the external suppliers of the data in question, and leave it up to them to sort it out. This can take time. In this case, they're from the University of Oregon in the USA. Backups: Here |
Send message Joined: 30 Jan 12 Posts: 38 Credit: 10,197,388 RAC: 0 |
Thanks guys, that's where my youngest son goes to school. |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
Currently I have 14 WU running Some have reached more than 20% completion But several ohters failed during download or in the first hours Lets see how these 14 will do... |
©2024 cpdn.org