Message boards : Number crunching : New work Discussion
Message board moderation
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 91 · Next
Author | Message |
---|---|
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
Some more seem to have been put out that will run on Linux from batch 599 (Hadcm3s) but all six I have received crashed. 5 at 19 seconds across two machines and one managed a whole two minutes something! |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
And all gone now. I hope some had better luck than I in running them. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,202,915 RAC: 2,154 |
I got two work units a few days ago. They are running and have not crashed. They each have over 56 hours of Elapsed time on them. 11-Oct-2017 17:08:04 Starting task hadcm3s_5045_200012_168_671_011310038_0 using hadcm3s version 834 in slot 5 11-Oct-2017 17:08:04 Starting task hadcm3s_504w_200012_168_671_011310065_0 using hadcm3s version 834 in slot 6 They seem to be uploading trickles. 12-Oct-2017 16:31:07 Started upload of hadcm3s_504w_200012_168_671_011310065_0_r1419425307_1.zip 12-Oct-2017 16:31:21 Finished upload of hadcm3s_504w_200012_168_671_011310065_0_r1419425307_1.zip 12-Oct-2017 16:31:46 Started upload of hadcm3s_5045_200012_168_671_011310038_0_r830022721_1.zip 12-Oct-2017 16:31:56 Finished upload of hadcm3s_5045_200012_168_671_011310038_0_r830022721_1.zip 13-Oct-2017 15:41:18 Started upload of hadcm3s_504w_200012_168_671_011310065_0_r1419425307_2.zip 13-Oct-2017 15:42:07 Finished upload of hadcm3s_504w_200012_168_671_011310065_0_r1419425307_2.zip 13-Oct-2017 15:43:05 Started upload of hadcm3s_5045_200012_168_671_011310038_0_r830022721_2.zip 13-Oct-2017 15:43:40 Finished upload of hadcm3s_5045_200012_168_671_011310038_0_r830022721_2.zip |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
There are 2 x 2,370 13-month ANZ at 50 km in batch #672 and batch #673 (batch list). |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
Some 13-month Africa models at 50 km resolution have been added to the queue - 3,900 batch #674, 3,375 batch #675 and 3,375 batch #676; there are also 200 HADCM3S models (batch list). |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
A couple of hundred hadcm3s tasks have been released. May be more and the rest are still being loaded into the queue. Not showing up on server stats yet and if it is only a couple of hundred they might never do so. Anyway, good news for two machines I have that have been without work since the WAH2's were withdrawn from Linux. Perhaps not such good news. 2 batch 602 on one machine crashed just under 3 minutes in. Now waiting to see what happens on other box....Three on another slightly faster machine now 7minutes in so may be OK. Will check in morning. |
Send message Joined: 7 Aug 04 Posts: 2186 Credit: 64,822,615 RAC: 5,275 |
My Phenom II picked up 5 of these new tasks from batch 602. All crashed immediately with segmentation violations. |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
I picked up six across two machines. On one two crashed at once. On the other three from batch 602 and one from batch 618 are still running and past five hours in. Nothing to do with machines as last time around the machine that has 4 running crashed all it got with segmentation faults. |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
One of the four has crashed, Interestingly with an invalid theta just before the sigseg fault. This a bit over 17 hours in. I don't remember seeing the two together before, though am pretty certain I do remember seeing tasks that have crashed with invalid theta on windows machines crashing with sigseg fault on Linux ones before. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
A small batch of 160 60-month Pacific North-West models at 25 km has been added - batch #678 (batch list). [Edit: Plus 50 x PNW25/49 in batch #679, and 400 x PNW25/60 as an extension of batch #665.] |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
A batch of 372 21-month Pacific North-West models at 25 km has been added - batch #680 (batch list). [Edit: ... and 7,200 SAS50/3.] |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
A batch of 3,600 3-month South Asia at 50 km has been added - batch #682 (batch list). |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
And now some PNW tasks batch 683. Still none for us penguin types though. |
Send message Joined: 5 Aug 04 Posts: 1120 Credit: 17,202,915 RAC: 2,154 |
Still none for us penguin types though. I tried to set up my BOINC client to run 50% of my machine's spare time on Climate prediction. But since I run Linux, I seldom run any climate prediction at all these days, but not for lack of trying. I used to run three Climate prediction tasks at a time, month-in, month-out, but not in a long time now. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
A varied selection of batches has been released in the last few days: 10 x WUS25/120, 1650 x NAM50/13 and the return of East Asia with 500 x EAS50/12 (batch list). |
Send message Joined: 4 Oct 13 Posts: 27 Credit: 2,301,681 RAC: 7,632 |
Three of my most recently downloaded batch of 11 models have crashed. These models also crashed for my "wingmen" if that is an accurate term to use at CPDN. Task 11361810 - wah2 Signal 11 received: Segment violation Task 11339935 - pnw25 Unknown error Task 11278191 - wah2 Signal 4 received: Illegal instruction - invalid function image Signal 4 received: Floating point exception Signal 4 received: Segment violation Not sure what is going on. There have been no interruptions in processing at all (i.e., suspends, reboots, etc.). |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
At least two different problems here, one from batch 683 is a create thread error which I think is a dodgy line in one of the files for the task. The segmentation error in batch 686 I don't think the root has been found of that one, I did a quick root around but it may be too early to see if any of these tasks are completing. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
Batch #687 has just been added and has 1,600 Central America 13-month models at 50 km resolution - i.e. CAM50/13 (batch list). |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,989,107 RAC: 21,788 |
A few thousand pnw tasks released, batch 688. Thinking about going back to WINE :( |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,803,756 RAC: 5,187 |
A new format, Central America at 25 km for 18 months - batch #689, 325 off (batch list). [Edit: plus 780 x EAS50/12.] |
©2024 cpdn.org