climateprediction.net (CPDN) home page
Thread 'Problem at upload?'

Thread 'Problem at upload?'

Message boards : Number crunching : Problem at upload?
Message board moderation

To post messages, you must log in.

AuthorMessage
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 38035 - Posted: 27 Sep 2009, 4:41:18 UTC

I am not sure whether I have a problem or not. I finished the WU hadsm3mh_kpbr_006311059_2. At the end the “to completeion” froze at 1:43 and the “status“ now reads “computation error.” However, the timestep count (in graphics) went all the way to 259248 and went into “post processing,” The messages (shown below) seem to show that the zip file uploaded successfully. Did the server get the results of the WU? If it didn’t I still have a backup copy that I can restore and run it to the end again.

9/26/2009 10:58:26 PM climateprediction.net Started upload of hadsm3mh_kpbr_006311059_2_4.zip
9/26/2009 10:58:26 PM climateprediction.net Sending scheduler request: To send trickle-up message.
9/26/2009 10:58:26 PM climateprediction.net Not reporting or requesting tasks
9/26/2009 10:58:31 PM climateprediction.net Scheduler request completed
9/26/2009 10:58:47 PM climateprediction.net Finished upload of hadsm3mh_kpbr_006311059_2_4.zip
9/26/2009 11:02:45 PM climateprediction.net Computation for task hadsm3mh_kpbr_006311059_2 finished
9/26/2009 11:02:45 PM climateprediction.net Restarting task hadsm3fub_jrtr_006402201_0 using hadsm3 version 607


ID: 38035 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 38037 - Posted: 27 Sep 2009, 9:23:45 UTC
Last modified: 27 Sep 2009, 9:24:01 UTC

The task is here. It looks good to me, all timesteps present and correct plus all graphs which must mean that all the files were received and valid.

I think the application version and stderrout should now show and don\'t. Even if they don\'t this is a Boinc thing and not the first time we\'ve seen strange things on a model\'s web page. Maybe they\'ll catch up later. IIRC the model\'s also still classified as \'New\' but I wouldn\'t worry about that because we\'ve seen truly crashed models classified as successes. Well, if a model\'s still supposedly new the app version and stderr out can\'t show.

It looks to me as if everything the researchers need is there.
Cpdn news
ID: 38037 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 38040 - Posted: 28 Sep 2009, 1:00:40 UTC - in response to Message 38037.  

The task is here. It looks good to me, all timesteps present and correct plus all graphs which must mean that all the files were received and valid.

I think the application version and stderrout should now show and don\'t. Even if they don\'t this is a Boinc thing and not the first time we\'ve seen strange things on a model\'s web page. Maybe they\'ll catch up later. IIRC the model\'s also still classified as \'New\' but I wouldn\'t worry about that because we\'ve seen truly crashed models classified as successes. Well, if a model\'s still supposedly new the app version and stderr out can\'t show.

It looks to me as if everything the researchers need is there.


Dear Mo:


Thanks for the help. It is good to know that the upload seems to have been excepted. I think that I may have found the reason for the “computation error” status of the WU. Below find details from my account page. It seem to say that there are “errors Too many total results.” Would this cause the problem.

Workunit details
application UK Met Office HADSM3 Mid-Holocene
created 17 Dec 1973 22:42:41 UTC
name hadsm3mh_kpbr_006311059
minimum quorum 1
initial replication 9
max # of error/total/success tasks 2, 5, 5
errors Too many total results
validation Pending
Task ID
click for details Computer Sent Time reported or deadline
explain Server state
explain Outcome
explain Client state


ID: 38040 · Report as offensive     Reply Quote
wateroakley

Send message
Joined: 6 Aug 04
Posts: 195
Credit: 28,402,184
RAC: 10,199
Message 38041 - Posted: 28 Sep 2009, 8:25:20 UTC

Jim,
There is a lot of post-phase processing in this model type. Notwithstanding the error messages in stderr, the graphs show a classic result. All 4 phase graphs are there and the credit is right. You can also see the middleware affecting the results by a small amount when comparing precipitation and temperature graphs of your task 9331754 AMD/XP and the other completed task 9331752 Intel/Linux.
ID: 38041 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 38042 - Posted: 28 Sep 2009, 8:53:15 UTC

\'Too many total results\' doesn\'t affect in any way the models already sent out.
Cpdn news
ID: 38042 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 39204 - Posted: 9 Mar 2010, 18:34:57 UTC

I don’t know if I have a problem or not? I had 2 AM3P’s finish at the same time and try to upload simultaneously. I think all of the zip files uploaded properly for both WU’s, but, am not sure. I have included the messages below:

3/9/2010 11:48:51 AM climateprediction.net Computation for task hadam3p_mzq5_1982_2_1006538159_4 finished
3/9/2010 11:48:51 AM climateprediction.net Restarting task hadam3p_mtoe_1986_2_1006530320_5 using hadam3p version 614
3/9/2010 11:48:54 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_1.zip
3/9/2010 11:48:54 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_2.zip
3/9/2010 11:49:34 AM climateprediction.net Sending scheduler request: To send trickle-up message.
3/9/2010 11:49:34 AM climateprediction.net Not reporting or requesting tasks
3/9/2010 11:49:39 AM climateprediction.net Scheduler request completed
3/9/2010 11:49:45 AM climateprediction.net Computation for task hadam3p_n1sr_1982_2_1006540845_4 finished
3/9/2010 11:49:45 AM climateprediction.net Restarting task hadam3p_mvoq_1964_2_1006532924_5 using hadam3p version 614
3/9/2010 11:51:32 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_2.zip
3/9/2010 11:51:32 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_3.zip
3/9/2010 11:51:36 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_3.zip
3/9/2010 11:51:36 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_1.zip
3/9/2010 11:52:20 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_1.zip
3/9/2010 11:52:20 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_2.zip
3/9/2010 11:54:55 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_1.zip
3/9/2010 11:54:55 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_3.zip
3/9/2010 11:54:59 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_3.zip
3/9/2010 11:55:02 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_2.zip
3/9/2010 12:29:50 PM climateprediction.net task hadam3p_mtoe_1986_2_1006530320_5 suspended by user
3/9/2010 12:29:57 PM climateprediction.net task hadam3p_mtnw_1981_2_1006530302_4 resumed by user
3/9/2010 12:29:57 PM climateprediction.net Restarting task hadam3p_mtnw_1981_2_1006530302_4 using hadam3p version 614

Did all 6 zip file (3 per WU) upload properly? If not I still have a backup that I could run to the end again.
ID: 39204 · Report as offensive     Reply Quote
ProfileThyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 39205 - Posted: 9 Mar 2010, 19:01:26 UTC - in response to Message 39204.  
Last modified: 9 Mar 2010, 19:02:15 UTC

Did all 6 zip file (3 per WU) upload properly? If not I still have a backup that I could run to the end again.

Yes, they all uploaded with no problems Jim. Here they are with the start and finish messages paired off:

3/9/2010 11:48:54 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_1.zip
3/9/2010 11:52:20 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_1.zip

3/9/2010 11:48:54 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_2.zip
3/9/2010 11:51:32 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_2.zip

3/9/2010 11:51:32 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_3.zip
3/9/2010 11:51:36 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_3.zip

3/9/2010 11:51:36 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_1.zip
3/9/2010 11:54:55 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_1.zip

3/9/2010 11:52:20 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_2.zip
3/9/2010 11:55:02 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_2.zip

3/9/2010 11:54:55 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_3.zip
3/9/2010 11:54:59 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_3.zip
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 39205 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 39206 - Posted: 9 Mar 2010, 19:16:40 UTC - in response to Message 39205.  

Did all 6 zip file (3 per WU) upload properly? If not I still have a backup that I could run to the end again.

Yes, they all uploaded with no problems Jim. Here they are with the start and finish messages paired off:

3/9/2010 11:48:54 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_1.zip
3/9/2010 11:52:20 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_1.zip

3/9/2010 11:48:54 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_2.zip
3/9/2010 11:51:32 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_2.zip

3/9/2010 11:51:32 AM climateprediction.net Started upload of hadam3p_mzq5_1982_2_1006538159_4_3.zip
3/9/2010 11:51:36 AM climateprediction.net Finished upload of hadam3p_mzq5_1982_2_1006538159_4_3.zip

3/9/2010 11:51:36 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_1.zip
3/9/2010 11:54:55 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_1.zip

3/9/2010 11:52:20 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_2.zip
3/9/2010 11:55:02 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_2.zip

3/9/2010 11:54:55 AM climateprediction.net Started upload of hadam3p_n1sr_1982_2_1006540845_4_3.zip
3/9/2010 11:54:59 AM climateprediction.net Finished upload of hadam3p_n1sr_1982_2_1006540845_4_3.zip


thank you! It is nice to know that after all the crunching the results are getting back to the servers properly.
ID: 39206 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 39454 - Posted: 2 Apr 2010, 15:35:19 UTC

Is there a problem with the servers? I just finished an AM3P and all three zip files are presently stuck in the “transfer” tab. The on “status” on zip file 1 and 2 reads “uploading”. I also can’t download new work. I have the “FAMOUS” model selected and the “server status” indicates that there are over 100 of them available, but, messages tells me none are available and to make another selection.

ID: 39454 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 39455 - Posted: 2 Apr 2010, 15:49:04 UTC
Last modified: 2 Apr 2010, 15:54:37 UTC

The servers may have filled up again, in which case the programs will shut down,
and Milo will get emails about the problem.
\'Murphy\' always likes to cause this to happen on public holidays. :)

As for FAMOUS models, you also need to look at the Applications page, which shows that the program associated with that model type is no longer there.

Because of the mutterings about the stability / reliability of the FAMOUS models, the application has been withdrawn, and will be replaced by the next version. This now won\'t happen until after Easter, when some more testing will also have been done.
Backups: Here
ID: 39455 · Report as offensive     Reply Quote
Profilemo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 39456 - Posted: 2 Apr 2010, 18:55:01 UTC

Jim, Milo says the two servers that HadAM3P uploads files to both appear to be working normally.
Cpdn news
ID: 39456 · Report as offensive     Reply Quote
ProfileJIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 39458 - Posted: 2 Apr 2010, 20:10:50 UTC

Dear Mo:

Sorry for the above post. The problem seems to have been a connectivity problem at my end. Rebooting modem and router seems to have fixed the problem. Mea Culpa.

ID: 39458 · Report as offensive     Reply Quote

Message boards : Number crunching : Problem at upload?

©2024 cpdn.org