climateprediction.net home page
Uploading files fails

Uploading files fails

Message boards : Number crunching : Uploading files fails
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 62499 - Posted: 25 May 2020, 11:53:31 UTC

They're working on it.
Things may progress faster when the May Bank Holiday is over, and Andy is back.

The 3 new batches, 870, 871, & 872 are still open.
ID: 62499 · Report as offensive     Reply Quote
Fons

Send message
Joined: 25 Aug 18
Posts: 4
Credit: 160,583
RAC: 0
Message 62500 - Posted: 25 May 2020, 13:10:39 UTC - in response to Message 62474.  

The "BOINC deadline" is long to prevent cpdn from hogging the processors, when other very much shorter projects are run at the same time.
This has been posted about for years.

The "deadline" here is, ASAP, so the researchers can do their analysis of the data.
And when they get back enough results, the batch is usually closed to free up server space.

Anyone who hasn't returned their results in a timely manner just misses out.


Thanks for the answer
As my computer isnt on on a daily basis, i am concluding this is not the project for me. And ist saddens me to have to have to delete my data. That it is posted on a forum that computingtime is a.s.a.p. doesnt make it very clear to users like me. Thank you for resurgsing the climate. It is needed, sadly i cant help.

with regards,
Fons
ID: 62500 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 62501 - Posted: 25 May 2020, 18:57:06 UTC - in response to Message 62500.  

[quote]The "BOINC deadline" is long to prevent cpdn from hogging the processors, when other very much shorter projects are run at the same time.
This has been posted about for years.

A shorter deadline, something on the order of 4 to 5 months, would seem to be in order as that seems to be how long the Researchers are willing to wait before they close the batches. There is no sense in running models that no one is ever going to look at because the batch was closed by the time you finished it.
ID: 62501 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 62502 - Posted: 25 May 2020, 19:49:05 UTC - in response to Message 62501.  

A shorter deadline, something on the order of 4 to 5 months, would seem to be in order as that seems to be how long the Researchers are willing to wait before they close the batches. There is no sense in running models that no one is ever going to look at because the batch was closed by the time you finished it.

Yes, that is my point too. The one-year deadline is just a waste of resources.
I think they do things just because that is the way they have always been done. For a science project, it is a bit curious.
ID: 62502 · Report as offensive     Reply Quote
The Real Weasle

Send message
Joined: 27 Aug 04
Posts: 16
Credit: 5,367,970
RAC: 3,235
Message 62503 - Posted: 25 May 2020, 21:01:00 UTC

Not sure what, but something still not working. Fresh W/Us, PCs on 24/7... and nothing. Can't upload the trickles. Sometimes they get to 100% but then... nothing.


25/05/2020 18:45:58 | climateprediction.net | [fxd] starting upload, upload_offset -1
25/05/2020 18:45:58 | climateprediction.net | Started upload of wah2_anz50_11ky_209412_32_870_012020738_0_r2127971084_7.zip
25/05/2020 18:45:58 | climateprediction.net | [file_xfer] URL: http://upload4.cpdn.org/cgi-bin/file_upload_handler
25/05/2020 18:47:24 | climateprediction.net | [file_xfer] http op done; retval 0 (Success)
25/05/2020 18:47:24 | climateprediction.net | [file_xfer] parsing upload response: <data_server_reply> <status>0</status> <file_size>0</file_size></data_server_reply>
25/05/2020 18:47:24 | climateprediction.net | [file_xfer] parsing status: 0
25/05/2020 18:47:24 | climateprediction.net | [fxd] starting upload, upload_offset 0
25/05/2020 18:58:05 | climateprediction.net | [file_xfer] http op done; retval 0 (Success)
25/05/2020 18:58:05 | climateprediction.net | [error] Error reported by file upload server: EOF on socket read : asked for 262144, got 159136
25/05/2020 18:58:05 | climateprediction.net | [file_xfer] parsing upload response: <data_server_reply> <status>1</status> <message>EOF on socket read : asked for 262144, got 159136</message></data_server_reply>
25/05/2020 18:58:05 | climateprediction.net | [file_xfer] parsing status: -127
25/05/2020 18:58:05 | climateprediction.net | [file_xfer] file transfer status -127 (transient upload error)
25/05/2020 18:58:05 | climateprediction.net | Temporarily failed upload of wah2_anz50_11ky_209412_32_870_012020738_0_r2127971084_7.zip: transient upload error
25/05/2020 18:58:05 | climateprediction.net | [file_xfer] project-wide xfer delay for 14418.677328 sec
25/05/2020 18:58:05 | climateprediction.net | Backing off 00:02:03 on upload of wah2_anz50_11ky_209412_32_870_012020738_0_r2127971084_7.zip
25/05/2020 18:59:39 | climateprediction.net | [fxd] starting upload, upload_offset -1
25/05/2020 18:59:39 | climateprediction.net | Started upload of wah2_anz50_10qp_209112_32_870_012019649_2_r2062308319_7.zip
25/05/2020 18:59:39 | climateprediction.net | [file_xfer] URL: http://upload4.cpdn.org/cgi-bin/file_upload_handler
25/05/2020 19:04:46 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
25/05/2020 19:04:46 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
25/05/2020 19:04:46 | climateprediction.net | Temporarily failed upload of wah2_anz50_10qp_209112_32_870_012019649_2_r2062308319_7.zip: transient HTTP error
25/05/2020 19:04:46 | climateprediction.net | [file_xfer] project-wide xfer delay for 14346.494949 sec
25/05/2020 19:04:46 | climateprediction.net | Backing off 00:02:49 on upload of wah2_anz50_10qp_209112_32_870_012019649_2_r2062308319_7.zip
25/05/2020 19:04:47 | | Project communication failed: attempting access to reference site
25/05/2020 19:04:48 | | Internet access OK - project servers may be temporarily down.
ID: 62503 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 30 May 05
Posts: 12
Credit: 689,799
RAC: 282
Message 62504 - Posted: 25 May 2020, 22:59:53 UTC - in response to Message 62503.  

Not sure what, but something still not working. Fresh W/Us, PCs on 24/7... and nothing. Can't upload the trickles. Sometimes they get to 100% but then... nothing.

Did you read what Les Bayliss / Volunteer moderator posted several posts prior above that this has been addressed and will be looked into after the responsibile people return on Tuesday.

ID: 62504 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 62506 - Posted: 26 May 2020, 1:39:51 UTC - in response to Message 62502.  

The year long deadline may be partly historical. Back when this project started the slow computers (1.2 GHz) single core machines took 8 or 9 months (running 24/7) to complete the very long 160 year models. They needed the long deadlines. Now, not so much.
ID: 62506 · Report as offensive     Reply Quote
DerManiak

Send message
Joined: 25 Sep 17
Posts: 1
Credit: 212,512
RAC: 0
Message 62508 - Posted: 26 May 2020, 6:55:04 UTC
Last modified: 26 May 2020, 6:55:56 UTC

Just to add to the observations, in case this could be helpful. I have 5 zip files 33 MB in size of wah2 (I think what is called 'trickles' above?) that keep failing to upload the past few days.
From the log:

26/05/2020 08:35:29 | climateprediction.net | [fxd] starting upload, upload_offset -1
26/05/2020 08:35:29 | climateprediction.net | Started upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_1.zip
26/05/2020 08:35:29 | climateprediction.net | [file_xfer] URL: http://upload4.cpdn.org/cgi-bin/file_upload_handler
26/05/2020 08:35:29 | climateprediction.net | [fxd] starting upload, upload_offset -1
26/05/2020 08:35:29 | climateprediction.net | Started upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_2.zip
26/05/2020 08:35:29 | climateprediction.net | [file_xfer] URL: http://upload4.cpdn.org/cgi-bin/file_upload_handler
26/05/2020 08:36:00 | | Suspending computation - CPU is busy
26/05/2020 08:36:10 | | Resuming computation
26/05/2020 08:40:56 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
26/05/2020 08:40:56 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
26/05/2020 08:40:56 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
26/05/2020 08:40:56 | climateprediction.net | Temporarily failed upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_1.zip: transient HTTP error
26/05/2020 08:40:56 | climateprediction.net | Backing off 05:42:41 on upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_1.zip
26/05/2020 08:40:56 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
26/05/2020 08:40:56 | climateprediction.net | Temporarily failed upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_2.zip: transient HTTP error
26/05/2020 08:40:56 | climateprediction.net | Backing off 02:45:29 on upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_2.zip
26/05/2020 08:40:56 | climateprediction.net | [fxd] starting upload, upload_offset -1
26/05/2020 08:40:56 | climateprediction.net | Started upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_3.zip
26/05/2020 08:40:56 | climateprediction.net | [file_xfer] URL: http://upload4.cpdn.org/cgi-bin/file_upload_handler
26/05/2020 08:40:56 | climateprediction.net | [fxd] starting upload, upload_offset -1
26/05/2020 08:40:56 | climateprediction.net | Started upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_4.zip
26/05/2020 08:40:56 | climateprediction.net | [file_xfer] URL: http://upload4.cpdn.org/cgi-bin/file_upload_handler
26/05/2020 08:40:57 | | Project communication failed: attempting access to reference site
26/05/2020 08:40:58 | | Internet access OK - project servers may be temporarily down.
26/05/2020 08:42:11 | | Project communication failed: attempting access to reference site
26/05/2020 08:42:11 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
26/05/2020 08:42:11 | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
26/05/2020 08:42:11 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
26/05/2020 08:42:11 | climateprediction.net | Temporarily failed upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_3.zip: transient HTTP error
26/05/2020 08:42:11 | climateprediction.net | [file_xfer] project-wide xfer delay for 905.008087 sec
26/05/2020 08:42:11 | climateprediction.net | Backing off 00:24:20 on upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_3.zip
26/05/2020 08:42:11 | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
26/05/2020 08:42:11 | climateprediction.net | Temporarily failed upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_4.zip: transient HTTP error
26/05/2020 08:42:11 | climateprediction.net | [file_xfer] project-wide xfer delay for 1721.800592 sec
26/05/2020 08:42:11 | climateprediction.net | Backing off 00:13:22 on upload of wah2_anz50_307u_208912_32_872_012025270_0_r2132121344_4.zip
26/05/2020 08:42:12 | | Internet access OK - project servers may be temporarily down.
ID: 62508 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 62512 - Posted: 26 May 2020, 11:51:45 UTC

The server down at the bottom of the planet is being attacked by thousands of Windows computers, and is getting a bit battered and bruised.
It may take some time to recovery, so Patience.

Ommmmmm ...
ID: 62512 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 62514 - Posted: 26 May 2020, 14:33:23 UTC - in response to Message 62512.  

The server down at the bottom of the planet is being attacked by thousands of Windows computers, and is getting a bit battered and bruised.
It may take some time to recovery, so Patience.

Ommmmmm ...


it might be a good idea if everyone reading this was to suspend uploads for a day or so, so as to take some of the pressure off the server while it gets caught up. Nothing will be lost.
ID: 62514 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 62515 - Posted: 26 May 2020, 15:02:37 UTC - in response to Message 62514.  

I expect that the server uploads at some fixed rate. Whether that is a few people fast or a lot slowly probably won't affect the total time much.
They can tell us otherwise if they want to.
ID: 62515 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4535
Credit: 18,976,682
RAC: 21,948
Message 62516 - Posted: 27 May 2020, 7:12:01 UTC - in response to Message 62512.  

The server down at the bottom of the planet is being attacked by thousands of Windows computers, and is getting a bit battered and bruised.
It may take some time to recovery, so Patience.

Ommmmmm ...


May be a bit more than the server getting a bit battered and bruised. None of #870 the first of these to go out have been reported as completed yet.
ID: 62516 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 62517 - Posted: 27 May 2020, 8:06:45 UTC

Plenty of time yet.
They've only been running long enough for "weather", not for "climate." :)
ID: 62517 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4535
Credit: 18,976,682
RAC: 21,948
Message 62518 - Posted: 27 May 2020, 9:14:44 UTC - in response to Message 62517.  

Plenty of time yet.
They've only been running long enough for "weather", not for "climate." :)


OK, got an idea of length of these now. 32 months, one i5 at 3.3GHz has returned six out of 32 trickle ups so at least a couple of days till the fastest machines complete I would guestimate.
ID: 62518 · Report as offensive     Reply Quote
Profile Iain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,799,614
RAC: 5,163
Message 62519 - Posted: 27 May 2020, 10:24:50 UTC - in response to Message 62518.  
Last modified: 27 May 2020, 10:26:08 UTC

Plenty of time yet.
They've only been running long enough for "weather", not for "climate." :)


OK, got an idea of length of these now. 32 months, one i5 at 3.3GHz has returned six out of 32 trickle ups so at least a couple of days till the fastest machines complete I would guestimate.

My first model has finished but not reported. The first two zips uploaded and cleared; the remainder, including the restart and 'out' file, occasionally upload to 100% and then revert to 0%.

My remote machines are the same - a couple of successful uploads and the remaining Zips retrying.
ID: 62519 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2185
Credit: 64,822,615
RAC: 5,275
Message 62521 - Posted: 27 May 2020, 15:28:27 UTC - in response to Message 62519.  

Similar to Iain, two tasks have finished on my i7 4770. However, none of the files have uploaded. Some will look to be 100% in the transfers tab for awhile, but won't complete. Sixty eight files waiting to upload.

I reported such to the appropriate people on the project.
ID: 62521 · Report as offensive     Reply Quote
Tomcat

Send message
Joined: 29 May 15
Posts: 17
Credit: 717,192
RAC: 12,206
Message 62522 - Posted: 27 May 2020, 17:12:25 UTC - in response to Message 62518.  
Last modified: 27 May 2020, 17:14:02 UTC

My 4 870 tasks are at 93% completion already. None of the trickles are being successfully uploaded. They may get to 100%, but never actually finish uploading.
ID: 62522 · Report as offensive     Reply Quote
Profile JIM

Send message
Joined: 31 Dec 07
Posts: 1152
Credit: 22,363,583
RAC: 5,022
Message 62523 - Posted: 27 May 2020, 17:40:22 UTC

I now have about 50 zip files that I can’t upload. I am suspending the 8 CP Wu’s running on my 2 machines until they get the server problem sorted out. Back to Rosetta and WCG.
ID: 62523 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 62524 - Posted: 27 May 2020, 18:22:47 UTC - in response to Message 62523.  

It is fun commiserating when you can't do anything about it. It is like the virus. Neither will be going anywhere for a while.
ID: 62524 · Report as offensive     Reply Quote
Snowpaw

Send message
Joined: 27 Feb 06
Posts: 2
Credit: 4,178,977
RAC: 0
Message 62525 - Posted: 27 May 2020, 20:21:53 UTC

Is an update on this issue coming? Situation has not resolved for several days.
ID: 62525 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Uploading files fails

©2024 cpdn.org