Message boards : Number crunching : Disk Quota exeded.
Message board moderation
Author | Message |
---|---|
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
Getting this on all uploads at the moment, project informed but it is 5.00pm on a Friday so space may not be freed up till Monday. - Project has been informed. Someone must be sending too much work out! |
Send message Joined: 24 Dec 16 Posts: 15 Credit: 1,564,952 RAC: 0 |
Getting this on all uploads at the moment, project informed but it is 5.00pm on a Friday so space may not be freed up till Monday. - Project has been informed. Having the same problem... have 17 WUs trying to upload and other adding every 15 mins! |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
Sarah has responded to my email. So they will be moving some data off the disks involved to longer term storage. Still not sure it will happen before Monday though which means that there will be a lot of machines trying to upload seeing as virtually everyone will have all the work they can cope with at the moment. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,884,997 RAC: 4,577 |
... this appears to be just the final restart upload for my machines. |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
Yes...the normal zip trickles are uploading OK but not the final restart upload for finished WUs. Getting the following error: 6/9/2017 1:30:55 PM | climateprediction.net | [error] Error reported by file upload server: can't write file wah2_global_b2kx_199812_25_577_011039931_1_r275968347_restart.zip: Disk quota exceeded Art |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
I take it back. Perhaps this is function of the WU type, but on another machine all my zip trickles are now failing to upload as well with the following error: 6/9/2017 3:32:21 PM | climateprediction.net | Temporarily failed upload of wah2_global_b0py_199812_25_578_011047520_0_r952609853_23.zip: transient upload error |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,714,303 RAC: 6,015 |
6/9/2017 1:30:55 PM | climateprediction.net | [error] Error reported by file upload server: can't write file wah2_global_b2kx_199812_25_577_011039931_1_r275968347_restart.zip: Disk quota exceeded I got the same for a WU from 578 batch, but it passed through after the second attempt and it was successfully reported as completed. Another WU from 577 batch did not have any upload errors 2h earlier, but after completion it is still "in progress" and I may list in the dedicated thread. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,372,248 RAC: 15,514 |
I've got one from a different batch: 09/06/2017 23:01:01 | | [http_xfer] [ID#1630] HTTP: wrote 184 bytes 09/06/2017 23:01:01 | climateprediction.net | [http] [ID#1630] Info: Connection #4319 to host upload2.cpdn.org left intact 09/06/2017 23:01:02 | climateprediction.net | [error] Error reported by file upload server: can't write file hadcm3s_825r_201412_120_564_011005106_1_r869911686_10.zip: Disk quota exceeded 09/06/2017 23:01:02 | climateprediction.net | Temporarily failed upload of hadcm3s_825r_201412_120_564_011005106_1_r869911686_10.zip: transient upload error 09/06/2017 23:01:02 | climateprediction.net | Backing off 01:37:27 on upload of hadcm3s_825r_201412_120_564_011005106_1_r869911686_10.zip |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
Unless things have changed, some of the zips go directly to wherever in the world the scientists who give Oxford the work are - various universities either always or on the whole. This would explain zips for some tasks going through and not others. My guess is First thing (start of working day) Monday they will start moving the data. Meanwhile, a back of envelope calculation making a few assumptions about number of cpus per machine and speed etc. would suggest at least 500GB/day of data is building up between all the machines out there crunching so when Oxford does start accepting the data again the servers are going to get hammered. Many will get transient http errors as a result of this till things calm down. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,372,248 RAC: 15,514 |
I've suspended network activity for the moment until things get sorted. |
Send message Joined: 6 Aug 04 Posts: 124 Credit: 9,195,838 RAC: 0 |
The wah2 583 batch gets the trickles uploaded with no problems. 11-Jun-2017 15:01:45 [climateprediction.net] Started upload of wah2_wus25_thkh_199809_25_583_011070077_1_r95718452_1.zip 11-Jun-2017 15:01:52 [climateprediction.net] Finished upload of wah2_wus25_thkh_199809_25_583_011070077_1_r95718452_1.zip Linux Users Everywhere @ BOINC |
Send message Joined: 27 Jan 07 Posts: 300 Credit: 3,288,263 RAC: 26,370 |
Got a dozen or so stuck file transfers. This hurts my other projects, but I've suspended all network activity until this is resolved. Even if they get the file move started on Monday, I expect the transfer to take many hours. So, don't expect much free space until mid-day Tuesday. BTW, why don't the programmers know how much data each model needs to upload? They should tell the sysadmin this information #models x upload size x #uploads. That way, the sysadmin can check if they'll run out of space before the tasks are finished. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,372,248 RAC: 15,514 |
wus25 goes to upload5. It is upload2 that has the problems. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,884,997 RAC: 4,577 |
My blocked uploads have started clearing now ... |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
Mine too. Disk quota has been increased to give some headroom. |
Send message Joined: 27 Jan 07 Posts: 300 Credit: 3,288,263 RAC: 26,370 |
I had about 1 GB upload this morning. :) |
©2024 cpdn.org