Message boards : Number crunching : A different upload problem -- out of disk space on rapid-watch.badc.rl.ac.uk
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,718,239 RAC: 8,054 |
I've had uploads complete since you first reported this problem. What happens if you paste http://rapid-watch.badc.rl.ac.uk/cpdn_cgi/file_upload_handler into a browser address bar? I get <data_server_reply> <status>1</status> <message>no command</message> </data_server_reply> suggesting that the server is alive and waiting for me. |
Send message Joined: 12 Mar 12 Posts: 29 Credit: 666,199 RAC: 0 |
I've had uploads complete since you first reported this problem. thanks for clue I get this from different computers - tested 3 of them <data_server_reply> <status>1</status> <message>no command</message> </data_server_reply> |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
I've had uploads complete since you first reported this problem. I haven't tried but then all the uploads start, they simply fail with the transient http error. I did post the http debug output in the other message thread about Rapit tasks. I have one upload in particular that regularly makes it to 91% before failing. I have also tried using uk-based proxy servers. I would hate to waste the 350 days worth of CPU time because I can't upload them. BOINC blog |
Send message Joined: 3 Sep 04 Posts: 126 Credit: 26,610,380 RAC: 3,377 |
The disk is full again: 07-11-2013 05:55:18 | climateprediction.net | Started upload of hadcm3n_7zdj_1980_40_008457130_1_2.zip 07-11-2013 05:55:21 | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space 07-11-2013 05:55:21 | climateprediction.net | Temporarily failed upload of hadcm3n_7zdj_1980_40_008457130_1_2.zip: transient upload error |
Send message Joined: 4 Jan 07 Posts: 9 Credit: 575,832 RAC: 0 |
Getting "Server is out of disk space" messages: Thu 07 Nov 2013 02:41:52 GMT | climateprediction.net | Sending scheduler request: To send trickle-up message. Thu 07 Nov 2013 02:41:52 GMT | climateprediction.net | Not requesting tasks: project is not highest priority Thu 07 Nov 2013 02:41:57 GMT | climateprediction.net | Scheduler request completed Thu 07 Nov 2013 02:42:16 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:42:17 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 02:42:17 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 02:42:17 GMT | climateprediction.net | Backing off 2 min 33 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:44:51 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:44:52 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 02:44:52 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 02:44:52 GMT | climateprediction.net | Backing off 4 min 47 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:49:40 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:49:41 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 02:49:41 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 02:49:41 GMT | climateprediction.net | Backing off 8 min 11 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:55:22 GMT | climateprediction.net | Computation for task hadcm3n_o93c_1900_40_008466907_0 finished Thu 07 Nov 2013 02:59:36 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 02:59:38 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 02:59:38 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 02:59:38 GMT | climateprediction.net | Backing off 26 min 8 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 03:25:46 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 03:25:48 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 03:25:48 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 03:25:48 GMT | climateprediction.net | Backing off 59 min 48 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 04:25:38 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 04:25:39 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 04:25:39 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 04:25:39 GMT | climateprediction.net | Backing off 1 hr 51 min 26 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 06:17:06 GMT | climateprediction.net | Started upload of hadcm3n_o93c_1900_40_008466907_0_4.zip Thu 07 Nov 2013 06:17:07 GMT | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space Thu 07 Nov 2013 06:17:07 GMT | climateprediction.net | Temporarily failed upload of hadcm3n_o93c_1900_40_008466907_0_4.zip: transient upload error Thu 07 Nov 2013 06:17:07 GMT | climateprediction.net | Backing off 4 hr 3 min 34 sec on upload of hadcm3n_o93c_1900_40_008466907_0_4.zip And from client_state.xml <file> <name>hadcm3n_o93c_1900_40_008466907_0_4.zip</name> <nbytes>54706631.000000</nbytes> <max_nbytes>188743680.000000</max_nbytes> <md5_cksum>b30d6cab259e190cc09b6d0bed81eed4</md5_cksum> <status>1</status> <upload_url>http://rapid-watch.badc.rl.ac.uk/cpdn_cgi/file_upload_handler</upload_url> <persistent_file_xfer> <num_retries>7</num_retries> <first_request_time>1383792135.482826</first_request_time> <next_request_time>1383819641.528833</next_request_time> <time_so_far>9.032011</time_so_far> <last_bytes_xferred>115.000000</last_bytes_xferred> <is_upload>1</is_upload> </persistent_file_xfer> </file> http://rapid-watch.badc.rl.ac.uk/cpdn_cgi/file_upload_handler gives me: <data_server_reply> <status>1</status> <message>no command</message> </data_server_reply> |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
OK, I'll email them. |
Send message Joined: 5 Sep 04 Posts: 21 Credit: 2,501,833 RAC: 2,156 |
I too have this error see below: 07/11/2013 15:47:53 | climateprediction.net | Started upload of hadcm3n_823a_1980_40_008460649_1_3.zip 07/11/2013 15:47:54 | climateprediction.net | [error] Error reported by file upload server: Server is out of disk space 07/11/2013 15:47:54 | climateprediction.net | Temporarily failed upload of hadcm3n_823a_1980_40_008460649_1_3.zip: transient upload error 07/11/2013 15:47:54 | climateprediction.net | Backing off 3 hr 30 min 52 sec on upload of hadcm3n_823a_1980_40_008460649_1_3.zip Regards Jurgen "All man born has a right to life and no man born has the right to take that life" |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,036,409 RAC: 14,604 |
I've got this as well. I guess the best thing to do is suspend network activity until it gets sorted. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The server in question is external to Oxford Uni. The relevant people have been asked to urgently increase storage space. |
Send message Joined: 17 Aug 04 Posts: 289 Credit: 44,103,664 RAC: 0 |
Thank you Les. |
Send message Joined: 3 Sep 04 Posts: 126 Credit: 26,610,380 RAC: 3,377 |
Now something different happens. The file is uploaded until it reaches 100%, then the no space message follows: 09-11-2013 01:20:15 | climateprediction.net | Started upload of hadcm3n_7zdj_1980_40_008457130_1_2.zip 09-11-2013 01:42:14 | climateprediction.net | [error] Error reported by file upload server: can't write file /storage/incoming/uploader//hadcm3n_7zdj_1980_40_008457130_1_2.zip: No space left on server 09-11-2013 01:42:14 | climateprediction.net | Temporarily failed upload of hadcm3n_7zdj_1980_40_008457130_1_2.zip: transient upload error |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
We're on it. It's the same problem. They need a seriously bigger hard disk. I don't know which of the several research centres is storing the data, or if someone there works weekends, so it may take a few days to sort out. In the meantime, there's mowing the lawn, reading a few books, starting a jogging / running program, or just relaxing and going Ooooom a lot. |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
I have two 52MB files that have been failing with this for about a week. Should we just continue to wait for a fix, or do I need to delete/cancel the jobs? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
This problem with disk space only occurred 3 days ago, so you may have a different problem. And I've already said that patience is needed. |
Send message Joined: 28 Mar 13 Posts: 16 Credit: 5,383,625 RAC: 0 |
Okay, patience is needed. So, the problem started around 11/6/2013. The main question I have is: How long will BOINC allow a download to fail until BOINC does something crazy like abort it? My wife has her first completed task queued up, and it took 640 hours (~27 days) to complete. I don't want to see all that work wasted. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
It used to be 14 days, but for late versions of BOINC 6.* onwards, it's 90 days. |
Send message Joined: 28 Mar 13 Posts: 16 Credit: 5,383,625 RAC: 0 |
Thank you, and thanks for making sure someone is trying to fix it. I will continue to monitor the upload every few days, and hopefully it gets resolved, especially before 2/4/2014. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
How long will BOINC allow a download to fail until BOINC does something crazy like abort it? Quite a while, certainly months rather than days. It is worth turning off network activity in BOINC manager to avoid repeated attempts at uploading and keeping an eye on this thread. I think that the number of upload attempts before it gives up has been increased but I can't remember what to. I have never had one fail due to this even when I haven't disabled network activity but it saves bandwidth if you do this. If running another project and you want to download work for it, just turn it on again when downloading work. |
Send message Joined: 28 Mar 13 Posts: 16 Credit: 5,383,625 RAC: 0 |
Recommending to turn off network activity is silly! Don't do that! BOINC intelligently backs off the upload-retry-intervals automatically. I'm attached to 26 other projects, doing work for them just fine, and I (obviously) require network activity to both download new tasks and upload completed results. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
It's NOT silly. For a lot of people it makes sense; e.g. those of us that are only running cpdn. Including me, where "network off" is the normal condition, only allowing uploads when I want, so that I can check for errors, as well as other reasons. |
©2024 cpdn.org