Message boards : Number crunching : Upload Problem ( Windows)
Message board moderation
Author | Message |
---|---|
Send message Joined: 27 Dec 05 Posts: 20 Credit: 236,510 RAC: 0 |
Hallo, i have a rar upload problem with CPN workunits, but i think its a problem with Boinc-manager, not with CPN-server. I try to explain it in English :) Upload starts, but after some minutes Boinc manager think the upload is broken and says: try again over X minutes. But the upload isnt broken ! It stil goes on, i see this on my router and network traffic monitor. If i wait to the "real" end of the upload and manually say Boinc manager try again now, the upload starts again an d after one second its reportet as finish. If i don't do that and Boinc manager try again it self after X minutes the upload starts new and the same problem comes. I think there is a problem with the feedback massages to Boinc manager that the upload still is in progres ?! Is there any resoluton ? |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
Thanks for reporting this, Chris. It does look like a problem with the latest version of Boinc, not CPDN. It might be worth reporting it to the Boinc Project. I think the Q&A discussion board is the place, but I am not sure. But before you go to Boinc, a few questions. The Boinc people will ask them anyway, so it's best to have the answers ready. ;-) Have you set the file_xfer_debug flag in cc_config.xml? What are the messages (Advanced view -> Messages tab) after you do that? Also, do uploads to other projects work OK? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Two more questions: Are there uploads waiting for other projects as well as this one? How many uploads are waiting? (Lots of zips waiting, especially from multiple projects, is a know problem.) Backups: Here |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
That sounds very like the problem I described (and gave a workaround for) here Chris. "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
I too have upload problems. The difference is I have an _13 (final) file from a hadam3p eu model and the upload url looks okay as "http://climateapps1.oucs.ox.ac.uk/cgi-bin/file_upload_handler". The upload doesn't start. Its only 35Mb :-) I had another _13 file that successfully uploaded yesterday after a week or retries, so presume this is the usual space problem. BOINC blog |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Hi, Mark, I confess to not having any Regional tasks upload #13 files to Oxford in the last few days but my last ones went home okay. Perhaps after the long Royal Wedding holiday ... (Some uploads now generate a meaningless [to us] red failure message. Please ignore it.) "ONLY" 35MB ...! By the way, you have two i7 920 box listings which might be the same machine. If so, they can be "merged" --> in "Your Account", click on Computers ... and scroll to the bottom and click on "Merge computers by name". (That's an impressive set of boxes you have.) May neither of us ever record a significant event for QCN, eh? "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
By the way, you have two i7 920 box listings which might be the same machine. If so, they can be "merged" --> in "Your Account", click on Computers ... and scroll to the bottom and click on "Merge computers by name". (That's an impressive set of boxes you have.) Actually I have 5 i7's. Three are 920's (one each currently running the full res ocean), so they will need to be running for a while to finish in time. May neither of us ever record a significant event for QCN, eh? I hope not. Cheers |
Send message Joined: 27 Dec 05 Posts: 20 Credit: 236,510 RAC: 0 |
But before you go to Boinc, a few questions. The Boinc people will ask them anyway, so it's best to have the answers ready. ;-) Have you set the file_xfer_debug flag in cc_config.xml? What are the messages (Advanced view -> Messages tab) after you do that? Also, do uploads to other projects work OK? I don't have problems with other project uploads... cc_config.xml ... i have this saved some month ago: <cc_config> <options> <max_file_xfers>1</max_file_xfers> <max_file_xfers_per_project>1</max_file_xfers_per_project> </options> </cc_config> Are there uploads waiting for other projects as well as this one? No others waiting.... |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
OK, Chris, try this: 1. Set cc_config.xml as follows: <cc_config> <log_flags> <file_xfer_debug>1</file_xfer_debug> <http_xfer_debug>1</http_xfer_debug> </log_flags> <options> <http_1_0>1</http_1_0> <max_file_xfers>1</max_file_xfers> <max_file_xfers_per_project>1</max_file_xfers_per_project> </options> </cc_config> 2. In Advanced View, select menu Advanced -> Read Config file http://boinc.berkeley.edu/wiki/Advanced_view#BOINC_Manager_Menus. 3. Re-try uploads, and then look at the messages in the Advanced View's Messages tab. Look for messages that begin: [file_xfer] ... or [http_xfer] ... |
Send message Joined: 27 Dec 05 Posts: 20 Credit: 236,510 RAC: 0 |
OK Greg, i'll try, but answer will take some time... at the moment i have a famous unit running... this takes 150 more hours to finish. With famous units i don't have this problem.. so i think the next trickle from a HDAM3P comes next week :) thx so far. |
Send message Joined: 17 Nov 07 Posts: 142 Credit: 4,271,370 RAC: 0 |
No problem. Good luck! |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
Still haven't managed to get my _13 file up. I have uploaded other CPDN files from this machine (just not the final file). Uploads for other projects are also working. Url is "http://climateapps1.oucs.ox.ac.uk/cgi-bin/file_upload_handler" and its on its 22nd retry. It hasn't managed to transfer any of it yet. Debug messages: Note that I have replaced the proxy server name/IP address for security reasons 4/05/2011 8:54:00 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle 'C:\Program Files\BOINC\ca-bundle.crt' 4/05/2011 8:54:00 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle set 4/05/2011 8:54:00 PM | climateprediction.net | Started upload of hadam3p_eu_2lou_1974_1_007161606_1_13.zip 4/05/2011 8:54:00 PM | | [http] [ID#5] Info: timeout on name lookup is not supported 4/05/2011 8:54:00 PM | | [http] [ID#5] Info: About to connect() to proxy xxx port xxx (#0) 4/05/2011 8:54:00 PM | | [http] [ID#5] Info: Trying a.b.c.d... 4/05/2011 8:54:00 PM | | [http] [ID#5] Info: Connected to xxx (a.b.c.d) port xxx (#0) 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: POST http://climateapps1.oucs.ox.ac.uk/cgi-bin/file_upload_handler HTTP/1.0 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: User-Agent: BOINC client (windows_x86_64 6.12.26) 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: Host: climateapps1.oucs.ox.ac.uk 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: Accept: */* 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: Accept-Encoding: deflate, gzip 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: Proxy-Connection: Keep-Alive 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: Content-Type: application/x-www-form-urlencoded 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: Content-Length: 295 4/05/2011 8:54:00 PM | | [http] [ID#5] Sent header to server: 4/05/2011 8:59:01 PM | | [http] [ID#5] Info: Operation too slow. Less than 10 bytes/sec transfered the last 300 seconds 4/05/2011 8:59:01 PM | | [http] [ID#5] Info: Expire cleared 4/05/2011 8:59:01 PM | | [http] [ID#5] Info: Closing connection #0 4/05/2011 8:59:01 PM | | [http] HTTP error: Timeout was reached 4/05/2011 8:59:02 PM | | Project communication failed: attempting access to reference site 4/05/2011 8:59:02 PM | | [http] HTTP_OP::init_get(): http://www.google.com/ 4/05/2011 8:59:02 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle 'C:\Program Files\BOINC\ca-bundle.crt' 4/05/2011 8:59:02 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle set 4/05/2011 8:59:02 PM | climateprediction.net | Temporarily failed upload of hadam3p_eu_2lou_1974_1_007161606_1_13.zip: HTTP error BOINC blog |
Send message Joined: 5 Aug 04 Posts: 1283 Credit: 15,824,334 RAC: 0 |
That makes things a lot clearer Mark. It looks like your proxy server has allowed BOINC (strictly speaking libcurl) to send it more data than it can pass on to the upload server within BOINC's 5 minute upload inactivity timeout (if BOINC doesn't receive an acknowledgement for the data it has passed to the proxy server before the inactivity timeout expires it classes the connection as failed). Is there any way you can reduce the proxy server's cache size (or disable caching) to test this out? "The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
That makes things a lot clearer Mark. It looks like your proxy server has allowed BOINC (strictly speaking libcurl) to send it more data than it can pass on to the upload server within BOINC's 5 minute upload inactivity timeout (if BOINC doesn't receive an acknowledgement for the data it has passed to the proxy server before the inactivity timeout expires it classes the connection as failed). I don't think its the proxy server as its working for other projects and other files for CPDN. I have disabled it in BOINC. As you can see from the log below its still getting the time out (only now its got 2 completed eu work unit to upload instead of just 1). 5/05/2011 9:06:55 PM | climateprediction.net | Starting task hadam3p_eu_2mu6_1990_1_007231768_1 using hadam3p_eu version 609 5/05/2011 9:08:39 PM | | Using proxy info from GUI 5/05/2011 9:08:39 PM | | Not using a proxy 5/05/2011 9:08:47 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle 'C:\Program Files\BOINC\ca-bundle.crt' 5/05/2011 9:08:47 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle set 5/05/2011 9:08:47 PM | climateprediction.net | Started upload of hadam3p_eu_2lou_1974_1_007161606_1_13.zip 5/05/2011 9:08:47 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle 'C:\Program Files\BOINC\ca-bundle.crt' 5/05/2011 9:08:47 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle set 5/05/2011 9:08:47 PM | climateprediction.net | Started upload of hadam3p_eu_2mwh_2000_1_007231748_1_13.zip 5/05/2011 9:08:47 PM | | [http] [ID#59] Info: timeout on name lookup is not supported 5/05/2011 9:08:47 PM | | [http] [ID#59] Info: About to connect() to climateapps1.oucs.ox.ac.uk port 80 (#0) 5/05/2011 9:08:47 PM | | [http] [ID#59] Info: Trying 163.1.13.16... 5/05/2011 9:08:47 PM | | [http] [ID#60] Info: timeout on name lookup is not supported 5/05/2011 9:08:47 PM | | [http] [ID#60] Info: About to connect() to climateapps1.oucs.ox.ac.uk port 80 (#1) 5/05/2011 9:08:47 PM | | [http] [ID#60] Info: Trying 163.1.13.16... 5/05/2011 9:08:48 PM | | [http] [ID#59] Info: Connected to climateapps1.oucs.ox.ac.uk (163.1.13.16) port 80 (#0) 5/05/2011 9:08:48 PM | | [http] [ID#60] Info: Connected to climateapps1.oucs.ox.ac.uk (163.1.13.16) port 80 (#1) 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.0 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: User-Agent: BOINC client (windows_x86_64 6.12.26) 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: Host: climateapps1.oucs.ox.ac.uk 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: Accept: */* 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: Accept-Encoding: deflate, gzip 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: Content-Type: application/x-www-form-urlencoded 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: Content-Length: 295 5/05/2011 9:08:48 PM | | [http] [ID#59] Sent header to server: 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: POST /cgi-bin/file_upload_handler HTTP/1.0 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: User-Agent: BOINC client (windows_x86_64 6.12.26) 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: Host: climateapps1.oucs.ox.ac.uk 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: Accept: */* 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: Accept-Encoding: deflate, gzip 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: Content-Type: application/x-www-form-urlencoded 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: Content-Length: 295 5/05/2011 9:08:48 PM | | [http] [ID#60] Sent header to server: 5/05/2011 9:13:49 PM | | [http] [ID#59] Info: Operation too slow. Less than 10 bytes/sec transfered the last 300 seconds 5/05/2011 9:13:49 PM | | [http] [ID#59] Info: Expire cleared 5/05/2011 9:13:49 PM | | [http] [ID#59] Info: Closing connection #0 5/05/2011 9:13:49 PM | | [http] [ID#60] Info: Operation too slow. Less than 10 bytes/sec transfered the last 300 seconds 5/05/2011 9:13:49 PM | | [http] [ID#60] Info: Expire cleared 5/05/2011 9:13:49 PM | | [http] [ID#60] Info: Closing connection #1 5/05/2011 9:13:49 PM | | [http] HTTP error: Timeout was reached 5/05/2011 9:13:49 PM | | [http] HTTP error: Timeout was reached 5/05/2011 9:13:50 PM | | Project communication failed: attempting access to reference site 5/05/2011 9:13:50 PM | | [http] HTTP_OP::init_get(): http://www.google.com/ 5/05/2011 9:13:50 PM | | [http] HTTP_OP::libcurl_exec(): ca-bundle set 5/05/2011 9:13:50 PM | climateprediction.net | Temporarily failed upload of hadam3p_eu_2lou_1974_1_007161606_1_13.zip: HTTP error BOINC blog |
Send message Joined: 28 Mar 09 Posts: 126 Credit: 9,825,980 RAC: 0 |
Update to the last post. After shutting down BOINC and deleting a few files (mostly HTTP_temp and the job_log files) and then starting it up again it seems happy to upload. Its using the proxy server at the moment. I have managed to upload both _13 files. There is a _12 file that belongs with one wu that is stuck (it goes to retry immediately so looks like a different issue). Maybe the project developers made some room, or maybe it was the BOINC client or libcurl, either way its an improvement. BOINC blog |
Send message Joined: 27 Dec 05 Posts: 20 Credit: 236,510 RAC: 0 |
Hmmm, i think the last 4 trickles gone away without problems.... 14.05.2011 02:41:58 climateprediction.net [fxd] starting upload, upload_offset -1 14.05.2011 02:41:58 climateprediction.net Started upload of hadam3p_pnw_ywhm_1969_1_006897170_0_4.zip 14.05.2011 02:41:58 climateprediction.net [file_xfer_debug] URL: http://boinc1.coas.oregonstate.edu/cpdn_cgi_main/file_upload_handler 14.05.2011 02:42:00 climateprediction.net Sending scheduler request: To send trickle-up message. 14.05.2011 02:42:00 climateprediction.net Not reporting or requesting tasks 14.05.2011 02:42:01 [http_xfer_debug] [ID#56] HTTP: wrote 93 bytes 14.05.2011 02:42:01 climateprediction.net [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval 0 14.05.2011 02:42:01 climateprediction.net [file_xfer_debug] parsing upload response: <data_server_reply> <status>0</status> <file_size>0</file_size></data_server_reply> 14.05.2011 02:42:01 climateprediction.net [file_xfer_debug] parsing status: 0 14.05.2011 02:42:01 climateprediction.net [fxd] starting upload, upload_offset 0 14.05.2011 02:42:01 [http_xfer_debug] [ID#1] HTTP: wrote 4683 bytes 14.05.2011 02:42:01 climateprediction.net Scheduler request completed 14.05.2011 02:42:11 [http_xfer_debug] [ID#1] HTTP: wrote 1243 bytes 14.05.2011 02:42:11 [http_xfer_debug] [ID#1] HTTP: wrote 355 bytes 14.05.2011 02:43:37 [http_xfer_debug] [ID#1] HTTP: wrote 6062 bytes 14.05.2011 02:44:31 [http_xfer_debug] [ID#56] HTTP: wrote 64 bytes 14.05.2011 02:44:31 climateprediction.net [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval 0 14.05.2011 02:44:31 climateprediction.net [file_xfer_debug] parsing upload response: <data_server_reply> <status>0</status></data_server_reply> 14.05.2011 02:44:31 climateprediction.net [file_xfer_debug] parsing status: 0 14.05.2011 02:44:31 climateprediction.net [file_xfer_debug] file transfer status 0 14.05.2011 02:44:31 climateprediction.net Finished upload of hadam3p_pnw_ywhm_1969_1_006897170_0_4.zip 14.05.2011 02:44:31 climateprediction.net [file_xfer_debug] Throughput 40315 bytes/sec |
Send message Joined: 27 Dec 05 Posts: 20 Credit: 236,510 RAC: 0 |
Hi, problem is back... unit try 3 days to upload himself... no success. Then i help by my "hand method" 25.05.2011 18:23:41 climateprediction.net [fxd] starting upload, upload_offset -1 25.05.2011 18:23:41 climateprediction.net Started upload of hadam3p_eu_41w7_2001_1_007254816_0_5.zip 25.05.2011 18:23:41 climateprediction.net [file_xfer_debug] URL: http://cpdn-upload1.comlab.ox.ac.uk/cgi-bin/file_upload_handler 25.05.2011 18:23:41 [http_xfer_debug] [ID#293] HTTP: wrote 93 bytes 25.05.2011 18:23:42 climateprediction.net [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval 0 25.05.2011 18:23:42 climateprediction.net [file_xfer_debug] parsing upload response: <data_server_reply> <status>0</status> <file_size>0</file_size></data_server_reply> 25.05.2011 18:23:42 climateprediction.net [file_xfer_debug] parsing status: 0 25.05.2011 18:23:42 climateprediction.net [fxd] starting upload, upload_offset 0 25.05.2011 18:28:50 Project communication failed: attempting access to reference site 25.05.2011 18:28:50 climateprediction.net [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval -184 25.05.2011 18:28:50 climateprediction.net [file_xfer_debug] file transfer status -184 25.05.2011 18:28:50 climateprediction.net Temporarily failed upload of hadam3p_eu_41w7_2001_1_007254816_0_5.zip: HTTP error 25.05.2011 18:28:50 climateprediction.net [file_xfer_debug] project-wide xfer delay for 10496.292001 sec 25.05.2011 18:28:50 climateprediction.net Backing off 1 hr 35 min 38 sec on upload of hadam3p_eu_41w7_2001_1_007254816_0_5.zip 25.05.2011 18:29:01 [http_xfer_debug] [ID#0] HTTP: wrote 9934 bytes 25.05.2011 18:29:02 Internet access OK - project servers may be temporarily down. ***** try again...clicked by me ***** 25.05.2011 18:29:23 climateprediction.net [fxd] starting upload, upload_offset -1 25.05.2011 18:29:23 climateprediction.net Started upload of hadam3p_eu_41w7_2001_1_007254816_0_5.zip 25.05.2011 18:29:23 climateprediction.net [file_xfer_debug] URL: http://cpdn-upload1.comlab.ox.ac.uk/cgi-bin/file_upload_handler 25.05.2011 18:29:23 [http_xfer_debug] [ID#294] HTTP: wrote 100 bytes 25.05.2011 18:29:24 climateprediction.net [file_xfer_debug] FILE_XFER_SET::poll(): http op done; retval 0 25.05.2011 18:29:24 climateprediction.net [file_xfer_debug] parsing upload response: <data_server_reply> <status>0</status> <file_size>13732652</file_size></data_server_reply> 25.05.2011 18:29:24 climateprediction.net [file_xfer_debug] parsing status: 0 25.05.2011 18:29:24 climateprediction.net [file_xfer_debug] file transfer status 0 25.05.2011 18:29:24 climateprediction.net Finished upload of hadam3p_eu_41w7_2001_1_007254816_0_5.zip 25.05.2011 18:29:24 climateprediction.net [file_xfer_debug] Throughput 128 bytes/sec 25.05.2011 18:29:24 climateprediction.net [fxd] starting upload, upload_offset -1 |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
Does anyone have any ideas when uploader1.atm will be back up? I presently have an a Hadam3p with zip files 3, 6, 9, 12 and 13 stuck in the transfer tab. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,718,239 RAC: 8,054 |
Mine have started to go through this very afternoon - first day back at work after a long holiday weekend in the UK. But I had a big backlog - it'll take a while to everyone's to get through. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
Mine have started uploading also. Zip files 9, 12, and 13 have gone and only 3 and 6 remain. This is what is really important. There is no reason to crunch the WU’s if we can’t upload the results. |
©2024 cpdn.org