climateprediction.net (CPDN) home page
Thread 'Site problems'

Thread 'Site problems'

Message boards : Number crunching : Site problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4

AuthorMessage
Aurum
Avatar

Send message
Joined: 15 Jul 17
Posts: 99
Credit: 18,701,746
RAC: 318
Message 64659 - Posted: 20 Oct 2021, 13:00:35 UTC - in response to Message 64654.  

Check the batch number. If it's closed, then that's the reason.
Those that know how to properly run a BOINC server system would issue a Server Abort signal and that would never happen.
ID: 64659 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4538
Credit: 19,005,674
RAC: 21,647
Message 64660 - Posted: 20 Oct 2021, 13:07:42 UTC - in response to Message 64659.  
Last modified: 20 Oct 2021, 13:10:45 UTC

Those that know how to properly run a BOINC server system would issue a Server Abort signal and that would never happen.


However, a transient http error is not because a batch has been closed and #88 has not been closed.
ID: 64660 · Report as offensive     Reply Quote
Harri Liljeroos

Send message
Joined: 9 Dec 05
Posts: 116
Credit: 12,547,934
RAC: 2,738
Message 64661 - Posted: 20 Oct 2021, 14:01:59 UTC - in response to Message 64659.  

Those that know how to properly run a BOINC server system would issue a Server Abort signal and that would never happen.

I think that server abort signal works only for tasks that has not been started crunching yet. Or is there a separate signal for 'Forced Abort'?
ID: 64661 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,702,480
RAC: 9,812
Message 64662 - Posted: 20 Oct 2021, 14:26:36 UTC - in response to Message 64661.  

'Not started by deadline' is a client-enforced abort. Administrative (server side) aborts are usually unconditional, "started or not" - documented at https://boinc.berkeley.edu/trac/wiki/CancelJobs
ID: 64662 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4538
Credit: 19,005,674
RAC: 21,647
Message 64663 - Posted: 20 Oct 2021, 14:33:39 UTC

I think that server abort signal works only for tasks that has not been started crunching yet. Or is there a separate signal for 'Forced Abort'?


Not sure about that, I had assumed any tasks could be aborted from the server assuming the client contacts the server but, never having gotten further than building the server from source code out of interest, I couldn't be sure and couldn't find the answer quickly in the documentation I have looked at.

I see that Richard has beaten me to it with a definitive answer.
ID: 64663 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 30,975,898
RAC: 14,500
Message 64665 - Posted: 20 Oct 2021, 18:34:36 UTC - in response to Message 64656.  

Does this help:-

Wed 20 Oct 2021 12:58:16 BST | climateprediction.net | [fxd] starting upload, upload_offset 0
Wed 20 Oct 2021 12:58:16 BST | climateprediction.net | Started upload of hadam4h_h0ye_201505_5_901_012076497_3_r1813679968_restart.zip
Wed 20 Oct 2021 12:58:16 BST | climateprediction.net | [file_xfer] URL: http://upload11.cpdn.org/cgi-bin/file_upload_handler
Wed 20 Oct 2021 12:58:16 BST | climateprediction.net | [fxd] starting upload, upload_offset -1
Wed 20 Oct 2021 12:58:16 BST | climateprediction.net | Started upload of hadam4h_h0ye_201505_5_901_012076497_3_r1813679968_5.zip
Wed 20 Oct 2021 12:58:16 BST | climateprediction.net | [file_xfer] URL: http://upload11.cpdn.org/cgi-bin/file_upload_handler
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#14] Info: Too old connection (1502 seconds), disconnect it
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#14] Info: Connection 39 seems to be dead!
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#14] Info: Closing connection 39
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#14] Info: TLSv1.3 (OUT), TLS alert, close notify (256):
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#15] Info: Found bundle for host upload11.cpdn.org: 0x55cd914c5e80 [serially]
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#15] Info: Server doesn't support multiplex (yet)
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#14] Info: Trying 192.171.139.103:80...
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#14] Info: TCP_NODELAY set
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#15] Info: Hostname 'upload11.cpdn.org' was found in DNS cache
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#15] Info: Trying 192.171.139.103:80...
Wed 20 Oct 2021 12:58:17 BST | climateprediction.net | [http] [ID#15] Info: TCP_NODELAY set
Wed 20 Oct 2021 13:00:17 BST | climateprediction.net | [http] [ID#14] Info: Connection timed out after 120131 milliseconds
Wed 20 Oct 2021 13:00:17 BST | climateprediction.net | [http] [ID#14] Info: Closing connection 40
Wed 20 Oct 2021 13:00:17 BST | climateprediction.net | [http] [ID#15] Info: Connection timed out after 120122 milliseconds
Wed 20 Oct 2021 13:00:17 BST | climateprediction.net | [http] [ID#15] Info: Closing connection 41
Wed 20 Oct 2021 13:00:17 BST | climateprediction.net | [http] HTTP error: Timeout was reached
Wed 20 Oct 2021 13:00:17 BST | climateprediction.net | [http] HTTP error: Timeout was reached
Wed 20 Oct 2021 13:00:18 BST | | Project communication failed: attempting access to reference site
Wed 20 Oct 2021 13:00:18 BST | | [http] HTTP_OP::init_get(): https://www.google.com/
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | [file_xfer] http op done; retval -184 (transient HTTP error)
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | Temporarily failed upload of hadam4h_h0ye_201505_5_901_012076497_3_r1813679968_restart.zip: transient HTTP error
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | [file_xfer] project-wide xfer delay for 3235.318955 sec
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | Backing off 00:09:36 on upload of hadam4h_h0ye_201505_5_901_012076497_3_r1813679968_restart.zip
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | [file_xfer] file transfer status -184 (transient HTTP error)
Wed 20 Oct 2021 13:00:18 BST | climateprediction.net | Temporarily failed upload of hadam4h_h0ye_201505_5_901_012076497_3_r1813679968_5.zip: transient HTTP error
ID: 64665 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,702,480
RAC: 9,812
Message 64666 - Posted: 20 Oct 2021, 19:08:50 UTC - in response to Message 64665.  
Last modified: 20 Oct 2021, 19:19:21 UTC

Yes, it proves that your particular problem is nothing to do with the expired certificate problem - you're not even trying to use https!

I get the same IP address for upload11.cpdn.org, but I don't get a ping response (mind you, many BOINC servers are set not to respond to pings). I don't think even the mods have access to a full current server status board, so this is probably one for the mods to ask the staff about - that server might have got caught up in the networking problems of a few weeks ago.

Edit - your task is a hadam4h, batch 901. I got a couple of hadam4h, batch 920 today - the first in a while. I've checked, and they're also set to send files to upload11.cpdn.org - so it shouldn't be a retired server.
ID: 64666 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 64667 - Posted: 20 Oct 2021, 19:28:28 UTC

OK, I've just emailed Andy about this.
And my last running task, a batch 901, failed to upload zip 2 overnight. :(
So, where's the queue for the coffee. :)
ID: 64667 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4538
Credit: 19,005,674
RAC: 21,647
Message 64668 - Posted: 20 Oct 2021, 20:31:42 UTC

HI,

both upload 3 and upload11 are the same system under the hood. I just restarted the upload handler and lots of httpd processes have appeared so should be fine now

Kind Regards


David


From Moderators email list.
ID: 64668 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,702,480
RAC: 9,812
Message 64669 - Posted: 20 Oct 2021, 20:49:43 UTC

Sigh. It just goes to prove that a BOINC project cannot run without an active, involved community base. It's a bit scary that it needs an associate professor to count how many servers are currently out of action. That's a real blocker for the Science United model of anonymous, contact-free, volunteer scientists.
ID: 64669 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 30,975,898
RAC: 14,500
Message 64670 - Posted: 21 Oct 2021, 7:17:18 UTC

Stuck uploads went OK after a manual retry transfer command. Wierdly zip file 5 from the finished task had uploaded earlier and been registered on the database but obviously the out and retsart zips had got stuck. All now OK (including another task downloaded).
ID: 64670 · Report as offensive     Reply Quote
ProfileBill F

Send message
Joined: 17 Jan 09
Posts: 124
Credit: 2,027,010
RAC: 2,694
Message 64690 - Posted: 24 Oct 2021, 3:13:08 UTC

Certificate file problem update.

For any 64 Bit Windows users that did not correct their own crt file BOINC Berkeley has released an official updated BOINC Version dated 17 Oct 2021. This version contains the corrected file.

7.16.20 can be found here

https://boinc.berkeley.edu/download.php

Bill F
ID: 64690 · Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 15 Jul 17
Posts: 99
Credit: 18,701,746
RAC: 318
Message 64691 - Posted: 25 Oct 2021, 13:33:59 UTC

Is the upload speed normally capped at 21 kBps?
ID: 64691 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 1120
Credit: 17,202,915
RAC: 2,154
Message 64692 - Posted: 25 Oct 2021, 14:05:42 UTC - in response to Message 64691.  

Is the upload speed normally capped at 21 kBps?


Average upload rate 1506.49 KB/sec
Average download rate 15655.55 KB/sec
ID: 64692 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4538
Credit: 19,005,674
RAC: 21,647
Message 64694 - Posted: 25 Oct 2021, 16:02:53 UTC - in response to Message 64691.  

Is the upload speed normally capped at 21 kBps?

I get about 90KB/s on my bored band. Anything from that up to 200KB if tethering with my phone which is 4G. Load balancing using both gives me up to 250KB total upload speed when I have 2 or more uploads going at once.
ID: 64694 · Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 15 Jul 17
Posts: 99
Credit: 18,701,746
RAC: 318
Message 64695 - Posted: 25 Oct 2021, 17:47:19 UTC

I think it's a problem in the western US from this big storm that just hammered us. The ULs keep moving, that's the important thing.
ID: 64695 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4538
Credit: 19,005,674
RAC: 21,647
Message 64696 - Posted: 25 Oct 2021, 20:24:55 UTC - in response to Message 64695.  

I think it's a problem in the western US from this big storm that just hammered us. The ULs keep moving, that's the important thing.


Makes sense, I am afraid unless it really causes a lot of damage/loss of life, weather events where you are don't make the news here.
ID: 64696 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 64697 - Posted: 25 Oct 2021, 21:01:03 UTC - in response to Message 64696.  

We don't see much evidence of it in the eastern U.S., just some normal rain, though it is making the news.
https://news.yahoo.com/record-breaking-california-bomb-cyclone-linked-to-climate-change-183607985.html

(Probably every topic on the CPDN forums will have to be sub-titled "Climate Change in the News" from now on.)
ID: 64697 · Report as offensive     Reply Quote
Jean-David Beyer

Send message
Joined: 5 Aug 04
Posts: 1120
Credit: 17,202,915
RAC: 2,154
Message 64698 - Posted: 25 Oct 2021, 21:04:29 UTC - in response to Message 64694.  

On my 75 Megabit Verizon FiOS connection, I get (right now)
           Timestamp 	          Download    Upload 	    Test Server	
10/25/2021 16:52:39              75.81 Mbps   60.33 Mbps    New York City, NY


Note that these are in Megabits per second, whereas the CPDN web site gives Kilobytes per second.
ID: 64698 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4

Message boards : Number crunching : Site problems

©2024 cpdn.org