climateprediction.net (CPDN) home page
Thread 'CPDN Monitor got quit request...'

Thread 'CPDN Monitor got quit request...'

Questions and Answers : Unix/Linux : CPDN Monitor got quit request...
Message board moderation

To post messages, you must log in.

AuthorMessage
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 1190 - Posted: 18 Aug 2004, 16:20:19 UTC
Last modified: 18 Aug 2004, 16:32:10 UTC

... seemingly from nowhere.
From slots/stderr.txt; same in both models (Bbox/SuSE 9.0/P4 3.0):
No heartbeat from core client - exiting

stderr_um.txt empty; yabsd.error empty; yabsd.out appear normal in both runs.

003u_000025123 - PH 1 TS 066598 - 08/10/1814 11:00 - H:M:S=0068:02:02 AVG= 3.68 DLT= 1.97
003u_000025123 - PH 1 TS 066599 - 08/10/1814 11:30 - H:M:S=0068:02:03 AVG= 3.68 DLT= 0.99
01su_100027319 - PH 1 TS 064814 - 01/09/1814 07:00 - H:M:S=0063:25:20 AVG= 3.52 DLT=14.69
003u_000025123 - PH 1 TS 066600 - 08/10/1814 12:00 - H:M:S=0068:02:05 AVG= 3.68 DLT= 1.98
01su_100027319 - PH 1 TS 064815 - 01/09/1814 07:30 - H:M:S=0063:25:21 AVG= 3.52 DLT= 0.95
003u_000025123 - PH 1 TS 066601 - 08/10/1814 12:30 - H:M:S=0068:02:06 AVG= 3.68 DLT= 0.99
01su_100027319 - PH 1 TS 064816 - 01/09/1814 08:00 - H:M:S=0063:25:23 AVG= 3.52 DLT= 1.94
01su_100027319 - PH 1 TS 064817 - 01/09/1814 08:30 - H:M:S=0063:25:24 AVG= 3.52 DLT= 1.00
01su_100027319 - PH 1 TS 064818 - 01/09/1814 09:00 - H:M:S=0063:25:26 AVG= 3.52 DLT= 1.94
01su_100027319 - PH 1 TS 064819 - 01/09/1814 09:30 - H:M:S=0063:25:27 AVG= 3.52 DLT= 1.00
CPDN Monitor got quit request...
Detaching shared memory...
CPDN Monitor got quit request...
Detaching shared memory...
2004-08-18 04:45:01 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:45:01 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:45:01 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:45:01 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:45:01 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:45:01 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:46:01 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 04:46:41 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:46:41 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:46:41 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:46:41 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:46:41 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:46:41 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:47:41 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 04:48:21 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:48:21 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:48:21 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:48:21 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:48:21 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:48:21 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:49:21 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 04:50:01 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:50:01 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:50:01 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:50:01 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:50:01 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:50:01 [climateprediction.net] Deferring communication with project for 1 minutes and 0 seconds
2004-08-18 04:51:01 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 04:51:41 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:51:41 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:51:41 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:51:41 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:51:41 [climateprediction.net] Deferring communication with project for 1 minutes and 38 seconds
2004-08-18 04:51:41 [climateprediction.net] Deferring communication with project for 1 minutes and 38 seconds
2004-08-18 04:53:19 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 04:53:59 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:53:59 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:53:59 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:53:59 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:53:59 [climateprediction.net] Deferring communication with project for 3 minutes and 53 seconds
2004-08-18 04:53:59 [climateprediction.net] Deferring communication with project for 3 minutes and 53 seconds
2004-08-18 04:57:52 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 04:58:32 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:58:32 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 04:58:32 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:58:32 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 04:58:32 [climateprediction.net] Deferring communication with project for 14 minutes and 3 seconds
2004-08-18 04:58:32 [climateprediction.net] Deferring communication with project for 14 minutes and 3 seconds
2004-08-18 05:12:35 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 05:13:15 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 05:13:15 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 05:13:15 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 05:13:15 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 05:13:15 [climateprediction.net] Deferring communication with project for 39 minutes and 15 seconds
2004-08-18 05:13:15 [climateprediction.net] Deferring communication with project for 39 minutes and 15 seconds
2004-08-18 05:52:30 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 05:53:10 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 05:53:10 [---] Can't resolve hostname climateapps2.oucs.ox.ac.uk (host not found or server failure)
2004-08-18 05:53:10 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 05:53:10 [climateprediction.net] scheduler init_op_project to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi failed, error -113
2004-08-18 05:53:10 [climateprediction.net] Deferring communication with project for 16 minutes and 9 seconds
2004-08-18 05:53:10 [climateprediction.net] Deferring communication with project for 16 minutes and 9 seconds
2004-08-18 06:09:19 [climateprediction.net] Sending request to scheduler: http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
2004-08-18 06:09:19 [climateprediction.net] Scheduler RPC to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded


Re. the ongoing Sched. requests -- even without a 'heartbeat', boinc apparently continues to play, even with the Models shut down.

Restarted okay.

*Edit* Dbox (WinXP, P4 3.0, in Beta) also had 'no response' messages overnight. (For the first time in several days, I can't blame my ISP because the upstream vendor finally found a hardware router|TCP/IP problem and resolved it.)

Abox (SuSE Linux 9.0, P4 2.8, in Beta) ran without issues overnight (as did Cbox with a THC run).

________________________________________________
It is impossible to enjoy idling thoroughly unless one has plenty of work to do.
-- Jerome K. Jerome (1859, 1927)
ID: 1190 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 1293 - Posted: 20 Aug 2004, 1:17:35 UTC

I've seen this pop up, it may be that a client upgrade is needed (when we get the official 4.03 or 4.04 "launch version" from Berkeley).
ID: 1293 · Report as offensive     Reply Quote
ProfileastroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 1305 - Posted: 20 Aug 2004, 2:44:17 UTC - in response to Message 1293.  

> I've seen this pop up, it may be that a client upgrade is needed (when we get
> the official 4.03 or 4.04 "launch version" from Berkeley).
>

Thanks, Carl.

So, what is this CC 'heartbeat'? (Or is that what goes away in BOINC 4.03/4?)

Cheers,
Jim
________________________________________________
Video meliora, proboque; Deteriora sequor
I see the better way, and approve it; I follow the worse
-- Ovid (43BC-17AD)
ID: 1305 · Report as offensive     Reply Quote
old_user1
Avatar

Send message
Joined: 5 Aug 04
Posts: 907
Credit: 299,864
RAC: 0
Message 1429 - Posted: 21 Aug 2004, 22:56:39 UTC - in response to Message 1305.  

it's actually a new feature of 4.x! :-)
the core client basically sends a "heartbeat" every 5 seconds, if apps don't receive it they can assume the boinc core client crashed and should shut down (so as not to leave "hanging & hidden" models running etc).

but I've done a lot of last minute debugging today so hopefully these things go away. I've also found some really big Linux compiler options/speed improvements I hope to get in by launch, it was speeding things up about 20% (3 sec/TS to 2.5 sec/TS)
ID: 1429 · Report as offensive     Reply Quote
Desti

Send message
Joined: 6 Aug 04
Posts: 124
Credit: 9,195,838
RAC: 0
Message 1430 - Posted: 21 Aug 2004, 23:49:02 UTC - in response to Message 1429.  


>
> but I've done a lot of last minute debugging today so hopefully these things
> go away. I've also found some really big Linux compiler options/speed
> improvements I hope to get in by launch, it was speeding things up about 20%
> (3 sec/TS to 2.5 sec/TS)
>
>

That would be really nice ;-)
_____
<a href="http://climateapps2.oucs.ox.ac.uk/cpdnboinc/team_display.php?teamid=43">Linux Users Everywhere @ climateprediction.net</a>
ID: 1430 · Report as offensive     Reply Quote

Questions and Answers : Unix/Linux : CPDN Monitor got quit request...

©2024 cpdn.org