climateprediction.net (CPDN) home page
Thread 'Scheduler process down?'

Thread 'Scheduler process down?'

Message boards : Number crunching : Scheduler process down?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Jesse Viviano

Send message
Joined: 20 Dec 14
Posts: 23
Credit: 2,450,095
RAC: 296
Message 51141 - Posted: 5 Jan 2015, 18:43:51 UTC

My BOINC client is trying to send a trickle-up message, and keeps getting HTTP errors. Second, the server status page returns a blank page as of this writing. (It has the correct HTML formatting when I viewed its source, but nothing else.) Has the scheduler process failed?
ID: 51141 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 51143 - Posted: 5 Jan 2015, 19:12:04 UTC - in response to Message 51141.  

The project people are working on the front end server problem, so anything could happen from time to time.
Expect turbulence. :)

ID: 51143 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 31,083,753
RAC: 15,077
Message 51144 - Posted: 5 Jan 2015, 23:11:01 UTC - in response to Message 51143.  

That would explain the HTTP errors I'm getting trying to report and Afr model.
ID: 51144 · Report as offensive     Reply Quote
ProfileDave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4540
Credit: 19,039,635
RAC: 18,944
Message 51145 - Posted: 6 Jan 2015, 10:08:35 UTC - in response to Message 51144.  

Server Status page is back up, front page still down though.
ID: 51145 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 20 Dec 14
Posts: 23
Credit: 2,450,095
RAC: 296
Message 51146 - Posted: 6 Jan 2015, 16:45:17 UTC

The server status page is up and shows the scheduler as running, but I still get HTTP errors when BOINC tries to make a scheduler request. Is it overloaded, or is something else keeping scheduler requests from succeeding?
ID: 51146 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 51147 - Posted: 6 Jan 2015, 17:04:19 UTC - in response to Message 51146.  

The answer depends on the exact string of words in the error message.
If that is for a zip file, then the server to which the zip is going may have a problem. e.g. the ANZ zips go to a server in Hobart Tasmania.

ID: 51147 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 20 Dec 14
Posts: 23
Credit: 2,450,095
RAC: 296
Message 51150 - Posted: 6 Jan 2015, 17:12:03 UTC - in response to Message 51147.  

Here is the event log in regards to my attempting to report the trickle up message and the completed work unit.
1/6/2015 8:57:31 AM |  | cc_config.xml not found - using defaults
1/6/2015 8:57:31 AM |  | Starting BOINC client version 7.4.36 for windows_x86_64
1/6/2015 8:57:31 AM |  | log flags: file_xfer, sched_ops, task
1/6/2015 8:57:31 AM |  | Libraries: libcurl/7.39.0 OpenSSL/1.0.1j zlib/1.2.8
1/6/2015 8:57:31 AM |  | Data directory: C:\ProgramData\BOINC
1/6/2015 8:57:31 AM |  | Running under account Jesse Viviano
1/6/2015 8:57:31 AM |  | CUDA: NVIDIA GPU 0: GeForce GTX 580 (driver version 347.09, CUDA version 7.0, compute capability 2.0, 3072MB, 2933MB available, 1843 GFLOPS peak)
1/6/2015 8:57:31 AM |  | OpenCL: NVIDIA GPU 0: GeForce GTX 580 (driver version 347.09, device version OpenCL 1.1 CUDA, 3072MB, 2933MB available, 1843 GFLOPS peak)
1/6/2015 8:57:31 AM |  | Host name: JesseViviano-PC
1/6/2015 8:57:31 AM |  | Processor: 12 GenuineIntel Intel(R) Core(TM) i7 CPU       X 980  @ 3.33GHz [Family 6 Model 44 Stepping 2]
1/6/2015 8:57:31 AM |  | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall nx lm vmx tm2 pbe
1/6/2015 8:57:31 AM |  | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00)
1/6/2015 8:57:31 AM |  | Memory: 11.99 GB physical, 23.98 GB virtual
1/6/2015 8:57:31 AM |  | Disk: 223.47 GB total, 123.24 GB free
1/6/2015 8:57:31 AM |  | Local time is UTC -5 hours
1/6/2015 8:57:31 AM |  | VirtualBox version: 4.3.20
1/6/2015 8:57:31 AM | climateprediction.net | URL http://climateprediction.net/; Computer ID 1350458; resource share 100
1/6/2015 8:57:31 AM |  | Preferences:
1/6/2015 8:57:31 AM |  | max memory usage when active: 9208.27MB
1/6/2015 8:57:31 AM |  | max memory usage when idle: 11049.92MB
1/6/2015 8:57:31 AM |  | max disk usage: 111.73GB
1/6/2015 8:57:31 AM |  | (to change preferences, visit a project web site or select Preferences in the Manager)
1/6/2015 8:57:31 AM |  | Resetting file projects/pogs.theskynet.org_pogs/pogs_image01.png: md5 checksum failed for file
1/6/2015 8:57:31 AM |  | Not using a proxy
1/6/2015 8:57:32 AM | climateprediction.net | Sending scheduler request: To send trickle-up message.
1/6/2015 8:57:32 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: )
1/6/2015 8:57:36 AM | climateprediction.net | Scheduler request failed: HTTP internal server error
1/6/2015 9:16:22 AM | climateprediction.net | Sending scheduler request: To send trickle-up message.
1/6/2015 9:16:22 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: )
1/6/2015 9:16:25 AM | climateprediction.net | Scheduler request failed: HTTP internal server error
1/6/2015 10:08:07 AM | climateprediction.net | Sending scheduler request: To send trickle-up message.
1/6/2015 10:08:07 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: job cache full)
1/6/2015 10:08:09 AM | climateprediction.net | Scheduler request failed: HTTP internal server error
1/6/2015 10:33:43 AM | climateprediction.net | Started upload of hadam3p_anz_m8ia_2012_1_009308664_0_12.zip
1/6/2015 10:36:09 AM | climateprediction.net | Finished upload of hadam3p_anz_m8ia_2012_1_009308664_0_12.zip
1/6/2015 10:42:17 AM | climateprediction.net | Started upload of hadam3p_anz_m8ia_2012_1_009308664_0_13.zip
1/6/2015 10:42:19 AM | climateprediction.net | Computation for task hadam3p_anz_m8ia_2012_1_009308664_0 finished
1/6/2015 10:50:37 AM | climateprediction.net | Finished upload of hadam3p_anz_m8ia_2012_1_009308664_0_13.zip
1/6/2015 11:41:31 AM | climateprediction.net | update requested by user
1/6/2015 11:41:38 AM | climateprediction.net | Sending scheduler request: Requested by user.
1/6/2015 11:41:38 AM | climateprediction.net | Sending trickle-up message
1/6/2015 11:41:38 AM | climateprediction.net | Reporting 1 completed tasks
1/6/2015 11:41:38 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: not highest priority project)
1/6/2015 11:41:39 AM | climateprediction.net | Scheduler request failed: HTTP internal server error
ID: 51150 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,730,664
RAC: 6,969
Message 51151 - Posted: 6 Jan 2015, 17:48:10 UTC - in response to Message 51147.  

Confirming - I've been getting

06/01/2015 17:45:48 | climateprediction.net | Sending scheduler request: Requested by user.
06/01/2015 17:45:48 | climateprediction.net | Requesting new tasks for CPU and NVIDIA GPU
06/01/2015 17:45:49 | climateprediction.net | Scheduler request failed: HTTP internal server error

every time I've tried to fetch new work, over the last six hours (trying to confirm the ANZ file size issue we discussed a few days ago).
ID: 51151 · Report as offensive     Reply Quote
Jim1348

Send message
Joined: 15 Jan 06
Posts: 637
Credit: 26,751,529
RAC: 653
Message 51153 - Posted: 6 Jan 2015, 18:30:17 UTC - in response to Message 51151.  

Same for me. I am setting up the account again, and I can now attach to the project, but that is as far as it goes.
1604			1/6/2015 1:14:27 PM	Fetching configuration file from http://climateprediction.net/get_project_config.php	
1606	climateprediction.net	1/6/2015 1:14:48 PM	Master file download succeeded	
1607	climateprediction.net	1/6/2015 1:14:53 PM	Sending scheduler request: Project initialization.	
1608	climateprediction.net	1/6/2015 1:14:53 PM	Requesting new tasks for CPU and NVIDIA GPU	
1609	climateprediction.net	1/6/2015 1:14:55 PM	Scheduler request failed: HTTP internal server error	
ID: 51153 · Report as offensive     Reply Quote
DJStarfox

Send message
Joined: 27 Jan 07
Posts: 300
Credit: 3,288,263
RAC: 26,370
Message 51154 - Posted: 6 Jan 2015, 19:35:09 UTC

I'm getting the same HTTP internal server error. Looks like something didn't quite recover from the previous outage.
ID: 51154 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 51155 - Posted: 6 Jan 2015, 21:27:29 UTC

Email sent.

ID: 51155 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 20 Dec 14
Posts: 23
Credit: 2,450,095
RAC: 296
Message 51157 - Posted: 6 Jan 2015, 21:40:13 UTC - in response to Message 51155.  

Thanks!
ID: 51157 · Report as offensive     Reply Quote
Jonathan Miller

Send message
Joined: 27 Jul 12
Posts: 21
Credit: 269,602
RAC: 0
Message 51160 - Posted: 7 Jan 2015, 10:02:41 UTC - in response to Message 51154.  

Hi Chaps, sorry for this, it is my fault.
I have upgraded the database libraries, but forgot to tell the webserver.

I am fixing it now.

Jonathan
Jonathan Miller
CPDN SysAdmin
ID: 51160 · Report as offensive     Reply Quote
ProfileIain Inglis
Volunteer moderator

Send message
Joined: 16 Jan 10
Posts: 1084
Credit: 7,841,902
RAC: 5,047
Message 51161 - Posted: 7 Jan 2015, 12:09:49 UTC

Mine now clear.
ID: 51161 · Report as offensive     Reply Quote
ProfileAlan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 31,083,753
RAC: 15,077
Message 51162 - Posted: 7 Jan 2015, 14:06:14 UTC - in response to Message 51161.  

Same here. Thanks Jonathon.
ID: 51162 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 20 Dec 14
Posts: 23
Credit: 2,450,095
RAC: 296
Message 51163 - Posted: 7 Jan 2015, 16:15:28 UTC
Last modified: 7 Jan 2015, 16:15:45 UTC

My finished work unit result now has been reported. However, trickles that are newer than the scheduler outage are not showing up in the work unit result logs, as written about in this thread.
ID: 51163 · Report as offensive     Reply Quote
Jonathan Miller

Send message
Joined: 27 Jul 12
Posts: 21
Credit: 269,602
RAC: 0
Message 51167 - Posted: 8 Jan 2015, 16:36:58 UTC - in response to Message 51163.  

...fixed this too.

Apologies
ID: 51167 · Report as offensive     Reply Quote
Profile[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 21 Oct 10
Posts: 53
Credit: 2,101,753
RAC: 3,985
Message 51171 - Posted: 10 Jan 2015, 10:15:40 UTC

Hi

Mine cannot download one WU for several days, it reported all OK and says it has one WU but can't download it, and when I update the project it won't ask for new work when I should have 2 WU on my dualcore (none left at the moment, only the one trying to download)

10/01/2015 10:40:21 Visit http://boinc.berkeley.edu/download.php to download it
10/01/2015 10:40:22 climateprediction.net Temporarily failed download of hadam3p_afr_7.22_windows_intelx86.exe: HTTP error
10/01/2015 10:40:22 climateprediction.net Backing off 1 hr 29 min 23 sec on download of hadam3p_afr_7.22_windows_intelx86.exe
10/01/2015 10:40:23 WUProp@Home Scheduler request completed
10/01/2015 10:40:43 climateprediction.net update requested by user
10/01/2015 10:40:43 Project communication failed: attempting access to reference site
10/01/2015 10:40:45 climateprediction.net Sending scheduler request: Requested by user.
10/01/2015 10:40:45 climateprediction.net Not reporting or requesting tasks
10/01/2015 10:40:46 Internet access OK - project servers may be temporarily down.
10/01/2015 10:40:46 climateprediction.net Scheduler request completed
10/01/2015 11:12:52 climateprediction.net update requested by user
10/01/2015 11:12:56 climateprediction.net Sending scheduler request: Requested by user.
10/01/2015 11:12:56 climateprediction.net Not reporting or requesting tasks
10/01/2015 11:12:58 climateprediction.net Scheduler request completed

Was it the same problem ?

Thanks
ID: 51171 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 1 Jan 07
Posts: 1061
Credit: 36,730,664
RAC: 6,969
Message 51172 - Posted: 10 Jan 2015, 10:45:19 UTC - in response to Message 51171.  

Climate Prediction workunits require the download of many, many files. If just one single file fails to download, it's usually a local problem on your computer.

Especially if that single file is an executable program, as it is in this case. You can download the file yourself from this link:

http://climateapps2.oerc.ox.ac.uk/cpdnboinc/download/mirror.php?file=/hadam3p_afr_7.22_windows_intelx86.exe

but when I tried it just now, it triggered a false virus detection from my 'Avast' anti-virus program.



If you temporarily disable antivirus scanning, you should be able to complete the download - but do remember to restore the AV service afterwards.
ID: 51172 · Report as offensive     Reply Quote
Profile[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 21 Oct 10
Posts: 53
Credit: 2,101,753
RAC: 3,985
Message 51175 - Posted: 10 Jan 2015, 17:57:51 UTC

Wow ! I just disabled both file and web agent and tried to restart download and network from boinc manager and that was it ! Thanks for the quick help !

The strangest thing is that I had long ago defined an exception for file check on the whole boinc directory in avast, in the file agent setup, but now I think I should also exclude the CPDN URL in the web agent...

I'll try this later, thanks again (after finishing the download I was able to update and got 2 new WUs, yes sir !)
ID: 51175 · Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Scheduler process down?

©2024 cpdn.org