Message boards : Number crunching : Scheduler process down?
Message board moderation
Author | Message |
---|---|
Send message Joined: 20 Dec 14 Posts: 23 Credit: 2,450,095 RAC: 296 |
My BOINC client is trying to send a trickle-up message, and keeps getting HTTP errors. Second, the server status page returns a blank page as of this writing. (It has the correct HTML formatting when I viewed its source, but nothing else.) Has the scheduler process failed? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The project people are working on the front end server problem, so anything could happen from time to time. Expect turbulence. :) |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,035,559 RAC: 14,581 |
That would explain the HTTP errors I'm getting trying to report and Afr model. |
Send message Joined: 15 May 09 Posts: 4540 Credit: 19,039,635 RAC: 18,944 |
Server Status page is back up, front page still down though. |
Send message Joined: 20 Dec 14 Posts: 23 Credit: 2,450,095 RAC: 296 |
The server status page is up and shows the scheduler as running, but I still get HTTP errors when BOINC tries to make a scheduler request. Is it overloaded, or is something else keeping scheduler requests from succeeding? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The answer depends on the exact string of words in the error message. If that is for a zip file, then the server to which the zip is going may have a problem. e.g. the ANZ zips go to a server in Hobart Tasmania. |
Send message Joined: 20 Dec 14 Posts: 23 Credit: 2,450,095 RAC: 296 |
Here is the event log in regards to my attempting to report the trickle up message and the completed work unit. 1/6/2015 8:57:31 AM | | cc_config.xml not found - using defaults 1/6/2015 8:57:31 AM | | Starting BOINC client version 7.4.36 for windows_x86_64 1/6/2015 8:57:31 AM | | log flags: file_xfer, sched_ops, task 1/6/2015 8:57:31 AM | | Libraries: libcurl/7.39.0 OpenSSL/1.0.1j zlib/1.2.8 1/6/2015 8:57:31 AM | | Data directory: C:\ProgramData\BOINC 1/6/2015 8:57:31 AM | | Running under account Jesse Viviano 1/6/2015 8:57:31 AM | | CUDA: NVIDIA GPU 0: GeForce GTX 580 (driver version 347.09, CUDA version 7.0, compute capability 2.0, 3072MB, 2933MB available, 1843 GFLOPS peak) 1/6/2015 8:57:31 AM | | OpenCL: NVIDIA GPU 0: GeForce GTX 580 (driver version 347.09, device version OpenCL 1.1 CUDA, 3072MB, 2933MB available, 1843 GFLOPS peak) 1/6/2015 8:57:31 AM | | Host name: JesseViviano-PC 1/6/2015 8:57:31 AM | | Processor: 12 GenuineIntel Intel(R) Core(TM) i7 CPU X 980 @ 3.33GHz [Family 6 Model 44 Stepping 2] 1/6/2015 8:57:31 AM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall nx lm vmx tm2 pbe 1/6/2015 8:57:31 AM | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00) 1/6/2015 8:57:31 AM | | Memory: 11.99 GB physical, 23.98 GB virtual 1/6/2015 8:57:31 AM | | Disk: 223.47 GB total, 123.24 GB free 1/6/2015 8:57:31 AM | | Local time is UTC -5 hours 1/6/2015 8:57:31 AM | | VirtualBox version: 4.3.20 1/6/2015 8:57:31 AM | climateprediction.net | URL http://climateprediction.net/; Computer ID 1350458; resource share 100 1/6/2015 8:57:31 AM | | Preferences: 1/6/2015 8:57:31 AM | | max memory usage when active: 9208.27MB 1/6/2015 8:57:31 AM | | max memory usage when idle: 11049.92MB 1/6/2015 8:57:31 AM | | max disk usage: 111.73GB 1/6/2015 8:57:31 AM | | (to change preferences, visit a project web site or select Preferences in the Manager) 1/6/2015 8:57:31 AM | | Resetting file projects/pogs.theskynet.org_pogs/pogs_image01.png: md5 checksum failed for file 1/6/2015 8:57:31 AM | | Not using a proxy 1/6/2015 8:57:32 AM | climateprediction.net | Sending scheduler request: To send trickle-up message. 1/6/2015 8:57:32 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: ) 1/6/2015 8:57:36 AM | climateprediction.net | Scheduler request failed: HTTP internal server error 1/6/2015 9:16:22 AM | climateprediction.net | Sending scheduler request: To send trickle-up message. 1/6/2015 9:16:22 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: ) 1/6/2015 9:16:25 AM | climateprediction.net | Scheduler request failed: HTTP internal server error 1/6/2015 10:08:07 AM | climateprediction.net | Sending scheduler request: To send trickle-up message. 1/6/2015 10:08:07 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: job cache full) 1/6/2015 10:08:09 AM | climateprediction.net | Scheduler request failed: HTTP internal server error 1/6/2015 10:33:43 AM | climateprediction.net | Started upload of hadam3p_anz_m8ia_2012_1_009308664_0_12.zip 1/6/2015 10:36:09 AM | climateprediction.net | Finished upload of hadam3p_anz_m8ia_2012_1_009308664_0_12.zip 1/6/2015 10:42:17 AM | climateprediction.net | Started upload of hadam3p_anz_m8ia_2012_1_009308664_0_13.zip 1/6/2015 10:42:19 AM | climateprediction.net | Computation for task hadam3p_anz_m8ia_2012_1_009308664_0 finished 1/6/2015 10:50:37 AM | climateprediction.net | Finished upload of hadam3p_anz_m8ia_2012_1_009308664_0_13.zip 1/6/2015 11:41:31 AM | climateprediction.net | update requested by user 1/6/2015 11:41:38 AM | climateprediction.net | Sending scheduler request: Requested by user. 1/6/2015 11:41:38 AM | climateprediction.net | Sending trickle-up message 1/6/2015 11:41:38 AM | climateprediction.net | Reporting 1 completed tasks 1/6/2015 11:41:38 AM | climateprediction.net | Not requesting tasks: don't need (CPU: not highest priority project; NVIDIA GPU: not highest priority project) 1/6/2015 11:41:39 AM | climateprediction.net | Scheduler request failed: HTTP internal server error |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,718,239 RAC: 8,054 |
Confirming - I've been getting 06/01/2015 17:45:48 | climateprediction.net | Sending scheduler request: Requested by user. 06/01/2015 17:45:48 | climateprediction.net | Requesting new tasks for CPU and NVIDIA GPU 06/01/2015 17:45:49 | climateprediction.net | Scheduler request failed: HTTP internal server error every time I've tried to fetch new work, over the last six hours (trying to confirm the ANZ file size issue we discussed a few days ago). |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
Same for me. I am setting up the account again, and I can now attach to the project, but that is as far as it goes. 1604 1/6/2015 1:14:27 PM Fetching configuration file from http://climateprediction.net/get_project_config.php 1606 climateprediction.net 1/6/2015 1:14:48 PM Master file download succeeded 1607 climateprediction.net 1/6/2015 1:14:53 PM Sending scheduler request: Project initialization. 1608 climateprediction.net 1/6/2015 1:14:53 PM Requesting new tasks for CPU and NVIDIA GPU 1609 climateprediction.net 1/6/2015 1:14:55 PM Scheduler request failed: HTTP internal server error |
Send message Joined: 27 Jan 07 Posts: 300 Credit: 3,288,263 RAC: 26,370 |
I'm getting the same HTTP internal server error. Looks like something didn't quite recover from the previous outage. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Email sent. |
Send message Joined: 20 Dec 14 Posts: 23 Credit: 2,450,095 RAC: 296 |
Thanks! |
Send message Joined: 27 Jul 12 Posts: 21 Credit: 269,602 RAC: 0 |
Hi Chaps, sorry for this, it is my fault. I have upgraded the database libraries, but forgot to tell the webserver. I am fixing it now. Jonathan Jonathan Miller CPDN SysAdmin |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,826,970 RAC: 5,066 |
Mine now clear. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,035,559 RAC: 14,581 |
Same here. Thanks Jonathon. |
Send message Joined: 20 Dec 14 Posts: 23 Credit: 2,450,095 RAC: 296 |
My finished work unit result now has been reported. However, trickles that are newer than the scheduler outage are not showing up in the work unit result logs, as written about in this thread. |
Send message Joined: 27 Jul 12 Posts: 21 Credit: 269,602 RAC: 0 |
...fixed this too. Apologies |
Send message Joined: 21 Oct 10 Posts: 53 Credit: 2,101,753 RAC: 3,985 |
Hi Mine cannot download one WU for several days, it reported all OK and says it has one WU but can't download it, and when I update the project it won't ask for new work when I should have 2 WU on my dualcore (none left at the moment, only the one trying to download) 10/01/2015 10:40:21 Visit http://boinc.berkeley.edu/download.php to download it 10/01/2015 10:40:22 climateprediction.net Temporarily failed download of hadam3p_afr_7.22_windows_intelx86.exe: HTTP error 10/01/2015 10:40:22 climateprediction.net Backing off 1 hr 29 min 23 sec on download of hadam3p_afr_7.22_windows_intelx86.exe 10/01/2015 10:40:23 WUProp@Home Scheduler request completed 10/01/2015 10:40:43 climateprediction.net update requested by user 10/01/2015 10:40:43 Project communication failed: attempting access to reference site 10/01/2015 10:40:45 climateprediction.net Sending scheduler request: Requested by user. 10/01/2015 10:40:45 climateprediction.net Not reporting or requesting tasks 10/01/2015 10:40:46 Internet access OK - project servers may be temporarily down. 10/01/2015 10:40:46 climateprediction.net Scheduler request completed 10/01/2015 11:12:52 climateprediction.net update requested by user 10/01/2015 11:12:56 climateprediction.net Sending scheduler request: Requested by user. 10/01/2015 11:12:56 climateprediction.net Not reporting or requesting tasks 10/01/2015 11:12:58 climateprediction.net Scheduler request completed Was it the same problem ? Thanks |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,718,239 RAC: 8,054 |
Climate Prediction workunits require the download of many, many files. If just one single file fails to download, it's usually a local problem on your computer. Especially if that single file is an executable program, as it is in this case. You can download the file yourself from this link: http://climateapps2.oerc.ox.ac.uk/cpdnboinc/download/mirror.php?file=/hadam3p_afr_7.22_windows_intelx86.exe but when I tried it just now, it triggered a false virus detection from my 'Avast' anti-virus program. If you temporarily disable antivirus scanning, you should be able to complete the download - but do remember to restore the AV service afterwards. |
Send message Joined: 21 Oct 10 Posts: 53 Credit: 2,101,753 RAC: 3,985 |
Wow ! I just disabled both file and web agent and tried to restart download and network from boinc manager and that was it ! Thanks for the quick help ! The strangest thing is that I had long ago defined an exception for file check on the whole boinc directory in avast, in the file agent setup, but now I think I should also exclude the CPDN URL in the web agent... I'll try this later, thanks again (after finishing the download I was able to update and got 2 new WUs, yes sir !) |
©2024 cpdn.org