Message boards : Number crunching : Completed tasks not showing on server
Message board moderation
Author | Message |
---|---|
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,361,480 RAC: 15,566 |
One of my tasks has completed on my machine but is not showing as completed on the server. The restart and out zips have both gone OK on 11th April. Task is hadsm4_a01g_201310_6_899_012070423_0 |
Send message Joined: 28 Jul 19 Posts: 150 Credit: 12,830,559 RAC: 228 |
One of my tasks has completed on my machine but is not showing as completed on the server. The restart and out zips have both gone OK on 11th April. Task is hadsm4_a01g_201310_6_899_012070423_0 Hmmm, 6 month job, last of the 6 months reported complete 3.5 days ago but the job itself has not closed. I think this needs someone with access to the server. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
email sent. |
Send message Joined: 18 Feb 06 Posts: 73 Credit: 62,618,764 RAC: 40,118 |
Hello, This one is also showing not finished: hadam4h_a1sf_209811_5_891_012051406_1 Workunit 12051406 greetings from Luxembourg |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Andy has restarted some scripts. Is it working now? |
Send message Joined: 18 Feb 06 Posts: 73 Credit: 62,618,764 RAC: 40,118 |
This one is still in progress: 22019407 12051406 19 Feb 2021, 20:01:12 UTC 2 Feb 2022, 1:21:12 UTC In progress --- --- 33,854.34 UK Met Office HadAM4 at N216 resolution v8.52 i686-pc-linux-gnu Albert Hentzen |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,361,480 RAC: 15,566 |
Not as of late Friday 16th. I will restart the VMware VM that task is running on over the weekend and see if that makes a difference. I am catching up on a couple of VB tasks at the moment. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I'm running a hadcm3s from batch 900 on a plain vanilla MacBook Air. It has 2 more zips to finish and upload. Then we'll be in a better position to work this out. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
OK, my task finished, reported, and is showing Success. Are any of the people who reported the original problem still having it? |
Send message Joined: 7 Aug 04 Posts: 2187 Credit: 64,822,615 RAC: 5,275 |
I've also had issues with successfully completed tasks that were acknowledged by the server, but not labeled as "SUCCESS" on the task page. This thread was where it was discussed earlier. It appears to have something to do with the task reporting when the server is very busy, and it somehow fails to assign the task as a SUCCESS in the database. https://www.cpdn.org/forum_thread.php?id=9006#63089 |
Send message Joined: 18 Feb 06 Posts: 73 Credit: 62,618,764 RAC: 40,118 |
Yes, this one is not showing success. https://www.cpdn.org/cpdnboinc/result.php?resultid=22019407 |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Is it running in a VM? |
Send message Joined: 18 Feb 06 Posts: 73 Credit: 62,618,764 RAC: 40,118 |
No, see below the starting indication this morning: Mon 19 Apr 2021 07:52:33 AM CEST | | Starting BOINC client version 7.16.6 for x86_64-pc-linux-gnu Mon 19 Apr 2021 07:52:33 AM CEST | | log flags: file_xfer, sched_ops, task Mon 19 Apr 2021 07:52:33 AM CEST | | Libraries: libcurl/7.68.0 OpenSSL/1.1.1f zlib/1.2.11 brotli/1.0.7 libidn2/2.2.0 libpsl/0.21.0 (+libidn2/2.2.0) libssh/0.9.3/openssl/zlib nghttp2/1.40.0 librtmp/2.3 Mon 19 Apr 2021 07:52:33 AM CEST | | Data directory: /var/lib/boinc-client Mon 19 Apr 2021 07:52:33 AM CEST | | No usable GPUs found Mon 19 Apr 2021 07:52:34 AM CEST | | libc: Ubuntu GLIBC 2.31-0ubuntu9.2 version 2.31 Mon 19 Apr 2021 07:52:34 AM CEST | | Host name: user1linux Mon 19 Apr 2021 07:52:34 AM CEST | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz [Family 6 Model 94 Stepping 3] Mon 19 Apr 2021 07:52:34 AM CEST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp md_clear flush_l1d Mon 19 Apr 2021 07:52:34 AM CEST | | OS: Linux Ubuntu: Ubuntu 20.04.2 LTS [5.8.0-50-generic|libc 2.31 (Ubuntu GLIBC 2.31-0ubuntu9.2)] Mon 19 Apr 2021 07:52:34 AM CEST | | Memory: 15.57 GB physical, 1.94 GB virtual Mon 19 Apr 2021 07:52:34 AM CEST | | Disk: 99.96 GB total, 49.05 GB free Mon 19 Apr 2021 07:52:34 AM CEST | | Local time is UTC +2 hours Mon 19 Apr 2021 07:52:34 AM CEST | | Config: GUI RPCs allowed from: Mon 19 Apr 2021 07:52:34 AM CEST | climateprediction.net | General prefs: from climateprediction.net (last modified 14-Mar-2021 18:31:43) Mon 19 Apr 2021 07:52:34 AM CEST | climateprediction.net | Computer location: school Mon 19 Apr 2021 07:52:34 AM CEST | | General prefs: using separate prefs for school Mon 19 Apr 2021 07:52:34 AM CEST | | Reading preferences override file Mon 19 Apr 2021 07:52:34 AM CEST | | Preferences: Mon 19 Apr 2021 07:52:34 AM CEST | | max memory usage when active: 15938.64 MB Mon 19 Apr 2021 07:52:34 AM CEST | | max memory usage when idle: 15938.64 MB Mon 19 Apr 2021 07:52:34 AM CEST | | max disk usage: 86.11 GB Mon 19 Apr 2021 07:52:34 AM CEST | | max CPUs used: 6 Mon 19 Apr 2021 07:52:34 AM CEST | | (to change preferences, visit a project web site or select Preferences in the Manager) Mon 19 Apr 2021 07:52:34 AM CEST | | Setting up project and slot directories Mon 19 Apr 2021 07:52:34 AM CEST | | Checking active tasks Mon 19 Apr 2021 07:52:34 AM CEST | climateprediction.net | URL https://climateprediction.net/; Computer ID 1515379; resource share 100 Mon 19 Apr 2021 07:52:34 AM CEST | | Setting up GUI RPC socket Mon 19 Apr 2021 07:52:34 AM CEST | | gui_rpc_auth.cfg is empty - no GUI RPC password protection Mon 19 Apr 2021 07:52:34 AM CEST | | Checking presence of 230 project files |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,361,480 RAC: 15,566 |
Still not showing as completed on the task list. Another task on the same machine has completed this afternoon and is showing as complete. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
A long while ago, for reasons I no longer remember, I used to fiddle with things a bit to get them going again. This involves making a copy, better yet several, of client_state.xml while BOINC was stopped. Then I found the place where BOINC was keeping track of what it was up to with each model, which is (was) a series of numbers, and changing it so that BOINC thought the model was at an earlier stage. e.g. Just to before model was "reported". When restarted, BOINC would check this, move to the next step, (reporting), and things would move on. Tricky enough when one only has one project running, with one or two models. Probably far too daunting with multiple projects. |
Send message Joined: 18 Feb 06 Posts: 73 Credit: 62,618,764 RAC: 40,118 |
Many thanks for the info. Think to leave it like that, beeing happy that Linux works almost well. greetings from Luxembourg |
Send message Joined: 6 Oct 06 Posts: 204 Credit: 7,608,986 RAC: 0 |
What if instead of exiting the model in VB/VM which does cause problems, we instead save the machine state? I have been experimenting for the last four days and it has so far caused no problem. Another advantage, on restart we do not have to start from the last trickle. I do not know exactly, any feedback? |
Send message Joined: 15 May 09 Posts: 4541 Credit: 19,039,635 RAC: 18,944 |
What if instead of exiting the model in VB/VM which does cause problems, we instead save the machine state? I have been experimenting for the last four days and it has so far caused no problem. Another advantage, on restart we do not have to start from the last trickle. I do not know exactly, any feedback? Makes sense to me - the equivalent of suspend or hibernation in a machine that is not using virtualisation. - That has never lost me a task compared with shutting down and re-starting which loses me between one in ten and one in twenty tasks on average when I do it. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 31,361,480 RAC: 15,566 |
This task is stilll showing as in progress although it has completed. I did look at the client state file and there was no trace of the task (!). |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,705,191 RAC: 5,539 |
Hi folks, I have this one successfully finished but still In progress on the web https://www.cpdn.org/result.php?resultid=22089752 |
©2024 cpdn.org