climateprediction.net home page
Completed tasks not showing on server

Completed tasks not showing on server

Message boards : Number crunching : Completed tasks not showing on server
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 30,946,057
RAC: 13,930
Message 63864 - Posted: 14 Apr 2021, 22:52:04 UTC
Last modified: 14 Apr 2021, 22:52:45 UTC

One of my tasks has completed on my machine but is not showing as completed on the server. The restart and out zips have both gone OK on 11th April. Task is hadsm4_a01g_201310_6_899_012070423_0
ID: 63864 · Report as offensive     Reply Quote
Bryn Mawr

Send message
Joined: 28 Jul 19
Posts: 149
Credit: 12,830,559
RAC: 228
Message 63865 - Posted: 15 Apr 2021, 1:01:23 UTC - in response to Message 63864.  

One of my tasks has completed on my machine but is not showing as completed on the server. The restart and out zips have both gone OK on 11th April. Task is hadsm4_a01g_201310_6_899_012070423_0


Hmmm, 6 month job, last of the 6 months reported complete 3.5 days ago but the job itself has not closed.

I think this needs someone with access to the server.
ID: 63865 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 63866 - Posted: 15 Apr 2021, 3:03:08 UTC

email sent.
ID: 63866 · Report as offensive     Reply Quote
Albert H.

Send message
Joined: 18 Feb 06
Posts: 73
Credit: 61,493,450
RAC: 47,882
Message 63867 - Posted: 15 Apr 2021, 7:53:25 UTC

Hello,
This one is also showing not finished:
hadam4h_a1sf_209811_5_891_012051406_1
Workunit 12051406

greetings from Luxembourg
ID: 63867 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 63871 - Posted: 15 Apr 2021, 21:01:46 UTC

Andy has restarted some scripts.
Is it working now?
ID: 63871 · Report as offensive     Reply Quote
Albert H.

Send message
Joined: 18 Feb 06
Posts: 73
Credit: 61,493,450
RAC: 47,882
Message 63872 - Posted: 15 Apr 2021, 21:36:26 UTC

This one is still in progress:
22019407 12051406 19 Feb 2021, 20:01:12 UTC 2 Feb 2022, 1:21:12 UTC In progress --- --- 33,854.34 UK Met Office HadAM4 at N216 resolution v8.52
i686-pc-linux-gnu
Albert Hentzen
ID: 63872 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 30,946,057
RAC: 13,930
Message 63877 - Posted: 16 Apr 2021, 22:41:04 UTC - in response to Message 63871.  
Last modified: 16 Apr 2021, 22:42:41 UTC

Not as of late Friday 16th. I will restart the VMware VM that task is running on over the weekend and see if that makes a difference. I am catching up on a couple of VB tasks at the moment.
ID: 63877 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 63878 - Posted: 17 Apr 2021, 1:08:29 UTC

I'm running a hadcm3s from batch 900 on a plain vanilla MacBook Air.
It has 2 more zips to finish and upload.
Then we'll be in a better position to work this out.
ID: 63878 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 63883 - Posted: 18 Apr 2021, 1:26:38 UTC
Last modified: 18 Apr 2021, 1:27:02 UTC

OK, my task finished, reported, and is showing Success.

Are any of the people who reported the original problem still having it?
ID: 63883 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2185
Credit: 64,822,615
RAC: 5,275
Message 63884 - Posted: 18 Apr 2021, 4:45:11 UTC

I've also had issues with successfully completed tasks that were acknowledged by the server, but not labeled as "SUCCESS" on the task page.

This thread was where it was discussed earlier. It appears to have something to do with the task reporting when the server is very busy, and it somehow fails to assign the task as a SUCCESS in the database.

https://www.cpdn.org/forum_thread.php?id=9006#63089
ID: 63884 · Report as offensive     Reply Quote
Albert H.

Send message
Joined: 18 Feb 06
Posts: 73
Credit: 61,493,450
RAC: 47,882
Message 63885 - Posted: 18 Apr 2021, 8:56:57 UTC

Yes, this one is not showing success.

https://www.cpdn.org/cpdnboinc/result.php?resultid=22019407
ID: 63885 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 63886 - Posted: 18 Apr 2021, 9:14:57 UTC - in response to Message 63885.  

Is it running in a VM?
ID: 63886 · Report as offensive     Reply Quote
Albert H.

Send message
Joined: 18 Feb 06
Posts: 73
Credit: 61,493,450
RAC: 47,882
Message 63887 - Posted: 19 Apr 2021, 5:58:58 UTC

No,
see below the starting indication this morning:

Mon 19 Apr 2021 07:52:33 AM CEST | | Starting BOINC client version 7.16.6 for x86_64-pc-linux-gnu
Mon 19 Apr 2021 07:52:33 AM CEST | | log flags: file_xfer, sched_ops, task
Mon 19 Apr 2021 07:52:33 AM CEST | | Libraries: libcurl/7.68.0 OpenSSL/1.1.1f zlib/1.2.11 brotli/1.0.7 libidn2/2.2.0 libpsl/0.21.0 (+libidn2/2.2.0) libssh/0.9.3/openssl/zlib nghttp2/1.40.0 librtmp/2.3
Mon 19 Apr 2021 07:52:33 AM CEST | | Data directory: /var/lib/boinc-client
Mon 19 Apr 2021 07:52:33 AM CEST | | No usable GPUs found
Mon 19 Apr 2021 07:52:34 AM CEST | | libc: Ubuntu GLIBC 2.31-0ubuntu9.2 version 2.31
Mon 19 Apr 2021 07:52:34 AM CEST | | Host name: user1linux
Mon 19 Apr 2021 07:52:34 AM CEST | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz [Family 6 Model 94 Stepping 3]
Mon 19 Apr 2021 07:52:34 AM CEST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp md_clear flush_l1d
Mon 19 Apr 2021 07:52:34 AM CEST | | OS: Linux Ubuntu: Ubuntu 20.04.2 LTS [5.8.0-50-generic|libc 2.31 (Ubuntu GLIBC 2.31-0ubuntu9.2)]
Mon 19 Apr 2021 07:52:34 AM CEST | | Memory: 15.57 GB physical, 1.94 GB virtual
Mon 19 Apr 2021 07:52:34 AM CEST | | Disk: 99.96 GB total, 49.05 GB free
Mon 19 Apr 2021 07:52:34 AM CEST | | Local time is UTC +2 hours
Mon 19 Apr 2021 07:52:34 AM CEST | | Config: GUI RPCs allowed from:
Mon 19 Apr 2021 07:52:34 AM CEST | climateprediction.net | General prefs: from climateprediction.net (last modified 14-Mar-2021 18:31:43)
Mon 19 Apr 2021 07:52:34 AM CEST | climateprediction.net | Computer location: school
Mon 19 Apr 2021 07:52:34 AM CEST | | General prefs: using separate prefs for school
Mon 19 Apr 2021 07:52:34 AM CEST | | Reading preferences override file
Mon 19 Apr 2021 07:52:34 AM CEST | | Preferences:
Mon 19 Apr 2021 07:52:34 AM CEST | | max memory usage when active: 15938.64 MB
Mon 19 Apr 2021 07:52:34 AM CEST | | max memory usage when idle: 15938.64 MB
Mon 19 Apr 2021 07:52:34 AM CEST | | max disk usage: 86.11 GB
Mon 19 Apr 2021 07:52:34 AM CEST | | max CPUs used: 6
Mon 19 Apr 2021 07:52:34 AM CEST | | (to change preferences, visit a project web site or select Preferences in the Manager)
Mon 19 Apr 2021 07:52:34 AM CEST | | Setting up project and slot directories
Mon 19 Apr 2021 07:52:34 AM CEST | | Checking active tasks
Mon 19 Apr 2021 07:52:34 AM CEST | climateprediction.net | URL https://climateprediction.net/; Computer ID 1515379; resource share 100
Mon 19 Apr 2021 07:52:34 AM CEST | | Setting up GUI RPC socket
Mon 19 Apr 2021 07:52:34 AM CEST | | gui_rpc_auth.cfg is empty - no GUI RPC password protection
Mon 19 Apr 2021 07:52:34 AM CEST | | Checking presence of 230 project files
ID: 63887 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 30,946,057
RAC: 13,930
Message 63895 - Posted: 19 Apr 2021, 22:12:09 UTC - in response to Message 63877.  

Still not showing as completed on the task list. Another task on the same machine has completed this afternoon and is showing as complete.
ID: 63895 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 63898 - Posted: 20 Apr 2021, 0:15:55 UTC - in response to Message 63895.  

A long while ago, for reasons I no longer remember, I used to fiddle with things a bit to get them going again.

This involves making a copy, better yet several, of client_state.xml while BOINC was stopped.

Then I found the place where BOINC was keeping track of what it was up to with each model, which is (was) a series of numbers, and changing it so that BOINC thought the model was at an earlier stage.
e.g. Just to before model was "reported".
When restarted, BOINC would check this, move to the next step, (reporting), and things would move on.

Tricky enough when one only has one project running, with one or two models.
Probably far too daunting with multiple projects.
ID: 63898 · Report as offensive     Reply Quote
Albert H.

Send message
Joined: 18 Feb 06
Posts: 73
Credit: 61,493,450
RAC: 47,882
Message 63900 - Posted: 20 Apr 2021, 21:32:05 UTC

Many thanks for the info. Think to leave it like that, beeing happy that Linux works almost well.

greetings from Luxembourg
ID: 63900 · Report as offensive     Reply Quote
KAMasud

Send message
Joined: 6 Oct 06
Posts: 204
Credit: 7,608,986
RAC: 0
Message 63918 - Posted: 28 Apr 2021, 12:45:03 UTC

What if instead of exiting the model in VB/VM which does cause problems, we instead save the machine state? I have been experimenting for the last four days and it has so far caused no problem. Another advantage, on restart we do not have to start from the last trickle. I do not know exactly, any feedback?
ID: 63918 · Report as offensive     Reply Quote
Profile Dave Jackson
Volunteer moderator

Send message
Joined: 15 May 09
Posts: 4535
Credit: 18,966,742
RAC: 21,869
Message 63919 - Posted: 28 Apr 2021, 12:48:43 UTC - in response to Message 63918.  

What if instead of exiting the model in VB/VM which does cause problems, we instead save the machine state? I have been experimenting for the last four days and it has so far caused no problem. Another advantage, on restart we do not have to start from the last trickle. I do not know exactly, any feedback?


Makes sense to me - the equivalent of suspend or hibernation in a machine that is not using virtualisation. - That has never lost me a task compared with shutting down and re-starting which loses me between one in ten and one in twenty tasks on average when I do it.
ID: 63919 · Report as offensive     Reply Quote
Profile Alan K

Send message
Joined: 22 Feb 06
Posts: 491
Credit: 30,946,057
RAC: 13,930
Message 63920 - Posted: 28 Apr 2021, 22:29:31 UTC - in response to Message 63895.  
Last modified: 28 Apr 2021, 22:31:58 UTC

This task is stilll showing as in progress although it has completed. I did look at the client state file and there was no trace of the task (!).
ID: 63920 · Report as offensive     Reply Quote
bernard_ivo

Send message
Joined: 18 Jul 13
Posts: 438
Credit: 25,620,508
RAC: 4,981
Message 64082 - Posted: 25 Jun 2021, 14:21:53 UTC

Hi folks,

I have this one successfully finished but still In progress on the web https://www.cpdn.org/result.php?resultid=22089752
ID: 64082 · Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Completed tasks not showing on server

©2024 cpdn.org