Message boards :
Number crunching :
No trickles on webpage
Message board moderation
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · Next
Author | Message |
---|---|
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,744,972 RAC: 4,718 |
once again... For example https://www.cpdn.org/cpdnboinc/workunit.php?wuid=11935086 Cheers, Bonsai911 |
Send message Joined: 7 Aug 04 Posts: 2185 Credit: 64,822,615 RAC: 5,275 |
@Bonsai911 How far along (what is the progress) for the tasks that aren't trickling? You have a whole bunch of tasks on that PC that are trickling just fine and have been all along. Usually when someone reports missing trickles, it's because a server reboot occurred and the trickle storage task didn't get restarted. The missing trickles here don't look to be a server side problem. |
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,744,972 RAC: 4,718 |
Thanks for your reply! I posted the message to early. The trickle appeared this evening. Thanks and keep up the good work. Bonsai911 |
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,744,972 RAC: 4,718 |
After further examination: There is a delay between sending trickles and showing up on the website. After some hours I recognized the delay is growing. Is there a server slowing down? Questions over questions, bonsai911 |
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,744,972 RAC: 4,718 |
|
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
This is just a guess. I think that trickles updating and the work unit "back off timer" might be synced. If this is true then updating for new work would delay trickles updating on your account and vise versa. If this is not true then perhaps there is some other CPU intensive load on the systems. It could be that the weekly script to calculate points is delaying the trickle updates for a day or two. |
Send message Joined: 9 Sep 04 Posts: 228 Credit: 30,744,972 RAC: 4,718 |
Maybe you are right. So let's see and wait. In approx. two hours we know more. We started up from this point Last time contacted server 10 Jan 2020, 6:41:01 UTC |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,976,682 RAC: 21,948 |
This is just a guess. I think that trickles updating and the work unit "back off timer" might be synced. If this is true then updating for new work would delay trickles updating on your account and vise versa. If this is not true then perhaps there is some other CPU intensive load on the systems. It could be that the weekly script to calculate points is delaying the trickle updates for a day or two. There was a power outage on two of the physical machines in Oxford last night. Andy says all services have now been restored. For the record, the time out for new work does not affect trickles being reported. |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
It does seem that something is up with the trickles on my N216 jobs. None of the trickles sent in the past six hours have shown up yet. I am not that worried about it though. My event logs show that the uploads completed without error and that means the model data went where it is supposed to. |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,976,682 RAC: 21,948 |
It does seem that something is up with the trickles on my N218 jobs. None of the trickles sent in the past six hours have shown up yet. I am not that worried about it though. My event logs show that the uploads completed without error and that means the model data went where it is supposed to. Will let Andy know. I suspect one of the scripts didn't get restarted after the power outage that affected some of the servers last night. |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
Like I said, it'd not a big deal to me. I know that data is there and will get processed sooner or later. |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,976,682 RAC: 21,948 |
Not a big deal for me either but sooner or later if Andy isn't told, there will be a flood of people posting here because they aren't getting their credit. (Assuming the trickles aren't going through and just not showing up on the model pages!) Email has been sent. |
Send message Joined: 22 Feb 06 Posts: 491 Credit: 30,952,705 RAC: 14,003 |
The N216 models are for 4 months so the webpage will only show these at 25% completion intervals, which in my cas is about 3.5 days (14day estimated completion). How fast is your macine? |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
The N216 models are for 4 months so the webpage will only show these at 25% completion intervals, which in my cas is about 3.5 days (14day estimated completion). How fast is your macine? Right now I am running 6 models, 3 per 4-core CCX on Ryzen 3700X. They are progressing at little over 0.56% per hour. If my math is right that is between 17 and 19 sec/TS. They all reached the 25% trickle point before I posted to report the missing info. I can't check my math yet due the missing info. https://www.cpdn.org/result.php?resultid=21871315 https://www.cpdn.org/result.php?resultid=21872035 https://www.cpdn.org/result.php?resultid=21872143 https://www.cpdn.org/result.php?resultid=21871965 https://www.cpdn.org/result.php?resultid=21872208 https://www.cpdn.org/result.php?resultid=21871936 |
Send message Joined: 7 Aug 04 Posts: 2185 Credit: 64,822,615 RAC: 5,275 |
If interested, you can check the average sec/TS by going into the "boinc cliient directory"/projects/climateprediction.net/"name of model task directory" and tailing the stdout_mon.txt file tail -f stdout_mon.txt |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
If interested, you can check the average sec/TS by going into the Useful! Thank you. Edit: These N216 models seem to have semi-regularly spaced time-steps that take more than 4x CPU time than most of the other other time-steps. I think it is a little bit like the "key frames" used to encode video. It does make sense if you consider that in the models every "cell" (temp, air pressure, humidity, and a lot more than I think off of the top of my head) in the 3D matrix will influence it's neighboring cells (and those cells must influence their neighbors and so on until you reach the edges of the zone being modeled) at each time-step. If this is true then sooner or later a model will reach a "large change" state where one or more cells change in a way that critically influences the entire model. |
Send message Joined: 7 Aug 04 Posts: 2185 Credit: 64,822,615 RAC: 5,275 |
These N216 models seem to have semi-regularly spaced time-steps that take more than 4x CPU time than most of the other other time-steps. I think it is a little bit like the "key frames" used to encode video. It does make sense if you consider that in the models every "cell" (temp, air pressure, humidity, and a lot more than I think off of the top of my head) in the 3D matrix will influence it's neighboring cells (and those cells must influence their neighbors and so on until you reach the edges of the zone being modeled) at each time-step. If this is true then sooner or later a model will reach a "large change" state where one or more cells change in a way that critically influences the entire model. Those are the timesteps that include the radiation calculations. Most of the models do that calculation every 3 hours. The hadam4/hadam4h models are global models with no regional component. The Weather At Home (WAH) models that run on Windows have both global and regional components, where the global model (at a lower resolution) is run for one day, then the regional model grid is run for that day, with boundary conditions from the global model, then on to the next day, etc. |
Send message Joined: 11 Dec 19 Posts: 108 Credit: 3,012,142 RAC: 0 |
Thank you for the clarifications. One of the reasons I love DC projects so much is that I can always learn something new. |
Send message Joined: 15 May 09 Posts: 4535 Credit: 18,976,682 RAC: 21,948 |
And, getting back to the trickles, Andy emailed yesterday to say that the problem should be sorted. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 25,620,508 RAC: 4,981 |
Hi It seems there might be a problem with trickles, after 21-22 March I have at least 3 N216 that do not have their 3&4 trickle on the web despite they finished successfully and upload queues are empty. here is an example https://www.cpdn.org/cpdnboinc/result.php?resultid=21871312 |
©2024 cpdn.org