Message boards : Number crunching : Download Checksum Error
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Please DON'T post email addresses on these web pages. Well, unless you really WANT spammers and their automated search programs to find them. :( |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Yes, what a nuisance about having the daily task quota reduced when it isn't your fault. Just for a moment's light relief I'll show you somebody who has no difficulty grabbing models. Art, I wouldn't mess with the computer that has two models running on 7.0.55. Richard's working on the problem and if he needs extra evidence of problems he'll post back and ask one or both of you to do whatever is required. He's probably doing some testing of his own. Take a backup of the complete contents of the BOINC Data folder now because if the problem turns out to affect uploads as well you may need to restore it. Exit completely from BOINC before copying for the backup. Cpdn news |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
Sorry...I've deleted my email addresses from my post. I still don't "get" how to post an image (jpg) file which is local to my computer. I'm obviously suffering from a dumb user error... I understand the format is but how do I refer to a local file? I've tried but that doesn't seem to work Any help appreciated. Art Masson |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
Trying again I understand the format is (img]http://example.com/pic.jpg[/img) but how do I refer to a local file? I've tried (img]file:///C:/eventlog1.jpg[/img) but that doesn't seem to work Any help appreciated. Art Masson |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
You can show files and photos from your own computer on the independent phpBB forum (the one that's down) but I don't think you can here. AFAIK on this forum you have to upload files or pics to a website or a dropbox and link to that. Cpdn news |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
OK, Thanks. I'll just let it to the experts to fix the problem and assume someone will ask if any more info is needed. Thanks so much for your help Art Masson |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,719,896 RAC: 7,946 |
I've sent a report to the boinc_alpha bug reporting list, with copies for the CPDN staff. There appears to be a incompatibility between what the CPDN server is sending out and what these late (v7.0.55 to v7.0.59) clients expect to receive. I expect it could be fixed by changing either the server or the client, but it would be very difficult for individual users to fix it on their home machines. So, either go back to a client version which is known to have worked for CPDN in the past, or wait for news that one party or other has changed the things under their control. The problem is in the area of downloading new tasks - it should be safe to run existing tasks, and to upload/report them when done. |
Send message Joined: 29 Sep 04 Posts: 2363 Credit: 14,611,758 RAC: 0 |
Thank you very much for your time, Richard. We know where we stand and what we have to do. The BOINC main page: http://boinc.berkeley.edu/download.php is still offering 7.0.28 which works with our models that contain some errors server-end. As far as I can see this problem affects all current CPDN models and probably all versions of BOINC after 7.0.28. If you have running models, keep the BOINC version you've got until they finish and set CPDN in the Projects tab to No New Tasks. Depending on the time it takes Andy to fix things in Oxford, and bearing in mind that it's a holiday weekend and next week he also has to grapple with the iframe exploit problem, you will have to decide whether to go back to 7.0.28 now or later. Any version after 7.0.28 is an alpha test version so you'll have to change version at some point. Back up the complete contents of your BOINC Data folder after exiting fully from BOINC before changing version. If you have running models and by chance they don't reappear after the version downgrade you'll be able to delete the entire contents of the new data folder and copy into it the contents that you saved. Do not try to back up and restore individual models. I've seen the instructions, which were for BOINC 6 or even 5, and they would fill at least two screens. Let's say this doesn't work. I repeat - we back up and restore the entire contents of the BOINC Data folder. If you have 7.0.28 or any version lower than that and want to run climate models, do not upgrade your BOINC to a version higher than 7.0.28 until this problem is fixed. Many thanks to WB and Art for reporting your problems and explaining things clearly. If anyone else is suffering from the same failed downloads with MD5 checksum errors, especially if you don't know what to do, please post here. Cpdn news |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,719,896 RAC: 7,946 |
One small note for volunteers who crunch for more than one project. This affects people who use graphics cards (GPUs) for crunching, and who use 'Anonymous Platform' - usually optimised applications. SETI is the project which comes to mind, but there are others too. Shortly after v7.0.28, there was a change made to the estimate that BOINC makes about the speed of your GPU. If you download (GPU, anonymous platform) work with v7.0.28 or before, and then upgrade to a later version of BOINC, the tasks may fail with 'maximum time exceeded'. New work downloaded with a new version of BOINC is fine. I'd just suggest that you avoid crossing the boundary between v7.0.28 and later BOINCs more often than you strictly have to, especially if you use GPUs. I haven't found which version of BOINC introduced this MD5 problem: there's nothing which obviously catches my eye in the list of changes (and the likely-looking one happened a long time before v7.0.28). But there might be one in the middle which would suit everyone until we get this fully sorted out. |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
You guys are great. Thanks so much...time to take some time off for Easter now that we have some good work-around solutions (notably back migrating to 7.0.28) once any processing of new work-units is completed...and marking our clients to receive no new tasks). Wondering what happens to work units that fail this way...hopefully they are not "lost" and are resent to other clients until there is a successful downloading! One final note. I do have a machine (which is remote from my home and in Atlanta) which has been running 7.0.44. It is in fact processing 4 work units at this time, but I note in the log multiple download failures -- which I can only assume is the check-sum problem. I therefore suspect the problem was introduced with client version 7.0.44 or earlier. Art Masson St. Charles, IL VP IT Strategy Kellogg Company |
Send message Joined: 1 Sep 04 Posts: 161 Credit: 81,522,141 RAC: 1,164 |
This is small data sample, but after downgrading to 7.0.44 I have downloaded two tasks successfully. I need at least 7.0.42 because of the app_config.xml capabilities introduced in that version for GPU's. Both 7.0.54 and 7.0.55 show the following fix: "Selective upgrades of the core client based on md5 hash compares." Maybe those "fixes" broke something else. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,719,896 RAC: 7,946 |
Both 7.0.54 and 7.0.55 show the following fix: "Selective upgrades of the core client based on md5 hash compares." May I ask where you're reading that from? I don't recognise the language from my usual BOINC reading, and in particular it doesn't appear in BOINC 7 Change Log and news. [edit - except for the special Android version of the client, which doesn't apply here. CPDN on a smartphone, anyone?] My attention is lighting on 7.0.54 --> 7.0.55: client (and maybe others): fix XML parsing bug when skipping large unrecognized elements. 7.0.52 --> 7.0.54: MGR (and possibly other components): XML parsing fix. but the BOINC code-searching tool seems to be broken at the moment, to add to our other woes. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Thanks for researching this Richard. Subtle bugs like this are the bane of programmers. |
Send message Joined: 1 Sep 04 Posts: 161 Credit: 81,522,141 RAC: 1,164 |
Richard Haselgrove - The "fixes" I referenced are at - http://boinc.berkeley.edu/dev/forum_thread.php?id=8203 Look at the 7.0.55 and 7.0.54 lists of fixes. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,719,896 RAC: 7,946 |
Richard Haselgrove - That's the BOINC 7 for Android Change Log and news thread. The MD5s they're talking about there are used to find differences in the source code, and merge the work of two different teams of developers together. They're nothing to do with checking the integrity of climate data being transferred over the internet to our computers for further processing. (Same principle, but very different implementation). On a different topic entirely, two HADcm3n models seen to have downloaded successfully to a v6.12.34 host while I was out at dinner. That seems to confirm that BOINC's 'new' XML parser is - may I say? - buggered, but the old one is still working. Other developers have pitched in to support that interpretation. |
Send message Joined: 1 Sep 04 Posts: 161 Credit: 81,522,141 RAC: 1,164 |
Richard - I guess I didn't look at the title close enough. I did a search on "md5" and found that thread. Sorry for the incorrect info. |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,719,896 RAC: 7,946 |
Berkeley have made BOINC v7.0.60 available for download, specifically in the expectation of solving the CPDN / MD5 download problem. I've got it installed on a couple of machines for general testing - seems safe enough to install - but there are no CPDN models available to download at the moment, so I can't confirm that the CPDN problem is fixed yet. Further news as we get it. |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
Richard -- would it help for WB and I to load 7.0.60 on one of our machines now to help validate the problem is fixed (or not) when CPDN models become available? Art Masson |
Send message Joined: 1 Jan 07 Posts: 1061 Credit: 36,719,896 RAC: 7,946 |
Richard -- would it help for WB and I to load 7.0.60 on one of our machines now to help validate the problem is fixed (or not) when CPDN models become available? If you're prepared to live on the cutting edge, yes, it would probably help if more than one pair of eyes was keeping a watching brief. Since I wrote that, I've independently heard from Andy Bowery that he has changed the CPDN server settings so that the problem should not recur when new work is generated. However, I suspect that if an old job fails (for any reason) on a volunteer's machine, and a replacement is sent out, that replacement might still have the trigger problem. So, it would be useful not only to run v7.0.60, but to look carefully at any new task that arrives - assuming it downloads successfully, try to work out whether it's Andy's fix (newly created work) or David's fix (handle the old work properly) which has been effective in that particular case. |
Send message Joined: 16 Oct 11 Posts: 254 Credit: 15,954,577 RAC: 0 |
Thanks. I'll load 7.0.60 on one of my machines this evening. Living Large here! Art Masson |
©2024 cpdn.org