|
Message boards : Number crunching : wah tasks failed
Message board moderation
Author | Message |
---|---|
![]() Send message Joined: 28 May 14 Posts: 34 Credit: 705,936 RAC: 0 |
First wah task on each of my pcs erred. Is this normal for these tasks? I do have a wah task on each pc that seems to be running alright for now but it was weird that each original task failed on each computer. I remember when i first joined the project we would have a lot of errors because the models weren't accurate. Is that what is happening with these wah tasks or is it my PCs or settings that are screwed up? NOTE: i'm just talking about the wah tasks. The errors on my tasks page from last month are from a bad BOINC installation on the new PC and interrupted tasks from BOINC upgrade on the old PC. |
![]() ![]() Send message Joined: 29 May 08 Posts: 128 Credit: 6,289,876 RAC: 0 |
Andrew Sanchez wrote: First wah task on each of my pcs erred... Interesting. The first one on one of my PCs did the same things (the other two were okay). I had the same kind of error as you had -- a series of what look like upload failures similar to the following: upload failure: <file_xfer_error> <file_name>wah2_eu2_g79c_1967_1_010161324_0_1.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> |
![]() Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Most of mine are running but a few failed. I'm working-up an email to staff. "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
<error_code>-161 (not found)</error_code> That's not a failure to upload. It's a failure (by BOINC), to find the zip file to upload it. The usual reason for this, is that the model crashed before that zip could be created. First guess in the case of both posters is that it's a computer problem. We'll know more when the first zips/trickles arrive, or more fail to do so. ************ While previewing this, I see that astroWX has posted. So more info coming in now. |
![]() Send message Joined: 28 May 14 Posts: 34 Credit: 705,936 RAC: 0 |
Just got another failure on my older laptop. That is 2 failed tasks on that laptop (computer name "Andy") and 1 fail so far on the new laptop ("Beats"). Seems to be a problem with the zip file each time. Watching a movie on Beats right now so task is suspended at 1.189% for now. Downloaded 3 more tasks on Andy and 2 of them are running. We'll see how they go... |
![]() Send message Joined: 22 Feb 06 Posts: 493 Credit: 31,669,049 RAC: 10,904 |
4 WAH tasks running at the moment but no trickles/zips uploaded yet. |
Send message Joined: 1 Jan 07 Posts: 1066 Credit: 36,887,369 RAC: 1,533 |
Just got another failure on my older laptop. That is 2 failed tasks on that laptop (computer name "Andy") and 1 fail so far on the new laptop ("Beats"). We can't see your computer names. so we'll have to guess which is which. But I see that all computers on your account have been upgraded to Windows 10 - this might possibly be significant. The project staff are going to check in the morning whether there is a significant correlation across the database between running Windows 10 and these new task failures. |
Send message Joined: 15 Nov 10 Posts: 43 Credit: 6,118,949 RAC: 0 |
all of the tasks I ran failed after 3 or 4 minutes running It's a windows 10 laptop ![]() |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Thanks for that. Knowing how far into the work they were is useful. |
![]() Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
A look ahead: Not only do these tasks have a covey of large downloads, my first .zip upload is 69.04 MB! (This won't do my DSL link any good . . .) "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
![]() ![]() Send message Joined: 29 May 08 Posts: 128 Credit: 6,289,876 RAC: 0 |
Les Bayliss wrote: ...First guess in the case of both posters is that it's a computer problem... I'm willing to concede that the error my PC generated was computer related -- perhaps some transient condition or just bad luck. This same host is 8-10 hours into 4 other WAH tasks with no apparent problems (that I know of). My lone Win10 PC seems to be blissfully grinding away, 6-7 hours into two of its WAHs. |
![]() Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,363,583 RAC: 5,022 |
The new WAH2 tasks seem to be running fine on my Win7 machine. Three of them are at 4.4% and 16+ hours. No problems. No graphics so no data on s/TS. No trickles yet. Does anyone know how many zip files these tasks produce. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Jim The failed tasks say 13 zips. ***************** ritterm There's now failures on Windows 7 and Windows 10. Also a "signal 11" and a "model crashed". So, it looks random. For now. As Richard said, the project people are going to dig into it tomorrow. And then we have the weekend. (Surprise!) ***************** astro Linux uploads for hadam3prm3pm2t_eu variety are ~72 Megs, and ~92 for zip 13. Ahhh the joys of big stash files. Computing power is sort of keeping up with the researchers wants, but not cheap 'net speeds. Although, while my phone line was dead, (wires broken in 2 spots), I got hold of a gadget with WiFi input, and output to the nearest 4G phone tower. Now THAT's fast. But expensive. I'm only using it now for my day to day activity, which means I don't get held up because BOINC is hogging the land line connection with zip uploads. Which it's doing right now, and will be for another 45 minutes or so. |
Send message Joined: 8 Jul 05 Posts: 33 Credit: 1,274,211 RAC: 0 |
Just had the one WU but it errored out on Win 7 64bit with the 13 absent zip file errors |
Send message Joined: 15 Feb 06 Posts: 137 Credit: 35,517,114 RAC: 10,523 |
Running 3 WAH2 WUs at present on Windows 10 64 bit. All sent their first ZIP with no problems. |
![]() ![]() Send message Joined: 29 May 08 Posts: 128 Credit: 6,289,876 RAC: 0 |
How often should trickles be uploaded with the WAHs? I have 9 tasks across three hosts that have been running for 15-20 hours and I see no trickles logged. And, I see no upload attempts in the BOINC message logs. |
![]() Send message Joined: 15 May 09 Posts: 4571 Credit: 19,039,635 RAC: 18,944 |
Not sure but for me a lot of tasks only trickle once a day or twice at the most these days. I am sure that someone will have an answer soon and be able to say how long it took them with their hardware for comparison. |
![]() Send message Joined: 15 May 09 Posts: 4571 Credit: 19,039,635 RAC: 18,944 |
I notice these tasks have all gone and the number of tasks in progress hasn't gone up enough to account for this. Have they been recalled? |
Send message Joined: 1 Jan 07 Posts: 1066 Credit: 36,887,369 RAC: 1,533 |
How often should trickles be uploaded with the WAHs? I have 9 tasks across three hosts that have been running for 15-20 hours and I see no trickles logged. And, I see no upload attempts in the BOINC message logs. With the great variation in computer speeds, it's probably best to answer that in terms of progress made, rather than absolute time. With 12 'simulation months' to be completed by each model, the trickle+upload pair should be generated around 8.3%, 16.6%, 25% ... progress. My leader is still only at 4.064% (after 21 hours), so it would be some time before I could fill in the third decimal place for the actual moment when it happens. |
![]() Send message Joined: 28 May 14 Posts: 34 Credit: 705,936 RAC: 0 |
I didn't have any failures over night. Got one task on the AMD laptop running at 3.368% and 1 task of the Intel laptop running at 1.822%. I think the failures yesterday all happened below 2% completion. |
©2025 cpdn.org