Questions and Answers : Windows : Visual Fortran Run-Time Error
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 8 Aug 04 Posts: 14 Credit: 14,420,896 RAC: 0 |
I'm getting exactly the same error on every single Coupled Model Full resolution job I get. Happens within a few seconds of start. It looks like climate.cpdc is an initial data file for the job, and a batch of bad WUs got out and are being recycled indefinitely. Hopefully they will be tracked down and killed soon. (is there a function that pulls a job out of circulation when it returns n times with error status?) It's a pain because it occupies a slot doing nothing useful until I click away all 6 visual Fortran messages. --miw |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
The FORTRAN error, in this case, is because there's a problem running these on Windows systems. miw. Yes. Look at the Workunit page for any model. The 6th item down contains the number of times certain things happen for a model. PS There's another FORTRAN thread a couple below this one, where it's talked about a bit more. And a thread in Number crunching on the model type in general here: hadcm3n Full Res Ocean out of memory error. As you can see from there, there's been a variety of error messages. |
Send message Joined: 8 Aug 04 Posts: 14 Credit: 14,420,896 RAC: 0 |
OK. Thanks. I've disabled this application for now. I had read the other threads. I don't actually mind if the odd unit fails because the envelope is being pushed, and Was aware that stopping and starting these models was somewhat risky so could also accept the odd bomb at PC restart which happens pretty rarely anyhow. But this behaviour is 100% mortality, which I had not seen before. I had assumed it was because there was a group of units that were bad, and that they were all that was left in the pool. --miw |
Send message Joined: 19 Feb 06 Posts: 1 Credit: 844,121 RAC: 0 |
As others are having this problem currently I'm just adding a 'me too'. Been running climate-prediction on this PC for a couple of years with no significant problem. This morning turned on the screen to see "forrtl severe (17) unit 5 on C:\...\hadcm3s_3if3_2005_2_009099917\jobs\climate.cpdc, line 393, position 19 at: hadcm3s_um_7.24_w 007CC8A, 00774F50, 0077412A, 00758F59, 00710A43, 0053C726, 0053E2C9, 0071AC3B, KERNELBASE.dll 749014DE, kernel32.dll 75F41194, hadcm3s_um_7.24_w 0071AC3B The job was ~60% complete, on closing the popup it reappeared about 10 times then the job jumped to 100% and finished. I'm waiting to see what happens when the next climate-prediction job starts. |
Send message Joined: 5 Sep 04 Posts: 21 Credit: 2,536,666 RAC: 1,507 |
I am also get this too. The last two work units have terminated with errors, first after three quarters of the unit completed the second within four hours of work. 13/11/2014 02:17:04 | | cc_config.xml not found - using defaults 13/11/2014 02:17:04 | | Starting BOINC client version 7.2.42 for windows_x86_64 13/11/2014 02:17:04 | | log flags: file_xfer, sched_ops, task 13/11/2014 02:17:04 | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6 13/11/2014 02:17:04 | | Data directory: C:\ProgramData\BOINC 13/11/2014 02:17:04 | | Running under account Jurgen 13/11/2014 02:17:04 | | CUDA: NVIDIA GPU 0: GeForce GTX 650 (driver version 344.65, CUDA version 6.5, compute capability 3.0, 1024MB, 901MB available, 823 GFLOPS peak) 13/11/2014 02:17:04 | | OpenCL: NVIDIA GPU 0: GeForce GTX 650 (driver version 344.65, device version OpenCL 1.1 CUDA, 1024MB, 901MB available, 823 GFLOPS peak) 13/11/2014 02:17:04 | | Host name: Jurgen-PC 13/11/2014 02:17:04 | | Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q9650 @ 3.00GHz [Family 6 Model 23 Stepping 10] 13/11/2014 02:17:04 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 syscall nx lm vmx smx tm2 pbe 13/11/2014 02:17:04 | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00) 13/11/2014 02:17:04 | | Memory: 15.00 GB physical, 89.99 GB virtual 13/11/2014 02:17:04 | | Disk: 931.22 GB total, 357.86 GB free 13/11/2014 02:17:04 | | Local time is UTC +0 hours 13/11/2014 02:17:04 | climateprediction.net | URL http://climateprediction.net/; Computer ID 1317381; resource share 100 13/11/2014 02:17:04 | | Reading preferences override file 13/11/2014 02:17:04 | | Preferences: 13/11/2014 02:17:04 | | max memory usage when active: 11519.29MB 13/11/2014 02:17:04 | | max memory usage when idle: 13823.15MB 13/11/2014 02:17:04 | | max disk usage: 30.00GB 13/11/2014 02:17:04 | | (to change preferences, visit a project web site or select Preferences in the Manager) 13/11/2014 02:17:04 | | Not using a proxy 13/11/2014 02:18:15 | climateprediction.net | Computation for task hadcm3n_scuf_1940_40_009114654_3 finished 13/11/2014 02:18:15 | climateprediction.net | Output file hadcm3n_scuf_1940_40_009114654_3_3.zip for task hadcm3n_scuf_1940_40_009114654_3 absent 13/11/2014 02:18:15 | climateprediction.net | Output file hadcm3n_scuf_1940_40_009114654_3_4.zip for task hadcm3n_scuf_1940_40_009114654_3 absent 13/11/2014 02:20:22 | climateprediction.net | update requested by user 13/11/2014 02:20:23 | climateprediction.net | Sending scheduler request: Requested by user. 13/11/2014 02:20:23 | climateprediction.net | Reporting 1 completed tasks 13/11/2014 02:20:23 | climateprediction.net | Not requesting tasks: don't need 13/11/2014 02:20:26 | climateprediction.net | Scheduler request completed 13/11/2014 03:15:01 | | Project communication failed: attempting access to reference site 13/11/2014 03:15:03 | | Internet access OK - project servers may be temporarily down. 13/11/2014 03:15:12 | | Project communication failed: attempting access to reference site 13/11/2014 03:15:13 | | Internet access OK - project servers may be temporarily down. 13/11/2014 07:29:43 | climateprediction.net | Sending scheduler request: To fetch work. 13/11/2014 07:29:43 | climateprediction.net | Requesting new tasks for CPU 13/11/2014 07:29:46 | climateprediction.net | Scheduler request completed: got 1 new tasks 13/11/2014 07:29:48 | climateprediction.net | Started download of hadcm3s_4r8h_1985_2_009181663.zip 13/11/2014 07:29:48 | climateprediction.net | Started download of SPARC_O3_RCP4.5_1979_2021_bs.gz 13/11/2014 07:29:50 | climateprediction.net | Finished download of hadcm3s_4r8h_1985_2_009181663.zip 13/11/2014 07:29:50 | climateprediction.net | Started download of 40lu_1985_rd.astart.gz 13/11/2014 07:29:51 | climateprediction.net | Finished download of SPARC_O3_RCP4.5_1979_2021_bs.gz 13/11/2014 07:29:51 | climateprediction.net | Started download of DMSSO2NH3_1970_2030_EAS_2.gz 13/11/2014 07:29:55 | climateprediction.net | Finished download of 40lu_1985_rd.astart.gz 13/11/2014 07:29:55 | climateprediction.net | Started download of sulpc_oxidants_19_A2_1990f.gz 13/11/2014 07:30:08 | climateprediction.net | Finished download of sulpc_oxidants_19_A2_1990f.gz 13/11/2014 07:30:08 | climateprediction.net | Started download of 40lu_1985_rd.ostart.gz 13/11/2014 07:30:17 | climateprediction.net | Finished download of DMSSO2NH3_1970_2030_EAS_2.gz 13/11/2014 07:30:17 | climateprediction.net | Finished download of 40lu_1985_rd.ostart.gz 13/11/2014 07:30:43 | climateprediction.net | Starting task hadcm3s_4r8h_1985_2_009181663_3 13/11/2014 13:16:50 | climateprediction.net | Computation for task hadcm3s_4r8h_1985_2_009181663_3 finished 13/11/2014 13:16:50 | climateprediction.net | Output file hadcm3s_4r8h_1985_2_009181663_3_1.zip for task hadcm3s_4r8h_1985_2_009181663_3 absent 13/11/2014 13:16:50 | climateprediction.net | Output file hadcm3s_4r8h_1985_2_009181663_3_2.zip for task hadcm3s_4r8h_1985_2_009181663_3 absent "All man born has a right to life and no man born has the right to take that life" |
Send message Joined: 5 Sep 04 Posts: 21 Credit: 2,536,666 RAC: 1,507 |
If you ignore the 'Intel(R) Visual Fortean' Notice (Leave it alone) work unit will still run. It appears to be a notice letting you know the graphics is not working. But if you click 'ok' or close buttons it will terminate the work unit and return it as an computation error. "All man born has a right to life and no man born has the right to take that life" |
Send message Joined: 18 Feb 11 Posts: 44 Credit: 9,975,761 RAC: 0 |
I am getting the aforementioned errors daily. As you mention, Ancient One, if nothing is done, the tasks continue to run and complete. The trouble is the error messages are so frequent and I can't get rid of them - clicking on OK or trying to close the windows by hitting the X at upper right does nothing. The only way to take them off my screen is to use Task Manager to kill them individually, a real pain since there are usually four of them and they pop up after any additional run time has elapsed, or to abort the tasks. The tasks are always, without exception, hadCM3_short 7.24, so I wonder if there is a way to filter so that I don't receive them. |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
If you go to Your Account, the 9th link down from the top in the left hand menu and then click on, "climateprediction.net preferences you can choose which model types you receive and which you don't. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
You also need to untick the "If no work for selected ..." or this option may give them to you anyway. And, if you don't accept the "short" models, then you won't get anything at the moment, as there's no other work for Windows computers. See the Server Status page in the blue menu to the left. |
Send message Joined: 15 May 09 Posts: 4542 Credit: 19,039,635 RAC: 18,944 |
Thanks for the additions Les, I didn't think to check the status page before replying. |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,903,241 RAC: 2,063 |
If BOINC is installed in service mode then the FORTRAN error messages will not appear on screen: however, I can't remember whether HADCM3S is one of the applications that won't run in service mode ... |
Send message Joined: 18 Feb 11 Posts: 44 Credit: 9,975,761 RAC: 0 |
Thanks, I will check that out. |
Send message Joined: 25 Jan 15 Posts: 1 Credit: 72,171 RAC: 0 |
Got a whole swag of Visual Fortran errors today. Completely hung my machine. Had to do a restart. Checking the stdoutae.txt I note the following for that time. 01-Mar-2015 13:19:05 [climateprediction.net] Task hadam3p_anz_n91c_2013_1_009526489_0 exited with zero status but no 'finished' file Haven't done anything as yet with regard to clearing any errors, as in resetting the project. Advice please. Alison PS During a major storm last night we had a complete power cut, lasting 15 to 20 minutes. Everything went down suddenly. Above may be a function of this event. |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
Hi, Alison, Welcome to the project and the boards. That is a diagnostic message we usually ignore. I see it occasionally and ignore it. It has always been transient, not persistent, and I've never reset the project in response. However, it is not unusual for power interruptions to result in failed tasks on restart. (Power loss means the many files in memory are not saved and, often, the result is a mismatched file-set on disk.) "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 12 May 05 Posts: 34 Credit: 1,436,930 RAC: 2,182 |
The last 7 WU's on computer http://climateapps2.oerc.ox.ac.uk/cpdnboinc/show_host_detail.php?hostid=1347460 have thrown off run-time errors. The client will show the project advancing but the task-manager shows no CPU usage. Finally, after responding to the error dialogs the WU ends in computation error. Should I reset the project? Do you need more information about the errors submitted? |
Send message Joined: 14 Apr 05 Posts: 31 Credit: 16,491,691 RAC: 0 |
I've recently been plagued with Fortran run-time errors, but only on my laptop, never on the desktop. (The laptop is Vista x86, whereas the desktop if Windows 7 x64.) Now I can see why misbehaved models could result in this, but my problem is that the laptop is only used intermittently by me. Its main use is to run CPDN tasks. Just now I checked the laptop to find that one of the tasks had crashed a couple of days ago, and it had been sitting there since then waiting for me to click on OK several times to dismiss the message boxes. By no means the first time this has happened either. I must have lost weeks of work to this problem. Is there any way to disable the message boxes, so that the model crashes and burns silently? Brian |
Send message Joined: 16 Jan 10 Posts: 1084 Credit: 7,903,241 RAC: 2,063 |
The message does not appear in service mode. However, most of the applications don't work in service mode, so that's a solution that's not a solution. So the real solution is to find what causes the message: in one instance for me it was a berserk printer driver that was running at 100% CPU - so the cause could be practically anything. Which isn't a solution either. So this topic continues. |
Send message Joined: 12 May 05 Posts: 34 Credit: 1,436,930 RAC: 2,182 |
The message does not appear in service mode. However, most of the applications don't work in service mode, so that's a solution that's not a solution. So the real solution is to find what causes the message: in one instance for me it was a berserk printer driver that was running at 100% CPU - so the cause could be practically anything. Which isn't a solution either. So this topic continues. I haven't seen the FORTRAN run time errors and needy GUI input messages for weeks but this machine managed to complete 1 WU but another failed. 3 are running and are 130 to 165 hours in. The machine has been up for days maybe weeks and this browser instance has 40-50 tabs open without issues. Ran a game a few days back, played videos, have the print spooler shut down along with various unnecessary other services. I'm trying to say that the machine is working nicely and no other projects seem to be failing WU so ... why the computing errors? |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Probably because no other project uses programs that are close to a million lines of source code, or are so complex in what they do. Add to this the auxiliary files, such as the new, more detailed analysis of the latest version of MOSES + Triffid, and you have a super computer program that doesn't tolerate desktop/laptop computers that aren't "just so". And, of course, there's also the failures due to planetary physics. Plus, sometimes there's a data file error, when one of the junior researchers mismatches what's in the data strings in several of the files. |
Send message Joined: 12 May 05 Posts: 34 Credit: 1,436,930 RAC: 2,182 |
Probably because no other project uses programs that are close to a million lines of source code, or are so complex in what they do. There seems to be a problem with the WU itself. 4 different machines have failed it with an error while computing Since it's approaching it's 5th and last failure I'm not going to worry any more about it. If I see any more 'error while computing' it's good to be able to track the WU history. |
©2024 cpdn.org