climateprediction.net home page
end process

end process

Message boards : Number crunching : end process
Message board moderation

To post messages, you must log in.

AuthorMessage
enginerd

Send message
Joined: 31 Aug 04
Posts: 13
Credit: 134,268
RAC: 0
Message 19454 - Posted: 20 Jan 2006, 1:25:36 UTC

i accidently did an \"end this process\" on my sulfur workunit (stupid non-optical mouse had dirt all in it). are there any options to get this workunit going again? can i back it up a few model days??? or is all hope lost?

;-^-(

sulphur_e0yj_000654427_0

Result ID 1329886
Workunit 871996
ID: 19454 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 19458 - Posted: 20 Jan 2006, 3:29:06 UTC

If you have a backup, you can restore it and continue the run. It will continue to show as errored but the Trickles will be accepted, credit will be given (as soon as the Run passes its last posted Trickle), and the science results will be used.

If you have other active projects, they will also be returned to the backup point because the entire boinc Folder must be restored.

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 19458 · Report as offensive     Reply Quote
enginerd

Send message
Joined: 31 Aug 04
Posts: 13
Credit: 134,268
RAC: 0
Message 19469 - Posted: 20 Jan 2006, 15:50:06 UTC

how exactly can i restore?
ID: 19469 · Report as offensive     Reply Quote
Profile old_user71
Avatar

Send message
Joined: 5 Aug 04
Posts: 19
Credit: 16,547
RAC: 0
Message 19471 - Posted: 20 Jan 2006, 16:35:03 UTC - in response to Message 19469.  

how exactly can i restore?


have you got a backup copy of the BOINC folder?

Click the Sig


Join UBT
ID: 19471 · Report as offensive     Reply Quote
enginerd

Send message
Joined: 31 Aug 04
Posts: 13
Credit: 134,268
RAC: 0
Message 19474 - Posted: 20 Jan 2006, 19:13:59 UTC

>>have you got a backup copy of the BOINC folder?

ummm.....
from before the wu started!

is there any way to delete some recent output files to trick boinc into letting me start this wu again?? 40.5 days is a lot of lost crunching.

actually the process canceled was a non-running sulfur thread (that had been aborted long ago) that still was present in the task manager. when i accidentally killed it, it took the running thread with it.
ID: 19474 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 19475 - Posted: 20 Jan 2006, 20:12:12 UTC

To backup:

On a regular basis, suspend BOINC, wait until the model stops running, then exit BOINC.

To restore:

Exit BOINC, delete the BOINC folder, copy the saved BOINC folder, paste this to where the deleted folder was, and click on boincmgr icon in BOINC folder to start it again.
The model will continue from where it was at the last checkpoint before the backup.

If you are running multiple projects, it gets messy.
See the <a href=\"http://boinc-doc.net/boinc-wiki/index.php?title=Backup_BOINC\"> BOINC Wiki</a> for details.

ID: 19475 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 19477 - Posted: 20 Jan 2006, 20:57:00 UTC - in response to Message 19474.  
Last modified: 20 Jan 2006, 20:59:03 UTC

>>have you got a backup copy of the BOINC folder?

ummm.....
from before the wu started!

is there any way to delete some recent output files to trick boinc into letting me start this wu again?? 40.5 days is a lot of lost crunching.

actually the process canceled was a non-running sulfur thread (that had been aborted long ago) that still was present in the task manager. when i accidentally killed it, it took the running thread with it.

Does your backup include sulphur_e0yj_000654427.zip or is it still on your system?

Do you have a client_state.xml file containing the 6 file_info sections and the workunit and result sections for that result?

If the answer to both is yes it ought to be possible to recover the model in it\'s current state, but you need to have a reasonable understanding of how everything hangs together. I\'ve done it a few times and if you need any help you can PM me the relevant client_state.xml files (in code blocks) on the TP forums.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 19477 · Report as offensive     Reply Quote
enginerd

Send message
Joined: 31 Aug 04
Posts: 13
Credit: 134,268
RAC: 0
Message 19608 - Posted: 24 Jan 2006, 19:41:12 UTC

ok Thyme Lawn

thanks for offering to help, i found sulphur_e0yj_000654427.zip and the client_state.xml file, but am unclear about what to do with them. any help (from you or anybody else) would be greatly appreciated.

-christo

ps. i also pm\'d you on the TP forums.
ID: 19608 · Report as offensive     Reply Quote

Message boards : Number crunching : end process

©2024 cpdn.org