Message boards : Number crunching : Thousands of computation errors
Message board moderation

To post messages, you must log in.

AuthorMessage
sesq

Send message
Joined: 21 Aug 15
Posts: 7
Credit: 12,567,685
RAC: 0
Message 1914 - Posted: 30 Jan 2017, 7:54:03 UTC

About a week ago two of my computers started throwing computation errors instantly upon new WU download. Since it started, some WUs compute fully and others error out instantly with no obvious pattern. Today the errors became constant again. Event log examples:

1/30/2017 12:22:26 AM | Universe@Home | Started download of universe_bh2_160803_62_52_20000_1-999999_445000
1/30/2017 12:22:28 AM | Universe@Home | Finished download of universe_bh2_160803_62_52_20000_1-999999_445000
1/30/2017 12:22:28 AM | Universe@Home | [error] Can't copy projects/universeathome.pl_universe/BHspin2_1_windows_intelx86.exe to slots/1/BHspin2_1_windows_intelx86.exe: Error 5
1/30/2017 12:22:29 AM | Universe@Home | Computation for task universe_bh2_160803_62_52_20000_1-999999_445000_0 finished
1/30/2017 12:22:29 AM | Universe@Home | Output file universe_bh2_160803_62_52_20000_1-999999_445000_0_r1263741599_0 for task universe_bh2_160803_62_52_20000_1-999999_445000_0 absent

Win7x64, AMD FX 8370, 16GB RAM. Not overclocked, not running hot, plenty of empty HD/RAM space and plenty of HD/RAM space allocated to BOINC, reboot doesn't help, project reset/remove-and-add doesn't help, and POGS is currently running fine on six cores. Anyone have any ideas?
ID: 1914 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 87
Credit: 4,320,738
RAC: 0
Message 1926 - Posted: 2 Feb 2017, 5:24:07 UTC - in response to Message 1914.  

It's the radiation from the black holes. It's affecting your system. You'll need to shield your house. :)
ID: 1926 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jun 16
Posts: 169
Credit: 317,253,046
RAC: 1
Message 1932 - Posted: 3 Feb 2017, 20:13:43 UTC - in response to Message 1914.  

About a week ago two of my computers started throwing computation errors instantly upon new WU download. Since it started, some WUs compute fully and others error out instantly with no obvious pattern. Today the errors became constant again. Event log examples:

1/30/2017 12:22:26 AM | Universe@Home | Started download of universe_bh2_160803_62_52_20000_1-999999_445000
1/30/2017 12:22:28 AM | Universe@Home | Finished download of universe_bh2_160803_62_52_20000_1-999999_445000
1/30/2017 12:22:28 AM | Universe@Home | [error] Can't copy projects/universeathome.pl_universe/BHspin2_1_windows_intelx86.exe to slots/1/BHspin2_1_windows_intelx86.exe: Error 5
1/30/2017 12:22:29 AM | Universe@Home | Computation for task universe_bh2_160803_62_52_20000_1-999999_445000_0 finished
1/30/2017 12:22:29 AM | Universe@Home | Output file universe_bh2_160803_62_52_20000_1-999999_445000_0_r1263741599_0 for task universe_bh2_160803_62_52_20000_1-999999_445000_0 absent

Win7x64, AMD FX 8370, 16GB RAM. Not overclocked, not running hot, plenty of empty HD/RAM space and plenty of HD/RAM space allocated to BOINC, reboot doesn't help, project reset/remove-and-add doesn't help, and POGS is currently running fine on six cores. Anyone have any ideas?


Did resetting the project remove the Universe project folder? I've never reset a project. Seems like this might be a permission and/or access error:
1/30/2017 12:22:28 AM | Universe@Home | [error] Can't copy projects/universeathome.pl_universe/BHspin2_1_windows_intelx86.exe to slots/1/BHspin2_1_windows_intelx86.exe: Error 5
ID: 1932 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
sesq

Send message
Joined: 21 Aug 15
Posts: 7
Credit: 12,567,685
RAC: 0
Message 1937 - Posted: 5 Feb 2017, 9:39:02 UTC - in response to Message 1932.  

Yes, remove/re-add removed the project folder. Permissions problem is an interesting thought, but I checked before and after and there don't appear to be any issues. I've pretty much given up on the project, though I'll let my remaining four machines keep burning until I find something better.
ID: 1937 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Feb 15
Posts: 253
Credit: 200,562,581
RAC: 0
Message 1938 - Posted: 5 Feb 2017, 16:01:50 UTC - in response to Message 1937.  

Possibly an anti-virus can cause a problem like that. Have you excluded the BOINC data folder?
I have seen a few access problems due to the write-cache I use also, but never in the BOINC folder; they have been elsewhere.
ID: 1938 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
sesq

Send message
Joined: 21 Aug 15
Posts: 7
Credit: 12,567,685
RAC: 0
Message 1939 - Posted: 6 Feb 2017, 1:40:36 UTC - in response to Message 1938.  
Last modified: 6 Feb 2017, 1:40:54 UTC

Good call. I forgot that the two machines with the problem were running a different virus scanner than the others. Guess it updated without telling me (grumble) and wiped out my whitelist in the process. I'm back to full capacity. Thanks.
ID: 1939 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Thousands of computation errors




Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek