Message boards : Number crunching : Problem with BHspin2 v0.02
Message board moderation

To post messages, you must log in.

AuthorMessage
JugNut

Send message
Joined: 11 Mar 15
Posts: 37
Credit: 271,242,973
RAC: 0
Message 2080 - Posted: 28 Mar 2017, 15:53:36 UTC
Last modified: 28 Mar 2017, 16:08:22 UTC

So far I have not had one of the new BHspin2 v0.02 finish. They get to somewhere near the end of the work unit(94-96%) then restart again at 0%

The message logs on all my PC's are full of messages like this..
Task universe_bh2_160803_104_37_20000_1-999999_300000_0 exited with zero status but no 'finished' file
If this happens repeatedly you may need to reset the project.

Has anyone actually completed BHspin2 v0.02 task yet? Cause I sure haven't.
I'm suspending all work here as this is just wasting electricity

A little help please Krzysztof.
ID: 2080 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 840
Credit: 144,180,465
RAC: 33
Message 2081 - Posted: 28 Mar 2017, 17:28:45 UTC - in response to Message 2080.  

I'm looking for the problem.
Probably you can kill the tasks, I will reproduce it when I sort the problem :(
Apologise, this is probably my fault... :(
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home team
My Patreon profile
Universe@Home on YT
ID: 2081 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
fruehwf

Send message
Joined: 5 Jul 16
Posts: 31
Credit: 18,447,833
RAC: 0
Message 2082 - Posted: 28 Mar 2017, 20:45:03 UTC - in response to Message 2081.  

Same Problem here.

After 1.6 to 1.7 % done the workunit resets to 0. On windows and Linux Machines with Univers BHspinv2 0.02
ID: 2082 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Kasim

Send message
Joined: 28 Mar 17
Posts: 1
Credit: 7,000
RAC: 0
Message 2083 - Posted: 28 Mar 2017, 21:53:04 UTC
Last modified: 28 Mar 2017, 21:57:48 UTC

Same problem here on Windows Vista. .Not over 1,5 %
ID: 2083 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 840
Credit: 144,180,465
RAC: 33
Message 2084 - Posted: 28 Mar 2017, 21:55:03 UTC - in response to Message 2082.  

As I says, my fault, I will cancel those tasks tomorrow...
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home team
My Patreon profile
Universe@Home on YT
ID: 2084 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
provost

Send message
Joined: 17 Nov 16
Posts: 1
Credit: 10,923,404
RAC: 0
Message 2089 - Posted: 29 Mar 2017, 17:00:06 UTC

Same problem across various platforms (Windows 7 64bit, Debian Jesse 64bit).

Please at least recall the erroneous tasks so that we don't have to abort them manually - if at all possible. This is quite important for those who run headless workstations.
ID: 2089 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 87
Credit: 4,320,738
RAC: 78
Message 2092 - Posted: 30 Mar 2017, 2:12:27 UTC

Glad I was checking in here after seeing the new version. Got a big batch this morning and still had a big batch when I came home.

I just confirmed none of my WUs finish. They reset to 0 progress after close to 3 minutes.

Windows 10 (64)
ID: 2092 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 87
Credit: 4,320,738
RAC: 78
Message 2093 - Posted: 30 Mar 2017, 2:13:14 UTC

Also, Krys, you updated the other apps on the app list but not BHSpin...

Was Android updated too??
ID: 2093 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Luigi R.

Send message
Joined: 10 Sep 15
Posts: 12
Credit: 20,067,933
RAC: 0
Message 2094 - Posted: 30 Mar 2017, 5:46:55 UTC
Last modified: 30 Mar 2017, 5:47:54 UTC

Same problem here.
This night I switched my host to Universe@Home without knowing the issue. No BH2 task completed.

Here's one of them: http://universeathome.pl/universe/result.php?resultid=21499390
ID: 2094 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 840
Credit: 144,180,465
RAC: 33
Message 2096 - Posted: 30 Mar 2017, 15:28:24 UTC - in response to Message 2094.  

Please cancel all WU's where BHspin2 application is in version 0.02.
There is major bug in this version.
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home team
My Patreon profile
Universe@Home on YT
ID: 2096 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Etienne Guyot

Send message
Joined: 10 Jun 16
Posts: 1
Credit: 322,000
RAC: 0
Message 2097 - Posted: 30 Mar 2017, 17:22:37 UTC - in response to Message 2096.  

Same for me: four tasks continuously running for more than 24 hours.
Following your advice, I have canceled the 4 jobs, but it's not enough: still got some CPU activity.
I had to reset the project, but BOINC complained with the following message:
"30/03/2017 19:01:42 | Universe@Home | [error] Couldn't delete file projects/universeathome.pl_universe/BHspin2_2_windows_x86_64.exe.gzt"
I had to start the task manager and manually kill each task that remained active despite the job cancellation and the project reset...
Then, I reset once again the project to get ride of the last file... Yes! Got it!
I will wait a little before restarting the work for this project...
ID: 2097 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
NotRealName

Send message
Joined: 5 Feb 17
Posts: 6
Credit: 2,135,900
RAC: 0
Message 2099 - Posted: 31 Mar 2017, 22:47:15 UTC

Ooohh... my computing resources are being wasted again :-(.
The tasks look normally in the BOINC Manager (I am on 80% now) but CPU time remains 00:00:00 in BoincView 1.2.5. Est. credits and CPU efficiency is zero as well, est. speed is not showing up. Ultraviolet reionization v0.02 is showing mentioned data without any problem. I hope it helps.
Einstein@home has a separate project for tests (Albert@home). Please, consider making something similar. As for now, whole Universe@home seems to be a test project (testing our patience).
ID: 2099 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
fruehwf

Send message
Joined: 5 Jul 16
Posts: 31
Credit: 18,447,833
RAC: 0
Message 2100 - Posted: 1 Apr 2017, 6:28:55 UTC

It seems that the Project has successfilly backedup.
since the working Version for BHSpin is(0.01) deeper than the last downloaded (0.02) I had to cancel all running tasks , then reset the Project and at last update the project..

Now everything works fine.
But the not finishing Task Problem from Version 0.01 will appear again.
So I will have an eye on the machines.

hth

Franz
ID: 2100 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Feb 15
Posts: 253
Credit: 200,562,581
RAC: 0
Message 2101 - Posted: 1 Apr 2017, 9:22:18 UTC - in response to Message 2100.  

Now everything works fine.
But the not finishing Task Problem from Version 0.01 will appear again.
So I will have an eye on the machines.

Thanks for the information. But I think krzyszp should make a formal announcement of where they are. I don't want to start again and find out the hard way. In particular, when they have a fixed version of 0.02 they should let us know.
ID: 2101 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
alex

Send message
Joined: 21 Feb 15
Posts: 64
Credit: 65,733,511
RAC: 5,095
Message 2113 - Posted: 2 Apr 2017, 18:32:18 UTC
Last modified: 2 Apr 2017, 18:35:11 UTC

Indeed, the 2.0.02 all fail.

Edit:
I killed them; BM said 'Abgebrochen durch Benutzer'.
But for a short time, just befor reporting them, BM said 'Berechnungsfehler'. Looks like the calculation error is not reported correct to BM during runtime.
ID: 2113 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 87
Credit: 4,320,738
RAC: 78
Message 2119 - Posted: 5 Apr 2017, 5:54:54 UTC

The error isn't caught by BOINC so it keeps running the app endlessly (or at least until it ages out).
ID: 2119 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 87
Credit: 4,320,738
RAC: 78
Message 2125 - Posted: 6 Apr 2017, 4:07:05 UTC

Oh where, oh where are thou Kryz??

Is it safe to come out of the moon bunkers yet? The Universe awaits us!
ID: 2125 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
QuantumHelos
Avatar

Send message
Joined: 30 Apr 17
Posts: 8
Credit: 64,583
RAC: 0
Message 2180 - Posted: 30 Apr 2017, 18:07:25 UTC

ram usage for the project is in the 7mb range .... could be something to do with memory buffer not being big enough .....

read > boinc http://esa-space.blogspot.com

30/04/2017 17:47:35 | | OpenCL: AMD/ATI GPU 0: AMD Radeon R9 200 Series (driver version 2348.3, device version OpenCL 1.2 AMD-APP (2348.3), 3072MB, 3072MB available, 4178 GFLOPS peak)

30/04/2017 17:47:35 | | Processor: 8 AuthenticAMD AMD FX-8320E Eight-Core Processor [Family 21 Model 2 Stepping 0]
30/04/2017 17:47:35 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 htt pni ssse3 fma cx16 sse4_1 sse4_2 popcnt aes f16c syscall nx lm avx sse4a osvw xop wdt fma4 topx page1gb rdtscp bmi1

feature set could be optimized and buffer increased to the 32mb range .... or 64mb without this being an issue .... + feature optimized in the compiler >

AMD Platform Optimization - please read for all developers

https://community.amd.com/thread/213045

http://32ipi028l5q82yhj72224m8j.wpengine.netdna-cdn.com/wp-content/uploads/2017/03/GDC2017-Optimizing-For-AMD-Ryzen.pdf

http://www.agner.org/optimize/

http://www.agner.org
http://esa-space.blogspot.com/

boinc optimization > http://esa-space.blogspot.rs/2017/04/boinc.html

T/C/RNG/Entropy Drivers and sources > http://esa-space.blogspot.ru/2017/04/rng-and-random-web.html
ID: 2180 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
wallerby

Send message
Joined: 9 Dec 18
Posts: 1
Credit: 7,333
RAC: 0
Message 3126 - Posted: 10 Dec 2018, 20:09:55 UTC

BHspin2_6_windows_x86_64 is a memory hog.
ID: 3126 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 840
Credit: 144,180,465
RAC: 33
Message 3129 - Posted: 10 Dec 2018, 23:22:46 UTC - in response to Message 3126.  

BHspin2_6_windows_x86_64 is a memory hog.

Some more info?
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home team
My Patreon profile
Universe@Home on YT
ID: 3129 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Problem with BHspin2 v0.02




Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek