21) Message boards : Number crunching : extreme long wu's (Message 2310)
Posted 28 Jul 2017 by JugNut
Post:
Hey krzys
I've just found a bunch of these bad WU's over all my PC's, what a mess.
I found out the hard way that even if I abort them they still don't die. They have to be manually killed from task manager. If you don't manually kill them after aborting them boinc thinks they've gone and assigns new work into the already loaded & running slot. Not good!!
Will kill this WU now, this is the fifth in the last few hours :(

The only way I can tell there locked up & not just long running is keeping an eye on the checkpointing. The WU below has been running for 15hrs 23mins and hasn't checkpointed for the last 13hrs. At least now I know what to look for & how to treat them. Aggressively...
As usual the stderr is empty but If you're interested I kept a copy of the slot directory before I aborted it, if you would like it just ask.

http://universeathome.pl/universe/results.php?hostid=1679&offset=0&show_names=0&state=6&appid=

Contents of error.dat3.....
error: bondi() accreted mass (6.024443) larger than envelope mass (5.618883) (60413)
error: bondi() accreted mass (7.907802) larger than envelope mass (7.402827) (144779)
error: bondi() accreted mass (5.415336) larger than envelope mass (2.590284) (146410)
error: bondi() accreted mass (9.456944) larger than envelope mass (9.022832) (258705)
error: bondi() accreted mass (9.386890) larger than envelope mass (5.976198) (356863)
error: bondi() accreted mass (11.491090) larger than envelope mass (7.780758) (361139)
error: bondi() accreted mass (6.318919) larger than envelope mass (5.818937) (438696)
error: bondi() accreted mass (5.645096) larger than envelope mass (5.213384) (445394)
error: bondi() accreted mass (5.773027) larger than envelope mass (5.230975) (671283)
error: bondi() accreted mass (12.410371) larger than envelope mass (8.284976) (693333)
error: bondi() accreted mass (8.904030) larger than envelope mass (6.786716) (702075)
error: bondi() accreted mass (6.480212) larger than envelope mass (6.192082) (750103)
error: bondi() accreted mass (5.496527) larger than envelope mass (4.505465) (818009)

EDIT: Just tried to kill another one but instead this time it killed my PC, (blue screened)
22) Message boards : Number crunching : Problem with BHspin2 v0.02 (Message 2080)
Posted 28 Mar 2017 by JugNut
Post:
So far I have not had one of the new BHspin2 v0.02 finish. They get to somewhere near the end of the work unit(94-96%) then restart again at 0%

The message logs on all my PC's are full of messages like this..
Task universe_bh2_160803_104_37_20000_1-999999_300000_0 exited with zero status but no 'finished' file
If this happens repeatedly you may need to reset the project.

Has anyone actually completed BHspin2 v0.02 task yet? Cause I sure haven't.
I'm suspending all work here as this is just wasting electricity

A little help please Krzysztof.
23) Message boards : Number crunching : Erroneous validations or incorrect CPU time? Part 2 (Message 2078)
Posted 28 Mar 2017 by JugNut
Post:
I've been wondering the same same thing. If I go to any of my PC's & click on a random BHspin v2 valid work unit and look at who the wingman is I have a 98% chance that the wingman will be a anonymous host running Linux 3.2.0-56-virtual with either a i3, i5 or i7 and always using just one core in a VM and usually has an extremely fast run time.(try it yourself) Is this possible? well anythings possible, but what would be the chances?
24) Message boards : Number crunching : What do you guys make of this? (Message 1602)
Posted 30 Sep 2016 by JugNut
Post:
Thank you Krzysztof for your speedy reply & fix . Much appreciated.

All is well in the Universe once more ;)


@ Stiller Cruncher: Good luck in finding a fix.

@Tex1954: What happens if you tick the checkbox in Boinc Manager preferences called "Skip image file verification"? Of course this usually not a good idea and a proper fix would be best but in your case maybe worth a shot? It's just a last resort if no fix is found. Try it on a few tasks and see what happens.
25) Message boards : Number crunching : What do you guys make of this? (Message 1597)
Posted 29 Sep 2016 by JugNut
Post:
There appears to be a special magic type of ARM device going around, it can can do BHspin v2 tasks in 1 - 2 seconds and every task validates against hosts that take hundreds or thousands of times longer to complete:(

http://universeathome.pl/universe/results.php?hostid=38724&offset=0&show_names=0&state=4&appid=

What do you guys make of this?

If nothing else the validator is not doing it's job..
26) Message boards : Number crunching : application source code... (Message 723)
Posted 9 Nov 2015 by JugNut
Post:
Source code is closed at the moment...
I have spoken with it owner to open/publish it but I still haven't permission to do this...


What a pity!


+1
27) Message boards : Cafe : Points (Message 306)
Posted 4 May 2015 by JugNut
Post:
I know, that points in BOINC are topic of very hot discussion, but I have an impression that Universe@Home gives to many points for work...


I don't think so.
IMO, Point inflation is something from the past. Crossproject comparison does not make sense anymore since the upcoming of GPU Apps, Bitcoin Utopia or some very "generous" projects.

Crunchers are only able to compare themselves in a reasonable way with others within the same project. F.e. comparing the Points of MindModeling with those of Einstein@home just makes no sense.

From this point of view, all you have to do to be "fair" is, to keep the U@H credits per App in reasonable relations within U@H.

In the beginning of U@H, you would have been free to choose any amount of point per wu - the people with much production would be on the top, the guys with less production below.
Changing the amount of points NOW would mean to touch a running credit system, where very much people have already collected points in the times of higher creditratios, while others who joined later won't be so lucky.


Well said. Spot on...

The project is finally working great & crunchers are happy, the forums are almost empty why would you want to change anything and risk alienating crunchers that have stuck with you? It makes no sense.

Put it this way have you ever heard anyone sayi "i'm leaving this project because it gives good credit?"

Just to make a point this is a link to a guys host at asteroids@home(as close as you can get to a direct peer) http://asteroidsathome.net/boinc/results.php?hostid=37104&offset=0&show_names=0&state=4&appid=
His computer is 255th fastest. There getting get almost twice what we get here. If you look for haswell based CPU's that use optimised AVX app there even 20% faster again for same credit.

I guess it all depends on what you call to much? As there are clearly some that give much more. In my point of view the credits here are perfect.
28) Message boards : News : The brief summary of the scientific article (Message 268)
Posted 16 Apr 2015 by JugNut
Post:
Very cool stuff indeed. Well done Grzegorz & team.
29) Message boards : News : The first scientific outcome! (Message 253)
Posted 2 Apr 2015 by JugNut
Post:
Congratulation Grzegorz & team. Good stuff.
30) Message boards : Number crunching : WU deadlines (Message 219)
Posted 31 Mar 2015 by JugNut
Post:
Remember folks anything with a "9" in the workunit name is to be deleted. Or will eventually be deleted for you :( As is mentioned on the front page.

EG: universe_xray_9_20000_300001-350000_304187_1
31) Message boards : Number crunching : Universe@Home | Scheduler request completed: got 0 new tasks (Message 206)
Posted 26 Mar 2015 by JugNut
Post:
For me work has been flowing very nicely for the last few days.:)

Well done Krzysztof.
32) Message boards : Number crunching : Universe@Home | Scheduler request completed: got 0 new tasks (Message 191)
Posted 20 Mar 2015 by JugNut
Post:
Yea I have plenty of work also but unfortunately work has been patchy all day & only comes in bursts, although having a bigger cache does helps even things out & so far has prevented me from running dry. .

I'll keep looking..
33) Message boards : Number crunching : Universe@Home | Scheduler request completed: got 0 new tasks (Message 184)
Posted 20 Mar 2015 by JugNut
Post:
Well I woke up this morning with piles of work..

I'll check how it progress throughout the day, but so far fingers crossed things are looking up :)

Thanks Krzysztof
34) Message boards : Number crunching : Universe@Home | Scheduler request completed: got 0 new tasks (Message 180)
Posted 19 Mar 2015 by JugNut
Post:
I got a burst of work for a while & it seemed to start flowing again properly for a while but now back to one at a time every 20th or so request for work. I'm stumped. Time for bed (down under) we'll see what tomorrow brings.
35) Message boards : Number crunching : Universe@Home | Scheduler request completed: got 0 new tasks (Message 179)
Posted 19 Mar 2015 by JugNut
Post:
Mostly I can only get 1 WU at a time as well. The only way i've been able to get enough work is by constantly hammering the servers even then I get 1 work unit out of about 20 or more attempts. If I just leave boinc alone to do it's own thing no matter how much I need work the scheduler will only ask for work a few times then it just gives up. After that it won't ask again for another hour or so or until work it uploaded & then still gets nothing. Eventually I run out of work. Very frustrating & far from optimum for me or the server.

Occasionally I can get a burst of work that fills the buffers but then the symptoms return & it's only a matter of time that without intervention i'd be empty again.
36) Message boards : Number crunching : Universe@Home | Scheduler request completed: got 0 new tasks (Message 175)
Posted 19 Mar 2015 by JugNut
Post:
Same here Daniel, it seems to be a reoccurring problem.
37) Message boards : Number crunching : Server is dry - or what? (Message 155)
Posted 11 Mar 2015 by JugNut
Post:
I just attached my last host but now there's no work. It seems i'm in the same boat as the original poster the server says there's plenty of work but I can't get any.


Previous 20




Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek