Message boards :
Number crunching :
extreme long wu's
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 10 · Next
Author | Message |
---|---|
Send message Joined: 4 Feb 15 Posts: 847 Credit: 144,180,465 RAC: 0 |
Are the new apps released now, it has been a week? Not yet. I had accident and motherboard in my main machine died... I just get new one, but it takes me few days to recover all my current jobs :( Krzysztof 'krzyszp' Piszczek Member of Radioactive@Home team My Patreon profile Universe@Home on YT |
Send message Joined: 1 Oct 16 Posts: 32 Credit: 268,033 RAC: 0 |
Fair enough. I'll look in again next weekend. |
Send message Joined: 23 Feb 15 Posts: 2 Credit: 2,107,545 RAC: 0 |
I've canceled today the result http://universeathome.pl/universe/result.php?resultid=20240557 after over 10 days calculation and not having reached 50% progress Matthias |
Send message Joined: 1 Oct 16 Posts: 32 Credit: 268,033 RAC: 0 |
How are you getting on Chris, are we ready to go on the new versions now? |
Send message Joined: 4 Feb 15 Posts: 847 Credit: 144,180,465 RAC: 0 |
How are you getting on Chris, are we ready to go on the new versions now? Early next week I think. Krzysztof 'krzyszp' Piszczek Member of Radioactive@Home team My Patreon profile Universe@Home on YT |
Send message Joined: 29 Aug 15 Posts: 3 Credit: 341,000 RAC: 0 |
Had some WUs in this category today, now aborted examples :- universe_bh2_160803_85_3_20000_1-999999_480000_1 universe_bh2_160803_85_1_20000_1-999999_800000_0 Both over 2.5 days with Boinc Manager reporting at 3% complete. |
Send message Joined: 1 Apr 15 Posts: 49 Credit: 30,557,740 RAC: 0 |
http://universeathome.pl/universe/results.php?hostid=47318 The same with an Android TV Box. Passed from 8-10 hours to 35-40 hours. At the moment, I prefer to use it for World Community Grid and POGS. ASUS X570 E-Gaming AMD Ryzen 9 3950X, 16 core / 32 thread 4.4 GHz AMD Radeon Sapphire RX 480 4GB Nitro+ Nvidia GTX 1080 Ti Gaming X Trio 4x16 GB Corsair Vengeance RGB 3466 MHz |
Send message Joined: 1 Oct 16 Posts: 32 Credit: 268,033 RAC: 0 |
So, another week, seems like problems still arise, so I assume the fix has not been released. Update? *** Off topic *** Something I noticed when beginning to post this, I did not notice I was logged in already, and went to the login page, put my name and password in, but trying to to submit the form, got a 404. Trivial. |
Send message Joined: 5 Feb 17 Posts: 6 Credit: 2,135,900 RAC: 0 |
|
Send message Joined: 21 Feb 15 Posts: 53 Credit: 1,385,888 RAC: 0 |
It is utterly unacceptable that this project still continues to waste CPU cycles, with a known bad app or batch of tasks. Literally, unacceptable. Wasting crunching power!! I may never turn "No New Tasks" off, because of this non-responsiveness! :( krzyszp, can't you do something to stop the bleeding, even?? |
Send message Joined: 4 Feb 15 Posts: 847 Credit: 144,180,465 RAC: 0 |
We will upgrade application shortly when we sort out some compiling problems, The very long units are still a mystery as it not happens very often and only on some computers (e.g. on one of my machines it's never happens). Anyway, the upgrade changes a quite big part of application so the problem should go away soon... Krzysztof 'krzyszp' Piszczek Member of Radioactive@Home team My Patreon profile Universe@Home on YT |
Send message Joined: 1 Oct 16 Posts: 32 Credit: 268,033 RAC: 0 |
I think you should suspend work unit production until this is fixed. There are peoples machines wasting days of CPU time, which obviously could be doing useful work, You are not winning friends continuing the way you are. |
Send message Joined: 21 Feb 15 Posts: 53 Credit: 1,385,888 RAC: 0 |
Unattended machines could be wasting more than just days, on this issue! Admin, please consider both of these: - Stopping sending work for tasks that could end up in a never-ending state. - Server-side-aborting tasks that could end up in a never-ending state. That's what I'd do. Wasting CPU cycles is equivalent to stealing CPU cycles from other projects. |
Send message Joined: 28 Feb 15 Posts: 253 Credit: 200,562,581 RAC: 0 |
If it is a short time until the new application is ready, and you need the results of the present application, then I would keep it going. It is a relatively small problem for me. If other people have machines that are more susceptible to it, then they can turn them off as they desire. But it makes no sense to prevent the completion of a scientific study for a few bad work units. All projects have them, and the crunchers can choose other projects anytime they want. |
Send message Joined: 21 Feb 15 Posts: 53 Credit: 1,385,888 RAC: 0 |
To be clear ... Tasks that error out eventually, are a pain to deal with, but a non-attended setup will handle them gracefully enough. Tasks that run continuously without end, are a pain to deal with, but a non-attended setup will end up crunching indefinitely, wasting electricity and wasting resources indefinitely. I speak loudly, because I think we're dealing with the 2nd case here. It sounds like you are saying "Oh, it's okay to render machines and CPUs completely useless and have them waste energy, if we make progress overall." ... and that's a very very bad idea. |
Send message Joined: 28 Feb 15 Posts: 253 Credit: 200,562,581 RAC: 0 |
If my machines were susceptible to that problem to the extent that I found it unacceptable, I would choose another project. You seem to be asking them to cancel the project, or some portion thereof, for a problem that affects some people more than others and they can't find the solution for. Are you expecting them to find the bad work units in advance? If they could do that, they could fix them. |
Send message Joined: 1 Oct 16 Posts: 32 Credit: 268,033 RAC: 0 |
What he has said is that there is a fix, but for some reason he can't compile it, sounds odd to me, but then, I've only been a software engineer for 30 years, what would I know. Nobody has said stop the project, what has been, quite rightly said, is that the situation he seems happy to live with, some of his crunchers are not. He fixes the problem, or looses crunchers, his choice. |
Send message Joined: 21 Feb 15 Posts: 53 Credit: 1,385,888 RAC: 0 |
If my machines were susceptible to that problem to the extent that I found it unacceptable, I would choose another project. You seem to be asking them to cancel the project, or some portion thereof, for a problem that affects some people more than others and they can't find the solution for. I am attached to every possible project, about 60 of them. I routinely do work for about 15 of them. I'm also one of the main BOINC Alpha testers. What I am asking for is not unreasonable. The request is: If a project has a situation where a task can get stuck in the worst possible state of running indefinitely (100% waste), the project does everything in their power to stop the bleeding, including possibly taking the app offline or cancelling affected batches of tasks. It has happened to other projects before, and they have responded correctly. I'm hoping for a correct response with this project. In the meantime, I'm lucky I don't have unattended setups, and I easily set No New Tasks on all 4 of my PCs. |
Send message Joined: 4 Feb 15 Posts: 847 Credit: 144,180,465 RAC: 0 |
including possibly taking the app offline or cancelling affected batches of tasks. It's never "batch" of tasks - is always just few tasks (in worst batch it was 14 WU's) in batch of around 40'000 and they mostly calculate correctly on wingman machine. This is why is so difficult to find result. Even when I manually run those tasks on my machine I didn't have any answers because their are finish properly... Krzysztof 'krzyszp' Piszczek Member of Radioactive@Home team My Patreon profile Universe@Home on YT |
Send message Joined: 21 Feb 15 Posts: 53 Credit: 1,385,888 RAC: 0 |
If there is anything you'd like me to test or try, tell me what to do and I'll do it. I want it solved, and am willing to try things for you. |