Work Units cancelled

Post to thread Subscribe


1 · 2 · Next
Author Message
Profile krzyszp
Project administrator
Project developer
Project tester
Avatar

Joined: 4 Feb 15
Posts: 393
Credit: 13,669,033
RAC: 4,964
Message 217 - Posted: 31 Mar 2015, 13:13:07 UTC

I had to cancel whole '9' series of work units on server.
Please cancel all WU's with '9' as first number in WU name.

Apologise for inconvenience
____________
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
Radioactive Android Map
Tsu

TheHoosh

Joined: 6 Mar 15
Posts: 12
Credit: 5,083,869
RAC: 0
Message 218 - Posted: 31 Mar 2015, 14:23:32 UTC - in response to Message 217.
Last modified: 31 Mar 2015, 14:23:55 UTC

Thanks for letting us know! I was already wondering why they were running for 22+ hours and were still not done.

Profile Euphoriabuzz
Avatar

Joined: 22 Feb 15
Posts: 2
Credit: 235,544
RAC: 686
Message 220 - Posted: 31 Mar 2015, 16:15:04 UTC - in response to Message 217.

Thanks for the continued updates

Vlaamse Leeuw

Joined: 4 Feb 15
Posts: 5
Credit: 218,889
RAC: 0
Message 222 - Posted: 31 Mar 2015, 16:47:42 UTC

Thanks for the update

Profile krzyszp
Project administrator
Project developer
Project tester
Avatar

Joined: 4 Feb 15
Posts: 393
Credit: 13,669,033
RAC: 4,964
Message 223 - Posted: 31 Mar 2015, 16:55:17 UTC - in response to Message 222.

I have released two new small batches of WU (10 and 11).
If the batches works again longer please give me a shout on forum...
____________
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
Radioactive Android Map
Tsu

Profile Sebastian M. Bobrecki
Volunteer tester

Joined: 4 Feb 15
Posts: 14
Credit: 133,059,924
RAC: 822
Message 225 - Posted: 31 Mar 2015, 18:36:26 UTC - in response to Message 223.

I have some task form 10 series, and it looks the same. 0% progress after ~40 minutes.

Profile krzyszp
Project administrator
Project developer
Project tester
Avatar

Joined: 4 Feb 15
Posts: 393
Credit: 13,669,033
RAC: 4,964
Message 226 - Posted: 31 Mar 2015, 18:40:44 UTC - in response to Message 225.
Last modified: 31 Mar 2015, 18:44:30 UTC

I have some task form 10 series, and it looks the same. 0% progress after ~40 minutes.

All of them are not working properly?

Edit:

Ok, I have cancelled both batches until we solve the problem. Is no point to waste your power...
____________
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
Radioactive Android Map
Tsu

Profile Sebastian M. Bobrecki
Volunteer tester

Joined: 4 Feb 15
Posts: 14
Credit: 133,059,924
RAC: 822
Message 227 - Posted: 31 Mar 2015, 18:46:50 UTC - in response to Message 226.

All of them are not working properly?
I checked on three hosts, and all of them shows the same behavior.

Vlaamse Leeuw

Joined: 4 Feb 15
Posts: 5
Credit: 218,889
RAC: 0
Message 228 - Posted: 31 Mar 2015, 18:55:04 UTC
Last modified: 31 Mar 2015, 19:00:00 UTC

Thanks for the notice

I just received some WU (8) . They seem to work fine.

Profile Ananas

Joined: 26 Mar 15
Posts: 52
Credit: 1,737,270
RAC: 0
Message 229 - Posted: 31 Mar 2015, 19:00:43 UTC - in response to Message 217.

I'm not sure if this information helps : The _9_ series had checkpoint files that consisted of a single line containing only the character "0" (0x30, not 0x00) followed by CRLF after 15+ hours

Profile bcavnaugh
Avatar

Joined: 28 Mar 15
Posts: 3
Credit: 1,283,722
RAC: 0
Message 230 - Posted: 31 Mar 2015, 19:21:13 UTC
Last modified: 31 Mar 2015, 20:19:02 UTC

After 12 Hours and a restart and starting over from Zero I killed the 9 Program Tasks.

Now I have some 8 Programs Tasks running I have my CPU set to run only 7 CPU Tasks and their are 14 Tasks running and taking 100% CPU Usage.
What is going on?
BOINC Version is 7.4.42
Running Tasks

Profile krzyszp
Project administrator
Project developer
Project tester
Avatar

Joined: 4 Feb 15
Posts: 393
Credit: 13,669,033
RAC: 4,964
Message 231 - Posted: 31 Mar 2015, 19:38:29 UTC - in response to Message 230.

I have no idea...
Your host shows 7 tasks in progress and the host have 12 threads, so Universe should take only 7 threads. If takes more then is something wrong with you manager (Universe are not MT application, and can't take more then one thread for one task).
____________
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
Radioactive Android Map
Tsu

Profile Nemrod

Joined: 21 Feb 15
Posts: 11
Credit: 1,376,238
RAC: 2,035
Message 232 - Posted: 31 Mar 2015, 19:45:55 UTC - in response to Message 231.

Everytime I have to restart my computer, 10 series of WU are starting from zero (i have the same problem with 9 series).

Profile krzyszp
Project administrator
Project developer
Project tester
Avatar

Joined: 4 Feb 15
Posts: 393
Credit: 13,669,033
RAC: 4,964
Message 233 - Posted: 31 Mar 2015, 19:57:56 UTC - in response to Message 232.

Are you mean, that checkpoints not working properly?
I have checked on Linux machine (works fine) but at the moment can't check on Windows.
____________
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
Radioactive Android Map
Tsu

Profile Ananas

Joined: 26 Mar 15
Posts: 52
Credit: 1,737,270
RAC: 0
Message 234 - Posted: 31 Mar 2015, 19:59:32 UTC - in response to Message 231.
Last modified: 31 Mar 2015, 20:07:18 UTC

I have no idea....

BOINC's "suspend" doesn't work on your application (p.s.: on windows, not sure about *ix), suspended tasks keep running. So if you have 7 tasks running and the host goes into panic mode and starts 7 different tasks with an earlier deadline, you will end up with 14 active tasks (BOINC doesn't notice that and marks the paused ones as "suspended" in the GUI).

The same problem exists at FiND@Home, still unsolved, so if you find a solution, maybe you could share it with them. Unfortunately they have not created their application properly so the executable doesn't contain the BOINC API version string (I supposed that it might be a version issue but could not verify it).

p.s.: This behaviour might have a bad side effect, when a workunit ends while it is supposed to sleep, it reports an error "result file present too long" (the wording is from my memory, might be slightly different) as BOINC doesn't pick up the result immediately (it doesn't watch suspended applications)

Profile bcavnaugh
Avatar

Joined: 28 Mar 15
Posts: 3
Credit: 1,283,722
RAC: 0
Message 235 - Posted: 31 Mar 2015, 20:25:41 UTC - in response to Message 230.

After 12 Hours and a restart and starting over from Zero I killed the 9 Program Tasks.

Now I have some 8 Programs Tasks running I have my CPU set to run only 7 CPU Tasks and their are 14 Tasks running and taking 100% CPU Usage.
What is going on?
BOINC Version is 7.4.42
Running Tasks



I restarted the computer and now only 7 Tasks are running now.
Note that I do not keep tasks in memory and after I Exited BOINC Client 6 Tasks were still running under Task Manager.

Profile DoctorNow
Avatar

Joined: 21 Feb 15
Posts: 9
Credit: 814,658
RAC: 32
Message 239 - Posted: 1 Apr 2015, 5:48:29 UTC
Last modified: 1 Apr 2015, 5:50:28 UTC

What is going here?
I just lost about 50 hours of crunching time because the server cancelled all my workunits. But these weren't from series 9, they were from 10 and nearly finished...
____________
Life is Science, and Science rules. To the universe and beyond
Proud member of BOINC@Heidelberg
My BOINC-Stats

Profile Ananas

Joined: 26 Mar 15
Posts: 52
Credit: 1,737,270
RAC: 0
Message 244 - Posted: 1 Apr 2015, 15:25:07 UTC - in response to Message 239.

What is going here? ...

Explanation in posting 226 (earlier in this thread).

Profile Blurf

Joined: 25 Feb 15
Posts: 4
Credit: 81,063
RAC: 0
Message 245 - Posted: 1 Apr 2015, 20:44:18 UTC

Any estimate of when new work will arrive? Our team designated Universe to be the Project of the Month. :)

Profile krzyszp
Project administrator
Project developer
Project tester
Avatar

Joined: 4 Feb 15
Posts: 393
Credit: 13,669,033
RAC: 4,964
Message 251 - Posted: 2 Apr 2015, 9:46:40 UTC - in response to Message 245.

Any estimate of when new work will arrive? Our team designated Universe to be the Project of the Month. :)

I think it will take few days...
____________
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
Radioactive Android Map
Tsu

1 · 2 · Next

Post to thread