Message boards : Number crunching : "Aborted by project" and "Cancelled by server"
Message board moderation

To post messages, you must log in.

AuthorMessage
Jan Henrik
Avatar

Send message
Joined: 22 Mar 16
Posts: 13
Credit: 1,113,528,333
RAC: 940
Message 5978 - Posted: 6 Dec 2022, 13:16:30 UTC

Hello Project,

Occasionally I have 1 or 2 "Aborted by project". It looks like it's increasing recently and yesterday I got 11 "Aborted by project" on just 1 machine.

What does that actually mean? Is that the same as the 21 "Cancelled by server" I can see on my task lists?

Greetings, Happy Holiday Season and good luck with the new server

JH
______________
"less than a pixel"
ID: 5978 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 10 May 20
Posts: 308
Credit: 4,733,484,700
RAC: 229,771
Message 5979 - Posted: 6 Dec 2022, 17:51:38 UTC

I believe it is something different than the standard cancelled by server which is just when the task sent out is no longer needed because a quorum result has already been achieved.

I believe that "aborted by project" is when the files on the host aren't the same as what the server believes they should be. Probably caused by communication errors or lags when sending you tasks. Same kind of thing when you get sent "lost tasks" because your client_state.xml file does not reflect what the server thinks it sent you.

I believe the message is just the "cleaning up" of your host by the server. But you probably should check out that host and see if something else is amiss.

I have never received a "aborted by project" message at Universe before. Have at other projects at earlier times though.

A proud member of the OFA (Old Farts Association)
ID: 5979 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jan Henrik
Avatar

Send message
Joined: 22 Mar 16
Posts: 13
Credit: 1,113,528,333
RAC: 940
Message 5981 - Posted: 10 Dec 2022, 8:01:55 UTC - in response to Message 5979.  

thanks for your input

I could have answered the second question myself if I had looked at the tasks in detail[never did this before].

compared the task/WU ids and they all are the same:

on the client side it's labeled "Aborted by project" on the web account list under "errors" it's labeled as "cancelled by server"

and if one looks at the details is says this:

Server state Over
Outcome Computation error
Client state Cancelled by server
Exit status 202 (0x000000CA) EXIT_ABORTED_BY_PROJECT
. . .
Run time
CPU time
Validate state Invalid
____

got no answer from the project
so my humble guess is that the sever cleans tasks that were in my cache as "ready to start" and hadn't started yet since there is no Run time or CPU time . . .
. . . so I know now that they are the same but don't really know why . . .
______________
"less than a pixel"
ID: 5981 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : "Aborted by project" and "Cancelled by server"




Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek