1) Message boards : Number crunching : "Aborted by project" and "Cancelled by server" (Message 5981)
Posted 10 Dec 2022 by Jan Henrik
Post:
thanks for your input

I could have answered the second question myself if I had looked at the tasks in detail[never did this before].

compared the task/WU ids and they all are the same:

on the client side it's labeled "Aborted by project" on the web account list under "errors" it's labeled as "cancelled by server"

and if one looks at the details is says this:

Server state Over
Outcome Computation error
Client state Cancelled by server
Exit status 202 (0x000000CA) EXIT_ABORTED_BY_PROJECT
. . .
Run time
CPU time
Validate state Invalid
____

got no answer from the project
so my humble guess is that the sever cleans tasks that were in my cache as "ready to start" and hadn't started yet since there is no Run time or CPU time . . .
. . . so I know now that they are the same but don't really know why . . .
2) Message boards : Number crunching : "Aborted by project" and "Cancelled by server" (Message 5978)
Posted 6 Dec 2022 by Jan Henrik
Post:
Hello Project,

Occasionally I have 1 or 2 "Aborted by project". It looks like it's increasing recently and yesterday I got 11 "Aborted by project" on just 1 machine.

What does that actually mean? Is that the same as the 21 "Cancelled by server" I can see on my task lists?

Greetings, Happy Holiday Season and good luck with the new server

JH
3) Message boards : Number crunching : "Output file . . .absent" (Message 5690)
Posted 1 Jul 2022 by Jan Henrik
Post:
. . . almost forgot about this:

yes! I could reproduce it on another machine and another project[though different app and different log-lingo yet same 1-2 second thing]

So it's not hardware or project-specific.

It happens sometimes after updates to the runtime environment.

restart of the client is not enough, have to restart the computer and then it's gone
4) Message boards : Number crunching : "Output file . . .absent" (Message 5352)
Posted 9 May 2022 by Jan Henrik
Post:
Thanks for all the input.

First of all it didn't reproduce so far, so it's not urgent, more a curiosity.


But why it would occur after a few seconds? That should result in a computation error, and even that should result in some sort of result files being produced.

So i guess corrupted downloads are a possibility, but the resulting errors are very odd if that was the case.

Yeah. I have errors that clearly identify as "Error while downloading", and those don't start to compute. So that's OK then. But why did the others even start to compute?

As for the client. Yes it's 7.18.1
I have 4 with the very same OS/update/upgrade/client etc. but it happened only with one.
I know 4 is a small sample, but that has me tilted back to the hardware side.

Although it's a recent NVMe SSD and I checked/benchmarked it, I could still be the "lucky" guy with a faulty brand new NVMe SSD.
The perpetrator has a complete hardware twin that managed to behave. So I will take both of the project and do tests/benchmarks and see if there is much difference.
(It's a good time for that now since the pride show obviously overloads the project-server, so when all egos are satisfied I might be done with my tests and we can go back to regular boincing then)
5) Message boards : Number crunching : "Output file . . .absent" (Message 5342)
Posted 7 May 2022 by Jan Henrik
Post:
You have a slow storage system or a anti-virus program running that is locking the slots directories so the client can't access or read the output file.


Thanks for your input.
I'm on 22.04 LTS, have no AV (or would you recommend that?) and the storage is a recent NVMe which I checked anyway but is OK.
Could it be the server?
It obviously couldn't handle yesterdays uploads and since the pride-show starts already before officially starting when the needy bunker tasks,
the downloads were getting more and more difficult around that time. Buggy downloads? . . . just a theory.

Anyway it didn't happen since then. So for now it is just a peculiar curiosity.

Thanks again for your thoughts.

BTW: what are the requirements for joining the OFA?
6) Message boards : Number crunching : LOOK OUT!!!! Pentathlon! (Message 5289)
Posted 4 May 2022 by Jan Henrik
Post:
Guilty as charged. I downloaded 5 days of work at once, most of which which will be returned in 24 hours when the race starts. Feels like cheating to me but they all do it. They shouldn't announce the project name until the start of the race. I got a couple of days work at a time, so asked 3 times in a row.


so it already started as I can see with the lots of pending

and I put the "Output file absent"-issue in a separate thread since I'm not sure if this is related[sorry]
7) Message boards : Number crunching : "Output file . . .absent" (Message 5288)
Posted 4 May 2022 by Jan Henrik
Post:
got about 130 tasks with compute errors after 1 or 2 seconds
and the event log printed something like this:

Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Computation for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 finished
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_0 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_1 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_2 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_4 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_5 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Starting task universe_bh2_190723_398_6021763979_20000_1-999999_770100_0

happened only on 1 machine though and not on another [completely identical one]
8) Message boards : Number crunching : LOOK OUT!!!! Pentathlon! (Message 5279)
Posted 3 May 2022 by Jan Henrik
Post:
More tasks are on the way :)
As you said - SSD's test is started ;)


are you sure?

It didn't start yet but I already get lots of this:
Tue 03 May 2022 03:28:05 AM CEST | Universe@Home | Scheduler request completed: got 0 new tasks
Tue 03 May 2022 03:29:01 AM CEST | Universe@Home | Project has no tasks available
(while the project status page still shows plenty of task available)

. . . instead of that:
Tue 03 May 2022 03:32:35 AM CEST | Universe@Home | This computer has reached a limit on tasks in progress


and also some:

Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Computation for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 finished
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_0 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_1 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_2 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_4 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Output file universe_bh2_190723_398_6021738979_20000_1-999999_745100_1_r565751666_5 for task universe_bh2_190723_398_6021738979_20000_1-999999_745100_1 absent
Sun 01 May 2022 10:02:33 PM CEST | Universe@Home | Starting task universe_bh2_190723_398_6021763979_20000_1-999999_770100_0
9) Message boards : Cafe : Sad (Message 5278)
Posted 3 May 2022 by Jan Henrik
Post:
My condolences. Remember the good times.
10) Message boards : News : No tasks (Message 5083)
Posted 13 Feb 2022 by Jan Henrik
Post:

And another news from this night...


Thanks for the quick answers and I wish you a speedy recovery or a very mild case in the first place. And good luck with the new SSDs . . .
11) Message boards : News : No tasks (Message 5074)
Posted 6 Feb 2022 by Jan Henrik
Post:
just curious(when there is time for it);

1.)
The recent server problem comes as with current load we have massive traffic between main and storage server and between main and database server.

I'm working in it but still can't recognize what exactly makes troubles in whole system.

. . . we need to replace two hard drives on main server - currently used are dying now.

Was the "massive traffic" because the hard drives are "dying" or are the hard drives "dying" because of the "massive traffic" ?


2.)
Anyway, I will prepare website monitoring script for better reactivity in case of major problems in future.

Your not planning to use Log4j do you?


3.)
then I will go to Warsaw . . . "remote hands" will replace it for me

Are you managing this project from the UK?
12) Message boards : Number crunching : Upload failure: file size too big (Message 4173)
Posted 24 Apr 2020 by Jan Henrik
Post:
app-arently ULX 0.12 keeps craving for attention of your qualified personnel;

Application
Universe ULX 0.12
Name
universe_ulx_512_3963_20000_1-999999_500000
State
Computation error
Received
4/22/2020 6:18:27 AM
Report deadline
5/6/2020 6:18:26 AM
Estimated computation size
807 GFLOPs
CPU time
07:07:37
Elapsed time
07:08:11
Executable
universe-ULX_12_windows_x86_64.exe

how peculiar;

Universe@Home | Output file universe_ulx_512_3963_20000_1-999999_500000_3_r1296757251_1 for task universe_ulx_512_3963_20000_1-999999_500000_3 exceeds size limit.
Universe@Home | File size: 2080570381.000000 bytes. Limit: 1700000000.000000 bytes

whoever can fix this

greetings
13) Message boards : Number crunching : Upload of WUs failed (Message 4033)
Posted 8 Feb 2020 by Jan Henrik
Post:


DietPi

853 Universe@Home 16-Feb-19 0:47:05 Temporarily failed download of universe_bh2_190723_296_178509822_20000_1-999999_510100: transient HTTP error

. . .

and many more


16-Feb-19 ???

remember PNT

is your Pi in 2019?
the server might be already in 2020 . . .







Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek