Message boards : Number crunching : Totally false server report/status
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile marsinph

Send message
Joined: 22 Mar 18
Posts: 29
Credit: 24,402,488
RAC: 0
Message 2809 - Posted: 15 Apr 2018, 16:44:15 UTC

Hello,
If I believe status of server on 15 apr 16:30:56 UTC, there are
- 698186 WU ready to send
- 61410 in progress
- waiting validation : 3 WU

OK so far.

But I already have 10 WU finbished the two cruncher and awaiting validation !
And there are at least 2 hours ago returned.by both cruncher
See on
https://yafu.myfirewall.org/yafu/results.php?hostid=33275

I know server room renovation. But status server show all is running, so for me it will say it runs (or am I so stupid ?)
lease not ask me if I stay on page without refresh. I have three hosts. The couldn't have the salme IE cache.

Who can explain this ???
In my account : 10 WU , on server status only three.
If the server loose WU, I will not spend energy for nothing !

Best regards from Belgium
ID: 2809 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Gibson Praise
Avatar

Send message
Joined: 26 Feb 15
Posts: 3
Credit: 56,424,411
RAC: 12
Message 2811 - Posted: 15 Apr 2018, 17:11:08 UTC

Definitely something off-base. Validators say they are running yet I have loads of wu waiting for validation and almost all of them have completed wingmen in the same state.
ID: 2811 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Greger

Send message
Joined: 29 Oct 15
Posts: 3
Credit: 755,014,467
RAC: 726,705
Message 2812 - Posted: 15 Apr 2018, 17:15:53 UTC

marsinph:
This is not YAFU and your task there are finished or aborted, so no pending.
ID: 2812 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile marsinph

Send message
Joined: 22 Mar 18
Posts: 29
Credit: 24,402,488
RAC: 0
Message 2815 - Posted: 16 Apr 2018, 4:21:07 UTC - in response to Message 2812.  

State: All (199) · In progress (0) · Validation pending (87) · Validation inconclusive (0) · Valid (85) · Invalid (0) · Error (27)
Application: All (199) · Black Hole Database (198) · Universe Ultraviolet reionization (0) · Universe BHspin v2 (1) · Universe OpenCL test (0) · Universe QuarkStars (0) · Universe ULX (0)
Task name:



Task
click for details
Show names

Work unit
click for details

Computer

Sent

Time reported
or deadline
explain

Status

Run time
(sec)

CPU time
(sec)

Credit

Application

34056200 15007012 484648 15 Apr 2018, 12:41:28 UTC 16 Apr 2018, 0:41:48 UTC Completed, waiting for validation 5,912.24 5,557.19 pending Black Hole Database v0.05
34056202 15007013 484648 15 Apr 2018, 12:41:28 UTC 15 Apr 2018, 22:52:12 UTC Completed, waiting for validation 5,841.34 5,563.43 pending Black Hole Database v0.05
34055735 15006779 484648 15 Apr 2018, 12:41:28 UTC 15 Apr 2018, 17:41:50 UTC Completed, waiting for validation 6,346.10 5,869.57 pending Black Hole Database v0.05
34055740 15006782 484648 15 Apr 2018, 12:41:28 UTC 15 Apr 2018, 16:47:12 UTC Completed, waiting for validation 6,029.82 5,805.50 pending Black Hole Database v0.05
34055755 15006789 484648 15 Apr 2018, 12:41:28 UTC 15 Apr 2018, 16:24:24 UTC Completed, waiting for validation 6,165.63 5,852.08 pending Black Hole Database v0.05
34056304 15007064 484648 15 Apr 2018, 12:41:28 UTC 15 Apr 2018, 18:54:11 UTC Completed, waiting for validation 6,285.25 5,764.07 pending Black Hole Database v0.05
34056306 15007065 484648 15 Apr 2018, 12:41:28 UTC 16 Apr 2018, 0:52:21 UTC Completed, waiting for validation 5,939.93 5,564.40 pending Black Hole Database v0.05
34056205 15007014 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 17:01:44 UTC Completed, waiting for validation 6,073.20 5,767.03 pending Black Hole Database v0.05
34056236 15007030 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 22:52:12 UTC Completed, waiting for validation 5,860.14 5,558.14 pending Black Hole Database v0.05
34056264 15007044 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 21:13:36 UTC Completed, waiting for validation 5,854.29 5,569.42 pending Black Hole Database v0.05
34056014 15006919 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 15:22:57 UTC Completed, waiting for validation 5,798.65 5,678.03 pending Black Hole Database v0.05
34056284 15007054 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 16:50:15 UTC Completed, waiting for validation 5,871.08 5,615.13 pending Black Hole Database v0.05
34056051 15006937 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 19:25:14 UTC Completed, waiting for validation 6,094.01 5,680.43 pending Black Hole Database v0.05
34056052 15006938 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 21:22:07 UTC Completed, waiting for validation 5,946.01 5,669.64 pending Black Hole Database v0.05
34055622 15006723 484648 15 Apr 2018, 12:41:27 UTC 16 Apr 2018, 1:02:41 UTC Completed, waiting for validation 5,925.39 5,614.04 pending Black Hole Database v0.05
34055624 15006724 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 15:05:21 UTC Completed, waiting for validation 5,851.08 5,739.99 pending Black Hole Database v0.05
34055162 15006493 484648 15 Apr 2018, 12:41:27 UTC 15 Apr 2018, 17:09:08 UTC Completed, waiting for validation 7,027.81 6,617.33 pending Black Hole Database v0.05
34056251 15007037 484648 15 Apr 2018, 12:41:26 UTC 15 Apr 2018, 16:44:20 UTC Completed, waiting for validation 5,984.43 5,695.08 pending Black Hole Database v0.05
34056281 15007052 484648 15 Apr 2018, 12:41:26 UTC 15 Apr 2018, 21:09:17 UTC Completed, waiting for validation 5,867.84 5,539.16 pending Black Hole Database v0.05
34056036 15006930 484648 15 Apr 2018, 12:41:26 UTC 15 Apr 2018, 14:53:27 UTC Completed, waiting for validation 5,720.01 5,608.19 pending Black Hole Database v0.05
ID: 2815 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Greger

Send message
Joined: 29 Oct 15
Posts: 3
Credit: 755,014,467
RAC: 726,705
Message 2817 - Posted: 16 Apr 2018, 23:10:53 UTC

Fully normal validate or waiting for your wingman to send back task to workunit. First then it start to validate your task.
If you check on workunit it is replication 2 and minimum quorum 2. If you find a workunit that are completed you could link that but if it would be pending validation as 2 task needs to be completed without error. As there is a deadline of 15 days it could take up to this time until it got canceled and resend to another wingman.

Validator could be set on hold if project admins need free up process to be able to generate big batch of work. This could take a few minus up to an hour and server status page would not pick up this fast enough as page update on specific times and would not be real time.

Work units would not get lost on server, work unit stay on server and it only send a copy of unit and and be send as a task to host. It would require hardware failure to lose data.
ID: 2817 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile marsinph

Send message
Joined: 22 Mar 18
Posts: 29
Credit: 24,402,488
RAC: 0
Message 2819 - Posted: 20 Apr 2018, 13:10:21 UTC

It is not so much the problem.
The server status page show all running.
But my "event log" report full opposite !
And my WU stay in status "ready to report" on my host/

20/04/2018 14:24:13 | Universe@Home | Sending scheduler request: To fetch work.
20/04/2018 14:24:13 | Universe@Home | Reporting 1 completed tasks
20/04/2018 14:24:13 | Universe@Home | Requesting new tasks for CPU
20/04/2018 14:24:15 | Universe@Home | Scheduler request completed: got 0 new tasks
20/04/2018 14:24:15 | Universe@Home | Server error: feeder not running
20/04/2018 14:59:09 | Universe@Home | update requested by user
20/04/2018 14:59:12 | Universe@Home | Sending scheduler request: Requested by user.
20/04/2018 14:59:12 | Universe@Home | Reporting 1 completed tasks
20/04/2018 14:59:12 | Universe@Home | Requesting new tasks for CPU
20/04/2018 14:59:14 | Universe@Home | Scheduler request completed: got 0 new tasks
20/04/2018 14:59:14 | Universe@Home | Server error: feeder not running
20/04/2018 15:03:15 | Universe@Home | update requested by user
20/04/2018 15:03:15 | Universe@Home | Sending scheduler request: Requested by user.
20/04/2018 15:03:15 | Universe@Home | Reporting 1 completed tasks
20/04/2018 15:03:15 | Universe@Home | Requesting new tasks for CPU
20/04/2018 15:03:17 | Universe@Home | Scheduler request completed: got 0 new tasks
20/04/2018 15:03:17 | Universe@Home | Server error: feeder not running
ID: 2819 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile planetclown

Send message
Joined: 8 Nov 15
Posts: 1
Credit: 14,456,767
RAC: 122,316
Message 2820 - Posted: 20 Apr 2018, 14:29:09 UTC
Last modified: 20 Apr 2018, 14:30:23 UTC

+1 I'm seeing the same behavior as marsinph ("Server error: feeder not running" and tasks hanging with "Ready to report" even after an update)
ID: 2820 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile marsinph

Send message
Joined: 22 Mar 18
Posts: 29
Credit: 24,402,488
RAC: 0
Message 2821 - Posted: 20 Apr 2018, 15:41:16 UTC

And it stay so !!!
I see producing WU increase, about 2,000,000 at 15/40UTC
But why to produce if we are unable to report ???
See my log :(our are UTC+2 : Belgium)
20/04/2018 17:35:23 | Universe@Home | update requested by user
20/04/2018 17:35:26 | Universe@Home | Sending scheduler request: Requested by user.
20/04/2018 17:35:26 | Universe@Home | Reporting 9 completed tasks
20/04/2018 17:35:26 | Universe@Home | Requesting new tasks for CPU
20/04/2018 17:35:28 | Universe@Home | Scheduler request completed: got 0 new tasks
20/04/2018 17:35:28 | Universe@Home | Server error: feeder not running
ID: 2821 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
frankhagen

Send message
Joined: 12 Aug 17
Posts: 12
Credit: 52,505,947
RAC: 35,627
Message 2822 - Posted: 20 Apr 2018, 19:06:10 UTC - in response to Message 2821.  

here we go again: Rückstand des Transitioners (Stunden) 6.97
ID: 2822 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Senilix

Send message
Joined: 22 Feb 15
Posts: 9
Credit: 3,107,110
RAC: 0
Message 2823 - Posted: 20 Apr 2018, 20:16:36 UTC - in response to Message 2822.  

5 days ago the project admin announced that he will be out of town for about a week. So we might have to wait for another couple of days until he returns and finds the time to kick that feeder's ass...
ID: 2823 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile lost68er
Avatar

Send message
Joined: 21 Mar 16
Posts: 9
Credit: 14,620,950
RAC: 0
Message 2824 - Posted: 21 Apr 2018, 6:34:33 UTC
Last modified: 21 Apr 2018, 6:49:48 UTC

...never touch a running system...

@Senilix: maybe he should not kick the feeder´s ass, but subject him to a long, painful interrogation .... ;-)
ID: 2824 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 760
Credit: 142,142,465
RAC: 5,737
Message 2831 - Posted: 22 Apr 2018, 19:01:28 UTC - in response to Message 2823.  

5 days ago the project admin announced that he will be out of town for about a week. So we might have to wait for another couple of days until he returns and finds the time to kick that feeder's ass...

I'm BACK :)

Yes, somebody has probably moved server to it's original location but didn't start properly all services. Now I have done it and everything should be ok in next few hours.
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home team
My Patreon profile
Universe@Home on YT
ID: 2831 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
CasinoAsist

Send message
Joined: 8 Apr 20
Posts: 2
Credit: 1,354,400
RAC: 0
Message 4162 - Posted: 16 Apr 2020, 14:07:56 UTC

I have similiar situation after excatly 1 year later .

96031544 44125459 555588 16 Apr 2020, 1:49:23 UTC 16 Apr 2020, 5:47:28 UTC Completed, waiting for validation 12,624.36 12,624.36 pending Universe ULX v0.12
windows_x86_64
95577089 43924054 556762 15 Apr 2020, 21:54:32 UTC 16 Apr 2020, 0:11:58 UTC Completed, waiting for validation 5,516.64 5,488.11 pending Universe BHspin v2 v0.19
x86_64-pc-linux-gnu
95577090 43924055 556762 15 Apr 2020, 21:54:32 UTC 16 Apr 2020, 0:15:25 UTC Completed, waiting for validation 5,455.77 5,434.87 pending Universe BHspin v2 v0.19
x86_64-pc-linux-gnu
95577001 43924010 556762 15 Apr 2020, 21:54:32 UTC 16 Apr 2020, 0:10:09 UTC Completed, waiting for validation 5,435.76 5,403.23 pending Universe BHspin v2 v0.19
x86_64-pc-linux-gnu
95576999 43924009 556762 15 Apr 2020, 15:21:49 UTC 15 Apr 2020, 22:39:17 UTC Completed, waiting for validation 5,113.69 5,076.22 pending Universe BHspin v2 v0.19
x86_64-pc-linux-gnu
95499883 43885500 555588 13 Apr 2020, 20:17:27 UTC 15 Apr 2020, 9:34:31 UTC Completed, waiting for validation 13,148.49 13,095.73 pending Universe BHspin v2 v0.19
windows_x86_64
95493459 43882292 555588 13 Apr 2020, 17:41:57 UTC 15 Apr 2020, 4:50:00 UTC Completed, waiting for validation 17,042.85 17,042.85 pending Universe BHspin v2 v0.19
windows_x86_64
95490665 43880896 555588 13 Apr 2020, 16:16:25 UTC 14 Apr 2020, 19:08:03 UTC Completed, waiting for validation 16,107.64 16,107.64 pending Universe BHspin v2 v0.19
windows_x86_64
95481349 43876244 555588 13 Apr 2020, 12:09:30 UTC 14 Apr 2020, 9:17:40 UTC Completed, waiting for validation 16,991.61 16,991.61 pending Universe BHspin v2 v0.19
windows_x86_64


these are waiting almost for 2-4 days .
ID: 4162 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Thyme Lawn

Send message
Joined: 15 Oct 17
Posts: 11
Credit: 3,811,677
RAC: 2,247
Message 4165 - Posted: 18 Apr 2020, 15:30:52 UTC - in response to Message 4162.  
Last modified: 18 Apr 2020, 15:32:06 UTC

All workunits have a minimum quorum of 2, so a second task has to be completed with the same result before tasks can change to validated. All of your tasks are waiting for the second completion.

For example, the last task in your list is from WU 43924054 and the second task still has 11 days to go until it'll be timed out (deadline 29 Apr 2020, 14:46:28 UTC). If that task fails or times out the server will issue an additional task and continue to do that until a pair of tasks are validated, 5 tasks fail or 10 tasks have been reported (any aborted tasks won't count as a failed task but will count as a reported task).

95577088 	546390 	15 Apr 2020, 14:46:28 UTC 	29 Apr 2020, 14:46:28 UTC 	In progress 	--- 	--- 	--- 	Universe BHspin v2 v0.19 x86_64-pc-linux-gnu
95577089 	556762 	15 Apr 2020, 21:54:32 UTC 	16 Apr 2020, 0:11:58 UTC 	Completed, waiting for validation 	5,516.64 	5,488.11 	pending 	Universe BHspin v2 v0.19 x86_64-pc-linux-gnu

"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 4165 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Totally false server report/status




Copyright © 2022 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek