Message boards : Number crunching : Problems with Work Units.
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Cruncher Pete

Send message
Joined: 23 Feb 15
Posts: 10
Credit: 106,637,804
RAC: 1
Message 72 - Posted: 24 Feb 2015, 6:21:46 UTC

Congratulations Krzysztof on the setting up of the production server. As one can expect, problems will occur . "Huston, we have a problem". I only been running WU's for 4 to 8 hours on various machines. Results to date: In progress=1146, Pending =17, Errors=16 (All cancelled by project), Valid=1.

Since sending WU's are halted, the validator and assimilator are Not running, shall I continue to crunch the remaining 1146 units? will it be some use for the project? or should I abort them. I am not worried about the credit but I do not wish to waste Electricity if they are not worth anything..

You might also like to know that although I am on cable, my upload speed is only 13kbps and looking at the upload of just one unit, it will take over one hour to upload a unit that is close to 5MBps. Not complaining, but I thought you ought to know.
ID: 72 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 740
Credit: 140,379,798
RAC: 12,754
Message 74 - Posted: 24 Feb 2015, 11:34:39 UTC - in response to Message 72.  

I'm surprised for errors on your computers. Overall fail rate for all WU's is 0.2208% (without counting aborted WU's).
Yes, you can continue as the cancelled by server units are probably from very first short batches started at the day one of project and cancelled by me due to bug in results template file.

I know that results files are large, but is nothing what I can do with this at the moment (the files are compressed, before compression they are up to 400MB together).
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
My Patreon profile
ID: 74 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile erik*
Avatar

Send message
Joined: 21 Feb 15
Posts: 4
Credit: 182,428,894
RAC: 48,977
Message 85 - Posted: 24 Feb 2015, 21:54:00 UTC

Hello all guys,
It seems the servor abort for the third user is due to tasks sent after the deadline for the second user.
http://universeathome.pl/universe/workunit.php?wuid=8187
it's not the only one for me.

Regards.
ID: 85 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Goroshko

Send message
Joined: 3 Jun 15
Posts: 4
Credit: 16,528,220
RAC: 7,237
Message 345 - Posted: 3 Jun 2015, 20:17:27 UTC
Last modified: 3 Jun 2015, 20:22:37 UTC

Hi to all!
I have problems downloading tasks for servers in the domain (Win-2003 (x32), Win-2008 (x64)) boinc-client v.6.10.45
Part of log file:
 03-Jun-2015 22:37:55 [http://universeathome.pl/universe/] Fetching scheduler list
03-Jun-2015 22:38:02 [---] Account manager: BAM! Host: 493047
03-Jun-2015 22:38:02 [---] Account manager: Number of BAM! connections for this host: 4777
03-Jun-2015 22:38:02 [---] Account manager contact succeeded
03-Jun-2015 22:38:02 [---] General prefs: from http://bam.boincstats.com/ (last modified 14-Mar-2013 13:04:53)
03-Jun-2015 22:38:02 [---] Host location: home
03-Jun-2015 22:38:02 [---] General prefs: no separate prefs for home; using your defaults
03-Jun-2015 22:38:02 [---] Reading preferences override file
03-Jun-2015 22:38:02 [---] Preferences limit memory usage when active to 4887.30MB
03-Jun-2015 22:38:02 [---] Preferences limit memory usage when idle to 7330.96MB
03-Jun-2015 22:38:02 [---] Preferences limit disk usage to 68.29GB
03-Jun-2015 22:38:04 [http://universeathome.pl/universe/] Master file download succeeded
03-Jun-2015 22:38:18 [http://universeathome.pl/universe/] Sending scheduler request: Project initialization.  Requesting 1 seconds of work, reporting 0 completed tasks
03-Jun-2015 22:38:29 [Universe@Home] Scheduler request succeeded: got 1 new tasks
03-Jun-2015 22:38:29 [---] General prefs: from http://bam.boincstats.com/ (last modified 03-Jun-2015 20:58:55)
03-Jun-2015 22:38:29 [---] Host location: home
03-Jun-2015 22:38:29 [---] General prefs: using separate prefs for home
03-Jun-2015 22:38:29 [---] Reading preferences override file
03-Jun-2015 22:38:29 [---] Preferences limit memory usage when active to 4887.30MB
03-Jun-2015 22:38:29 [---] Preferences limit memory usage when idle to 7330.96MB
03-Jun-2015 22:38:29 [---] Preferences limit disk usage to 68.29GB
03-Jun-2015 22:38:30 [Universe@Home] [error] Couldn't find suitable URL for universe_xray_18_20000_300001-500000_353423_1_0
03-Jun-2015 22:38:30 [---] [error] No URL for file transfer of universe_xray_18_20000_300001-500000_353423_1_0
03-Jun-2015 22:38:30 [Universe@Home] [error] Couldn't find suitable URL for universe_xray_18_20000_300001-500000_353423_1_1
03-Jun-2015 22:38:30 [---] [error] No URL for file transfer of universe_xray_18_20000_300001-500000_353423_1_1
03-Jun-2015 22:38:30 [Universe@Home] [error] Couldn't find suitable URL for universe_xray_18_20000_300001-500000_353423_1_2
03-Jun-2015 22:38:30 [---] [error] No URL for file transfer of universe_xray_18_20000_300001-500000_353423_1_2
03-Jun-2015 22:38:30 [Universe@Home] [error] Couldn't find suitable URL for universe_xray_18_20000_300001-500000_353423_1_3
03-Jun-2015 22:38:30 [---] [error] No URL for file transfer of universe_xray_18_20000_300001-500000_353423_1_3
03-Jun-2015 22:38:31 [Universe@Home] Started download of universe-xray_sources_v2_3_windows_intelx86.exe
03-Jun-2015 22:38:31 [Universe@Home] Started download of job_3.0.xml
03-Jun-2015 22:38:34 [Universe@Home] Finished download of job_3.0.xml
03-Jun-2015 22:38:34 [Universe@Home] Started download of wu_idum3_18_353423
03-Jun-2015 22:38:36 [Universe@Home] Finished download of wu_idum3_18_353423
03-Jun-2015 22:38:36 [Universe@Home] Started download of universe_xray_18_20000_300001-500000_353423_1_0
03-Jun-2015 22:38:37 [Universe@Home] Finished download of universe_xray_18_20000_300001-500000_353423_1_0
03-Jun-2015 22:38:37 [Universe@Home] Started download of universe_xray_18_20000_300001-500000_353423_1_1
03-Jun-2015 22:38:38 [Universe@Home] Finished download of universe_xray_18_20000_300001-500000_353423_1_1
03-Jun-2015 22:38:38 [Universe@Home] Started download of universe_xray_18_20000_300001-500000_353423_1_2
03-Jun-2015 22:38:38 [Universe@Home] [error] Couldn't find suitable URL for universe_xray_18_20000_300001-500000_353423_1_0
03-Jun-2015 22:38:38 [---] [error] No URL for file transfer of universe_xray_18_20000_300001-500000_353423_1_0
03-Jun-2015 22:38:39 [Universe@Home] Finished download of universe_xray_18_20000_300001-500000_353423_1_2
03-Jun-2015 22:38:39 [Universe@Home] Started download of universe_xray_18_20000_300001-500000_353423_1_3
03-Jun-2015 22:38:39 [Universe@Home] [error] Couldn't find suitable URL for universe_xray_18_20000_300001-500000_353423_1_1
03-Jun-2015 22:38:39 [---] [error] No URL for file transfer of universe_xray_18_20000_300001-500000_353423_1_1


Loading permanent tasks that fall in error.
ID: 345 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 740
Credit: 140,379,798
RAC: 12,754
Message 346 - Posted: 3 Jun 2015, 20:42:02 UTC - in response to Message 345.  

Hmm, I see few problems.

I can't find on your account computer with BOINC version 6.xx, but I can see v 5.10.45. I didn't test application with that old version of BOINC but I strongly suspect that this will not work due to file compression used in our project. Also as far as I know, only versions above 7.0 fully support file compression...

Also, we have small server problem today, can you maybe try to reset project?
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
My Patreon profile
ID: 346 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Goroshko

Send message
Joined: 3 Jun 15
Posts: 4
Credit: 16,528,220
RAC: 7,237
Message 348 - Posted: 4 Jun 2015, 17:01:10 UTC - in response to Message 346.  

Over the past day counted and checked several project tasks of a server. However, the situation has not changed. Must I refuse computing of these computers? Thank you for answer.
ID: 348 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 740
Credit: 140,379,798
RAC: 12,754
Message 349 - Posted: 4 Jun 2015, 18:14:26 UTC - in response to Message 348.  

Over the past day counted and checked several project tasks of a server. However, the situation has not changed. Must I refuse computing of these computers? Thank you for answer.

Which computers?
On your account I see only one broken task...
Can you give me a link to "problematic" computer, please?
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
My Patreon profile
ID: 349 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Goroshko

Send message
Joined: 3 Jun 15
Posts: 4
Credit: 16,528,220
RAC: 7,237
Message 350 - Posted: 4 Jun 2015, 19:08:03 UTC - in response to Message 349.  

Which computers?
I mean computers with boinc-client v.6.10.45 (3pcs in domain)
On your account I see only one broken task...
Yes, I know, but process download new WU's is continuously. Maybe it not important, but log file update everytime.
Can you give me a link to "problematic" computer, please?
I can only provide a log file. If look in general situation - project is computing. Question is correctly or not. Is there an additional burden on my PC's & your server?
Thank you.
ID: 350 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Volodymyr Goroshko

Send message
Joined: 3 Jun 15
Posts: 4
Credit: 16,528,220
RAC: 7,237
Message 355 - Posted: 5 Jun 2015, 15:17:18 UTC - in response to Message 350.  

Now the situation has stabilized.
Only one server has constant attempts to download the WU's.
Maybe it suffer...
ID: 355 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 4 Feb 15
Posts: 46
Credit: 3,934,546
RAC: 7,058
Message 359 - Posted: 10 Jun 2015, 1:17:12 UTC
Last modified: 10 Jun 2015, 1:34:50 UTC

If you are referring to the constant downloading of files when you have a Universe task on your computer and you are using an older BOINC Client of 6.xx.xx then this is normal.
I have BOINC Clients 6.12.41 and 6.12.43 and I get constant downloading files that clog up my log file.
It does not stop the Universe tasks being run, and whilst it is annoying there is nothing that seems to able to be done to stop it, other than upgrading BOINC Client to a 7.xx.xx version, this I have yet to do.

The downloading files are of zero byte length but take up some small amount of bandwidth. There are a number of earlier posts about this issue.
As I said it wont stop the project from running.

Conan
ID: 359 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 4 Feb 15
Posts: 46
Credit: 3,934,546
RAC: 7,058
Message 434 - Posted: 26 Jul 2015, 3:18:58 UTC

This is one of the last of the current batch of work units still out there See Here
It is still out there because it has a major problem.
The first volunteer had it running for over 171 hours before aborting the WU.
I aborted the same WU last night as it had reached over 36 Hours (still using CPU) and only showed just between 52% and 53%, which it had shown for many, many hours without moving.

Seems to be in an endless loop.

Conan
ID: 434 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 78
Credit: 1,972,738
RAC: 0
Message 435 - Posted: 26 Jul 2015, 5:26:54 UTC - in response to Message 434.  

This is one of the last of the current batch of work units still out there See Here
It is still out there because it has a major problem.
The first volunteer had it running for over 171 hours before aborting the WU.
I aborted the same WU last night as it had reached over 36 Hours (still using CPU) and only showed just between 52% and 53%, which it had shown for many, many hours without moving.

Seems to be in an endless loop.

Conan


Got caught in an event horizon. ;)

When will there be a new batch of work?
ID: 435 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey
Avatar

Send message
Joined: 4 Apr 15
Posts: 26
Credit: 27,394,567
RAC: 156,011
Message 436 - Posted: 27 Jul 2015, 16:29:52 UTC

Website shows 1000 tasks available but when I ask for tasks on my pc I get this:

7/27/2015 12:26:10 PM | Universe@Home | Project has no tasks available

Besides being painfully slow it doesn't work!!!
ID: 436 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 78
Credit: 1,972,738
RAC: 0
Message 437 - Posted: 27 Jul 2015, 21:16:17 UTC - in response to Message 436.  

Aye, but if you look at the number of WUs out, it only says 34. It's actually been saying 10,000 for a whiles. Either there's not really 10,000 WUs or something is stuck. I think it's the former actually.

Krys, you are strangely quiet. Did you have another emergency departure?
ID: 437 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
cykodennis
Avatar

Send message
Joined: 4 Feb 15
Posts: 24
Credit: 7,035,527
RAC: 0
Message 438 - Posted: 28 Jul 2015, 11:26:46 UTC

I've got a whole bunch of problems with U@H in the moment.
Upload problems, no new work, calculation errors on several machines. I'll pause U@H until knowing more.
"I should bring one important point to the attention of the authors and that is, the world is not the United States..."
ID: 438 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 740
Credit: 140,379,798
RAC: 12,754
Message 439 - Posted: 28 Jul 2015, 14:24:29 UTC

Ok.

Is only some resend ready to go at the moment, I'm also strongly work with MySQL server as database are growth rapidly recently (so, I have to remove some old results to get it back correctly).

We will not create new job until work on new app finishes and it takes while until UW team finish their work on it and I will port application to BOINC.

I hope this not takes to long.
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
My Patreon profile
ID: 439 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey
Avatar

Send message
Joined: 4 Apr 15
Posts: 26
Credit: 27,394,567
RAC: 156,011
Message 440 - Posted: 29 Jul 2015, 1:23:39 UTC - in response to Message 439.  

Ok.

Is only some resend ready to go at the moment, I'm also strongly work with MySQL server as database are growth rapidly recently (so, I have to remove some old results to get it back correctly).

We will not create new job until work on new app finishes and it takes while until UW team finish their work on it and I will port application to BOINC.

I hope this not takes to long.


Thank you!!
ID: 440 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Yavanius
Avatar

Send message
Joined: 13 May 15
Posts: 78
Credit: 1,972,738
RAC: 0
Message 441 - Posted: 29 Jul 2015, 6:28:54 UTC - in response to Message 439.  



We will not create new job until work on new app finishes and it takes while until UW team finish their work on it and I will port application to BOINC.
.


Will you be adding any new clients beyond what you currently have?

dziękuję
ID: 441 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krzysztof Piszczek - wspieram ...
Project administrator
Project developer
Project tester
Avatar

Send message
Joined: 4 Feb 15
Posts: 740
Credit: 140,379,798
RAC: 12,754
Message 442 - Posted: 29 Jul 2015, 13:55:24 UTC - in response to Message 441.  


Will you be adding any new clients beyond what you currently have?
dziękuję

It will be new app to replace current one.
Krzysztof 'krzyszp' Piszczek

Member of Radioactive@Home project team
My Patreon profile
ID: 442 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Highlander_6596
Avatar

Send message
Joined: 8 Aug 15
Posts: 2
Credit: 344,667
RAC: 0
Message 446 - Posted: 9 Aug 2015, 4:15:17 UTC
Last modified: 9 Aug 2015, 4:17:56 UTC

The follow issue is observed... Server error: feeder not running
Apologies if this is already known.

8/8/2015 9:05:05 PM | Universe@Home | update requested by user
8/8/2015 9:05:06 PM | Universe@Home | sched RPC pending: Requested by user
8/8/2015 9:05:06 PM | Universe@Home | [sched_op] Starting scheduler request
8/8/2015 9:05:06 PM | Universe@Home | Sending scheduler request: Requested by user.
8/8/2015 9:05:06 PM | Universe@Home | Requesting new tasks for CPU and AMD/ATI GPU
8/8/2015 9:05:06 PM | Universe@Home | [sched_op] CPU work request: 33222.84 seconds; 4.00 devices
8/8/2015 9:05:06 PM | Universe@Home | [sched_op] AMD/ATI GPU work request: 117924.97 seconds; 0.50 devices

8/8/2015 9:05:09 PM | Universe@Home | Scheduler request completed: got 0 new tasks
8/8/2015 9:05:09 PM | Universe@Home | Server error: feeder not running
8/8/2015 9:05:09 PM | Universe@Home | Project requested delay of 3600 seconds
8/8/2015 9:05:09 PM | Universe@Home | [sched_op] Deferring communication for 01:00:00
8/8/2015 9:05:09 PM | Universe@Home | [sched_op] Reason: project requested delay

ID: 446 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Problems with Work Units.




Copyright © 2021 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek