41) Message boards : Number crunching : Automatic abort work unit (Message 6123)
Posted 10 Apr 2023 by Grant (SSSF)
Post:
does the speed of pcie matter in Boinc? according to the motherboard specs. i am now only run 4x for all 3 gpus. is there any benefit if the gpus are operating in 8x or 16x?
It would depend on the application, but in general the PCIe bus bandwidth has little (if any) impact on processing GPU Tasks (at least if it's PCIe 4).


as of now, there are still 3-4 errors a day from each project. im ok with that but if i can get rid of them, it will be super.
Look at the std_err output for those errored Tasks to see what is occurring.

As far a Universe is concerned, you haven't had any for 4 or more days now (and the only one that was recent wasn't really an error, it was just the Project cancelling a Task before you started processing it).
42) Message boards : Number crunching : Automatic abort work unit (Message 6119)
Posted 8 Apr 2023 by Grant (SSSF)
Post:
Remove the 5700XT and see if the issues go away- something else could have happened around the same time the new video card was added that's causing the problem. If removing the card fixes it, then it's the problem, if not, then something else has occurred that's causing issues.

It's worth getting Process Explorer to see what's running on the system; as much as Task Manager has improved over the years, Process Explorer is still way better for a more detailed look at what is using system resources.
In my case, after using Windows Security for years without issue, it started sucking up CPU time and none of the following updates sorted the problem out so i disabled it & now use a 3rd party security application.
43) Message boards : Number crunching : Automatic abort work unit (Message 6116)
Posted 7 Apr 2023 by Grant (SSSF)
Post:
Here's the stderr.txt file from one of your systems for a Task that errored out.

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<message>
Process still present 5 min after writing finish file; aborting</message>
<stderr_txt>
22:32:10 (2992): Can't acquire lockfile (32) - waiting 35s
22:32:46 (2992): Can't acquire lockfile (32) - exiting
22:32:46 (2992): Error: The process cannot access the file because it is being used by another process.

 (0x20)
01:18:54 (11308): called boinc_finish(0)

</stderr_txt>
]]>




And another

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<message>
Process still present 5 min after writing finish file; aborting</message>
<stderr_txt>
22:32:11 (5604): Can't acquire lockfile (32) - waiting 35s
22:32:46 (5604): Can't acquire lockfile (32) - exiting
22:32:46 (5604): Error: The process cannot access the file because it is being used by another process.

 (0x20)
01:32:30 (3792): Can't acquire lockfile (32) - waiting 35s
01:33:05 (3792): Can't acquire lockfile (32) - exiting
01:33:05 (3792): Error: The process cannot access the file because it is being used by another process.

 (0x20)
02:06:33 (13224): Can't acquire lockfile (32) - waiting 35s
02:07:08 (13224): Can't acquire lockfile (32) - exiting
02:07:08 (13224): Error: The process cannot access the file because it is being used by another process.

 (0x20)
02:21:37 (8564): Can't acquire lockfile (32) - waiting 35s
02:22:12 (8564): Can't acquire lockfile (32) - exiting
02:22:12 (8564): Error: The process cannot access the file because it is being used by another process.

 (0x20)
02:41:58 (2464): Can't acquire lockfile (32) - waiting 35s
02:42:33 (2464): Can't acquire lockfile (32) - exiting
02:42:33 (2464): Error: The process cannot access the file because it is being used by another process.

 (0x20)
02:43:30 (10852): called boinc_finish(0)

</stderr_txt>
]]>


It's either a mechanical HDD that can't keep up with the disk input/output load, or more likely a badly behaved AV programme.
White list the C:\ProgramData\BOINC directory and the issue should no longer occur.
Reducing the number of CPU cores/threads in use shouldn't be necessary.


Then there was another system where all the work timed out- it looks like for whatever reason you added it as a new system/it got a new ID number, so all the existing work timed out.
44) Questions and Answers : Preferences : Universe@home (Message 6112)
Posted 4 Apr 2023 by Grant (SSSF)
Post:
Is Universe@home done?
No.


I have stopped receiving tasks.
Are you running more than one project? Processing time could be owed to the other project(s). The more projects you do & the larger your cache, and the less time BOINC has to do work then the longer it takes for your Resource Share settings to be met.
What messages do you get in your BOINC Event Log when you Update Universe@home?

BOINC Manager, Advanced view, Tools, Event log.


Looking at your processing times, it looks as though your system is doing quite a bit of work other than just BOINC- there's a significant difference between the CPU time & the Run time (over a couple of hours i'd expect 2-4min difference for a lightly used system, not 25min).
45) Message boards : Number crunching : Server Thread (Message 6083)
Posted 26 Mar 2023 by Grant (SSSF)
Post:
And we're back again after being MIA for a few hours.
46) Questions and Answers : Windows : Upload problem (Message 6077)
Posted 22 Mar 2023 by Grant (SSSF)
Post:
You're running an older client, so it could be an issue with the security certificate.

I'd suggest downloading & installing the current BOINC client over you existing installation to upgrade it.
If after upgrading & restarting BOINC the files don't start to upload, it will be a case of checking out the Event log, possibly enabling some diagnostic flags in order to see what's going on.
47) Message boards : Number crunching : Oracle VM & Ubuntu (Message 6056)
Posted 19 Mar 2023 by Grant (SSSF)
Post:
Pretty atrocious credit by the looks atm.
The amount of Credit per Task is fixed.
Linux gives better processing times than Windows.

Instead of mucking around with VMs you could just make use of Windows Subsystem for LINUX?
48) Questions and Answers : Getting started : Why 50% CPU faster than 100? (Message 6052)
Posted 19 Mar 2023 by Grant (SSSF)
Post:
Well, in my first run I let my computer use 100% CPU with 10 WUs
It's took 1h30m to complete all.
But when I limit to 50% and 5 WUs
It's took only 1h10m (2x5WUs) to complete.
Why and how?
Different Tasks require different lengths of time to complete.
Even if running exactly the same Task- Reducing the number of cores in use allows higher clocking of the CPU, and less cache contention, so less time to process each Task. But as less Tasks will be done, that will result in less work actually being done per hour.
49) Questions and Answers : Getting started : What is happening? (Message 6050)
Posted 18 Mar 2023 by Grant (SSSF)
Post:
Hmm, thats odd because I had thought my Mac has been returning results for the last several weeks - I may have been misreading it but there was no problem raised when I added the project (unlike LHC which requires VirtualBox).
Looking at your systems, your Linux & Windows systems are returning work, but your Darwin system which was attached to Rosetta on Feb 19 hasn't done a single Task.

You can add a Project, even if it doesn't have an application for that system's OS, it just won't be able to get any work. Go to the BOINC Manger advanced view and bring up the Event Log (Tools Tab, Event Log).
Go to the Projects Tab, select Universe & click on Update. See what messages come up in the Event Log.
50) Message boards : Number crunching : Universe Disappeared (Message 6046)
Posted 18 Mar 2023 by Grant (SSSF)
Post:
You now show two Ryzen 7 5700G systems- one with Windows 10, the other with Windows 11.
Looks like you upgraded the OS on your system, and upgraded the BOINC Manager, and for some reason the system got a new ID number- Created 18 Mar 2023, 4:44:57 UTC


Did you upgrade the OS 3 days ago? Because that's the last time the Win10 system contacted the Universe server.
51) Message boards : Number crunching : Universe Disappeared (Message 6043)
Posted 17 Mar 2023 by Grant (SSSF)
Post:
Tried to add Universe, but got another message saying that the project could not be added.
That would be because of the old BOINC version you have.
Too late to edit- or if you have upgraded BOINC to the current version then it's an indication of a configuration issue with your computer/modem/isp (in order of most likely cause) that is blocking access to the Universe servers.
52) Message boards : Number crunching : Universe Disappeared (Message 6042)
Posted 17 Mar 2023 by Grant (SSSF)
Post:
Tried to add Universe, but got another message saying that the project could not be added.
That would be because of the old BOINC version you have.
53) Questions and Answers : Getting started : What is happening? (Message 6039)
Posted 17 Mar 2023 by Grant (SSSF)
Post:
Looking at the Applications page, there doesn't appear to be one for Apple OS, just Windows & LINUX.
54) Message boards : Number crunching : Server Thread (Message 6033)
Posted 15 Mar 2023 by Grant (SSSF)
Post:
No problems here at all since my earlier post outage post.


Exit BOINC, wait a while, then restart it.
Check the Event log to see what messages are there.

No recent changes to your AV software? These days they often also take over the firewall settings.
You have an older version of BOINC there, if there's nothing obvious in the Event log, try upgrading to the current BOINC version.
55) Message boards : Number crunching : Server Thread (Message 6027)
Posted 13 Mar 2023 by Grant (SSSF)
Post:
Thanks for the update.

Things seem to have settled down now; uploads, downloads & Scheduler requests all going through OK without need of manual intervention.
56) Message boards : Number crunching : Server Thread (Message 6025)
Posted 13 Mar 2023 by Grant (SSSF)
Post:
We seem to be back after an extended break.
Forums are sluggish, web site extremely slow to respond. And uploads & downloads are taking lots of Retrys to get through, but they are slowly clearing. Scheduler response is slow, but not too bad compared to everything else.
57) Message boards : Number crunching : Running Universe@Home - progress stuck (Message 6022)
Posted 3 Mar 2023 by Grant (SSSF)
Post:
Likewise- i've never had Universe Task stall.
In the past on other projects if i've had a Task stall for longer than it's expected runtime, i'd exit BOINC, give it 20-30 seconds & then restart. If the Task gets stuck again, then i'd abort it.
Once or twice the Task has completed normally, but most times it's had to be aborted.
58) Message boards : Number crunching : Server Thread (Message 6015)
Posted 10 Feb 2023 by Grant (SSSF)
Post:
Thanks, nice to know what's happening.
59) Message boards : Number crunching : Server Thread (Message 6013)
Posted 10 Feb 2023 by Grant (SSSF)
Post:
But, there is also a good news, that I have delivered another server to CAMK this morning and the machine will replace our old, main server in January.
Wondering if the main server has been upgraded yet or not?
60) Message boards : Number crunching : No new tasks for one of my machines? (Message 6010)
Posted 25 Jan 2023 by Grant (SSSF)
Post:
Glad to hear it was an easy fix.


Previous 20 · Next 20




Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek