1) Message boards : Number crunching : Automatic abort work unit (Message 6122)
Posted 10 Apr 2023 by kcharuso
Post:
hi, im running 26 universes 2 milkyways and 2 einsteins. in total, i'm running 30 work units so i still have 1 core or 2 threads available for whatever is needed in the system. as for memory, little under 10gb is used from 32gb available. cpu temp is stable 76c and hottest gpu is the ATI one at 81c. everything is within the recommendation as far as i can tell from the tools i have.

also, 2 milkyway tasks by the 5700xt give more points than just running 1 task even after the points is half. strange but good. however, i did not get the similar results in einstein running 1080 where running 1 task per gpu is always better than running 2.

does the speed of pcie matter in Boinc? according to the motherboard specs. i am now only run 4x for all 3 gpus. is there any benefit if the gpus are operating in 8x or 16x?

as of now, there are still 3-4 errors a day from each project. im ok with that but if i can get rid of them, it will be super.
2) Message boards : Number crunching : Automatic abort work unit (Message 6118)
Posted 8 Apr 2023 by kcharuso
Post:
Good day Mr. Myers,

i am wondering about the completion time of work units not just from Universe but other projects as well. let me describe my setup to perhaps indicate possible issue. im running a B550 with 3950x on 32g and 1tb M2 with a pair of 1080. in Boinc im running universe for the cpu(30) tasks and einstine for gpu(2) tasks. the machine is 24/7 full load for months at the time with zero issue.

a month ago i was given 5700xt for free so i installed it in my last avaliable pcie slot and add milkyway just for this new gpu and nothing else it will crunch for. the machine operated flawlessly as if nothing was change....for a week or so. i can do my daily work at the same time with no lag or heat and planty of ram was still avaliable. i measured 780w from the socket while i can supply over 1000w from the psu. everything looks great within capability of hardware and software.

however, i noticed random Boinc tasks in every projects started to take longer to complete and errors or invalid task begins to appear. lastly, is the issue with universe tasks i described. it seems that the newly added 5700xt is the issue but i couldn figure out why. i monitored as many parameters as the machine can provide but nothing indicate stress or over utilization. in fact, less than 50% of resorces and capabilies were put to used. cpu @ 100% (70c) all gpus @ 98-100% (82c)

i really want to fix this issue while also put the free 5700xt to use. i dont know if this is possible or is there any setting i can configure. most importantly, i wanted to know the cause of the gradual slow down. have you came across similar issue or anything i can look into

thank you
3) Message boards : Number crunching : Automatic abort work unit (Message 6117)
Posted 8 Apr 2023 by kcharuso
Post:
thank you very much. i am enlightened. all suggestions are now under implementation. ill keep yall updated. thanks again
4) Message boards : Number crunching : Automatic abort work unit (Message 6114)
Posted 7 Apr 2023 by kcharuso
Post:
hi people,
i was wondering if there is a way to automaticlly abort work unit. often times i noticed work units that were on its 9 hours of crunching while having 99% completion indicated. i usually assumed these work units must have some error and will not be credited. i then aborted that task to free computation resources for other work unit.

is there an option i can set in cc.config or app.config to have these work units automatically aborted after certain time say, 5 hours of crunching if the task is not yet completed.

from my rough observations, there were one or two work units that ended up like so every day. i never have this problem before so i dont know whats up. please assist

thanks







Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek