Message boards : Number crunching : Number of errors
Message board moderation

To post messages, you must log in.

AuthorMessage
i9

Send message
Joined: 10 Mar 18
Posts: 1
Credit: 9,282,733
RAC: 0
Message 2678 - Posted: 11 Mar 2018, 11:52:46 UTC

Hello,

Is it normal for half or more of tasks to end in error?

State: All (713) · In progress (40) · Validation pending (155) · Validation inconclusive (0) · Valid (253) · Invalid (0) · Error (265)

Thanks,
Luis
ID: 2678 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
entigy

Send message
Joined: 10 Mar 18
Posts: 3
Credit: 16,667
RAC: 0
Message 2680 - Posted: 11 Mar 2018, 12:47:58 UTC

Only just started on this project, but I've had 3 out of 4 units go on error:

11/03/2018 11:01:00 | Universe@Home | Aborting task universe_bhdb_180109_4_3099997_20000_1-999999_100200_5: exceeded disk limit: 745.71MB > 667.57MB
11/03/2018 12:41:03 | Universe@Home | Aborting task universe_bhdb_180109_5_130824870_20000_1-999999_825200_1: exceeded disk limit: 1034.78MB > 858.31MB

Boinc preferences are set to use 10Gb of space, so how can it be running out??
ID: 2680 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jun 16
Posts: 169
Credit: 317,253,046
RAC: 0
Message 2681 - Posted: 11 Mar 2018, 13:16:50 UTC

See this thread. Lots of issues with this new app:
https://universeathome.pl/universe/forum_thread.php?id=312
ID: 2681 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Weltall

Send message
Joined: 22 Feb 15
Posts: 12
Credit: 16,678,134
RAC: 0
Message 2894 - Posted: 21 May 2018, 7:57:29 UTC

Since i have my new Rig all tasks became errors.
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<message>
Disk usage limit exceeded</message>
<stderr_txt>

</stderr_txt>
]]>
ID: 2894 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Weltall

Send message
Joined: 22 Feb 15
Posts: 12
Credit: 16,678,134
RAC: 0
Message 2895 - Posted: 21 May 2018, 10:34:55 UTC

21.05.2018 12:31:27 | Universe@Home | Temporarily failed upload of universe_bhdb_180327_17_48429952_20000_1-999999_430200_5_r1302013804_0: transient HTTP error
21.05.2018 12:31:27 | Universe@Home | Backing off 00:35:24 on upload of universe_bhdb_180327_17_48429952_20000_1-999999_430200_5_r1302013804_0
21.05.2018 12:31:27 | Universe@Home | Temporarily failed upload of universe_bhdb_180327_17_48429952_20000_1-999999_430200_5_r1302013804_4: transient HTTP error
21.05.2018 12:31:27 | Universe@Home | Backing off 01:01:25 on upload of universe_bhdb_180327_17_48429952_20000_1-999999_430200_5_r1302013804_4
21.05.2018 12:31:29 | | Internet access OK - project servers may be temporarily down.
It is not possible to upload any task.
ID: 2895 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jun 16
Posts: 169
Credit: 317,253,046
RAC: 0
Message 2896 - Posted: 21 May 2018, 11:41:50 UTC - in response to Message 2894.  

Since i have my new Rig all tasks became errors.
<core_client_version>7.8.3</core_client_version>
<![CDATA[
<message>
Disk usage limit exceeded</message>
<stderr_txt>

</stderr_txt>
]]>


The project keeps sending out these tasks to people each batch since they keep erroring out. Don't run BHDB unless there is a new batch.
ID: 2896 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile marsinph

Send message
Joined: 22 Mar 18
Posts: 29
Credit: 24,402,488
RAC: 0
Message 2907 - Posted: 25 May 2018, 16:40:58 UTC

Hello, the same for me about 50% WU finish in error !
Always the same "disk limit exceed"
All my host runs with SSD 250Gb and all have about 100Gb free.
In BAM, my setting are set to "not use more than 50% of disk space"
It will say 125Gb if I can count ?!
Considering I run 8 WU at same time, it will say 15GB
I come to conclusion that sometimes a WU need it.
But, I monitor the use of my system and never I see a peak.

Who can explain ???
Best regards from Belgium' s first team
ID: 2907 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jun 16
Posts: 169
Credit: 317,253,046
RAC: 0
Message 2908 - Posted: 25 May 2018, 16:53:08 UTC

There's an artificial disk limit and the tasks exceed it at the end of computation. The admin raised it before and it needs to be done again.

Also all you're going to get now are resends from other people having errors. It basically happens for everyone who runs a given task. So if it aborts for them, it'll abort for you too. The BHDB tasks come in batches. Once the tasks run out then about the only thing left are the bad tasks being resent. Just stop running that app after the batch has been sent out.
ID: 2908 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile marsinph

Send message
Joined: 22 Mar 18
Posts: 29
Credit: 24,402,488
RAC: 0
Message 2910 - Posted: 25 May 2018, 18:45:08 UTC - in response to Message 2908.  

There's an artificial disk limit and the tasks exceed it at the end of computation. The admin raised it before and it needs to be done again.

Also all you're going to get now are resends from other people having errors. It basically happens for everyone who runs a given task. So if it aborts for them, it'll abort for you too. The BHDB tasks come in batches. Once the tasks run out then about the only thing left are the bad tasks being resent. Just stop running that app after the batch has been sent out.
ID: 2910 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Number of errors




Copyright © 2024 Copernicus Astronomical Centre of the Polish Academy of Sciences
Project server and website managed by Krzysztof 'krzyszp' Piszczek