Inconclusive/invalid

Message boards : Number crunching : Inconclusive/invalid

Author Message
TylerChris
Send message
Joined: Mar 29 07
Posts: 23
Credit: 513,393
RAC: 2

Seeing a large rise in this issue on 'B'WUs only.
On my machine it only occurs on the longer work units.
eg.

explain

Status

Run time
(sec)

CPU time
(sec)

Credit

Application



113069493

442248

8 Feb 2012 4:53:10 UTC

8 Feb 2012 16:44:18 UTC

Completed, marked as invalid

26,483.56

26,443.82

0.00

openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57



113075412

152827

8 Feb 2012 5:06:19 UTC

8 Feb 2012 21:39:50 UTC

Completed, marked as invalid

24,596.31

22,597.09

0.00

openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57



113144426

406355

8 Feb 2012 21:52:19 UTC

12 Feb 2012 12:31:08 UTC

Completed and validated

20,686.63

20,440.56

148.91

openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57



113496957

439960

12 Feb 2012 9:16:16 UTC

13 Feb 2012 0:52:09 UTC

Completed, marked as invalid

15,746.01

15,746.01

0.00

openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57



113564291

203599

13 Feb 2012 1:08:37 UTC

13 Feb 2012 12:44:18 UTC

Completed and validated

19,291.73

17,691.00

148.91

openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
.


Task is Here
Cannot find anything wrong in the logs.
Thanks
Chris

ukjohnd
Send message
Joined: Jan 14 07
Posts: 2
Credit: 642,105
RAC: 0

Same here.

This one invalid
https://malariacontrol.net/workunit.php?wuid=60472250

This one valid
https://malariacontrol.net/workunit.php?wuid=60746137

Plus quite a lot of others

P . P . L .
Avatar
Send message
Joined: Aug 27 08
Posts: 56
Credit: 500,976
RAC: 0

Hi.

Add me to this group of errors, they are only on these long tasks run on (Branch B) v6.57 app

i don't have any trouble with v6.58 or other projects that have longer tasks!

Some are taking up to 12hrs to finish, what a waste.


https://malariacontrol.net/workunit.php?wuid=61821326

Name wu_1203_417_152316_0_1330661466_1
Workunit 61821326
Created 2 Mar 2012 5:48:35 UTC
Sent 2 Mar 2012 5:58:24 UTC
Received 3 Mar 2012 6:50:31 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 176059
Report deadline 5 Mar 2012 17:18:24 UTC
Run time 48,907.79
CPU time 48,304.58
Validate state Invalid
Credit 0.00
Application version openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57

=======================================================================

https://malariacontrol.net/workunit.php?wuid=61670804


Name wu_1204_416_151196_0_1330474869_1
Workunit 61670804
Created 29 Feb 2012 1:39:46 UTC
Sent 29 Feb 2012 2:03:54 UTC
Received 29 Feb 2012 23:15:26 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 176059
Report deadline 3 Mar 2012 13:23:54 UTC
Run time 41,119.67
CPU time 40,605.60
Validate state Invalid
Credit 0.00
Application version openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57

==========================================================

I have/had others like this one marked as inconclusive, i don't what your problem is with these.

https://malariacontrol.net/workunit.php?wuid=61737301

Completed, validation inconclusive 47,069.73 46,397.23 pending openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57

This one will more than likely go the same way, i've now switched off all (Branch B) v6.57 on my rigs over this.

https://malariacontrol.net/workunit.php?wuid=61795996

Completed, waiting for validation 21,968.61 21,068.62 pending openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
____________

Profile mikey
Avatar
Send message
Joined: Mar 23 07
Posts: 4382
Credit: 5,361,193
RAC: 1,084

Are you guys crunching multiple projects meaning Malaria has to pause while another project runs and then start back up again? The reason I ask is I run Malaria on a pc that ONLY runs Malaria and I am having NO problems at all. Knock on wood!!

Profile Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408

Unsuspended result (openMalariaB v6.57, it ran uninterrupted), invalid, I guess it's a Linux vs. Windows issue :
wu_1190_506_177621_0_1337957189
Linux :


Warning: will use heterogeneity workaround.
sim end
T/A: 1406908/1406908 <======================
20:30:10 (27468): called boinc_finish



Windows :

Warning: will use heterogeneity workaround.
sim end
T/A: 1403145/1403145 <======================
22:15:50 (1792): called boinc_finish




The box does not tend to have invalid/inconclusive results, usually it has only trouble with the checkpoints (BOINC heartbeat bug) when several Malaria results checkpoint simuntanously

swiftmallard
Avatar
Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0

Here is one that is marked as successful but no credit is granted:
https://malariacontrol.net/result.php?resultid=125933327
Neither I nor my wingman received credit.

Profile Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408

That's the default reward for crunching long running workunits :-(

swiftmallard
Avatar
Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0

I completed several others almost as long and received credit.
https://malariacontrol.net/result.php?resultid=126025075
https://malariacontrol.net/result.php?resultid=126025076
https://malariacontrol.net/result.php?resultid=126025074

Profile Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408

probably just below the limit. Check the other threads lately, others have the problem too ("Unusual result", "Errors Overnight" and "Long Run Times")

swiftmallard
Avatar
Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0

What limit?

Profile Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408

Claimed credits or used-up CPU operations (those values are directly related). Those are a multiplication of runtime, benchmark results and a factor.

Above that limit (I guess has a fixed ratio to some average value), BOINC doesn't grant any credits, below it does acording to the project rules.

michaelT
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: Jul 20 10
Posts: 47
Credit: 16,359
RAC: 0

Regarding the non credit issue the problem :

It seems that this is due to some issues with the validator : there is a MAX_GRANTED_CREDIT parameter which should in theory grant MAX_GRANTED_CREDIT (it avoids cheating with high credit request) if WU_CREDIT > MAX_GRANTED_CREDIT but in our case it granted 0 credit ... :(

Some of the 0 granted workunits have already been purged but we manage to get all the hosts and the average credits for all those ones. So for the one who didn't get credit before it's fixed now.

We increased the MAX_GRANTED_CREDIT like that this should not be a problem anymore. But let me know if it happen again.

Post to thread

Message boards : Number crunching : Inconclusive/invalid


Return to malariacontrol.net main page


Copyright © 2013 africa@home