openMalaria test version v6.63

Message boards : Number crunching : openMalaria test version v6.63

Author Message
Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

My first 2 tasks have successfully completed.

They had the same estimated run time but one took almost 4 times longer to run than the other. That's consistent with the run time variation I see on all other MCDN applications.
____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

TylerChris
Send message
Joined: Mar 29 07
Posts: 23
Credit: 513,393
RAC: 2

Same here.
All went well with my bunch.
Shortest 960 secs longest 17.500 secs.

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

My Q6600 has now completed 32 tasks with the run time ranging from 1,178 to 17,517 seconds.
____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

John C MacAlister
Send message
Joined: Feb 20 11
Posts: 20
Credit: 192,407
RAC: 0

I have processed 21 of these tasks using an AMD Phenom II X4 955 @ 3.8 GHz,4 GB memory, with the following results:

Average Run time: 4016.71 sec
Average CPU Time: 3088.73 sec
Average points/h Run Time 28.20283
____________

John C MacAlister
Send message
Joined: Feb 20 11
Posts: 20
Credit: 192,407
RAC: 0

The Run Time ranged from 955.82 to 12 508.49 sec on my 21 tasks.
____________

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

My Q6600 has now completed 88 of the wu_1210_* tasks with no change in the run time range, but overnight it ran 21 wu35_* tasks with much shorter run times of between 150 and 194 seconds.
____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

All 5 tasks from WU 63091874 (wu_1223_317_483_0_1332376758) ran for a decent time (possibly close to completion) before failing with exit status 72 and the following stderr output:

Error: effectiveEIR is not finite: 1.#QNAN

Call stack, starting from ..\..\model\Host\InfectionIncidenceModel.cpp:194:
sorry, no trace from this platform!
OpenMalaria: Domain error


WU 63122045 (wu_1223_318_549_0_1332412325) had 2 failures on Darwin systems before completing successfully on my Q6600. The failures ran for a decent time (again possibly close to completion), failing with exit status 74 and the following stderr output:

Error: initialKappa is invalid
Call stack, starting from /Users/africa/workspace/openmalaria_r885/trunk/model/Transmission/NonVectorModel.cpp:128:
sorry, no trace from this platform!

____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

WU 63128226 (wu_1217_507_562_0_1332418809) has failed with exit status 74 and the following stderr output on 4 Windows systems:

Error: initialKappa is invalid
Call stack, starting from /Users/africa/workspace/openmalaria_r885/trunk/model/Transmission/NonVectorModel.cpp:128:
sorry, no trace from this platform!

____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Snagletooth
Send message
Joined: Dec 24 09
Posts: 10
Credit: 81,468
RAC: 83

My mac was the fourth (after three Windows computers made the attempt) to fail with Exit status 74 (0x4a): wu_1213_505_472_0_1332369669

same stdrr out for all:

process exited with code 74 (0x4a, -182)


Error: initialKappa is invalid
Call stack, starting from /Users/africa/workspace/openmalaria_r885/trunk/model/Transmission/NonVectorModel.cpp:128:
sorry, no trace from this platform!


Best,
Snags

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

My Q6600 XP system is now up to 300 completed tasks with 7 failures; the exit status 72 and 74 tasks previously mentioned plus 5 more exit status 74 "Error: initialKappa is invalid" tasks. The workunits for the additional failures are:


  • 63200030 (wu_1223_515_739_0_1332497051)
  • 63200964 (wu_1222_35_741_0_1332498184)
  • 63201004 (wu_1222_512_741_0_1332498186)
  • 63216510 (wu_1222_524_785_0_1332515408)
  • 63233373 (wu_1223_327_832_0_1332533045)


____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

I've just aborted 7 tasks (of the 43 in my most recent work download) because there's been at least one failure with exit status 74 "Error: initialKappa is invalid" on another Widows+Intel system. They were from workunits:


  • 63127012 (wu_1223_514_558_0_1332417488)
  • 63194899 (wu_1213_507_725_0_1332491527)
  • 63195204 (wu_1215_507_726_0_1332491711)
  • 63209859 (wu_1222_522_766_0_1332507967)
  • 63264196 (wu_1213_507_933_0_1332565327)
  • 63270814 (wu_1223_173_953_0_1332573245)
  • 63271071 (wu_1222_327_954_0_1332573666)


____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

wu_1212_31_1492_0_1332775684_1 completed successfully but stderr.txt includes the following:

Warning: human life-span (6) shorter than length of warm-up requested by
transmission model (55). Transmission may be unstable; perhaps use forced
transmission (mode="forced") or a longer life-span.

____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

WU 63122045 (wu_1223_318_549_0_1332412325) had 2 failures on Darwin systems before completing successfully on my Q6600.

And here's one the other way round. WU 63807822 (wu_1223_523_2446_0_1333141807) had 3 exit status 74 failures on Windows systems (plus the _3 task I aborted) before completing successfully on a Darwin system.
____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Snagletooth
Send message
Joined: Dec 24 09
Posts: 10
Credit: 81,468
RAC: 83

wu_1211_505_5345_0_1334192046

4 down so far all with exit code 74

From my Mac:

Error: initialKappa is invalid
Call stack, starting from /Users/africa/workspace/openmalaria_r885/trunk/model/Transmission/NonVectorModel.cpp:128:
sorry, no trace from this platform!

From Windows 7:

Error: initialKappa is invalid
Call stack, starting from ..\..\model\Transmission\NonVectorModel.cpp:128:
sorry, no trace from this platform!

From linux:

Error: initialKappa is invalid
Call stack, starting from /home/tino/devel/read_only_om/model/Transmission/NonVectorModel.cpp:128:
+0x2da OM::Transmission::NonVectorModel::initIterate()
+0x554 OM::Simulation::start()
+0x17b main()
+0xed __libc_start_main()
()

Another Windows 7:

Error: initialKappa is invalid
Call stack, starting from ..\..\model\Transmission\NonVectorModel.cpp:128:
sorry, no trace from this platform!
OpenMalaria: No such file or directory

tigerfeet
Send message
Joined: Aug 9 08
Posts: 2
Credit: 82,938
RAC: 78

3 tasks down with error code 74 (0x4a) on a Windows Vista 32-bit machine

120077888
120078238 and
120079670

7.0.25

- exit code 74 (0x4a)


Error: initialKappa is invalid
Call stack, starting from ..\..\model\Transmission\NonVectorModel.cpp:128:
sorry, no trace from this platform!
04:43:35 (4344): called boinc_finish


]]>

tigerfeet
Send message
Joined: Aug 9 08
Posts: 2
Credit: 82,938
RAC: 78

2 tasks more with the same error code on the same machine:

120249440 and
120077888 (second time)

see above this....

swiftmallard
Avatar
Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0

I have had two of these:
Stderr output

7.0.25

- exit code 74 (0x4a)


Error: initialKappa is invalid
Call stack, starting from ..\..\model\Transmission\NonVectorModel.cpp:128:
sorry, no trace from this platform!
OpenMalaria: Domain error
01:14:51 (3920): called boinc_finish


]]>
otherwise they ran smoothly.

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

I started receiving tasks from a new batch yesterday, with the first 2 completing successfully:

wuindicesRun3_2151_1337595750_0
wuindicesRun3_35_1337595750_0
____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Zapp
Send message
Joined: Mar 27 11
Posts: 2
Credit: 46,458
RAC: 0

WU 68582080 failed with an invalid initial Kappa.

Thyme Lawn
Send message
Joined: Jun 20 06
Posts: 181
Credit: 1,233,724
RAC: 1,389

The wu_3220_* batch of work which started appearing on my systems just over 2 hours ago has a particularly poor completion record.

So far I've had 2 successful tasks, 10 exit status 74 (Error: initialKappa is invalid) failures and I've aborted 28 because the WU had had at least one exit status 74 failure.

The successful completions (wu_3220_31_10801_0_1337894116_0 and wu_3220_31_10845_0_1337939168_0) both included the following in their stderr_txt:

Warning: human life-span (6) shorter than length of warm-up requested by
transmission model (55). Transmission may be unstable; perhaps use forced
transmission (mode="forced") or a longer life-span.

____________
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Profile Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408

Invalid Kappa seems not to be a version problem, I have it here :

wu_3152_506_177610_0_1337953391_0 which is a v6.57 result

edit: As results tend to disappear very quick here ... the complete message says :

Exception: initialKappa is invalid
OpenMalaria: Result too large
21:51:48 (1512): called boinc_finish

This looks a bit like it always said "initialKappa is invalid" if any error occurs - afterall it has been at the very end of the calculation, somewhat late to check the initialKappa.

next one, different operating system, same error.

michaelT
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: Jul 20 10
Posts: 47
Credit: 16,359
RAC: 0

Thanks for the info...
We're trying to find the source of the problem. It could be that one of the parameters which is automatically generated using a genetic algorithm is out of the boundaries, so the human infectivity is too low and some workunits are crashing.
The automatic generation of new workunits have been disabled until the problem is solved.

We will let you now as soon as the problem is solved. :)


____________
Michael Tarantino
Swiss Tropical and Public Health Institute
http://www.swisstph.ch

Zapp
Send message
Joined: Mar 27 11
Posts: 2
Credit: 46,458
RAC: 0

This looks a bit like it always said "initialKappa is invalid" if any error occurs - afterall it has been at the very end of the calculation, somewhat late to check the initialKappa.

On Linux these jobs fail within seconds with an

Assertion `totalDensity == totalDensity' failed.

so it looks like there is something wrong that can be checked early on.

Post to thread

Message boards : Number crunching : openMalaria test version v6.63


Return to malariacontrol.net main page


Copyright © 2013 africa@home