| Author | Message |
|
|
|
I received an error with the mappredictor application on one of my boxes.
27/05/2007 02:34:06|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0000013.txt_0_0 to slots/0/phi_8km_pslines.txt
27/05/2007 02:34:06|malariacontrol.net beta|Unrecoverable error for result mapwca0000013.txt_0 (Couldn\'t start or resume: 2)
27/05/2007 02:34:06||request_reschedule_cpus: start failed
27/05/2007 02:34:06|malariacontrol.net beta|Computation for result mapwca0000013.txt_0 finished
The link to the wu is here: http://www.malariacontrol.net/workunit.php?wuid=2535675
Mine was the only one of the three to error out.
Can\'t say what may have caused it though.
[edit]
And another one, different box.
28/05/2007 04:03:29|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0000016.txt_1_0 to slots/0/phi_8km_pslines.txt
28/05/2007 04:03:29|malariacontrol.net beta|Unrecoverable error for result mapwca0000016.txt_1 (Couldn\'t start or resume: 2)
28/05/2007 04:03:29|malariacontrol.net beta|Deferring scheduler requests for 1 minutes and 0 seconds
28/05/2007 04:03:29||Rescheduling CPU: start failed
28/05/2007 04:03:29|malariacontrol.net beta|Computation for task mapwca0000016.txt_1 finished
WU = http://www.malariacontrol.net/workunit.php?wuid=2535678
And another, also different box.
27/05/2007 13:31:25|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0081234.txt_1_0 to slots/0/phi_8km_pslines.txt
27/05/2007 13:31:25|malariacontrol.net beta|Unrecoverable error for result mapwca0081234.txt_1 (Couldn\'t start or resume: 2)
27/05/2007 13:31:25|malariacontrol.net beta|Deferring scheduler requests for 1 minutes and 0 seconds
27/05/2007 13:31:25||Rescheduling CPU: start failed
27/05/2007 13:31:25|malariacontrol.net beta|Computation for task mapwca0081234.txt_1 finished
WU = http://www.malariacontrol.net/workunit.php?wuid=2535690
And another two, on yet another box.
This one is interesting because it also has an unexpected state 7 error message.
27/05/2007 03:25:44|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0000008.txt_0_0 to slots/0/phi_8km_pslines.txt
27/05/2007 03:25:44|malariacontrol.net beta|Unrecoverable error for result mapwca0000008.txt_0 (Couldn\'t start or resume: 2)
27/05/2007 03:25:44|malariacontrol.net beta|Deferring scheduler requests for 1 minutes and 0 seconds
27/05/2007 03:25:44||Rescheduling CPU: start failed
27/05/2007 03:25:44|malariacontrol.net beta|Computation for task mapwca0000008.txt_0 finished
27/05/2007 03:25:45|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0000009.txt_0_0 to slots/0/phi_8km_pslines.txt
27/05/2007 03:25:45|malariacontrol.net beta|Unrecoverable error for result mapwca0000009.txt_0 (Couldn\'t start or resume: 2)
27/05/2007 03:25:45|malariacontrol.net beta|Deferring scheduler requests for 1 minutes and 0 seconds
27/05/2007 03:25:45||Rescheduling CPU: start failed
27/05/2007 03:25:45|malariacontrol.net beta|Unexpected state 7 for task mapwca0000009.txt_0
27/05/2007 03:25:46|malariacontrol.net beta|Computation for task mapwca0000009.txt_0 finished
WU = http://www.malariacontrol.net/workunit.php?wuid=2526703
WU = http://www.malariacontrol.net/workunit.php?wuid=2526704
[/edit] |
|
|
|
|
|
got the same error
28/05/2007 21:26:47|malariacontrol.net beta|Sending scheduler request to http://www.malariacontrol.net/malariacontrol_cgi/cgi
28/05/2007 21:26:47|malariacontrol.net beta|Reason: To fetch work
28/05/2007 21:26:47|malariacontrol.net beta|Requesting 25920 seconds of new work, and reporting 1 completed tasks
28/05/2007 21:26:51|malariacontrol.net beta|Scheduler request succeeded
28/05/2007 21:26:53|malariacontrol.net beta|Started download of file mappredictor_5.17_windows_intelx86
28/05/2007 21:26:53|malariacontrol.net beta|Started download of file centers_aez_517.txt
28/05/2007 21:26:54|malariacontrol.net beta|Finished download of file centers_aez_517.txt
28/05/2007 21:26:54|malariacontrol.net beta|Throughput 537 bytes/sec
28/05/2007 21:26:54|malariacontrol.net beta|Started download of file jobmap517.xml
28/05/2007 21:26:56|malariacontrol.net beta|Finished download of file mappredictor_5.17_windows_intelx86
28/05/2007 21:26:56|malariacontrol.net beta|Throughput 334620 bytes/sec
28/05/2007 21:26:56|malariacontrol.net beta|Finished download of file jobmap517.xml
28/05/2007 21:26:56|malariacontrol.net beta|Throughput 1255 bytes/sec
28/05/2007 21:26:56|malariacontrol.net beta|Started download of file locations_265_517.txt
28/05/2007 21:26:56|malariacontrol.net beta|Started download of file predictor_5.17_windows_intelx86
28/05/2007 21:26:57|malariacontrol.net beta|Finished download of file locations_265_517.txt
28/05/2007 21:26:57|malariacontrol.net beta|Throughput 44517 bytes/sec
28/05/2007 21:26:57|malariacontrol.net beta|Started download of file winbugs_output.out.txt
28/05/2007 21:27:00|malariacontrol.net beta|Finished download of file predictor_5.17_windows_intelx86
28/05/2007 21:27:00|malariacontrol.net beta|Throughput 332179 bytes/sec
28/05/2007 21:27:00|malariacontrol.net beta|Started download of file wca0046849.txt
28/05/2007 21:27:01|malariacontrol.net beta|Finished download of file wca0046849.txt
28/05/2007 21:27:01|malariacontrol.net beta|Throughput 1060 bytes/sec
28/05/2007 21:27:01|malariacontrol.net beta|Started download of file wca0055134.txt
28/05/2007 21:27:02|malariacontrol.net beta|Finished download of file wca0055134.txt
28/05/2007 21:27:02|malariacontrol.net beta|Throughput 1473 bytes/sec
28/05/2007 21:27:02|malariacontrol.net beta|Started download of file wca0096849.txt
28/05/2007 21:27:03|malariacontrol.net beta|Finished download of file wca0096849.txt
28/05/2007 21:27:03|malariacontrol.net beta|Throughput 1048 bytes/sec
28/05/2007 21:27:44|malariacontrol.net beta|Finished download of file winbugs_output.out.txt
28/05/2007 21:27:44|malariacontrol.net beta|Throughput 448180 bytes/sec
28/05/2007 21:27:46||Rescheduling CPU: files downloaded
28/05/2007 21:27:46||Rescheduling CPU: files downloaded
28/05/2007 21:27:46||Rescheduling CPU: files downloaded
28/05/2007 21:27:46||Using earliest-deadline-first scheduling because computer is overcommitted.
28/05/2007 21:27:46|XtremLab|Pausing task BRM395363_0 (left in memory)
28/05/2007 21:27:47|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0046849.txt_1_0 to slots/4/phi_8km_pslines.txt
28/05/2007 21:27:47|malariacontrol.net beta|Unrecoverable error for result mapwca0046849.txt_1 (Couldn\'t start or resume: 2)
28/05/2007 21:27:47|malariacontrol.net beta|Deferring scheduler requests for 1 minutes and 0 seconds
28/05/2007 21:27:47||Rescheduling CPU: start failed
28/05/2007 21:27:47|malariacontrol.net beta|Computation for task mapwca0046849.txt_1 finished
that happend with all mappredictor wu\'s i got
a few also gave
29/05/2007 01:16:40|malariacontrol.net beta|Unexpected state 7 for task mapwca0142889.txt_1
but most simply errored out
my result page
____________
 |
|
|
|
|
|
This is under Windows Millenium
The wu crashed immediately.
29/05/2007 20:04:23
Can\'t copy projects/www.malariacontrol.net/mapwca0052889.txt_2_0 to slots/4/phi_8km_pslines.txt
29/05/2007 20:04:23
Unrecoverable error for result mapwca0052889.txt_2 (Couldn\'t start or resume: 2)
29/05/2007 20:04:23
Deferring scheduler requests for 1 minutes and 0 seconds
29/05/2007 20:04:23
Rescheduling CPU: start failed
29/05/2007 20:04:23
Computation for task mapwca0052889.txt_2 finished
---------------------------------------------------------
Result ID : 8275497
Work unit ID : 2575863
Result ID 8275497
Name mapwca0052889.txt_2
Workunit 2575863
Created 29 May 2007 9:24:38 UTC
Sent 29 May 2007 9:26:07 UTC
Received 29 May 2007 18:12:30 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status -185 (0xffffff47)
Computer ID 7988
Report deadline 1 Jun 2007 20:46:07 UTC
CPU time 0
stderr out
<core_client_version>5.4.9</core_client_version>
<message>
Couldn\'t start or resume: 2
</message>
Validate state Invalid
Claimed credit 0
Granted credit 0
application version 5.17
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
maireVolunteer moderator Project administrator Project developer Project scientist Send message Joined: Nov 7 05 Posts: 439 Credit: 118,258 RAC: 0
|
Can\'t copy projects/www.malariacontrol.net/mapwca0052889.txt_2_0 to slots/4/phi_8km_pslines.txt
Unrecoverable error for result mapwca0052889.txt_2 (Couldn\'t start or resume: 2)
This error is by far the most common problem with the revised mappredictor, accounting for nearly all errors. It seems like the BOINC wrapper tries copy one of the output files before the science application has created it, probably before the science app has even started. We\'ve never seen that here, any insights that would help others prevent it would be much appreciated. Could this be some permission problem? Leftovers in the slot directory?
Thanks
Nick
____________
Nicolas Maire
Swiss Tropical and Public Health Institute
http://www.swisstph.ch |
|
|
|
|
Can\'t copy projects/www.malariacontrol.net/mapwca0052889.txt_2_0 to slots/4/phi_8km_pslines.txt
Unrecoverable error for result mapwca0052889.txt_2 (Couldn\'t start or resume: 2)
This error is by far the most common problem with the revised mappredictor, accounting for nearly all errors. It seems like the BOINC wrapper tries copy one of the output files before the science application has created it, probably before the science app has even started. We\'ve never seen that here, any insights that would help others prevent it would be much appreciated. Could this be some permission problem? Leftovers in the slot directory?
Thanks
Nick
I only catched the last of my mappredictor wu\'s (all other were already errored out) so i couldn\'t really play around with it.
The Slot directory was empty, i checked that before i allowed the wu to run.
Permissions weren\'t changed the last couple of weeks and BOINC always runs under the same user account, so there shouldn\'t be a problem with that.
As far as i remember it seems the science app wasn\'t started because my firewall didn\'t ask for permission to run the app (which it normaly does with every new app version)
[edit]
I looked through a couple of failed wu\'s and found that all succesful results came from boinc version higher than 5.4 (most of the are 5.8 and higer but at least one 5.6 also worked).
So, it might be a problem related to the boinc manager and the way it handles wrapper apps
[/edit] |
|
|
|
|
|
Ehm, i\'m also running an A64X2 (under XP) but using a CC 5.6.4 and with this machine, the 5.17 was running fine.
Maybe you can do a request on the db ?
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
|
|
|
Watching my result-tabs, it seems thta this porblem occurs again.
But all my computer work fine. May it be a BOINC-Manager problem, like Stefan said? Except one all using old clients like 4.45 or 5.3.x
I`m using the actual (5.10.x) one and it works fine till yet. Knock on wood.
____________
|
|
|
|
|
|
May be double, but now I have two other WU with an error and a popup window.
Will only report errors running mappredictor.
2007-06-05 20:42:22 [malariacontrol.net beta] Starting mapwca0075684.txt_1
2007-06-05 20:42:22 [malariacontrol.net beta] Starting task mapwca0075684.txt_1 using mappredictor version 517
2007-06-05 20:42:28 [malariacontrol.net beta] Deferring communication for 1 min 0 sec
2007-06-05 20:42:28 [malariacontrol.net beta] Reason: Unrecoverable error for result mapwca0075684.txt_1 ( - exit code 1282 (0x502))
2007-06-05 20:42:35 [malariacontrol.net beta] [error] Can\'t rename output file mapwca0075684.txt_1_0
2007-06-05 20:42:41 [malariacontrol.net beta] [error] Can\'t rename output file mapwca0075684.txt_1_1
2007-06-05 20:42:41 [malariacontrol.net beta] Computation for task mapwca0075684.txt_1 finished
2007-06-05 20:42:41 [malariacontrol.net beta] Output file mapwca0075684.txt_1_0 for task mapwca0075684.txt_1 absent
2007-06-05 20:42:41 [malariacontrol.net beta] Output file mapwca0075684.txt_1_1 for task mapwca0075684.txt_1 absent
2007-06-05 20:43:21 [malariacontrol.net beta] Sending scheduler request: Requested by user
<core_client_version>5.10.0</core_client_version>
<![CDATA[
<message>
- exit code 1282 (0x502)
</message>
<stderr_txt>
o1
c1
app error: 0x502
</stderr_txt>
]]>
May be it is the wu which is buggy because the error occurs also by other computers.
http://www.malariacontrol.net/workunit.php?wuid=2657733
http://www.malariacontrol.net/workunit.php?wuid=2663544 (since yet, only my error, but I bet it will increase)
____________
|
|
|
|
|
|
I also had a mappredictor wu error out, result id 8539616
2007-06-06 10:12:15|malariacontrol.net beta|Starting mapwca0183684.txt_2
2007-06-06 10:12:28|malariacontrol.net beta|Starting task mapwca0183684.txt_2 using mappredictor version 517
2007-06-06 10:12:30|malariacontrol.net beta|app reporting negative CPU: -458403381849.693480
2007-06-06 10:12:34|malariacontrol.net beta|app reporting negative CPU: -458403381849.693480
2007-06-06 10:12:34|malariacontrol.net beta|Reason: Unrecoverable error for result mapwca0183684.txt_2 (Ett oväntat nätverksfel har uppstått. (0x3b) - exit code 59 (0x3b))
2007-06-06 10:12:39|malariacontrol.net beta|[error] Can\'t rename output file mapwca0183684.txt_2_0
2007-06-06 10:12:45|malariacontrol.net beta|[error] Can\'t rename output file mapwca0183684.txt_2_1
2007-06-06 10:12:45|malariacontrol.net beta|Computation for task mapwca0183684.txt_2 finished
2007-06-06 10:12:45|malariacontrol.net beta|Output file mapwca0183684.txt_2_0 for task mapwca0183684.txt_2 absent
2007-06-06 10:12:45|malariacontrol.net beta|Output file mapwca0183684.txt_2_1 for task mapwca0183684.txt_2 absent
Using Windows 98 SE with BOINC 5.8.16 |
|
|
|
|
I also had a mappredictor wu error out, result id 8539616
2007-06-06 10:12:15|malariacontrol.net beta|Starting mapwca0183684.txt_2
2007-06-06 10:12:28|malariacontrol.net beta|Starting task mapwca0183684.txt_2 using mappredictor version 517
2007-06-06 10:12:30|malariacontrol.net beta|app reporting negative CPU: -458403381849.693480
2007-06-06 10:12:34|malariacontrol.net beta|app reporting negative CPU: -458403381849.693480
2007-06-06 10:12:34|malariacontrol.net beta|Reason: Unrecoverable error for result mapwca0183684.txt_2 (Ett oväntat nätverksfel har uppstått. (0x3b) - exit code 59 (0x3b))
2007-06-06 10:12:39|malariacontrol.net beta|[error] Can\'t rename output file mapwca0183684.txt_2_0
2007-06-06 10:12:45|malariacontrol.net beta|[error] Can\'t rename output file mapwca0183684.txt_2_1
2007-06-06 10:12:45|malariacontrol.net beta|Computation for task mapwca0183684.txt_2 finished
2007-06-06 10:12:45|malariacontrol.net beta|Output file mapwca0183684.txt_2_0 for task mapwca0183684.txt_2 absent
2007-06-06 10:12:45|malariacontrol.net beta|Output file mapwca0183684.txt_2_1 for task mapwca0183684.txt_2 absent
Using Windows 98 SE with BOINC 5.8.16
Are you running an antivirus with real-time scanning of files? Or some type of indicization service?
____________
|
|
|
|
|
|
If an antivirus-prog is responsible for these errors than it shoud appear more often to Wus running on that computer. So I could not believe it.
Just have a look to the result-tab of these WUs (or mine wus mentoined before). You can see that every host failed running the wu. Some because of too old, but the most of them with the same error.
____________
|
|
|
|
|
|
I am getting the same error on 2 machines
6/6/2007 11:19:16|malariacontrol.net beta|Starting mapwca0184684.txt_6
6/6/2007 11:19:17|malariacontrol.net beta|Starting task mapwca0184684.txt_6 using mappredictor version 517
6/6/2007 11:19:18|malariacontrol.net beta|[file_xfer] Started upload of file mapwca0010484.txt_0_0
6/6/2007 11:19:18|malariacontrol.net beta|[file_xfer] Started upload of file mapwca0010484.txt_0_1
6/6/2007 11:19:19|malariacontrol.net beta|Deferring communication for 1 min 0 sec
6/6/2007 11:19:19|malariacontrol.net beta|Reason: Unrecoverable error for result mapwca0184684.txt_6 (An unexpected network error occurred. (0x3b) - exit code 59 (0x3b))
6/6/2007 11:19:25|malariacontrol.net beta|[error] Can\'t rename output file mapwca0184684.txt_6_0
6/6/2007 11:19:30|malariacontrol.net beta|[error] Can\'t rename output file mapwca0184684.txt_6_1
6/6/2007 11:19:30|malariacontrol.net beta|Computation for task mapwca0184684.txt_6 finished
6/6/2007 11:19:30|malariacontrol.net beta|Output file mapwca0184684.txt_6_0 for task mapwca0184684.txt_6 absent
6/6/2007 11:19:30|malariacontrol.net beta|Output file mapwca0184684.txt_6_1 for task mapwca0184684.txt_6 absent
I am using Boinc 5.8.16 on both machines. No anti-virus or other file scanning proggies running.
EDIT: Added work unit link for convenience: http://www.malariacontrol.net/workunit.php?wuid=2662783
____________

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair |
|
|
|
|
|
Japp, and this confirms my suggestion.
application Prediction of Malaria Prevalence
created 5 Jun 2007 17:14:01 UTC
name mapwca0184684.txt
minimum quorum 2
initial replication 2
max # of error/total/success results 7, 20, 10
errors Too many error results
May be maire schould have look into these failing workunits. I would say there must be something different in comparison with the others.
____________
|
|
|
|
|
|
wuid=2673496
wuid=2673502
wuid=2664492
6/6/2007 6:16:48 AM|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0085184.txt_3_0 to slots/0/phi_8km_pslines.txt
6/6/2007 6:16:48 AM|malariacontrol.net beta|Unrecoverable error for result mapwca0085184.txt_3 (Couldn\'t start or resume: 2)
6/6/2007 6:16:48 AM||Rescheduling CPU: start failed
6/6/2007 6:16:48 AM|malariacontrol.net beta|Unexpected state 7 for task mapwca0085184.txt_3
6/6/2007 6:16:49 AM|malariacontrol.net beta|Computation for task mapwca0085184.txt_3 finished
6/6/2007 9:26:36 AM|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0052284.txt_0_0 to slots/0/phi_8km_pslines.txt
6/6/2007 9:26:36 AM|malariacontrol.net beta|Unrecoverable error for result mapwca0052284.txt_0 (Couldn\'t start or resume: 2)
6/6/2007 9:26:36 AM||Rescheduling CPU: start failed
6/6/2007 9:26:36 AM|malariacontrol.net beta|Computation for task mapwca0052284.txt_0 finished
6/6/2007 9:26:36 AM|malariacontrol.net beta|Can\'t copy projects/www.malariacontrol.net/mapwca0051684.txt_2_0 to slots/0/phi_8km_pslines.txt
6/6/2007 9:26:36 AM|malariacontrol.net beta|Unrecoverable error for result mapwca0051684.txt_2 (Couldn\'t start or resume: 2)
6/6/2007 9:26:36 AM||Rescheduling CPU: start failed
6/6/2007 9:26:36 AM|malariacontrol.net beta|Unexpected state 7 for task mapwca0051684.txt_2
6/6/2007 9:26:37 AM|malariacontrol.net beta|Computation for task mapwca0051684.txt_2 finished
|
|
|
|
|
|
Hi Pengu.....it appears you and the others are using BOINC client 5.4.11 on your machine for these failed WUs. It appears that you must have the 5.8 series BOINC client to run these \'map\' WUs. The lastest stable version is 5.8.16 available for download from the BOINC website. Once installed, your troubles should be over....:)..Cheers, Rog.
____________
|
|
|
|
|
Hi Pengu.....it appears you and the others are using BOINC client 5.4.11 on your machine for these failed WUs. It appears that you must have the 5.8 series BOINC client to run these \'map\' WUs. The lastest stable version is 5.8.16 available for download from the BOINC website. Once installed, your troubles should be over....:)..Cheers, Rog.
I run 5.8.16 on all my machines, and still got the error on 2 of them until now.
____________

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair |
|
|
|
|
Hi Pengu.....it appears you and the others are using BOINC client 5.4.11 on your machine for these failed WUs. It appears that you must have the 5.8 series BOINC client to run these \'map\' WUs. The lastest stable version is 5.8.16 available for download from the BOINC website. Once installed, your troubles should be over....:)..Cheers, Rog.
I\'m running a CC 5.6.4 and it\'s fine so it doesn\'t appear \"that you must have the 5.8 series BOINC client to run these \'map\' WUs.\" but probably and presently a CC 5.5.x or higher will help (according to this link)
Anyway the application has others problems and i\'m afraid that no CC will help for these problems.
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
|
|
Hi Pengu.....it appears you and the others are using BOINC client 5.4.11 on your machine for these failed WUs. It appears that you must have the 5.8 series BOINC client to run these \'map\' WUs. The lastest stable version is 5.8.16 available for download from the BOINC website. Once installed, your troubles should be over....:)..Cheers, Rog.
I\'m running a CC 5.6.4 and it\'s fine so it doesn\'t appear \"that you must have the 5.8 series BOINC client to run these \'map\' WUs.\" but probably and presently a CC 5.5.x or higher will help (according to this link)
Anyway the application has others problems and i\'m afraid that no CC will help for these problems.
I stand corrected!....Cheers, Rog.
____________
|
|
|
|
|
Japp, and this confirms my suggestion.
application Prediction of Malaria Prevalence
created 5 Jun 2007 17:14:01 UTC
name mapwca0184684.txt
minimum quorum 2
initial replication 2
max # of error/total/success results 7, 20, 10
errors Too many error results
May be maire schould have look into these failing workunits. I would say there must be something different in comparison with the others.
Some of the older mapping workunits have a incorrectly formatted data string which causes the application to crash shortly after starting. This problem is fixed so you shouldn\'t get any more of them.
____________
Alain Studer
Swiss Tropical Institute |
|
|
|
|
|
Fixing the badly formatted data string in the WUs has undoubtedly helped. Still, WUs issued after that fix are still crashing frequently (though not always) on systems running older versions of BOINC. Upgrading to BOINC 5.8.16 is highly recommended. BOINC 5.10.2 (beta) is working well here so far. WUs issued after the data string fix announced by Alain that have crashed on pre 5.8.xx systems are completing error free on my BOINC 5.10.2 machine.
____________
-- |
|
|
|
|
Fixing the badly formatted data string in the WUs has undoubtedly helped. Still, WUs issued after that fix are still crashing frequently (though not always) on systems running older versions of BOINC. Upgrading to BOINC 5.8.16 is highly recommended. BOINC 5.10.2 (beta) is working well here so far. WUs issued after the data string fix announced by Alain that have crashed on pre 5.8.xx systems are completing error free on my BOINC 5.10.2 machine.
As (already) said, i have no problem with a CC 5.6.4 and i would be curious to see how a CC 5.5.x would work...
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
|
|
I also had a mappredictor wu error out, result id 8539616
2007-06-06 10:12:15|malariacontrol.net beta|Starting mapwca0183684.txt_2
2007-06-06 10:12:28|malariacontrol.net beta|Starting task mapwca0183684.txt_2 using mappredictor version 517
2007-06-06 10:12:30|malariacontrol.net beta|app reporting negative CPU: -458403381849.693480
2007-06-06 10:12:34|malariacontrol.net beta|app reporting negative CPU: -458403381849.693480
2007-06-06 10:12:34|malariacontrol.net beta|Reason: Unrecoverable error for result mapwca0183684.txt_2 (Ett oväntat nätverksfel har uppstått. (0x3b) - exit code 59 (0x3b))
2007-06-06 10:12:39|malariacontrol.net beta|[error] Can\'t rename output file mapwca0183684.txt_2_0
2007-06-06 10:12:45|malariacontrol.net beta|[error] Can\'t rename output file mapwca0183684.txt_2_1
2007-06-06 10:12:45|malariacontrol.net beta|Computation for task mapwca0183684.txt_2 finished
2007-06-06 10:12:45|malariacontrol.net beta|Output file mapwca0183684.txt_2_0 for task mapwca0183684.txt_2 absent
2007-06-06 10:12:45|malariacontrol.net beta|Output file mapwca0183684.txt_2_1 for task mapwca0183684.txt_2 absent
Using Windows 98 SE with BOINC 5.8.16
Remember you, this problem of negative time is not new for the hosts using Win9x.
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
|
|
Fixing the badly formatted data string in the WUs has undoubtedly helped. Still, WUs issued after that fix are still crashing frequently (though not always) on systems running older versions of BOINC. Upgrading to BOINC 5.8.16 is highly recommended. BOINC 5.10.2 (beta) is working well here so far. WUs issued after the data string fix announced by Alain that have crashed on pre 5.8.xx systems are completing error free on my BOINC 5.10.2 machine.
As (already) said, i have no problem with a CC 5.6.4 and i would be curious to see how a CC 5.5.x would work...
My cursory investigation of failed WUs returned by quorum partners indicates 5.6.4 and 5.5.x do work sometimes but fail far more frequently than 5.8.x. I imagine you would observe the same if you looked through result reports from your quorum partners. I didn\'t (already) say that because I thought you\'d figure it out. Damn, I was wrong again.
____________
-- |
|
|
|
|
|
For example, Fardringle is running 5.5.0 and getting tons of crashed mappredictor units.
____________
-- |
|
|
|
|
|
Hi Everyone
Hope this helps the admins
Recent errors with Mappredictor 5.17 appear to be associated with the client
I run malaria with the truxoft client tx36 these appear to fail.
I have noticed the following clients appear to fail
Boinc 5.4.x
However
clients on 5.8.x appear ok.
Also somebody else in this thread has observed other working clients.
Ian |
|
|
|
|
|
Well I\'ve been running 5.10.x for some time now and I\'ve had quite a few faliures with that but it\'s a near-stable build now (almost ready for public release) so yeah...
____________
 |
|
|
|
|
|
I\'ve had three WU\'s error out on client 5.8.16 as well.
____________
Freedom of Speech is a cherished right in every democracy and democratic institution. |
|
|
|
|
I\'ve had three WU\'s error out on client 5.8.16 as well.
Three errors out of how many WUs?
I\'ve had 1 error out of 180 WUs. Running BOINC 5.10.2.
____________
-- |
|
|
|
|
|
Looking at BOINCview log of 3 hosts running mappredictor going back to 6/6/2007 when they started flowing again, I have 7 failures out of 303 (2.31%)
A couple were on host running CC5.9.5 now upgraded to 5.10.2 and the other failures were on two hosts running cc5.10.2 |
|
|
|
|
|
I have heard that to stop the errors causing a halt to crunching until you acknowledge the error, then do the following;
On Windows XP SP2:
Control Panel -> System -> Advanced -> Error reporting button (bottom - right) -> disable.
More detailed instructions:
http://support.microsoft.com/kb/310414
Live long and BOINC.
____________
Paul
(S@H1 8888)
 |
|
|
|
|
|
No failure on my A64 X2 4400+, running a CC 5.6.4
On my Barton 3200+, i tried a CC 5.5.16. Presently i\'m using a CC 5.6.4.
With these 2 CC, excepted that the cpu time is negative (i\'m running Millenium) but if i don\'t close the dos box, the wus are valid.
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
|
|
Hi Everyone
Hope this helps the admins
Recent errors with Mappredictor 5.17 appear to be associated with the client
I run malaria with the truxoft client tx36 these appear to fail.
I have noticed the following clients appear to fail
Boinc 5.4.x
However
clients on 5.8.x appear ok.
Also somebody else in this thread has observed other working clients.
Ian
See wrapper, please :
* This requires version 5.5 or higher of the BOINC core client.
The application Prediction of Malaria Prevalence is using the Boinc wrapper.
According the Lattice project, you need at least a CC 5.5.1 with any application using the wrapper.
____________
Do you want to get banned for 31 years and your account & credits deleted at a Boinc project ? Predictor@home is your best choice. |
|
|
|
|
Hi Everyone
Hope this helps the admins
Recent errors with Mappredictor 5.17 appear to be associated with the client
I run malaria with the truxoft client tx36 these appear to fail.
I have noticed the following clients appear to fail
Boinc 5.4.x
However
clients on 5.8.x appear ok.
Also somebody else in this thread has observed other working clients.
Ian
See wrapper, please :
* This requires version 5.5 or higher of the BOINC core client.
The application Prediction of Malaria Prevalence is using the Boinc wrapper.
According the Lattice project, you need at least a CC 5.5.1 with any application using the wrapper.
We have updated the core client minimum version number from 5.41 to 5.51
____________
Alain Studer
Swiss Tropical Institute |
|
|
|
|
|
Why don\'t ALL projects email their userbase to update to the latest Core Client as each new one is released as stable.
A lot of people install BOINC as \"set and forget\" and thus rarely update.
If this is done we may not have so many people using out of date clients and save a lot of hassle.
____________
 |
|
|
|
|
|
It would be better if Berkeley gave BOINC auto-update capability something like Windows has.
____________
-- |
|
|
|
|
It would be better if Berkeley gave BOINC auto-update capability something like Windows has.
There was severe resistance to this idea in the early days of BOINC from participants concerned about security. However it appears that the tide has turned. I have seen this topic come up several times recently with almost no resistance.
____________
BOINC WIKI

BOINCing since 2002/12/8 |
|
|
|
|
|
I have no objections to auto update, could be download and let user install. Should anyone hack my computer and steal my secrets on eternal youth and infinite wealth I will soon know who they are. They can\'t hide forever!
____________
 |
|
|
|
|
It would be better if Berkeley gave BOINC auto-update capability something like Windows has.
There was severe resistance to this idea in the early days of BOINC from participants concerned about security. However it appears that the tide has turned. I have seen this topic come up several times recently with almost no resistance.
I personally have no problems with auto-updating but I do know that alot of people over at Seti use Boinc in a business environment and would not like this. The resistance in the beginning was from those people because they could not afford to have an untested program run and then crash their pc. There is no way Boinc can be tested in every single configuration that people have so I can see the concerns. I think an easy way to avoid this would be to to have a check box allowing auto-updates or just downloading them and flashing the icon in some way to indicate an update is available. Just flashing the icon now would be a good intermediate solution. At least for those that actually look at the screen. I do not personally look at most of my pc\'s more than once or twice a week.
____________
 |
|
|
|
|
|
If the update transaction was done over HTTPS protocol rather HTTP and the executables arrived with a certificate of authenticity then I would feel very secure. Still, as mikey said, some crunchers are ultra paranoid, perhaps for good reason. They can be accomodated too, we have the technology :)
There should be 4 options for auto-update:
- no auto-update
- inform me when updates are available
- download updates when available but do not install
- download and install updates automatically
EDIT ADDED: There is already a thread about auto-updates on the BOINC forums at Berkeley. Looks like it\'s going to happen :)
____________
--
|
|
|
|
|
|
Hi!
Today I get:
2007-06-14 12:06:48|malariacontrol.net beta|[task_debug] Process for mapwca0079147.txt_1 exited
2007-06-14 12:06:48|malariacontrol.net beta|[task_debug] task_state=EXITED for mapwca0079147.txt_1 from handle_exited_app
2007-06-14 12:06:48|malariacontrol.net beta|Deferring communication for 1 min 0 sec
2007-06-14 12:06:48|malariacontrol.net beta|Reason: Unrecoverable error for result mapwca0079147.txt_1 ( - exit code 1282 (0x502))
2007-06-14 12:06:48|malariacontrol.net beta|[task_debug] result state=COMPUTE_ERROR for mapwca0079147.txt_1 from CS::report_result_error
2007-06-14 12:06:48|malariacontrol.net beta|[task_debug] Process for mapwca0079147.txt_1 exited
2007-06-14 12:06:48|malariacontrol.net beta|[task_debug] exit code 1282 (0x502):
2007-06-14 12:06:53|malariacontrol.net beta|[error] Can\'t rename output file mapwca0079147.txt_1_0
2007-06-14 12:06:58|malariacontrol.net beta|[error] Can\'t rename output file mapwca0079147.txt_1_1
2007-06-14 12:06:58|malariacontrol.net beta|Computation for task mapwca0079147.txt_1 finished
2007-06-14 12:06:58|malariacontrol.net beta|Output file mapwca0079147.txt_1_0 for task mapwca0079147.txt_1 absent
2007-06-14 12:06:58|malariacontrol.net beta|Output file mapwca0079147.txt_1_1 for task mapwca0079147.txt_1 absent
2007-06-14 12:06:58|malariacontrol.net beta|[task_debug] result state=COMPUTE_ERROR for mapwca0079147.txt_1 from CS::app_finished
* prediction od malaria prevalence 5.17
* Windows XP
* AMD Athlon(tm) XP 2500+ [x86 Family 6 Model 10 Stepping 0]
* Memory 511.48 MB
<core_client_version>5.9.3</core_client_version>
<![CDATA[
<message>
- exit code 1282 (0x502)
</message>
<stderr_txt>
o1
c1
bp
pwa1
pwa5
pwa6
pwa7
pwa8
app error: 0x502
</stderr_txt>
]]>
I hope this helps to fix the application.
EDIT:
Well, the next WU also finished with the same error.
I don\'t know why, but now when I don\'t hear music with Realplayer, the third WU is working correctly. I wonder, if it will fihish with success. Strange, isn\'t it?
____________
|
|
|