Home
Willkommen Gast. Bitte Einloggen oder Registrieren.
September 10, 2010, 12:42:39

Einloggen mit Benutzername, Passwort und Sitzungslänge
Sie sind hier: Startseite >> Forum
Suche:     Erweiterte Suche
2749 Beiträge in 611 Themen von 1225 Mitglieder
Neuestes Mitglied: reinasela
Übersicht Hilfe Suche Kalender Einloggen Registrieren
+  Science@home.de
|-+  Wissenschaftliche Themen
| |-+  Verteiltes Rechnen
| | |-+  Evolution@home (unofficial forum)
| | | |-+  English
| | | | |-+  Other (Moderatoren: Laurence Loewe, Shaktai, Pascal)
| | | | | |-+  Stalling simulations
« vorheriges nächstes »
Seiten: [1] Drucken
Autor Thema: Stalling simulations  (Gelesen 5945 mal)
ColinDB
Newbie
*
Offline Offline

Beiträge: 6


Profil anzeigen E-Mail
Stalling simulations
« am: November 21, 2005, 02:27:10 »

I like the fact that Laurence replies directly to the inquiries that I have sent through. I also like the manual interaction involved in this project as you feel you are actually doing something to contribute instead of just loading up a program and then just giving it run time. I currently have 6 machines commited to E@H.

I struck a problem around June that I still haven't figured out; and I am curious to find out if anyone else has encountered the same. Brief description ---

I have been running the same run file (85.8 MB RAM and 5.79 days) on 3 separate machines.

No.1 machine produced 22 continuous results (AMD 64 3200 - 1Gb ram)
No.2 machine produced 12 continuous results (AMD XP 2400 - 512Mb ram - FSB@216Mhz)
No.3 machine produced 10 continuous results (AMD XP 2400 - 512Mb ram - FSB@200Mhz)

All have run without interruption or stalling for approx 595 hours each – very stable environment. For whatever reason, all 3 machines stopped writing files after the same time period. (eprogress, elastparameters and intermediate results files). Preferences set give acres of headroom for the application to run freely and there is plenty of memory resource available.

595 hours is just shy of 25 days. I have since been resetting all of my machines before they get to the 595hr mark to prevent the stalling. I haven't tried running beyond this point recently but will be running my newest machine until it stops to see how it goes (AMD 64 3700 - 2Gb ram). I will let you know the outcome.

Same run file - 3 different PCs - 3 different stop positions in file - all after the same amount of run time! Any suggestions?

Cheers,

Colin.
Gespeichert
Pascal
Moderator
Jr. Member
*****
Offline Offline

Beiträge: 95


Profil anzeigen
Re: Stalling simulations
« Antworten #1 am: November 21, 2005, 07:22:32 »

Hi Colin,
first of all, welcome aboard!

My suggestion is: take a look at the speed of your PCs and then you know, why there are different numbers of results. The second thing is indeed the behaviour of each simulation. For this issue - please take a second look at the project's website - www.evolutionary-research.net - and enter 'Muller's Ratchet' into your most favorite search engine. On the website you may find any compact information about that.
I also encounter very often, that some simulations only need 40 % of their expected time - others need up to 200 % of that time. So don't worry. There's one statistic on the website which counts the GigaIndividuals - and that's a perfect thing if you want to see your computed time in the stats.
Gespeichert
Laurence Loewe
Moderator
Newbie
*****
Offline Offline

Beiträge: 30


Profil anzeigen
Re: Stalling simulations
« Antworten #2 am: November 21, 2005, 07:33:32 »

Hi Colin,

Thank you for reporting this really strange behaviour. I must admit that I dont have any idea about what could cause this!
As far as I understand it, it should have nothing to do with the total number of GigaIndividuals processed or with predicted CPU-times.

However, there are two things that I would be interested in that might contribute to a solution:
a) Can you exclude a power-failure or someone playing with the console (i.e. all started at the same time and then were hit by an external cause; you can turn off writing these files if you fiddle with the right console parameters)
b) Do you use a 64 bit Windows? (Yes I know, this should not affect the XPs...)

I suggest to try hit the same barreer again to see if it can be reproduced. This is the first time I hear such a report.

Thanks for your patience with this.

Laurence
Gespeichert
ColinDB
Newbie
*
Offline Offline

Beiträge: 6


Profil anzeigen E-Mail
Re: Stalling simulations
« Antworten #3 am: November 22, 2005, 12:34:53 »

Hello again!

I understand PC speeds as I build my own computers to cater for specific applications. As most are not extensively used, they may as well keep busy when they're not engaged for their specific purposes - hence E@H gets the CPU cycles. I thank you Pascal for your input. Glad to see someone reads my postings. I'm clocking up significant numbers already and will continue to do so in the name of science and research.

As for other possible causes - power failure can be ruled out as 2 other PCs running separate applications were fine during the same timeslot. While all were started "simultaneously", there was a 2hr difference between the first and last computer being started. The same 2hr differentials were reproduced on the timestamps of the mentioned files on each machine. (approx 595hrs from launch to stop report)

All machines are networked in a home environment. There is only my wife and I who live here. My wife would rather play sports than pay any attention to my activities on my PCs. Only 2 of the 6 PCs can access the net through a 1.5Mb broadband connection. (My problem occurred before I had broadband and was still on dial-up). OS is a mixture of 2000 Pro, XP home and XP Pro (SP1 + SP2).

Since 3 of my machines rarely run other applications, I will leave them running without resetting to see if I encounter a repeat of my findings. My other PCs are constantly in use, so they cannot be allowed to run under the same scenario. I will report back in around 12 days as I have 13 days up at present on 1.

This is still fun! I love doing research. Sheer curiosity keeps me wanting to find out what's happening. Perhaps I should get out more!!!

The CPUs will stay warm in the mean time!

Cheers,

Colin.
Gespeichert
Laurence Loewe
Moderator
Newbie
*****
Offline Offline

Beiträge: 30


Profil anzeigen
Re: Stalling simulations
« Antworten #4 am: November 22, 2005, 01:53:06 »

Hi Colin,

Looking forward to see what you report,

Cheers,
Laurence
Gespeichert
ColinDB
Newbie
*
Offline Offline

Beiträge: 6


Profil anzeigen E-Mail
Re: Stalling simulations
« Antworten #5 am: Dezember 09, 2005, 01:13:44 »

Hi again!

Time has passed, and I ran 2 of the 3 trial machines without resetting (the third was required for other work for a short time). Both machines stopped reporting eprogress files after 596.5 hours. The first generated a completed run file 8 hours later then stopped, the other 19 hours later.

I looked back through the reporting files to find out which were the last simulations run and created new run files from the "completed" run files taking out those simulations already run (there were plenty of simulations that had not been looked at yet in both)

I re-booted one machine only, relaunched the program on both and both are behaving normally again. The second machine has been running for nearly 31 days (about 730hrs) with the only hiccup at 596.5 hrs.

Will this shed any further light on what's going on? While resetting a machine every 25 days isn't a problem, it just breaks up run continuity and leaves a CPU idle when it could be doing research.

Cheers,

Colin.
Gespeichert
Barrie
Gast


E-Mail
Re: Stalling simulations
« Antworten #6 am: Dezember 09, 2005, 09:54:18 »

It might be interesting to see if stopping and starting the simulator would be enough to circumvent the problem ie is a re-boot of the OS really needed?  If restarting the simulator is enough, it would suggest that the problem is internal to the simulator.  If a re-boot is needed, then that suggests some interaction between the simulator and the OS is involved.  ?
Gespeichert
ColinDB
Newbie
*
Offline Offline

Beiträge: 6


Profil anzeigen E-Mail
Re: Stalling simulations
« Antworten #7 am: Dezember 09, 2005, 10:33:56 »

I intend on experimenting and I am doing just that. 1 machine has been restarted without reboot and will be allowed to run again until the next "incident". It has already been running for 4.5 days already and is behaving normally.

Perhaps at this point I should be asking the obvious question - has anyone else run the simulator non-stop past 600 hours (25 days)?
Gespeichert
Barrie
Gast


E-Mail
Re: Stalling simulations
« Antworten #8 am: Dezember 10, 2005, 07:29:20 »

Looking at the high scores (it's what I live for) there have been simulations longer than 25 days, but single simulations may be a special case which do not hit this problem.
Gespeichert
ColinDB
Newbie
*
Offline Offline

Beiträge: 6


Profil anzeigen E-Mail
Re: Stalling simulations
« Antworten #9 am: Mai 09, 2006, 02:07:28 »

Just reporting in again to say that all 6 of my machines stall around said 595hrs. All I do now is simply halt the application before this time elapses and relaunch the program (without re-boot) to avoid any lost time. I am surprised that no-one else suffers the same problems or can offer any advice on my findings. I no longer consider it an issue though, I just want to know why for simple curiosity!

Cheers, Colin.
Gespeichert
Laurence Loewe
Moderator
Newbie
*****
Offline Offline

Beiträge: 30


Profil anzeigen
Re: Stalling simulations
« Antworten #10 am: Mai 10, 2006, 12:58:57 »

Hi Colin and others!

To revisit the issue, I had a quick glance at the source code and I did not find any obvious reason for this behaviour (at least it was not what I had thought for a moment).

An additional question though: Your reports seem to suggest that the simulations can easily run for longer than the 596.5 hours and will also write their *final* results to the hard disk, even if they stopped writing eprogress, elastparameters and intermediate results files. Is that correct?

A question for other Windows participants: Did you observe the same behavioir as reported by ColinDB on your machine? If Yes, I would be interested to know. I assume that it all would have different symptoms on the Mac, if any.

Currently I cannot spend much time to digg into the details, unfortunately, but I hope that I will be able to fix it, once I can focus again on the technical side of the simulators.

Cheers, Laurence
Gespeichert
Laurence Loewe
Moderator
Newbie
*****
Offline Offline

Beiträge: 30


Profil anzeigen
Re: Stalling simulations
« Antworten #11 am: Mai 10, 2006, 01:06:21 »

An additional question that may help track down the bug:

Does this phenomenon depend in any way on the frequency that eProgress is written?

This can be adjusted, if the simulator is not running, by opening the "S005_Preferences.txt" of the evolution@home-client and changing the value after the line
"WriteProgressFileIntervallSec (Write one "eProgress.txt" file after how many seconds) :"
to 5 (or any other number you want, the default is 120).

If the simulator is running, the menu can be used to change that.

Cheers, Laurence
Gespeichert
ColinDB
Newbie
*
Offline Offline

Beiträge: 6


Profil anzeigen E-Mail
Re: Stalling simulations
« Antworten #12 am: Juni 11, 2006, 06:06:03 »

Hi All,

Slow checking back - sorry.... but feedback is here.

Simulations do run on past 596.5hrs even if progress files stop writing and application will complete it's immediate task. It reports a result, changes the run file to a completed run file then stops. The run file has always had more simulations available to run in each case but sets to completed regardless.

I will adjust the report intervals on 2 of my high end machines. I will change one to"5" and another to "600" and monitor performance.

I assume that if the problem was to occur under these new settings and was attributable to this setting, the machine set to 5 would display it's problems in around 25hrs from start of application. I'll report back in a couple of days.

Keeping on ratcheting!

Cheers,  Colin.
Gespeichert
Laurence Loewe
Moderator
Newbie
*****
Offline Offline

Beiträge: 30


Profil anzeigen
Re: Stalling simulations
« Antworten #13 am: Juni 23, 2006, 03:04:32 »

I'll report back in a couple of days.

Hi Colin,
Thanks for doing these tests. I'm interested in the results. I definitively want to get to the bottom of this (if I can) to avoid such problems with future simulators.
Cheers,
Laurence
Gespeichert
Seiten: [1] Drucken 
« vorheriges nächstes »
Gehe zu:  


Powered by MySQL Powered by PHP Powered by SMF 1.1.11 | SMF © 2006-2009, Simple Machines LLC | Impressum Prüfe XHTML 1.0 Prüfe CSS
Seite erstellt in 0.174 Sekunden mit 22 Zugriffen.