Presentation is loading. Please wait.

Presentation is loading. Please wait.

PARALLEL GENETIC ALGORITHMS AND THE SCIENCE OF ASTEROSEISMOLOGY A Review of the Doctoral Dissertation Research of Dr. Travis Metcalfe.

Similar presentations


Presentation on theme: "PARALLEL GENETIC ALGORITHMS AND THE SCIENCE OF ASTEROSEISMOLOGY A Review of the Doctoral Dissertation Research of Dr. Travis Metcalfe."— Presentation transcript:

1 PARALLEL GENETIC ALGORITHMS AND THE SCIENCE OF ASTEROSEISMOLOGY A Review of the Doctoral Dissertation Research of Dr. Travis Metcalfe

2 Outline Introduction Introduction The Science of Asteroseismology The Science of Asteroseismology The Genetic Algorithm The Genetic Algorithm Parallel Computing Parallel Computing Conclusion Conclusion

3 Introduction Astronomers observe the universe and gather information about it. They then fit this information into mathematical models. The process of “fitting” involves adjusting the many parameters of the model. When they have a good fit, they use the parameter settings to tell them something about the object or phenomenon they are studying. The author uses a parallel genetic algorithm to solve this problem of optimization.

4 The Goal of the Research To Further the Understanding of the Composition and Characteristics of White Dwarves More Generally, Since White Dwarves are the Endpoint for all but the most massive stars, this research can lead to a better understanding of stellar evolution

5 * Source

6 Traditional Technique Make an initial “guess” for parameter values Make an initial “guess” for parameter values Use some iterative technique to improve upon the initial guesses. Use some iterative technique to improve upon the initial guesses.

7 Adjustable Input Parameters Mass Mass Temperature Temperature H and He layer masses H and He layer masses Convective Efficiency Convective Efficiency Core composition Core composition

8 Problem with this technique Results often depend on the initial guess Results often depend on the initial guess The initial guess is inherently subjective, often the result of intuition or past experience The initial guess is inherently subjective, often the result of intuition or past experience

9 The Genetic Algorithm A genetic algorithm provides a more systematic approach to optimizing the results A genetic algorithm provides a more systematic approach to optimizing the results The genetic algorithm used was PIKAIA The genetic algorithm used was PIKAIA PIKAIA is a general purpose “function optimization” genetic algorithm PIKAIA is a general purpose “function optimization” genetic algorithm Public domain software Public domain software Fortran-77 Fortran-77

10 Outline Introduction Introduction The Science of Asteroseismology The Science of Asteroseismology The Genetic Algorithm The Genetic Algorithm Parallel Computing Parallel Computing Conclusion Conclusion

11 White dwarves which show a regular variation in light intensity are known as pulsating white dwarves White dwarves which show a regular variation in light intensity are known as pulsating white dwarves Using photometric techniques, this variation in intensity can be very accurately measured with such instruments as the Whole Earth Telescope (WET) Using photometric techniques, this variation in intensity can be very accurately measured with such instruments as the Whole Earth Telescope (WET)

12 The pulsation is the result of seismic activity within the white dwarf The pulsation is the result of seismic activity within the white dwarf Just as seismological information can be used to study the internal nature of the earth, seismological data, as expressed in varying stellar luminosity, can be used to determine the characteristics of these pulsating white dwarves. Just as seismological information can be used to study the internal nature of the earth, seismological data, as expressed in varying stellar luminosity, can be used to determine the characteristics of these pulsating white dwarves.

13 Observed Light Curve for the White Dwarf GD 358.

14 Outline Introduction Introduction The Science of Asteroseismology The Science of Asteroseismology The Genetic Algorithm The Genetic Algorithm Parallel Computing Parallel Computing Conclusion Conclusion

15 Initial Conditions Population size: 1000 ( in later work this was reduced to 128). Population size: 1000 ( in later work this was reduced to 128). No rationale was given for how the initial population value was chosen, or why it was changed. No rationale was given for how the initial population value was chosen, or why it was changed. For each member of the initial population, parameter values are randomly set For each member of the initial population, parameter values are randomly set

16 Duration Until the difference between the average fitness and the best fitness in the population were less than 1%. Until the difference between the average fitness and the best fitness in the population were less than 1%. In later work, he used a constant 200 generations. In later work, he used a constant 200 generations.

17 Fitness Measurement The model is then run using these initial values The model is then run using these initial values Fitness is based on the root-mean-square differences between the observed and calculated pulsation periods Fitness is based on the root-mean-square differences between the observed and calculated pulsation periods

18 Fitness Measurement The fitness value is converted to a survival probability by normalizing with respect to the most fit member The fitness value is converted to a survival probability by normalizing with respect to the most fit member The next generation is chosen randomly. This random selection is weighted, based on each member’s survivability ratio The next generation is chosen randomly. This random selection is weighted, based on each member’s survivability ratio

19 Crossover Numerical encoding Numerical encoding Each of the initial parameter values are concatenated into one long string Each of the initial parameter values are concatenated into one long string A single point crossover technique is used. The position along the string is picked randomly A single point crossover technique is used. The position along the string is picked randomly

20 Mutation Mutation is achieved by randomly selecting a number in the string and changing it to a new, randomly chosen value Mutation is achieved by randomly selecting a number in the string and changing it to a new, randomly chosen value

21 Illustration Consider two members, each with two parameters. Consider two members, each with two parameters. M 1 has X=2.573 and Y= 4.457. M 1 has X=2.573 and Y= 4.457. M 2 has parameter values X=3.547 and Y=2.332. M 2 has parameter values X=3.547 and Y=2.332. After encoding, M 1 =25734457 and M 2 =35472332 After encoding, M 1 =25734457 and M 2 =35472332

22 Illustration The crossover point is randomly chosen, and the string segments swapped The crossover point is randomly chosen, and the string segments swapped M 1 25734|457  25734332 M 2 35472|332  35472457

23 Illustration Mutating M 1 involves picking a random spot along the string, and changing that value: Mutating M 1 involves picking a random spot along the string, and changing that value: M 1 257|3|4332  25784332

24 Illustration* The strings would then be parsed back into parameter values. For M 1, this would be: The strings would then be parsed back into parameter values. For M 1, this would be: M 1 X= 2.578Y=4.332 * Modified from [1]

25 Crossover and Mutation Rate The cross over rate: 65% The cross over rate: 65% The mutation rate: 0.3%. The mutation rate: 0.3%. In later work, the author increased the crossover rate to 85% and varied the mutation rate from 0.1% to 16.6%, depending on the variation between the mean fitness value, and the best fitness value In later work, the author increased the crossover rate to 85% and varied the mutation rate from 0.1% to 16.6%, depending on the variation between the mean fitness value, and the best fitness value

26 Elitism The most fit solution was passed unaltered the next generation The most fit solution was passed unaltered the next generation

27 Rationale The idea behind the relatively low crossover and mutation rate is to prevent removing promising solutions from each generation too rapidly The idea behind the relatively low crossover and mutation rate is to prevent removing promising solutions from each generation too rapidly

28 Repetition The paper states: “Repeating this procedure many times with different random number seeds helps to ensure that the minimum found is truly global” The paper states: “Repeating this procedure many times with different random number seeds helps to ensure that the minimum found is truly global” It does not elaborate on how many Many times is, though It does not elaborate on how many Many times is, though

29 Repetition In a later paper, he uses 5 repetitions In a later paper, he uses 5 repetitions This result was obtained in the following way… This result was obtained in the following way…

30 Values were put in for the model, and pulsation periods generated. Values were put in for the model, and pulsation periods generated. The genetic algorithm attempted to find the original parameters based on the output of the model The genetic algorithm attempted to find the original parameters based on the output of the model This was done 20 times, and the results were as follows… This was done 20 times, and the results were as follows…

31 Results (second paper) First Order Solution… First Order Solution…

32

33 The genetic algorithm found the exact result 9/20 times, and was close enough on four other occasions for the correct result to be determined by the addition of some other iterative technique, for a total of 65% accuracy. The genetic algorithm found the exact result 9/20 times, and was close enough on four other occasions for the correct result to be determined by the addition of some other iterative technique, for a total of 65% accuracy.

34 If the GA was rerun, and the best result selected, the accuracy increased to 88% If the GA was rerun, and the best result selected, the accuracy increased to 88% After 5 runs, the accuracy was over 99% After 5 runs, the accuracy was over 99% Because no correct answer was found after 200 iterations, the number of generations was reduced to 200 Because no correct answer was found after 200 iterations, the number of generations was reduced to 200

35 Output Curve

36 Outline Introduction Introduction The Science of Asteroseismology The Science of Asteroseismology The Genetic Algorithm The Genetic Algorithm Parallel Computing Parallel Computing Conclusion Conclusion

37 Problem Division Part one: running the numerical model using a large number of different initial parameters. Part one: running the numerical model using a large number of different initial parameters. Part two: determining fitness, selecting the next generation, and performing crossover/mutation Part two: determining fitness, selecting the next generation, and performing crossover/mutation

38 Master-Slave Paradigm Part one – running the model with a given set of parameters was performed by the slave nodes Part one – running the model with a given set of parameters was performed by the slave nodes Part two – fitness evaluation, selection/crossover/mutation was performed by the master node Part two – fitness evaluation, selection/crossover/mutation was performed by the master node

39 PVM PVM was used as the message passing library PVM was used as the message passing library

40 Execution The master machine generates a job pool of parameter values that it passes to the slave machines. The master machine generates a job pool of parameter values that it passes to the slave machines. The slave machines in turn run the model and return the results to the master. The slave machines in turn run the model and return the results to the master. If there are more parameter sets available, the node is given another job. If there are more parameter sets available, the node is given another job.

41 Execution The master calculates variance. The master calculates variance. Determines fitness. Determines fitness. After the models have been run for a given generation, the master determines the members of the next generation and runs the crossover/mutation methods on the appropriate portion of the new population. After the models have been run for a given generation, the master determines the members of the next generation and runs the crossover/mutation methods on the appropriate portion of the new population. As the new parameters are created, they are sent to the workstations. As the new parameters are created, they are sent to the workstations.

42 The Network The Cluster is composed of one master computer and 64 slave nodes The Cluster is composed of one master computer and 64 slave nodes The cluster of computers is divided into three subnets The cluster of computers is divided into three subnets Each subnet is connected to the master serially, using coaxial cable and a 10base-2 (thin Ethernet) system Each subnet is connected to the master serially, using coaxial cable and a 10base-2 (thin Ethernet) system

43 Darwin Pentium-II 333 MHz system with 128 MB RAM Pentium-II 333 MHz system with 128 MB RAM Two 8.4 GB hard disks. Two 8.4 GB hard disks. Three NE-2000 compatible network cards, one for each of the segments Three NE-2000 compatible network cards, one for each of the segments

44 Darwin

45 Nodes Motherboard Motherboard Processor Processor Single 32 MB RAM chip Single 32 MB RAM chip NE-2000 compatible network card NE-2000 compatible network card No Hard drive! No Hard drive!

46 Nodes Half of the nodes contain Pentium-II 300 MHz processors, while the other half are AMD K6-II 450 MHz chips Half of the nodes contain Pentium-II 300 MHz processors, while the other half are AMD K6-II 450 MHz chips

47 The Cluster

48 Conclusion Based on initial results, the use of genetic algorithms appears to be a promising method for minimizing the residual difference between observational data and the Wilson—Devinney model Based on initial results, the use of genetic algorithms appears to be a promising method for minimizing the residual difference between observational data and the Wilson—Devinney model

49 Conclusion It is also a wonderful example of how parallel computing, open source software and clusters of workstations can have a profound impact on the course of research. It is also a wonderful example of how parallel computing, open source software and clusters of workstations can have a profound impact on the course of research.

50 PIKAIA Namesake “Pikaia Gracilens, a little worm-like beast that crawled in the mud of a long gone seafloor of the Cambrian era, 530 million years ago. While not particularly impressive in the tooth and claw department, Pikaia is believed to be the founder of the phylum Chordata, whose subsequent evolution had consequences still very much felt today by the rest of the ecosystem”

51 References 1. Metcalfe, T. S. (1999), Genetic-Algorithm Based Light-Curve Optimization Applied to Observations of the W Ursae Majoris Star Bh Cassiopeiae, The Astronomical Journal, Vol. 117, No. 5, pp. 2503-2510 2. Metcalfe, T. S., R. E. Nather, and D. E. Winget (2000), Genetic- Algorithm-Based Asteroseismological Analysis of the DBV White Dwarf GD 358, The Astrophysical Journal, Vol. 545, No. 2, pp. 974-981 3. Metcalfe, T. S. (2000), The Asteroseismology Metacomputer, Baltic Astronomy, Vol. 9, pp. 479-483

52 References Author’s Web page: http://www.whitedwarf.org Wilson-Devinney: http://cdsads.u-strasbg.fr/cgi-bin/nph- bib_query?1971ApJ...166..605W http://cdsads.u-strasbg.fr/cgi-bin/nph- bib_query?1971ApJ...166..605W PIKAIA Web Page: http://www.hao.ucar.edu/public/research/si/pikaia /pikaia.html http://www.hao.ucar.edu/public/research/si/pikaia /pikaia.html

53 References Image Sources All images were taken from: http://www.whitedwarf.org http://www.whitedwarf.org Except… H-R Diagram http://www.astunit.com/tutorials/stellar.htm Pikaia Gracilens: PIKAIA Website


Download ppt "PARALLEL GENETIC ALGORITHMS AND THE SCIENCE OF ASTEROSEISMOLOGY A Review of the Doctoral Dissertation Research of Dr. Travis Metcalfe."

Similar presentations


Ads by Google