Title: Learning%20and%20Evolution:%20Lessons%20from%20the%20Baldwin-Effect
1Learning and EvolutionLessons from the
Baldwin-Effect
- Georg Theiner
- P747 Complex Adaptive Systems
- March 11th, 2003
2Outline
- A brief history of modern evolutionary biology
- What is the Baldwin Effect?
- Hinton Nowlan's (1987) simulation
- JAVA-applet of BE
- The trade-offs between phenotypic plasticity and
rigidity - Subsequent studies
- Discussion
3Lamarckian Evolution
- Published Philosophie Zoologique (1809)
- Assumption Change in the environment causes
changes in the needs of organisms living in that
environment, which in turn causes changes in
their behavior. - Mechanisms of evolution
- First Law Use or disuse causes structures
(organs) to enlarge or shrink - Second Law All such acquired changes are
heritable - Example long legs and webbed feet of wading
birds, long neck of giraffe
Jean-Baptiste Lamarck (1744-1829)
4Darwinian Evolution
- Published The Origin of Species (1859)
- direct manipulation of one's genetic make-up
impossible - acquired characteristics are not directly passed
on to offspring - Mechanism of evolution
- Genetic variation in species through random
mutations - Natural selection operates on phenotypes
Charles Darwin (1809-82)
5Baldwinian Evolution
- Published "A New Factor in Evolution" (1896)
- Independently identified by Baldwin, Morgan, and
Osborn in 1896 - New factor phenotypic plasticity the ability
of an organism to adapt to its environment during
its lifetime - Examples ability to learn, to increase muscle
strength with exercise, to tan with exposure to
sun
James Mark Baldwin (1861-1934)
6The Baldwin Effect
- A cluster of effects emerging from an interaction
between 2 adaptive processes - genotypic evolution of population (global search)
- individual organism's phenotypic flexibility
(local search) - Concerned with benefits and costs of lifetime
learning - lifetime learning can alter the genetic
composition of an evolving population
7- Hypothesized examples
- bird song (Simpson 1953)
- human language capacity (Pinker and Bloom 1990,
Deacon 1997) - consciousness, intelligence (Dennett 1991, 1995)
- learning capacity eventually becomes genetically
encoded ? resembles Lamarckian sequence - consistent with Darwinian mechanism for
inheritance of traits
8The Baldwin Effect, Step 1
- Evolutionary value of learning accelerates
evolution of an adaptive trait - As a result of mutation, an organism becomes
capable of learning how to do X - Learning how to do X increases an organism's
fitness - Creates new selective pressures because
selection is now also working on the ability to
perform X. - Since the successful X-er has greater
reproductive success, eventually the population
may consist entirely of individuals able to learn
how to do X.
9The Baldwin Effect, Step 2
- Since learning can be costly, evolution favors
rigid solutions in which acquiring X is part of
an organism's genetic make-up (phenotypic
rigidity) - Chance of reproductive success be proportional to
how quickly (reliably) X can be learnt - New selective pressures cause competition between
slow and fast learners - Some individuals are innately better equipped for
performing X, have reproductive advantage - Eventually, capacity to X comes under direct
genetic control genetic assimilation,
canalization of a trait (Waddington 1942)
10Hinton Nowlan Simulation (1987)
- Organism with neural net, 20 connections (phenes)
- 20 genes, one-to-one mapping on phenes
- Each gene can have 3 alleles
- 0 no connection
- 1 connection
- ? undetermined, learning
- one Good Phenotype net works just in case all
nodes are connected - one Good Genotype all 1's
11"Needle in a haystack"-fitness landscape
- Evolutionary search modeled by GA
- Population of 1000 organisms
- Each allele is randomly initialized
- p 0.5 for ?
- p 0.25 for 0 and 1
- performs no better than random
fitness
combination of alleles
12Problem of passing on the good genome
- Even if good solution discovered, not easily
passed on - unless fit organism finds very-close-to-fit mate,
good genome will be destroyed - expected number of good (immediate) offspring lt 1
- can be bypassed in artificial simulations using
elitism operator, asexual reproduction
13The importance of lifetime learning
- Augment evolutionary search with phenotypic
plasticity - Each organism performs 1000 learning trials
during lifetime - learning mechanism random guess
- if correct net is found, stop else keep
searching - all phenes equally hard to learn
- requires that organism recognizes the correct
solution
14Determine next generation
- Use a version of Holland's GA (1975)
- Perform 1000 matings
- Selection algorithm Roulette Wheel
- Select parents with probability proportional to
fitness
- Fitness function F of an individual A in a
population i is - F(Ai) 1 (G g) / G (N 1)
- G number of allowed guesses
- g number of guesses until solution found
- N length of genotype
- in our case 1 (19n/1000)
15- Wheel is spun twice (2 parents) for each mating,
single offspring is generated - cross-over point for combining parental alleles
is chosen randomly - offspring inherit only genome, never learnt
connection settings - Model parameters are fine-tuned
- typical genotype has about 10 connections
genetically determined (0's or 1's) - about 210 learning trials
16Results 1
- Phenotypic plasticity smoothes "needle in a
haystack" fitness landscape - by allowing an organism to explore neighboring
regions of phenotypic space - no unlikely saltations necessary to climb fitness
peak
17Results 2
- if no phenotypic plasticity, about 220 ( 1
million) organisms have to be produced to succeed
in search - with learning, finding the correct net requires
only 16 x 1000 organisms - little selection pressure to fix all phenes
genetically
18JAVA-Simulation (Watson and Wiles 2001)
- Run with "Show all data" check-box to see
frequency of 0's and ?'s - Alter random number seed
- Additional evolutionary operators
- mutation
- chance (as specified in Advanced Options) that a
given allele will be flipped to either 0, 1, or ?
(with equal p) - maintain diversity, avoid local fitness maxima
- elitism
- forces best individual of each population to be
included unchanged in next generation
19- Alternative Selection Algorithms
- Ranked Roulette Wheel
- slice of wheel is proportional to ranked fitness
- minimizes real differences in fitness
- less selection bias for top-fit individuals
- Tournament
- randomly picks 2 individuals from population,
chooses fitter one with p k (as set in Advanced
Options) - runs much faster
- preserves genetic diversity much longer
- Standard combinations for optimization algorithms
- Standard roulette without elitism
- tournament with elitism
20Fundamental insight of BE
- Trade-offs between learning (plasticity) and
instinct (rigidity)
21French and Messinger (1994)
- amount of plasticity and amount of benefit of
learnt behavior is crucial to size of BE - having blue eyes vs. humming Middle C vs. winking
- x-axis agent's normalized distance from Good
Gene (number of bits differing by total number of
bits) - y-axis probability of learning the Good Phene
22- BE is significant only for a narrow window of
plasticity - if too low or too high, virtually no convergence
towards Good Gene
23Mayley (1996a, 1996b, 1997)
- Possible selective disadvantage of learning
Hiding Effect - phenotypic fitness differentials are compensated
by learning capacity - genetic differences are hidden from selection by
learning - trade-offs between Baldwin and Hiding effect
24Discussion
- Unrealistic assumptions about fitness landscape
- extremely rugged fitness landscape makes pure
evolutionary search very hard - How smooth are real search spaces?
- Unrealistic assumption about learning mechanism
- instead e.g. use hillclimbing procedure for local
optimization - enhances BE only if learning procedure is not too
sophisticated, otherwise insufficient selective
pressure for hard-wiring
25- Learning trials are "cheap" genetic experiments
- but biological reality of those two search
strategies differs in many respects - Unrealistic assumption about genome-phenome
mapping - mapping could be one-to-many
- genetic specification and successful guessing of
a trait are treated interchangeably - transformation of phenotype to genotype
(development) is trivialized
26- Do we need an explicit fitness function?
- French Messinger (1994) introduce spatial
dimension - consider 3 areas of plasticity Good Phene more
efficient metabolism, movement, reproduction - world determines fitness of a given genotype
- Using simple models to understand complex
phenomena - Controlled experiments are practically unfeasible
- How simple is too simple?
27Selective Bibliography on BE
A bibliography on BE (last update 2001) http//www.cs.bath.ac.uk/jjb/web/baldwin.html
An online JAVA-simulation of BE http//www.itee.uq.edu.au/jwatson/bdemo/index.html requires JAVA version 1.3.1 or greater
Ancel, L. (2000) Undermining the Baldwin Expediting Effect Does Phenotypic Plasticity Accelerate Evolution? Theoretical Population Biology, 58, 307-19. http//www.santafe.edu/ancel/PAPERS/TPB.pdf
Baldwin, J.M. (1896) A New Factor in Evolution, American Naturalist, 30, 441-51. http//www.santafe.edu/sfi/publications/Bookinforev/baldwin.html
28Belew, R.K. (1990) Evolution, Learning, and Culture Computational Metaphors for Adaptive Search, Complex Systems, 4, 11-49.
Downes, S. (2003) Baldwin Effects and Expansion of the Explanatory Repertoire in Evolutionary Biology, in Weber, B., and Depew, D.J., (eds.), loc.cit.
French, R., and Messinger, A. (1994) Genes, phenes and the Baldwin effect, in Brooks, R., and Maes, P. (eds.), Artificial Life IV, MIT Press, 277-82. http//www.santafe.edu/amessing/baldwin.ps
Hinton, G.E., and Nowlan, S.J. (1987) How Learning Can Guide Evolution, Complex Systems, 1, 495-502. reprinted in Mitchell, M., and Belew, R. (eds.), Adaptive Individuals in Evolving Populations Models and Algorithms (1996) http//www-advancedgec.ge.uiuc.edu/papers/Chap 25 Adaptive Individuals.pdf
29Jones, M., and Konstam, A. (1999) The Use of Genetic Algorithms and Neural Networks to Investigate the Baldwin Effect, in Carroll, J., and Hiddad, H. et al. (eds.), Proceedings of the 1999 ACM Symposium on Applied Computing, 275-79.
Ku, K., and Mak, M. (1998) Empirical Analysis of the Factors that Affect the Baldwin Effect, in Eiben, A.E., and Baeck, T. et al. (eds.), Parallel Problem Solving From Nature, Springer, 481-90.
Mayley, G. (1996a) The evolutionary cost of learning. In Maes, P., Mataric, M., Meyer, J-A., Pollack, J., and Wilson, S. (Eds), From Animals to Animats Proceedings of the Fourth International Conference on Simulation of Adaptive Behaviour, 458-467, MIT Press. http//www.cogs.susx.ac.uk/users/gilesm/sab96.ps
30Mayley, G., (1996) Landscapes, Learning Costs and Genetic Assimilation, Evolution, Learning, and Instinct 100 Years of the Baldwin Effect, a Special Issue of Evolutionary Computation, 4(3), 1996. http//www.cogs.susx.ac.uk/users/gilesm/ec.ps
P.Turney, D. Whitley and R. Anderson (eds), Evolution, Learning, and Instinct 100 Years of the Baldwin Effect, Special Issue of Evolutionary Computation, 4(3), 1996 Check out Table of contents http//alife.ccp14.ac.uk/baldwin/baldwin/toc.html Editorial with a short history of BE http//alife.ccp14.ac.uk/baldwin/baldwin/editorial.html
31Turney, P. (1996) Myths and legends of the Baldwin effect, in Fogarty, T., and Venturini, G. (eds.), Proceedings of the ICML-96, 135-42. ftp//ai.iit.nrc.ca/pub/iit-papers/NRC-39220.pdf
Waddington, C.H. (1942) Canalization of Development and the Inheritance of Acquired Characteristics, Nature, 150, 563-65.
Weber, B., and Depew, D.J. (eds.), Evolution and Learning the Baldwin Effect Reconsidered, MIT Press, 2003. on-order by B-Main Lib