Title: Contagion, Tipping and Navigation in Networks
1Contagion, Tipping and Navigation in Networks
- Networked Life
- CSE 112
- Spring 2007
- Prof. Michael Kearns
2Gladwell, page 7 The Tipping Point is the
biography of the idea that the best way to
understand the emergence of fashion trends, the
ebb and flow of crime waves, or the rise in teen
smoking is to think of them as epidemics. Ideas
and products and messages and behaviors spread
just like viruses do
on networks.
3Gladwell Tipping Examples
- Hush Puppies
- almost dead in 1994 gt 10x sales increase by 96
- no advertising or marketing budget
- claim viral fashion spread from NY teens to
designers - must be certain connectivity and individuals
- NYC Crime
- 1992 gt 2K murders lt 770 five years later
- standard socio-economic explanations
- police performance, decline of crack, improved
economy, aging - but these all changed incrementally
- alternative small forces provoked anti-crime
virus - Technology tipping fax machines, email, cell
phones - Tipping origins 1970s white flight
4Key Characteristics of Tipping(according to
Gladwell)
- Contagion
- viral spread of disease, ideas, knowledge, etc.
- spread is determined by network structure
- network structure will influence outcomes
- who gets infected, infection rate, number
infected - Amplification of the incremental
- small changes can have large, dramatic effects
- network topology, infectiousness, individual
behavior - Sudden, not gradual change
- phase transitions and non-linear phenomena
- How can we formalize some of these ideas?
5Rates of Growth and Decay
linear
linear
nonlinear, tipping
nonlinear, gradual decay
6Gladwells Three Sources of Tipping
- The Law of the Few (Messengers)
- Connectors, Mavens and Salesman
- Hubs and Authorities
- The Stickiness Factor (Message)
- The infectiousness of the message itself
- Still largely treated as a crude property of
transmission - The Power of Context
- global influences affecting messenger behavior
7Epidemos
- Forest fire simulation
- grid of forest and vacant cells
- fire always spreads to adjacent four cells
- perfect stickiness or infectiousness
- connectivity parameter
- probability of forest
- fire will spread to connected component of source
- tip when forest 0.6
- clean mathematical formalization (e.g. fraction
burned) - Viral spread simulation
- population on a grid network, each with four
neighbors - stickiness parameter
- probability of passing disease
- connectivity parameter
- probability of rewiring local connections to
random long-distance - no long distance connections tip at stickiness
0.3 - at rewiring 0.5, often tip at stickiness 0.2
8Mathematizing the Forest Fire
- Start with a regular 2-dimensional grid network
- this represents a complete forest
- Delete each vertex (and its edges) with
probability p (independently) - this represents random clear-cutting or natural
fire breaks - Choose a random remaining vertex v
- this is my campsite
- Q What is the expected size of vs connected
component? - this is how much of the forest is going to burn
9Mathematizing the Epidemic
- Start with a regular 2-dimensional grid network
- this represents a dense population with local
connections (neighbors) - Rewire each edge with probability p to a random
destination - this represents long-distance connections
(chance meetings) - Choose a random remaining vertex v
- this is an infection spreads probabilistically
to each of vs neighbors - Fraction killed more complex
- depends on both size and structure of vs
connected component - Important theme
- mixing regular, local structure with random,
long-distance connections
10Some Remarks on the Demos
- Connectivity patterns were either local or random
- will eventually formalize this model
- what about other/more realistic structure?
- Tipping was inherently a statistical phenomenon
- probabilistic nature of connectivity patterns
- probabilistic nature of disease spread
- model likely properties of a large set of
possible outcomes - can model either inherent randomness or
variability - Formalizing tipping in the forest fire demo
- might let grid size N ? infinity, look at fixed
values of p - is there a threshold value q
- p gt q ? expected fraction burned lt 1/10
- p lt q ? expected fraction burned gt 9/10
11Small Worlds and the Law of the Few
- Gladwells Law of the Few
- a small number of highly connected vertices
(? heavy tails) - inordinate importance for global connectivity (?
small diameter) - Travers Milgram 1969 classic early social
network study - destination a Boston stockbroker lived in
Sharon, MA - sources Nebraska stockowners Nebraska and
Boston randoms - forward letter to a first-name acquaintance
closer to target - target information provided
- name, address, occupation, firm, college, wifes
name and hometown - navigational value?
- Basic findings
- 64 of 296 chains reached the target
- average length of completed chains 5.2
- interaction of chain length and navigational
difficulties - main approach routes home (6.1) and work (4.6)
- Boston sources (4.4) faster than Nebraska (5.5)
- no advantage for Nebraska stockowners
12The Connectors to the Target
- T M found that many of the completed chains
passed through a very small number of penultimate
individuals - Mr. G, Sharon merchant 16/64 chains
- Mr. D and Mr. P 10 and 5 chains
- Connectors are individuals with extremely high
degree - why should connectors exist?
- how common are they?
- how do they get that way? (see Gladwell for
anecdotes) - Connectors can be viewed as the hubs of social
traffic - Note no reason target must be a connector for
small worlds - Two ways of getting small worlds (low diameter)
- truly random connection pattern ? dense network
- a small number of well-placed connectors in a
sparse network
13Small Worlds A Modern Experiment
- The Columbia Small Worlds Project
- considerably larger subject pool, uses email
- subject of Dodds et al. assigned paper
- Basic methodology
- 18 targets from 13 countries
- on-line registration of initial participants, all
tracking electronic - 99K registered, 24K initiated chains, 384 reached
targets - Some findings
- lt 5 of messages through any penultimate
individual - large friend degree rarely (lt 10) cited
- Dodds et al ? no evidence of connectors!
- (but could be that connectors are not cited for
this reason) - interesting analysis of reasons for forwarding
- interesting analysis of navigation method vs.
chain length
14The Strength of Weak Ties
- Not all links are of equal importance
- Granovetter 1974 study of job searches
- 56 found current job via a personal connection
- of these, 16.7 saw their contact often
- the rest saw their contact occasionally or
rarely - Your closest contacts might not be the most
useful - similar backgrounds and experience
- they may not know much more than you do
- connectors derive power from a large fraction of
weak ties - Further evidence in Dodds et al. paper
- TM, Granovetter, Gladwell multiple spaces
distances - geographic, professional, social, recreational,
political, - we can reason about general principles without
precise measurement
15The Magic Number 150
- Social channel capacity
- correlation between neocortex size and group size
- Dunbars equation neocortex ratio ? group size
- Clear implications for many kinds of social
networks - Again, a topological constraint on typical degree
- From primates to military units to Gore-Tex
16A Mathematical Digression
- If theres a Magic Number 150 (degree bound)
- and we want networks with small diameter
- then there may be constraints on the mere
existence of certain NWs - let D be the largest degree allowed
- why? e.g. because there is a limit to how many
friends you can have - suppose we are interested in NWs with
(worst-case) diameter D (or less) - why? because many have claimed that D is often
small - let N(D,D) size of the largest possible NW
obeying D and D - Exact form of N(D,D) is notoriously elusive
- but known that it is between (D/2)D and 2DD
- So, for example, if N 300M (U.S. population)
- to be certain NW exists, solve N lt (D/2)D
- if D lt 150 (e.g. see Gladwell) D gt 4.5
- if D lt 6 (e.g. see Travers Milgram) D gt 52
- so these literatures are consistent (whew!)
- More generally multiple structural properties
may be competing
17