An Introduction to

About This Presentation

Title:

An Introduction to

Description:

Romantic Love. Bickers with (and there are, of course, many other relations inside ... Mainly a graphics program, but is expanding the analytic capabilities ... – PowerPoint PPT presentation

Number of Views:67

Avg rating:3.0/5.0

Slides: 101

Provided by: sociologyr

Learn more at: https://people.duke.edu

Category:

Tags: introduction

more less

Transcript and Presenter's Notes

Title: An Introduction to

1
An Introduction to Social Network Analysis
James Moody Department of Sociology The Ohio
State University
2
Introduction
The world we live in is connected
Jim Moody
3
Introduction
These patterns of connection form a social space.
Social network analysis maps and analyzes this
social space.
4
Adolescent Social Structure
5
(No Transcript)
6
(No Transcript)
7
Introduction
Yet standard social science analysis methods do
not take this space into account. Moreover, the
complexity of the relational world makes it
impossible (in most cases) to understand this
connectivity using only our intuitive
understanding of a setting.
8
Introduction

Why networks matter
Intuitive information travels through contacts
between actors, which can reflect a power
distribution or influence attitudes and
behaviors. Our understanding of social life
improves if we account for this social space.
Less intuitive patterns of inter-actor contact
can have effects on the spread of goods or
power dynamics that could not be seen focusing
only on individual behavior.

9
Introduction

Social network analysis is
a set of relational methods for systematically
understanding and identifying connections among
actors
a body of theory relating to types of observable
social spaces and their relation to individual
and group behavior.

10
Introduction

Network analysis assumes that
How actors behave depends in large part on how
they are linked together
Example Adolescents with peers that smoke are
more likely to smoke themselves.
The success or failure of organizations may
depend on the pattern of relations within the
organization
Example The ability of companies to survive
strikes depends on how product flows through
factories and storehouses.
(continued..)

11
Introduction
Network analysis assumes that

Patterns of relations reflect the power
structure of a given setting, and clustering may
reflect coalitions within the group
Example Overlapping voting patterns in a
coalition government

12
Introduction
An information network Email exchanges within
the Reagan white house, early 1980s (source
Blanton, 1995)
13
Introduction
Power positions and potential influence
14
Overview
15
Basic Concepts

Origins of network analysis
Beginning in the 1930s, a systematic approach to
theory and research, based on the notion that
relations matter, began to emerge
In 1934 Jacob Moreno introduced the ideas and
tools of sociometry
At the end of World War II, Alex Bavelas founded
the Group Networks Laboratory at M.I.T.

16
Basic Concepts

From the outset, the network analysis has been
a. guided by formal theory organized in
mathematical terms, and
b. grounded in the systematic analysis of
empirical data
In the 1970s, when modern discrete combinatorics
(esp. graph theory) developed rapidly and
powerful computers became readily available that
the study of social networks began to flourish

17
Basic Concepts

Actors are nodes
Ideas, Papers, Events, Individuals,
Organizations, Nations
Relations are lines between pairs of nodes
Symmetric (shares a room with)
Asymmetric (gives an order to)
Valued (number of times seen together)

18
Basic Concepts

Network data are familiar to you
For example
- Personal, face-to-face contact
- Telephone contact
- Email contact
- Contact through faxes or wires
- Snail-mail contact
- Membership in the same organization
- Attendance at the same meetings
- Graduates of the same university

19
Basic Concepts
For example, you might be tracking the activities
of a number of people in related, but not
identical cases, including meetings they
attended. You may know little of the content of
the event, or what they may have said to each
other, only whether particular people were at the
event. Your data might look like
20
Basic Concepts
11.19.2001. Meeting at Brussels.
Attending Smith, Johnson, Davis, James,
Jackson 12.22.2001. Meeting at Paris.
Attending Johnson, James, Jones,
Wilson 1.12.2001. Meeting in New York.
Attending Jones, Carter, Burns 2.14.2001.
Meeting in Denver. Attending Wilson, Burns,
Wilf, Newman
(Red bold indicates people who are the focus of
an investigation)
21
Basic Concepts
While perhaps not immediately apparent when
looking at the list of names, a simple algorithm
reveals connections among these actors.
22
Basic concepts
Types of network data 1) Ego-network - Have
data on a respondent (ego) and the people they
are connected to (alters) - May include
estimates of connections among alters
23
Basic concepts
Types of network data 2) Partial network - Ego
networks plus some amount of tracing to reach
contacts of contacts - Something less than full
account of connections among all pairs of actors
in the relevant population - Example CDC
Contact tracing data for STDs
24
Basic concepts
Types of network data 3) Complete - Data on all
actors within a particular (relevant) boundary -
Never exactly complete, but boundaries are set -
Example Coauthorship data among all writers in
the social sciences
25
Examples linked levels of data
Actor
Key contact
Primary Relation
26
Why networks matter
Consider the following (much simplified) scenario

Probability that actor i passes information to
actor j (pij)is a constant over all relations
0.6
S T are connected through the following
structure

S
T

The probability that S passes the information to
T through either path would be 0.09

27
Probability of transfer of information over
independent paths

The probability that the information passes from
i to j is assumed constant at pij.
The probability that the information passes
through multiple links (i to j, and from j to k)
is the joint probability of each (link1 and link2
and link k) pijd where d is the path
distance.
To calculate the probability of of the
information passing through multiple paths, use
the compliment of it not passing through any
paths. The probability of not passing through
path l is 1-pijd, and thus the probability of not
passing through any path is (1-pijd)k, where k is
the number of paths
Thus, the probability of i passing the
information to j given k independent paths is

Why matter
Distance
28
Probability of information passing over
non-independent paths
- To get the probability that I passes the
information to j given that paths intersect at 4,
I calculate
Using the independent paths formula.
29
Why networks matter
Now consider the following (similar?) scenario
S
T

Every actor but one has the exact same number of
contacts
The category-to-category mixing is identical
The distance from S to T is the same (7 steps)
S and T have not changed their behavior
Their contacts contacts have the same behavior
But the probability of the information passing
from S to T is
0.148
Different outcomes different potentials for
intervention

30
Overview
Introduction
Basic Concepts
Flows within Networks
Structure of Social Space
Tools, Models Methods For Flows and Structures
Conclusions
31
Network Flow

In addition to the simple probablity that one
actor passes information on to another (pij), two
factors affect flow through a network
Topology
the shape, or form, of the network
- Example one actor cannot pass information to
another unless they are either directly or
indirectly connected
Time
- the timing of contact matters
- Example an actor cannot pass information he
has not receive yet

32
Topology
Two features of the networks shape are known to
be important connectivity and centrality

Connectivity refers to how actors in one part of
the network are connected to actors in another
part of the network.
Reachability Is it possible for actor i to
reach actor j? This can only be true if there is
a chain of contact from one actor to another.
Distance Given they can be reached, how many
steps are they from each other?
Number of paths How many different paths
connect each pair?

33
Network topology reachability
Without full network data, you cant distinguish
actors with limited information from those more
deeply embedded in a setting.
c
b
a
34
Network topology distance number of paths

Given that ego can reach alter, distance
determines the likelihood of information passing
from one end of the chain to another.
Because information spread is never certain, the
probability of transfer decreases over distance.
However, the probability of transfer increases
with each alternative path connecting pairs of
people in the network.

35
Network topology distance number of paths
Distance is measured by the (weighted) number of
relations separating a pair
Actor a is 1 step from 4 2 steps from 5
3 steps from 4 4 steps from 3 5 steps from 1
a
36
Network topology distance number of paths
Paths are the different routes one can take.
Node-independent paths are particularly important.
There are 2 independent paths connecting a and b.
b
There are many non-independent paths
a
37
Probability of information transfer
by distance and number of paths, assume a
constant pij of 0.6
1.2
1
10 paths
0.8
5 paths
probability
0.6
2 paths
0.4
1 path
0.2
0
2
3
4
5
6
Path distance
38
Reachability in Colorado Springs (Sexual contact
only)

High-risk actors over 4 years
695 people represented
Longest path is 17 steps
Average distance is about 5 steps
Average person is within 3 steps of 75 other
people
137 people connected through 2 independent paths,
core of 30 people connected through 4 independent
paths

(Node size log of degree)
39
Network topology centrality

Centrality refers to (one dimension of) location,
identifying where an actor resides in a network.
For example, we can compare actors at the edge
of the network to actors at the center.
In general, this is a way to formalize intuitive
notions about the distinction between insiders
and outsiders.

40
Centrality example
At the local level, we expect people like NSJMP
and NSOLN to have greater access to information
than others in the network. Network analysis
gives us a set of tools to quantify this
difference.
41
Centrality example
Actors that appear very different when seen
individually, are comparable in the global
network.
(Node size proportional to betweenness centrality
)
42
Information flows
Two factors that affect network
flows Topology - the shape, or form, of the
network - simple example one actor cannot pass
information to another unless they are either
directly or indirectly connected Time - the
timing of contacts matters - simple example an
actor cannot pass information he has not receive
yet
43
Timing in networks
A focus on contact structure often slights the
importance of network dynamics Time affects
networks in two important ways 1) The structure
itself goes through phases that are correlated
with information spread 2) The timing of contact
constrains information flow
44
Sexual Relations among A syphilis outbreak
Changes in Network Structure
Rothenberg et al map the pattern of sexual
contact among youth involved in a Syphilis
outbreak in Atlanta over a one year period.
(Syphilis cases in red)
Jan - June, 1995
45
Sexual Relations among A syphilis outbreak
July-Dec, 1995
46
Sexual Relations among A syphilis outbreak
July-Dec, 1995
47
Drug Relations, Colorado Springs, Year 1
Data on drug users in Colorado Springs, over 5
years
48
Drug Relations, Colorado Springs, Year 2 Current
year in red, past relations in gray
Data on drug users in Colorado Springs, over 5
years
49
Drug Relations, Colorado Springs, Year 3 Current
year in red, past relations in gray
Data on drug users in Colorado Springs, over 5
years
50
Drug Relations, Colorado Springs, Year 4 Current
year in red, past relations in gray
Data on drug users in Colorado Springs, over 5
years
51
Drug Relations, Colorado Springs, Year 5 Current
year in red, past relations in gray
Data on drug users in Colorado Springs, over 5
years
52
What impact does timing have on flow through the
network?
In addition to changes in the shape over time,
contact timing constrains how information can
flow through the network. Consider the
following example
53
A hypothetical contact network
8 - 9
C
E
3 - 7
2 - 5
B
A
0 - 1
3 - 5
D
F
Numbers above lines indicate contact periods
54
The path graph for the hypothetical contact
network
E
C
B
A
D
F
55
Direct contact network of 8 people in a ring
(adjacency matrix cell number of paths from
row to column)
56
Implied contact network of 8 people in a ring All
contacts concurrent
57
Implied contact network of 8 people in a
ring Mixed Concurrent
2
3
2
1
1
2
2
3
Density 0.57
58
Implied contact network of 8 people in a
ring Serial (1)
1
8
2
7
3
6
5
4
Density 0.73
59
Implied contact network of 8 people in a
ring Serial (2)
1
8
2
7
3
6
1
4
Density 0.51
60
Implied contact network of 8 people in a
ring Serial (3)
1
2
2
1
1
2
1
2
Density 0.43
61
Information flows
Summary Topology - Information requires
connected communication chains - Real-world
networks are too complex to map these without
specialized tools. Time - Network topology
changes over time. This has implications for
information flow. - Because small changes in
relationship timing can have dramatic effects on
information flow, it is impossible to know this
intuitively.
62
Overview
Introduction
Basic Concepts
Flows within Networks
Structure of Social Space
Tools, Models Methods For Flows and Structures
Conclusions
63
Structure of Social Space
Information flows are only one use of
networks It is also possible to characterize the
key topological features of any social network.
These features include things such as the extent
of hierarchy and clustering.
64
Structure of Social Space
1) Identify core groups patterns of relations
among groups a. embeddedness in groups constrains
action b. group structure affects stability
resource distribution 2) Locate tensions or
inconsistencies in a relational structure that
might indicate sources of social change.
65
Structure of Social Space
Two features of interest related to network
structure 1) Cohesive groups Sets of people
who interact frequently with each other. These
are often groups that work together. Groups are
often organized into positions within a network
that indicate particular roles or access
resources 2) Hierarchy Relational structure
can identify the leadership positions within a
network, though either direction of ties or
periphery status
66
Structure of cohesive groups
A cohesive group is a set of actors with more
interaction inside the group than outside the
group, mutually connected through multiple paths.
67
Cohesive Group Structure
Immaculate Preparatory High School
68
Cohesive Group Structure 3 types of positions
Immaculate Preparatory High School
69
Cohesive Group Structure Group member
Immaculate Preparatory High School
70
Cohesive Group Structure Group Member
Immaculate Preparatory High School
71
Cohesive Group Structure Bridge between groups
Immaculate Preparatory High School
72
Cohesive Group Structure Outsider
Immaculate Preparatory High School
73
Cohesive Groups Relevance

Identify people who bridge important
constituencies
- people who are between groups have a unique
ability to control information
Such actors are said to bridge structural holes,
the number of holes an actor bridges gives
insight into an actors power position in the
network.

74
Hierarchy and network position
Many cohesive groups are embedded within a
hierarchy, which one can map using relational
tools. Changes in the hierarchical position
indicate changes in the power structure.
75
Examples of Hierarchical Systems
Linear Hierarchy (all triads transitive)
Simple Hierarchy
Branched Hierarchy
Mixed Hierarchy
76
Hierarchy and network position
If you dont know the hierarchy of the network,
asymmetry optimization techniques allow one to
identify levels in a hierarchy
77
Hierarchy and network position
If you dont know the hierarchy of the network,
asymmetry optimization techniques allow one to
identify levels in a hierarchy
78
Group structure through multiple relations
Start with some basic ideas of what a role is
An exchange of something (support, ideas,
commands, etc) between actors. Thus, we might
represent a family as
H
W
C
C
C
Provides food for
(and there are, of course, many other relations
inside the family)
79
Group structure through multiple relations
The key idea, is that we can express a role
through a relation (or set of relations) and thus
a social system by the inventory of roles. If
roles equate to positions in an exchange system,
then we need only identify particular aspects of
a position. But what aspect? Structural
Equivalence
Two actors are structurally equivalent if they
have the same types of ties to the same people.
80
Structural Equivalence
A single relation
81
Structural Equivalence
Graph reduced to positions
82
Alternative notions of equivalence
Instead of exact same ties to exact same alters,
you look for nodes with similar ties to similar
types of alters
83
Overview
Introduction
Basic Concepts
Flows within Networks
Structure of Social Space
Tools, Models Methods For Flows and Structures
Conclusions
84
Tools, Methods Models
Data Representations
Adjacency Matrix
Graph
Arc List
Node List
85
Tools, Methods Models
Graphical Display

Benefits
Intuitive way to display networks.
Helps people see the social space it is a map.
A concise presentation of a great deal of data.
Costs
Lack of standards for how to display can create
misleading images.
Displays of large networks tend to reveal only
the roughest properties of the network

86
Tools, Methods Models
Graphical Display Software

PAJEK
Program for analyzing and plotting very large
networks
Intuitive windows interface
Used for most of the real data plots in this
presentation
Mainly a graphics program, but is expanding the
analytic capabilities
Free
Available from

87
Tools, Methods Models
Graphical Display Software

Cyram Netminer for Windows
Very new largely untested
Price range depends on application
Limited to smaller networks O(100)

88
Tools, Methods Models
Graphical Display Software

NetDraw
Also very new, but by one of the best known names
in network analysis software.
Free
Limited to smaller networks O(100)

89
Tools, Methods Models
Analysis Methods Descriptive / Measurement
The key text for methods and measurement is
Wasserman, Stanley and Katherine Faust. 1994.
Social Network Analysis. Cambridge Cambridge
University Press.
The basic network measures use graph theory to
formalize aspects of the network, and always work
from either an adjacency matrix (slow for large
graphs) or an edge/node list.
90
Tools, Methods Models
Analysis Methods Descriptive / Measurement
Properties of interest include
Individual Level Degree Number of contacts for
each person - Sum over the row/column of the
adjacency matrix. Closeness Centrality Inverse
of the distance to every other node in the
network. Count path distances from ego to
alters. Sub-group Level Group Membership Which
groups are there? Various search algorithms for
identifying groups. Group Position Where does a
given group fit in the overall flow of relations?
Various Equivalence algorithms. Graph
Level Density Number of ties present as a
percentage of all possible ties. Centralization
To what degree are edges focused through a small
number of nodes. Various formulas for different
centrality indices.
91
Tools, Methods Models
Analysis Methods Descriptive / Measurement
Software

1) UCI-NET
General Network analysis program, runs in Windows
Good for computing measures of network topography
for single nets
Input-Output of data is a little clunky, but
workable.
Not optimal for large networks
Available from
Analytic Technologies
Borgatti_at_mediaone.net
2) STRUCTURE
A General Purpose Network Analysis Program
providing Sociometric Indices, Cliques,
Structural and Role Equivalence, Density Tables,
Contagion, Autonomy, Power and Equilibria In
Multiple Network Systems.
DOS Interface w. somewhat awkward syntax
Great for role and structural equivalence models
Manual is a very nice, substantive, introduction
to network methods
Available from a link at the INSNA web site
http//www.heinz.cmu.edu/project/INSNA/soft_inf.ht
ml

92
Tools, Methods Models
Analysis Methods Descriptive / Measurement
Software

3) NEGOPY
Program designed to identify cohesive sub-groups
in a network, based on the relative density of
ties.
DOS based program, need to have data in arc-list
format
Moving the results back into an analysis program
is difficult.
Available from
William D. Richards
http//www.sfu.ca/richards/Pages/negopy.htm
4) SPAN - Sas Programs for Analyzing Networks
(Moody, ongoing)
is a collection of IML and Macro programs that
allow one to
a) create network data structures from nomination
data
b) import/export data to/from the other network
programs
c) calculate measures of network pattern and
composition
d) analyze network models
Allows one to work with multiple, large networks
Easy to move from creating measures to analyzing
data
All of the Add Health data are already in SAS
Available by sending an email to
Moody.77_at_osu.edu

93
Tools, Methods Models
Analysis Methods Statistical Models
There are two general classes of statistical
models for networks
1) Models of the network itself The statistical
question is how an observed network fits into the
class of all possible random graphs with a given
set of topological characteristics. The whole
network is the substantive unit of analysis,
though technically one works with the dyads from
the network. Examples p models (Wasserman
and Pattison), MCMC random graph models (Tom
Snijders, Mark Handcock) 2) Models of individual
behavior that incorporate network
characteristics The statistical question is
whether or not network properties affect
individual behaviors. Examples Network
regressive-autoregressive models (Doriean), Peer
influence models (Friedkin)
94
Tools, Methods Models
Analysis Methods Statistical Models
Exponential Random Graph Models
Where z is a collection of r explanatory
variables, calculated on x 2 is a collection of
r parameters to be estimated k is a normalizing
constant that ensures the probability sums to 1.
As it turns out, k is incredibly difficult to
identify, introducing a number of complexities to
the model.
95
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
96
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
97
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
98
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
99
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
100
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
101
Exponential Random Graph Model Details Kindly
provided by Mark Handcock, University of
Washington Statistics Department.
102
Tools, Methods Models
Analysis Methods Statistical Models
Exponential Random Graph Models
To estimate the model, we work with the
conditional probabilities (XijXcij) instead of
the full graph. This transforms the exponential
model to a logit model on the dyads
103
Analysis Methods Statistical Models
Exponential Random Graph Models
Software for analyzing these models is available
from Logit Pseudo-Likelihood estimation http//
kentucky.psych.uiuc.edu/pstar/index.html (SPSS
programs) http//www.sfu.ca/richards/Pages/pspar
.html (Program for Large graphs) Empirically,
these models are tricky to estimate, as the
potential result space can easily become
degenerate, particularly as z starts to include a
more complicated rage of dependencies. MCMC
Estimation Ongoing work by Mark Handcock, Tom
Snijders and Co.
104
Tools, Methods Models
Analysis Methods Statistical Models
Network Effect Models
Question is whether or not being connected to a
particular set of people affects an individuals
behavior. The key statistical point is that we
have abandoned the assumption that our cases are
independent. These models originated in spatial
statistics looking at the effect of an adjacent
geographic area on outcomes for any given area.
105
Basic Peer Influence Model
Formal Model
(1)
(2)
Y(1) an N x M matrix of initial opinions on M
issues for N actors X an N x K matrix of K
exogenous variable that affect Y B a K x M
matrix of coefficients relating X to Y a a
weight of the strength of endogenous
interpersonal influences W an N x N matrix of
interpersonal influences
106
Basic Peer Influence Model
Formal Model
(1)
This is the basic general linear model. It says
that a dependent variable (Y) is some function
(B) of a set of independent variables (X). At
the individual level, the model says that
Usually, one of the covariates is e, the model
error term.
107
Basic Peer Influence Model
(2)
This part of the model taps social influence. It
says that each persons final opinion is a
weighted average of their own initial opinions
And the opinions of those they communicate with
(which can include their own current opinions)
108
Basic Peer Influence Model
The key to the peer influence part of the model
is W, a matrix of interpersonal weights. W is a
function of the communication structure of the
network, and is usually a transformation of the
adjacency matrix. In general
Various specifications of the model change the
value of wii, the extent to which one weighs
their own current opinion and the relative weight
of alters.
109
Basic Peer Influence Model
Formal Properties of the model
If we allow the model to run over t, we can
describe the model as
The model is directly related to spatial
econometric models
Where the two coefficients (a and b) are
estimated directly (See Doreian, 1982, SMR)
110
Overview
Introduction
Basic Concepts
Flows within Networks
Structure of Social Space
Tools, Models Methods For Flows and Structures
Conclusions
111
(No Transcript)

Write a Comment

User Comments (0)