Title: Modeling and Analysis of Dynamic Social Communication Networks
1Modeling and Analysis of Dynamic Social
Communication Networks
SNAG
- Malik Magdon-Ismail
- CS, RPI.
- www.cs.rpi.edu/magdon
2SNAG Social Network Analgorithms Group
- Mark Goldberg
- M-I
- Al Wallace
- Sponsors
- Jeff Baumes
- Sean Barnes
- Justin Chen
- Matt Francisco
- Mykola Hayvanovich
- Konstantin Mertsalov
- Yingjie Zhou
SNAG
3Communications
Time January 12, 2005, 0935 From
joe_at_xyz.com To sue_at_abc.com Subject
Hello Message Where have you been?
160631 ltFreeTradegt Republicans were the worst
pacifists before ww1 and ww2 160643
ltSweetLeafgt France Fries 160650 ltFreeTradegt
As a generality, of course their were Republican
Hawks. 160713 ltFreeTradegt Sweet, good pun but
bad story! 160718 ltSweetLeafgt yup 160723
ltLupinegt anyways, he's perpetually tormented by
presidential actions 160725 ltSweetLeafgt it
aint good for no one 160747 ltSweetLeafgt I
think they knew it was commiing 160751
ltFreeTradegt Rossevelt met monthly in New York
with mostly trusted Republicans to talk about how
to get america into the war. 160810
ltFreeTradegt and he spent 2 year with Churchill
meeting him sometimes secretly in the ocean to
discuss the same topic. 160822 ltFreeTradegt
Exchanging a lot of letters. 160825
ltFreeTradegt telegrams 160828 ltLupinegt There
really is nothing like a shorn scrotum. It's
breathtaking, I suggest you try it. 160855
ltFreeTradegt Well they didnt literally meet in the
ocean, they were on ships.
4Minimal Intrusion
- Dont use communication content.
- Less intrusive
- Easier
5Overview
- Part I
- Finding groups from communications.
- Part II
- Virtual Social Science Laboratory.
6I Groups from Communications
- Algorithms
- Spatial algorithms (clustering)
- Temporal hidden group algorithms
- Software tool SIGHTS
- Statistical Identification of Groups Hidden in
Time and Space - Applications
- Simulated datasets
- Web logs
- Enron email corpus
7Communications Data
- Email, Telephone, Newsgroup, Weblog, Chatrooms,
Time January 12, 2005, 0935 From
joe_at_xyz.com To sue_at_abc.com Subject
Hello Message Where have you been lately?
Time January 12, 2005, 0935 From
joe_at_xyz.com To sue_at_abc.com Subject
Hello Message Where have you been lately?
8Communication Graph
January 12, 2005, 0935
sue_at_abc.com
joe_at_xyz.com
9Streaming Communications
Time Step
0
10
20
30
10Cycle Model
Time Step
0
10
20
30
11Types of Structure
- Spatial Correlation (spatial groups)
- Temporal Correlation (temporal or planning groups)
12Groups Correlated in Space
13Groups Correlated in Time
14Groups correlated in time
15Spatial Correlation
- Clustering graphs into overlapping clusters
16Groups as Clusters
- Social groups tend to communicate with each other
- Find social groups by finding locally dense
clusters
likely a social group
likely not a social group
17Locally vs. Globally Dense
18Clustering vs. Partitioning
19Clustering density metrics
- PinEin/Eposs
- Ein/(EinEout)
- Pin/(PinPout)
Eout
Ein
20Influential Nodes
21Iterative Improvement
- Improve initial clusters using iterative local
optimization. - Link Agregate (LA) B,G,M-I 05.
- RaRe Iterative Scan (IS) B,G,K,M-I,P 05.
22Some Real Social Networks
23Some Real Social Networks
- CiteSeer (co-authorship graph)
- Example clusters
- Electric circuit design
- An optimization strategy for reconfigurable
control systems - Optimization of Neural Networks
- A new activation function in the Hopfield
network for solving optimization problems - Intersection
- Sensitivity analysis in degenerate quadratic
programming
24Temporal Correlation
- Finding hidden groups that are planning over time
25Connectivity and Planning
Internally connected
Externally connected
26Persistence
- Group connected in successive time periods.
- Persistence ? planning over time.
27Finding Temporal Hidden Groups
- Given communication graphs G1,,GT
- Is there a hidden group of size gt K?
- Find all such hidden groups?
- Over what period is the hidden group active?
28Algorithms
- Low order poly-time algorithms
- B,G,M-I,W 05
- Not all members connected in every time period?
- Connected in most time periods?
- NP-Hard
29Example
30Example
31Example
32SIGHTS
- Statistical Identification of Groups Hidden in
Time and Space
33Statistical Significance
- Background communications
- Nature of hidden group
- Detecting non-trusting hidden groups is easier
34Ali Baba dataset
- Unclassified synthesized data for the Department
of Defense - Used for specific case studies for initial
validation of research - Nine embedded hidden groups
Message content not used
35Ali Baba initial results
- Ground Truth
- Group A
- Dog
- Vulture
- Camel
- Yassir Hussein
- Bird
- (6 others)
- Group B
- Ahmet
- Saleh Sarwuk
- Shaid
- Pavlammed Pavlah
- Osan Domenik
- SIGHTS
- Group A
- Dog
- Vulture
- Camel
- Gopher
- Group B
- Ahmet
- Saleh Sarwuk
- Shaid
- Ahmett
- Dajik
36Cycle vs. Stream Model
Sent at time B
Sent at time B 20
Sent at time B 40
Probability of reaction
min
max
Time since message received
37Stream Example
- Time From To Message
- 1000 Alice Charlie Golf tomorrow? Tell
everyone. - 1005 Charlie Felix Alice mentioned
golf tomorrow. - 1006 Alice Bob Hey, golf
tomorrow. Spread the word. - 1012 Alice Bob Tee off 8am at
Pinehurst. - 1013 Felix Grace Hey guys, golf
tomorrow. - 1013 Felix Harry Hey guys, golf
tomorrow. - 1015 Alice Charlie Pinehurst Tee
time 8am. - 1020 Bob Elizabeth Were playing golf
tomorrow. - 1020 Bob Dave Were playing
golf tomorrow. - 1022 Charlie Felix Tee time 8am at
Pinehurst - 1025 Bob Elizabeth We tee off 8am at
Pinehurst. - 1025 Bob Dave We tee off 8am at
Pinehurst. - 1031 Felix Grace Tee time 8am,
Pinehurst. - 1031 Felix Harry Tee time 8am,
Pinehurst.
38Stream Example
- Time From To
- 1000 Alice Charlie
- 1005 Charlie Felix
- 1006 Alice Bob
- 1012 Alice Bob
- 1013 Felix Grace
- 1013 Felix Harry
- 1015 Alice Charlie
- 1020 Bob Elizabeth
- 1020 Bob Dave
- 1022 Charlie Felix
- 1025 Bob Elizabeth
- 1025 Bob Dave
- 1031 Felix Grace
- 1031 Felix Harry
39Streams vs. Cycles
- Tree threads may overlap.
- Some may be short, some long.
40Stream Algorithms
- Efficient algorithms for small trees (triples,
chains). - Build larger frequent trees from smaller.
- What size tree is statistically significant?
41Enron data in stream model
Earlier
Later
42II Virtual Social Science Laboratory
- A general HMM model.
- Simulation
- social science experiments.
- Reverse engineering
- what makes a society tick?
43Goal
- Given a societys communication history,
- Can we predict the societys future
- eg number of groups after 3 months?
- average group size after 3 months?
- Can we deduce something about the nature of the
society - eg actors have a propensity to join small
groups?
44Social Networks
45Social Networks
46Social Networks
1
2
- Join
3
47Social Networks
1
2
- Join
- Leave
3
48Social Networks
- Join
- Leave
- Disappear
49Social Networks
1
- Join
- Leave
- Disappear
- Appear
3
50Social Networks
4
1
- Join
- Leave
- Disappear
- Appear
- Re-appear
3
51Communication History
52Social Group History
53Learning and Predicting
Societys History (Macro-Laws)
Actors Behavior (Micro-Laws)
Societys History (Macro-Laws)
Learn
Predict
Predict (Simulate)
Societys Future
54Example of Micro-Law
SMALL
LARGE
55Micro-Laws
- Actor micro-laws
- Probabilistically specify actor decisions.
- Group micro-laws
- Probabilistically specify group decisions.
56Hidden Markov Model
- Society is a probabilistically driven complex
system.
P(ST1micro-lawsS0,,ST)
Functions Parameters
History
Social Capital Theory
57Simulation
P(ST1micro-lawsS0,,ST)
Observe
Postulate
58Reverse Engineering
P(ST1micro-lawsS0,,ST)
Observe
Learn
59Putnam on Social Capital
- Collapse of social capital in United States
communities - Actors build social capital by belonging to
social groups.
60Why?
- Technological innovation?
- Cultural change?
- Demographics change?
61Test Such Hypotheses in VSSL
62Reverse Engineering
Simulated data proof of concept.
Small Medium Large
Small 49.2 0.8 0.0
Medium 0.3 73.3 1.5
Large 0.0 3.8 371.2
Newsgroups actors prefer small
groups Butler 1999
63Reverse Engineering can
- Obtain actor preferences (eg. size).
- Determine society reward structure.
- Probabilistic micro-laws governing actor and
group dynamics. - ...
64Summary
- Discovering groups in space and time
- Societys social group history.
- VSSL Virtual Social Science Lab
- Simulation social science experiments.
- Reverse engineering learn behavior.
- Algorithms, tools, applications (data).
65Ongoing Work
- Data
- Weblogs, Chatrooms, Email (eg. Enron)
- Finding hidden groups
- Stream, cycle (NP-hard)
- Modeling and reverse engineering
- Visualization
- Dynamic networks
- Information visualization (Knowledgization)
66Thank You
- http//www.cs.rpi.edu/magdon