People Forecasting Where people are going - PowerPoint PPT Presentation

1 / 24

About This Presentation

Title:

People Forecasting Where people are going

Description:

vt-1. gt. rt. lt. yt. vt. Ft-1. D: Time-of-day (discrete) W: Day of week (discrete) ... Predict origin/destinations and routes taken by an individual. ... – PowerPoint PPT presentation

Number of Views:21

Avg rating:3.0/5.0

Slides: 25

Provided by: vib34

Category:

more less

Transcript and Presenter's Notes

Title: People Forecasting Where people are going

1
People ForecastingWhere people are going?

Vibhav Gogate-Computer Science
Rina Dechter-Computer Science
Other Collaborators
Bozhena Bidyuk-Computer Science
James Marca-Transportation Science
Craig Rindt-Transportation Science
University of California, Irvine, CA 92967

2
Motivation

Origin/Destination (O-D) matrix
A necessary input to most microscopic simulation
models in transportation literature.
Old method (Peeta et al. 2002) uses paper and
pencil surveys to generate O-D matrices
Can IT help?
Our proposed Activity Model learns and predicts
a users origins/destinations and routes using
his GPS log.
Our hope is that even if a small sample of
population agrees to share their GPS data, we can
compute the aggregate O-D matrix for a large area
like a city.

3
How O-D matrices are estimated? (Peeta et al.
2002)
4
Architecture
Learning Engine
O-D matrix for a region
Probabilistic Model
GIS Database
Inference Engine
5
Probabilistic model Hybrid Dynamic Mixed Networks

Extends Hybrid Dynamic Bayesian Networks (Lerner
2002) to include Discrete constraints
Able to model all of the following
Discrete, Continuous Gaussian variables
Markov processes
Deterministic (constraint networks)probabilistic
information (Bayesian Networks)

6
Building the activity model
dt
wt
dt-1
wt-1
D Time-of-day (discrete) W Day of week
(discrete) Goal collection of locations where
the person spends significant amount of time.
(discrete) F Counter to control goal
switching. Route A hidden variable that just
predicts what path the person takes
(discrete) Location A pair (e,d) e is the edge
on which the person is and d is the distance of
the person from one of the end-points of the edge
(continuous) Velocity Continuous GPS reading
(lat,lon,spd,utc).
gt-1
gt
Ft-1
Ft
rt-1
rt
vt-1
vt
lt-1
lt
yt-1
yt
7
Constraints in the model
If (distance(lt-1,gt-1)ltthreshold and Ft-10)
Then FtD If (distance(lt-1,gt-1)ltthreshold and
Ft-1gt0) Then FtFt-1-1 If(distance(lt-1,gt-1)gtthre
shold and Ft-1 0) Then Ft0 If(distance(lt-1,gt-
1)gtthreshold and Ft-1 gt 0) Then Ft0 If(Ft-1gt0
and Ft0) gt is given by P(gtgt-1) If(Ft-10 and
Ft0) gt is same as gt-1 If(Ft-1gt0 and Ftgt0) gt
is same as gt-1 If(Ft-10 and Ftgt0) gt is given
by P(gtgt-1)
gt
gt-1
Ft-1
Ft
lt-1
8
Example Queries

Where the person will be 10 minutes from now?
P(lTd1t,w1t,y1t) where Tt10 minutes
What is the persons next goal?
P(gTd1t,w1t,y1t)

9
Example of Goals
10
Example of Route
Grocery store
Route Seen Route Predicted
11
Contributions

A new modeling framework of Hybrid Dynamic Mixed
Networks
A Hybrid Dynamic Mixed Network model for
transportation routines
Predict origin/destinations and routes taken by
an individual.
Novel inference algorithms for reasoning in
Hybrid Dynamic Mixed Networks
An Expectation propagation based algorithm
A new algorithm that combines Particle Filtering
and Generalized Belief Propagation in a
systematic way.

12
Inference in Hybrid Dynamic Mixed Networks (HDMN)

Filtering problem
The Belief state at time t given evidence until
time t P(Xte1t)
Complexity of exact inference NP-hard
Exponential in treewidth
Discrete Dynamic Mixed Networks
Treewidth number of variables in each time
slice
Hybrid Dynamic Mixed Networks
Treewidth O(T) where T is the number of
time-slices.
Approximation is a must in most cases!

13
Approximate Inference

Two popular approximate inference algorithms for
Dynamic Networks
Generalized Belief Propagation (Heskes et al.
02)
Rao-Blackwellised Particle Filtering (Doucet et
al. 02)
Our contribution Extend these two algorithms to
allow discrete constraints
Iterative Join Graph Propagation-Sequential
(IJGP-S)
A new Rao-Blackwellised Particle Filtering (RBPF)
algorithm called IJGP-RBPF.
Use output of IJGP to compute an importance
function
Parameterized by two complexity parameters of i
and w which provides us with a range of
algorithms to choose from.

14
Steps in IJGP-S(i)

Create a Join graph in a sequential manner.
Extends a method by Murphy 02 that creates
junction-tree in a sequential manner.
Perform message passing in slice t and its
interfaces with slice t-1 and t1
Complexity O(exp(i)) where i is maximum number
of variables in a clique of a join-graph.

15
Rao-Blackwellised Particle Filtering (RBPF)

Divide the current state-space into Rt and Xt
where
Rt Rao-Blackwellised (RB) variables
Xt Marginal Variables.
For i1 to N do
Sample Rt(i) and compute marginals on Xt(i) given
Rt(i),Rt-1(i),Xt-1(i) and observation yt using an
exact inference algorithm.
W-cutset (Bidyuk and Dechter 04)
An elegant way to select the RB variables.

16
Extending RBPF to Hybrid Dynamic Mixed Networks

Naïve Extension
Sample from the distribution given by the
Bayesian Network and reject all samples which are
not solutions to the constraint portion.
However, If the distribution generates
non-solutions to the constraint portion with a
high probability, most samples will be rejected.

17
Extending RBPF to Hybrid Dynamic Mixed Networks

Use IJGP for Hybrid Mixed Networks (Gogate and
Dechter 05, UAI) to generate an importance
function.
Sample from this importance function
All other steps are same as in Doucet et al.2002
The resulting algorithm IJGP-RBPF
Complexity O(MAX(exp(i),exp(w)))
i is the i-bound of the join-graph
w is the treewidth of the RB-variables also
called the w-cutset.

18
Experimental ResultsData Collection

GPS data was collected by one of the authors for
a period of 6 months.
Latitude and longitude pairs
3 months data was used for training and 3 months
for testing.
Data divided into segments
A segment is a series of GPS readings such that
two consecutive readings are less than 15 minutes
apart.

19
Experimental ResultsModels and algorithms

Test if adding new variables improves prediction
accuracy.
Model-1 Model as described before
Model-2 Remove variables dt and wt
Model-3 Remove variables dt, wt,ft,rt,gt from
each time slice.
Algorithms
IJGP-RBPF(1,2), IJGP-RBPF(2,1), IJGP-S(1) and
IJGP-S(2)

20
Various Activity models
dt
wt
dt-1
wt-1
gt-1
gt
Model-1
Ft-1
Ft
rt-1
rt
Model-2
vt-1
vt
Model-3
lt-1
lt
yt-1
yt
21
Learning the models from data

EM algorithm used for learning the models
Takes about 3 to 5 days to learn data that is
distributed over 3 months.
Since EM uses inference as a sub-step, we have 4
EM algorithms corresponding to the 4 algorithms
used for inference
IJGP-RBPF(1,2), IJGP-RBPF(2,1), IJGP-S(1) and
IJGP-S(2)

22
Predicting Goals (MODEL-1)

Compute P(gte1t) and compare it with the actual
goal.
Accuracy percentage of goals predicted
correctly.
N number of particles
Column learning algorithm
Row inference algorithm

23
Predicting Goals (Model-2)

Compute P(gte1t) and compare it with the actual
goal.
Accuracy percentage of goals predicted
correctly.
N number of particles
Column learning algorithm
Row inference algorithm

24
Predicting Goals (Model-3)

Compute P(gte1t) and compare it with the actual
goal.
Accuracy percentage of goals predicted
correctly.
N number of particles
Column learning algorithm
Row inference algorithm

25
Predicting Routes

Compare the path of the person predicted by the
model with the actual path.
False positives (FP)---Precision
count the number of roads that were not taken by
the person but were in the predicted path.
False Negatives (FN)---Recall
count the number of roads that were taken by the
person but were not in the predicted path.

26
False Positives and False Negatives for Route
prediction
Model-1 shows the highest route prediction
accuracy, given by low false positives and false
negatives.
27
Future Work O-D estimation through Simulation