Statistical Analysis of Geographical Information(2) - PowerPoint PPT Presentation

About This Presentation
Title:

Statistical Analysis of Geographical Information(2)

Description:

Dr. Marina Gavrilova * Autocorrelation is needed to understand the relationship between locations and observed variables Line Pattern Analyzers and Polygon Pattern ... – PowerPoint PPT presentation

Number of Views:135
Avg rating:3.0/5.0
Slides: 35
Provided by: cwa81
Category:

less

Transcript and Presenter's Notes

Title: Statistical Analysis of Geographical Information(2)


1
Statistical Analysis of Geographical
Information(2)
  • Dr. Marina Gavrilova

2
Topics
  • Autocorrelation
  • Line Pattern Analyzers
  • Polygon Pattern Analyzers
  • Network Pattern Analyzes

3
Spatial Autocorrelation of Points
  • Spatial autocorrelation coefficients measure and
    test how clustered/dispersed the point locations
    are with respect to their attribute values.
  • Spatial autocorrelation of a set of points refers
    to the degree of similarity between points or
    events occurring at these points and points or
    evens in nearby locations.
  • With the spatial autocorrelation coefficient, we
    can measure
  • The proximity of location
  • The similarity of the characteristics of these
    locations.

4
Measures for Spatial Autocorrelation
  • Two popular indices for measuring spatial
    autocorrelation applicable to a point
    distribution Gearys Ratio and Morans I Index.
  • sij representing the similarity of point i s and
    point j s attributes.
  • wij representing the proximity of point i s and
    point j s locations, wii0 for all points.
  • xi representing the value of the attribute of
    interest for point i .
  • n representing the total number of points.

5
SAC (1)
  • The spatial autocorrelation coefficient (SAC) is
    proportional to the weighted similarity of the
    point attribute values.

6
SAC (2)
  • The spatial weights in the computations of the
    spatial autocorrelation coefficient may take on a
    form other than a distance-based format. For
    example
  • wij can take a binary form of 1 or 0, depending
    on whether point i and point j are spatially
    adjacent.
  • If tow regions share a common boundary, the two
    centroids of these regions can be defined as
    spatially adjacent wij 1 otherwise wij 0.

7
Gearys Ratio
  • In Gearys Ratio, the similarity attribute
    values between two points is defined
  • The computation of Gearys Ratio

8
Morans I Index
  • In Morans I Index, the similarity attribute
    values between two points is defined
  • The computation of Morans I Index

9
Gearys Ratio vs. Morans I Index
Numerical scales of Gearys Ratio and Morans I Numerical scales of Gearys Ratio and Morans I Numerical scales of Gearys Ratio and Morans I
Spatial Patterns Gearys C Morans I
Clustered pattern in which adjacent or nearby points show similar characteristics 0ltClt1 I gt E(I)
Random pattern in which points do not show particular patterns of similarity C 1 I E(I)
Dispersed pattern in which adjacent or nearby points show different characteristics 1ltClt2 I lt E(I)
E(I) (-1)/(n-1), which n denoting the number of points in distribution E(I) (-1)/(n-1), which n denoting the number of points in distribution E(I) (-1)/(n-1), which n denoting the number of points in distribution
10
Scales of Gearys Ratio and Morans I Index
  • The indexs scale for Gearys Ratio does not
    correspond to our conventional impression of the
    correlation coefficient of the (-1, 1) scale,
    while the scale of Morans I resembles more
    closely the scale conventional correlation
    measure
  • The value for no spatial autocorrelation is not
    zero but -1/n-1
  • The values of Morans I Index in some empirical
    studies are not bounded by (-1,1), especially the
    upper bound of 1.

11
Introduction of Linear Features
  • In a vector GIS database, linear features are
    best described as line objects. The
    representation of geographic features by
    geographic objects is scale dependent.
  • For instance, on a small-scale map (1
    1,000,000), a mountain range may be represented
    by a line showing its approximate location. When
    a large geographic scale is adopted (124,000), a
    polygon object is more appropriate to represent
    the detail of a mountain range.

12
Linear Features
  • Some linear features do not have to be connected
    to each other to form a network. Each of these
    linear segments can be interpreted alone.
    Examples include extensive features such as
    mountain ranges and touchdown paths of tornados.
  • Besides linear geographic features, line objects
    in a GIS environment can represent phenomena or
    events that have beginning locations and ending
    locations. For example, we often use lines with
    arrow to show wind direction and magnitudes.

13
Spatial Attributes of Linear Features
  • Linear features can have attributes just like
    other types of features.
  • Length
  • Orientation and Direction

14
Directional Mean
  • Direction mean is similar to the concept of an
    average in classical statistics. It shows the
    general direction of a set of vectors. It can be
    simplified to 1 unit in length (unit vectors).

15
Spatial Attribute of Network Features (1)
  • In a network database, linear features are linked
    together topologically.
  • The length of a network can be defined as the
    aggregated length of individual segments of
    links.
  • Orientation or direction is also essential. For
    example, the flow direction of tributaries of
    river network should relatively consistent if the
    watershed is not very large or is elongated in
    shape.

16
Spatial Attribute of Network Features (2)
  • Connectivity of a network is how many different
    links or edges are connected to each other.
  • Connectivity matrix store and represent how
    different links are joined together. The labels
    of the columns and rows in the connectivity
    matrix are the IDs or the links in the network.
    If two links are directly joined to each other,
    the cell have a value of 1. Otherwise, the value
    will be 0.

17
Railroads Centering at Washington, D.C.
18
Length Attribute Analysis of Linear Features
  • The spatial dataset for the application example
    is the Breeding Bird Survey Routes of North
    America from the National Atlas.
  • The database includes routes for the annual bird
    survey. Routes for the survey are represented as
    polyline segments.
  • The data describing the Continental Divide the
    Rocky Mountains from the National Atlas is also
    used.

19
Breeding Bird Survey Routes
  • Breeding bird survey routes at 100 miles and
    between 100 and 200 miles from the Continental
    Divide

20
Summary statistics of route report
  • From these descriptive statistics, it is quite
    obvious that the routes closer to the Continental
    Divide have a slightly higher degree of geometric
    complexity than those farther away.
  • Still, we would like to confirm if the difference
    in the mean is due to sampling error or to some
    systematic processes by performing the
    difference-of-means test.

21
Application Example for Network Analysis
  • We use a dataset modified from the shape file of
    major U.S. interstate highways included in the
    dissemination of ArcView GIS by ESRI.
  • The data theme is Roads_rt.shp with 147 line
    segments, which represent major interstate
    highways and some state highways.
  • The data must conform to the properties of a
    planar graph. When two lines cross each other, a
    vertex will be created. However, the highway data
    do not need meet it.

22
Highway networks
23
Mapping the networks
24
Introduction of Polygon Pattern Analyzers
  • The spatial patterns of geographic objects and
    phenomena are often the result of physical of
    cultural-human processes taking place on the
    surface of the earth.
  • Spatial pattern is a static concept since a
    pattern only show how geographic objects
    distribute at one given time.
  • Spatial process is a dynamic concept because it
    depicts and explains how the distribution of
    geographic objects comes to exist and may change
    over time.

25
Spatial Relationships
  • A spatial pattern can generally categorized as
    clustered, dispersed, or random.
  • In clustered case, darker shades representing a
    certain characteristic appear to cluster on the
    western side.
  • In dispersed case, countries with darker shades
    appear to be spaced evenly.
  • In random case, there may be no particular
    systematic structure or mechanism controlling the
    way these polygons are distributed.

26
Types of Patterns
27
Spatial Dependency
  • In classifying spatial patterns of polygons as
    either clustered, dispersed, or random, we can
    focus on how various polygons are arranged
    spatially.
  • We can measure the similarity or dissimilarity of
    any pair of neighboring polygons, or polygons
    within a given neighborhood.
  • When these similarities and dissimilarities are
    summarized for the entire spatial pattern, we
    essentially measure the magnitude of spatial
    autocorrelation, or spatial dependency.

28
Strength of spatial autocorrelation
  • In addition to its type or nature, spatial
    autocorrelation can be measured by its strength.
  • Strong spatial autocorrelation means that the
    attribute values of adjacent geographic objects
    are strongly related.
  • If attribute values of adjacent geographic
    objects do not appear have a clear order or a
    relationship, the distribution is said to have a
    weak spatial autocorrelation, or a random
    pattern.

29
Joint Count Statistics
  • Joint Count Statistics can be used to measure the
    magnitude of spatial autocorrelation among
    polygons with binary nominal data.
  • For interval or ratio data, we may use Morans I
    index, Gearys Ratio C, and G-statistic.
  • These global measures assume that the magnitude
    of the spatial autocorrelation is reasonably
    stable across the study region.

30
Neighbor Definition
  • Elements in spatial weight matrices are often
    used as weights in the calculation of spatial
    autocorrelation statistics or in the spatial
    regression models.
  • The neighboring polygons of X First order
    neighbors, high order neighbors.

31
Binary Connectivity Matrix
  • The cell will either be 0 or 1 in a binary
    matrix.
  • cij1 when the i th polygon is adjacent to the j
    th polygon.
  • cij0 when the i th polygon is not adjacent to
    the j th polygon.

32
Centroid Distance
  • There are several ways to measure the distance
    between any two polygons. A very popular practice
    is to use the centroid of the polygon to
    represent the polygon.
  • There are different ways to determine the
    centroid of a polygon.
  • In general, the shape of the polygon affects the
    location of its centroid. Polygons with unusual
    shapes may generate centroids that are located in
    undesirable locations.

33
Nearest Distances
  • One method to determine the distance between any
    two features is based on the distance of their
    nearest parts.
  • An interesting situation involving the distance
    of nearest parts occurs when the two features are
    adjacent to each other. When this is the case,
    the distance between two features is 0.

34
Conclusions
  • Autocorrelation is needed to understand the
    relationship between locations and observed
    variables
  • Line Pattern Analyzers and Polygon Pattern
    Analyzers are used to udenrstand complex spatial
    processes
  • Network Pattern analysis can be performed using
    advanced mathematical modeling tools
Write a Comment
User Comments (0)
About PowerShow.com