Statistical Pattern Recognition Part I: Clustering - PowerPoint PPT Presentation

1 / 21
About This Presentation
Title:

Statistical Pattern Recognition Part I: Clustering

Description:

scene object ( car', road', ...) alphabet letter ( a', b', c' ... Insurance: profile identification of low- or high-risk clients. data samples: clients ... – PowerPoint PPT presentation

Number of Views:194
Avg rating:3.0/5.0
Slides: 22
Provided by: claudecch
Category:

less

Transcript and Presenter's Notes

Title: Statistical Pattern Recognition Part I: Clustering


1
Fac. of Comp., Eng. Tech. Staffordshire
University
Image Processing, Computer Vision, and Pattern
Recognition
Statistical Pattern RecognitionPart I
Clustering
Dr. Claude C. Chibelushi
2
Outline
  • Introduction
  • Features
  • Clustering
  • k-means clustering algorithm
  • Pattern Similarity
  • Summary

3
Introduction
  • Examples of pattern recognition tasks

Image scene
  • scene object (car, road, ...)
  • alphabet letter (a, b, c, ...)
  • writers name

Handwriting
  • word (one, two, three, ...)
  • speakers name

Speech
4
Introduction
  • Pattern recognition
  • Categorisation of patterns into a finite number
    of classes
  • Common approaches
  • statistical pattern recognition
  • connectionist pattern recognition is sometimes
    considered as subset of statistical recognition
  • syntactic pattern recognition

5
Introduction
  • Pattern recognition
  • Statistical
  • classification based on statistical distribution
    of patterns
  • Syntactic
  • classification based on structural relationship
    between elements of pattern
  • Connectionist
  • classification using artificial neural networks
    (statistical basis?)

6
Introduction
  • Terminology
  • Feature(s) representation or description of
    salient pattern attribute(s)
  • Class category , group, type
  • patterns with common characteristics /
    properties
  • patterns dissimilar from patterns in other
    classes
  • Classification assignment of class label to
    observed pattern

7
Features
  • Patterns often considered as points in space
  • selected pattern attributes or properties
    (features) represented by
  • numerical measurements
  • linguistic labels
  • space often multidimensional, hence
  • set of numerical property values called feature
    vector

8
Features
  • Examples attributes for
  • pixel-based image segmentation
  • pixel colour, edge strength,
  • object recognition
  • object shape, colour, size,

9
Features
  • Example 2D feature plot for two image types (5
    samples of each)

10
Clustering
  • Clustering unsupervised learning that identifies
    groups of similar data samples
  • partial supervision (labelling of some samples)
    is possible
  • Cluster group of similar data samples, which are
    dissimilar from samples in other groups
  • for pattern recognition sample is synonymous
    with pattern

11
Clustering
  • Example applications
  • Image processing image segmentation into
    homogeneous regions
  • data samples examples
  • pixels
  • sample details pixel colour, edge strength, edge
    direction,
  • image regions (e.g. pixel blobs)
  • sample details region colour, region texture,
    region edge density, region shape, region spatial
    configuration / topology,

12
Clustering
  • Example applications
  • Marketing market segmentation into groups of
    customers with similar buying behaviour
  • data samples customers
  • sample details database of purchases,
    demographics,
  • Insurance profile identification of low- or
    high-risk clients
  • data samples clients
  • sample details database of client claim history,
    demographics,

13
Clustering
  • k-means clustering algorithm
  • Iterative algorithm for finding k clusters
  • number of clusters assumed known
  • Each cluster represented by prototype
  • cluster centroid (mean)

14
Clustering
  • k-means clustering algorithm pseudo code
  • kMeansClustering(dataSamples, k)
  • initialise cluster centroids
  • do
  • // assign each sample to a cluster
  • for each data sample
  • find nearest cluster centroid
  • // update each cluster mean
  • for each cluster
  • update centroid
  • until (centroids change is insignificant)

15
Clustering
  • k-means clustering algorithm
  • Limitations
  • requires prior knowledge of number of clusters
  • may fail to find optimal grouping of samples
  • sensitive to choice of
  • initial cluster prototypes
  • e.g. what if a prototype is far from any data
    sample?
  • distance measure (similarity measure)

16
Clustering
  • k-means clustering algorithm
  • As result of limitations, experimentation is
    often required
  • multiple trials with different
  • k
  • initial prototype values
  • distance measures

17
Pattern similarity
  • Similarity can be tied to geometrical distance
    between points that represent samples
  • Many similarity measures are available
  • e.g. Minkowski metric, cosine similarity measure,
    ...
  • choice is application dependent

18
Pattern similarity
  • Minkowski metric
  • Popular similarity measure
  • Family of distance measurements between two
    points
  • v point (represents feature vector)
  • vk coordinate of point (i.e. feature)
  • m dimensionality of data space (i.e. number of
    features)

19
Pattern similarity
  • Minkowski metric (ctd.)
  • Special cases
  • p 1 gives city-block distance (a.k.a. Manhattan
    distance or taxi-cab distance)
  • equivalent to Hamming distance if observation
    vectors are binary
  • p 2 gives Euclidean distance

20
Pattern similarity
  • Minkowski metric pseudo code
  • sum 0
  • for feature from 1 to m
  • diff featVector1feature featVector2feature
  • pDiff power( abs(diff), p )
  • sum sum pDiff
  • sum power( sum, 1 / p )

21
Pattern similarity
  • Minkowski metric (ctd.)
  • Limitations
  • sensitive to non-uniform scale across features
  • large-scale features may dominate distance
    calculation
  • e.g. when features have different measurement
    units
  • sensitive to correlation between features
  • see limitations of minimum-distance classifier
    (in Part II)

22
Pattern similarity
  • Cosine similarity measure
  • Related to angle between two vectors

Numerator dot product of vi and vj
Denominator product of magnitudes of vi and vj
Denominator effectively acts as scaling factor so
that -1 ? cos(vi, vj) ? 1
23
Summary
  • Clustering unsupervised identification groups of
    similar data samples
  • Features salient pattern attributes
  • k-means clustering algorithm
  • iterative algorithm for finding k clusters
  • each represented by prototype
  • experimentation is often required as result of
    algorithm limitations
  • Geometrical distance or angle can be used as
    similarity measure
  • e.g. Minkowski metric, cosine measure
Write a Comment
User Comments (0)
About PowerShow.com