General Signal Processing and Machine Learning Tools for BCI Analysis (Part-1) - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

General Signal Processing and Machine Learning Tools for BCI Analysis (Part-1)

Description:

Md. Zia Uddin Bio-Imaging Lab, Department of Biomedical Engineering Kyung Hee University Abstract Introduction Spectral Filtering Special Filtering Classification ... – PowerPoint PPT presentation

Number of Views:96
Avg rating:3.0/5.0
Slides: 23
Provided by: Bilal8
Category:

less

Transcript and Presenter's Notes

Title: General Signal Processing and Machine Learning Tools for BCI Analysis (Part-1)


1
General Signal Processing and Machine Learning
Tools for BCI Analysis (Part-1)
  • Md. Zia Uddin
  • Bio-Imaging Lab, Department of Biomedical
    Engineering
  • Kyung Hee University

2
Contents
  • Abstract
  • Introduction
  • Spectral Filtering
  • Special Filtering
  • Classification

3
Abstract
  • This presentation is about signal processing and
    machine learning techniques and their
    applications to BCI.
  • Overview of general signal processing and
    classification methods as used in single-trial
    EEG analysis is given.
  • For further study, original publications are
    encouraged.

4
Why ML for BCI
  • Subject wise experiments
  • Subject to subject result variances for same kind
    of experiments.
  • Session wise experiments
  • Session to session huge result variability for
    the same person.
  • Real time experiments
  • The system needs to identify the subjects mental
    state from single trial.
  • Much more complexity arises.
  • Solution
  • A session and user brain signature adaptable
    system is necessary to overcome the subject to
    subject and session to session huge variability.

5
Why Preprocessing?
  • Relevant information extraction is difficult
    because of large dimensional data (i.e., Curse of
    dimensionality).
  • Dimensionality has to be reduced keeping the
    discriminative information and eliminating
    undiscriminative information.
  • Most of the classification methods calculate
    covariance matrix of the data for further feature
    analysis. Huge covariance matrix is required in
    the case of large dimensional data.
  • Thus, Prepropcessiong steps regarding
    dimensionality reduction is required
  • In some cases
  • A priory knowledge is used (e.g., spatial Laplace
    filter at predefined scalp locations)
  • Automatic methods (e.g., spatial filters
    determined by common spatial pattern analysis)

6
Spectral Filter FIR IIR
  • Common approach is to use digital frequency
    filter
  • To consider desired frequency range
  • Two sequences of poles (a) and zeroes (b) with
    length na and nb are necessary that can be
    calculated by Butterworth or elliptic.
  • The source signal x is filtered to y as
  • a(1)y(t)b(1)x(t) b(2)x(t-1) ...
    b( nb )x(t- nb -1) a(2)y(t-1) -...- a( na )y(t-
    na -1)
  • Where a and na are constrained to be 1, is called
    FIR filter (i.e., considering all zeros).
  • Advantage of FIR
  • Produce steeper slopes in between pass and stop
    band.
  • In most of the BCI applications, band pass filter
    is required to consider specific frequency range.

7
Spectral Filter Fourier-Based Filter
  • A good alternative than FIR and IIR is to use
    temporal Fourier-based filtering in BCI.
  • A signal switches from temporal to the spectral
    domain.
  • The filtered signal is obtained by choosing
    suitable weighting to the relevant frequency
    components and applying Inverse Fourier
    Transformation (IFT).
  • The short time window determines the frequency
    resolution

8
Spatial Filter Bipolar Filtering
  • EEG channels are measured as voltage potential
    relative to a standard reference (referential
    recording).
  • Also, it is possible to record all the channels
    as voltage difference between the electrode
    pairs.
  • From referential EEG, bipolar channels can be
    obtained by subtracting the respective channels
  • FC4-CP4(FC4-ref) - (CP4-ref)FC4ref -CP4ref
  • Reduces the effect of local smearing by computing
    local gradient.
  • Focuses on the local activity while contributions
    of more distant sources are attenuated

9
Spatial Filter Common Average Reference
  • The mean of all EEG channels are subtracted from
    each channel to get the common average reference
    signals.
  • Reduces the influence of far field sources but
    may introduce some undesired spatial smearing
  • Artifacts of one channel may spread to all other
    channels.

10
Spatial Filter Laplace Filtering
  • More localize filter can be obtained through
    this.
  • Laplace signals are obtained by subtracting the
    average of surrounding electrodes from each
    individual channel.
  • C4Lap C4ref- ¼(C2ref C6ref FC4ref CP4ref)
  • The choice of surrounding channels determine the
    characteristics of the filter.
  • Usually, small Laplacians are used (as example
    given above).
  • Large Laplacians use neighbors at 20 distance
    as defined in international 10-20 system.

11
Spatial Filter Principle Component Analysis(1)
  • Represent multidimensional data with fewer number
    of variables retaining main features of the data.
  • It is inevitable that by reducing dimensionality
    some features of the data will be lost. It is
    hoped that these lost features are comparable
    with the noise and they do not tell much about
    underlying population.
  • The method PCA tries to project multidimensional
    data to a lower dimensional space retaining as
    much as possible variability of the data.
  • Its simplicity makes it very popular. But care
    should be taken in applications. First it should
    be analyzed if this technique can be applied.

12
Spatial Filter Principle Component Analysis(2)
  • Orthogonal directions of greatest variance in
    data
  • Projections along PC1 discriminate the data most
    along anyone axis
  • First principal component is the direction of
    greatest variability (covariance) in the data.
  • Second is the next orthogonal (uncorrelated)
    direction of greatest variability
  • So first remove all the variability along the
    first component, and then find the next direction
    of greatest variability
  • And so on

13
Spatial Filter Principle Component Analysis(3)
  • We can ignore the components of lesser
    significance.
  • We do lose some information, but if the
    eigenvalues are small, we dont lose much
  • n dimensions in original data
  • calculate n eigenvectors and eigenvalues
  • choose only the first p eigenvectors, based on
    their eigenvalues
  • final data set has only p dimensions

Basic Steps
Eigenplot
14
Visual Evoked Potential Extraction from Single
Trial EEG signals using PCA filtering(1)
  • Problem Definition
  • Remove the noise to get VEP in the single trial
    29 channels EEG data without ensemble averaging
  • Technique adopted to solve the Problem
  • Selection of principal components as basis for
    the reconstruction of signal
  • Methodology
  • Given signal is divided into an ensemble of
    signals, for each channel
  • An ensemble average for each channel is obtained
    as a reference
  • Apply PCA to find out the orthonormal
    eigenvectors which are used as basis for signal
    approximation
  • Selection of Principal components as basis by
    looking at the frequency components present in
    the prototype signal i.e. the averaged signal

15
Visual Evoked Potential Extraction from Single
Trial EEG signals using PCA filtering(2)
Original Signal
Single epoch after PCA filtering
Reconstructed epoch stacks
16
Spatial Filter Independent Component Analysis(1)
  • Basically ICA is applied for Blind Source
    Separation (BSS)
  • Assume an observation (signal) is a linear mix of
    unknown independent source signals
  • The mixing (not the signals) is stationary
  • We have as many observations as unknown sources
  • To find sources in observations
  • Need to define a suitable measure of independence
  • For example - the cocktail party problem
    (sources are speakers) Find Z
  • Formal Statement
  • N independent sources Zmn ( M xN )
  • linear square mixing Ann ( N xN )
  • produces a set of observations Xmn ( M xN )
  • .. XT AZT

17
Spatial Filter Independent Component Analysis(2)
  • demix observations XT ( N xM ) into YT WXT
    YT ( N xM ) ? ZT W ( N xN )
    ? A-1
  • How do we recover the independent sources?
  • (We are trying to estimate W ? A-1 )
  • . We require a measure of independence!

18
Spatial Filter Independent Component Analysis(3)
  • The source signals are mixed by random non
    orthogonal matrix
  • JADE algorithm was applied to demix the signals
  • After reordering and scaling, the demixed signals
    are very similar to sources.
  • PCA would fail here as the mixed signals are not
    orthogonal to each other, which is the key
    assumption of PCA.
  • Other ICA algorithms
  • Infomax
  • FastICA

19
Spatial Filter Independent Component
Analysis(4) Applying ICA to single-trial EEG
epochs
  • Data Collection
  • EEG data were recorded from 31 scalp electrodes
  • 29 placed at locations based on a modified
    International 10-20 system
  • one placed below the right eye (VEOG),
  • one placed at the left outer canthus (HEOG). All
  • 31 channels were referred to the right mastoid
    and were digitally sampled for analysis at 256
    Hz with a 0.01- to 100-Hz analog bandpass plus a
    50-Hz lowpass filter.
  • Subjects participated in a 2-hour visual spatial
    selective attention task in which they were
    instructed to attend to filled circles flashed
    in random order in five locations.
  • Component IC1, generated by blinks
  • IC4 generated by temporal muscle activity.

20
Spatial Filter Independent Component
Analysis(5) Applying ICA to single-trial EEG
epochs (2)
  • The scalp maps and power spectra of the 31
    independent components derived from target
    response epochs from a 32-year-old autistic
    subject.
  • Blink and eye movement artifact components (IC1
    and IC9) had a typical strong low frequency
    peak.
  • Temporal muscle artifact components (i.e., ICs
    14, 22, 27, and 29) had characteristic focal
    optima at temporal sites and power plateaus at 20
    Hz and higher.

21
Conclusion
  • Next class
  • more classification techniques and some
    practical examples.

22
  • Thank you
Write a Comment
User Comments (0)
About PowerShow.com