Natural Language Processing for - PowerPoint PPT Presentation

1 / 1
About This Presentation
Title:

Natural Language Processing for

Description:

Natural Language Processing for Internet Security: the AMiCA project V. Hoste, W. Daelemans, G. De Pauw, E. Lefever, B. Desmet, S. Schulz, B. Verhoeven & C. Van Hee – PowerPoint PPT presentation

Number of Views:67
Avg rating:3.0/5.0
Slides: 2
Provided by: Louise188
Category:

less

Transcript and Presenter's Notes

Title: Natural Language Processing for


1
Natural Language Processing for Internet
Security the AMiCA project
V. Hoste, W. Daelemans, G. De Pauw, E.
Lefever, B. Desmet, S. Schulz, B. Verhoeven C.
Van Hee
Rationale
Project overview
  • Young people spend a lot of time online
  • Online environments are not without risks
  • Unfeasible for stakeholders to keep track of
    potentially harmful situations
  • Protection detect and curate threats

Development
Grounding
Issues and risks of social media use
AMiCA kernel
Platform
Dataflow management
Context mining analysis
Manual monitoring infeasible because of
information overload
Urgent demand for automatic monitoring
Validation 3 use cases
Automutilation suicidal behavior
AMiCA Goals
Cross-media analysis
Core technologies
  • Detection and filtering of unwanted and illegal
    online content
  • Cross-media analysis (text, image, video)
  • Context and profile analysis
  • Aggregated data gt quantitative information on
    risk incidence
  • Embedded monitoring and privacy by design

Text Analytics
Transgressive sexual behavior
Image Processing Audio Mining
Cyberbullying
Text analytics
Normalisation
  • Translate noisy language into its canonical form
  • Approaches spelling correction, machine
    translation,
  • G2P2G, classification,

Original Normalized
hey sarahke tis al lang gelde dak hier ng op ben geweest ma hey bffl eh ) hey sarahke het is al lang geleden dat ik hier nog op ben geweest maar hey best friends for life he )
Deep text analytics
Profiling
  • Automatic extraction of information about the
    author of a text identity, gender, age,
    educational level, personality, etc.
  • Challenges single out feature types and
    discriminative methods that are able to
    efficiently deal with large author set sizes,
    small data sizes, and a variety of topics and
    genres
  • Text analysis pipeline that automatically
    analyzes text up to the level of discourse
  • Modules that deal with non-propositional aspects
    of meaning (e.g. modality, negation) , necessary
    for filtering and mining social media

Frame-based detection
  • Script temporal sequence of event frames with
    different roles (participants, action, location,
    time, )
  • Script detection through an ensemble of
    classifiers trained on the detection of
    participant features and their interactions

Transgressive sexual behaviour script with
series of event frames in which participants
(minor, adult) experience a number of grooming
steps Cyberbullying script with series of event
frames in which participants (bully, bystander,
victim) experience a number of interactions
with the support of
Write a Comment
User Comments (0)
About PowerShow.com