Novelty Detection in ATLAS - PowerPoint PPT Presentation

1 / 17
About This Presentation
Title:

Novelty Detection in ATLAS

Description:

Novelty Detection in ATLAS. Topic Detection and Tracking (TDT) New Event Detection Task in TDT ... on Retrospective and Online Event Detection. Yiming Yang, ... – PowerPoint PPT presentation

Number of Views:33
Avg rating:3.0/5.0
Slides: 18
Provided by: nycLti
Category:

less

Transcript and Presenter's Notes

Title: Novelty Detection in ATLAS


1
Novelty Detection in ATLAS
  • Topic Detection and Tracking (TDT)
  • New Event Detection Task in TDT
  • Existing Problem
  • Topic-Conditioned Novelty Detection
  • Future Work
  • References

2
TDT (Topic Detection and Tracking)
  • Started from 1997
  • Five tasks
  • Topic Tracking
  • Topic Detection
  • New Event Detection (Novelty Detection, a.k.a.
    First Story Detection)
  • Story Link Detection
  • Segmentation
  • Benchmark Evaluation

3
Topic Tracking
4
Topic Detection
5
New Event Detection (a.k.a. First Story Detection)
6
New Event Detection
  • Characteristics (difficulties)
  • Online learning (vs. retrospective learning)
  • Unsupervised learning (vs supervised learning)
  • Large number of targets (events)
  • Harder to correctly predict the first one

7
NED Approach I Cosine Similarity as Similarity
Metric
8
NED Approach I Cosine Similarity Formula
  • Where represents the ith document as a
    vector
  • represents the term weight of the kth
    term in the ith document

9
NED Approach I Use Centroid to Represent Cluster
10
NED Approach II More Information Learning
Machine
Score gt Threshold
Old
New
11
2002 NED Benchmark Evaluation Results
12
Problems in Current Approaches
Story 1 airplane crash Korean 747 . ,
injured dead January 10, investigate
reason
Story 2 TWA-800 airplane crash
investigate dead December unknown people
13
Events and Topics
  • Event an action happening during a certain time
    period and at a certain location.
  • Topic a recurring and broader class of events.

14
Topic-Conditioned Novelty Detection
15
Topic-Conditioned Novelty Detection
16
Future Work
  • Explore new similarity metrics
  • Metrics are the most fundamental item in
    clustering
  • Explore more learning machines
  • Learning Machines can be any efficient online
    regression/classification algorithm, like SVM,
    logistic regression, etc.
  • Use clustering instead of classification at
    topic-level
  • Try to reduce human efforts as far as we can
  • Explore the role of Named Entities (NEs)
  • Named Entities like persons name, location,
    organization, date, etc., would be informative
    for novelty detection, many of them can useful
    features.

17
References
  • Topic-Conditioned Novelty Detection. Yiming Yang,
    Jian Zhang, Jaime Carbonell and Chun Jin. SIGKDD
    2002.
  • A Study on Retrospective and Online Event
    Detection. Yiming Yang, Tom Piece and Jaime
    Carbonell. SIGIR 98
  • The 2001 topic detection and tracking task
    definition and evaluation plan. In
    http//www.nist.gov/speech/tests/tdt/tdt2001/evalp
    lan.htm
  • Nists 1998 topic detection and tracking
    evaluation. J. Fiscus, G. Doddington, J.
    Garofolo, and A. Martin. In proceedings of the
    DARPA Broadcast News Transcription and
    Understanding Workshop.
  • New Event and Link Detection at CMU for TDT 2002.
    J. Carbonell, Y. Yang, R. Brown, J. Zhang, and J.
    Ma.
Write a Comment
User Comments (0)
About PowerShow.com