Metda Ontea - PowerPoint PPT Presentation

1 / 7
About This Presentation
Title:

Metda Ontea

Description:

This results to executing more ontology queries and thus consuming more time. ... Michal Laclavik, Martin Seleng, Emil Gatial, Zoltan Balogh, Ladislav Hluchy: ... – PowerPoint PPT presentation

Number of Views:24
Avg rating:3.0/5.0
Slides: 8
Provided by: mriabi
Category:
Tags: martin | metda | ontea

less

Transcript and Presenter's Notes

Title: Metda Ontea


1
Metóda Ontea
  • Pracovná dielna NAZOU
  • 21-23. 9. 2007. Polana

2
Príspevok k stavu poznania
  • Pattern based annotation
  • Podobné metódy
  • C-PANKOW, SemTag
  • Iné jazyky ako anglictina
  • Slovencina
  • Rýchlejie a presnejie ako C-PANKOW
  • Umonuje aj tvorbu intancií, SemTag nie

3
Príspevok k stavu poznania nástroj Ontea
  • Pattern
  • PatternRegExp annotate(), vráti mnoinu resultov
  • Result napr. (Bratislava, regionSettlement)
  • ResultRegExp
  • ResultOnto
  • ResultTransformer
  • LuceneRelevance
  • SesameIndividualSearch
  • SesameIndividualSearchAndCreate
  • TvaroslovnikLemmatizer

NAZOU, 21-23. 9. 2007, Polana
4
Overovanie úpenost (1)
  • Nový experiment pre
  • Ontea creation
  • Ontea Creation indexovanie Experiment s RFTS a
    Lucene indexing
  • Lematizácia

5
Overovanie rýchlost (2)
  • Ontea Creation the instances of ontological
    concepts are created in the input text collection
    based on regular patterns matching.
  • produce OWL ontology files which need to be
    integrated on central machine.
  • Created instances are evaluated by computing
    their relevance using RTFS or Lucene indexing
    tool. The instances with relevance value above
    given threshold are identified as relevant and
    filled in result domain ontology OWL file. (stage
    related to RTFS tool)
  • Ontea Search process for searching annotation
    tags within annotated text similarly to step one
    but using general keyword matching patterns. This
    results to executing more ontology queries and
    thus consuming more time.
  • Last stage integrated produced semantic metadata
    to one knowledge base represented by OWL file.

NAZOU, 21-23. 9. 2007, Polana
6
Overovanie rýchlost (3)
  • 500 job offers documents takes 67 minutes
  • Intel(R) Pentium(R) 4 CPU 2.40GHz
  • About 35000 Slovak offers on the web, many more
    in English language
  • This means that periodic annotation of jobs takes
    78 hours more then 3 days
  • Step 1 and 3 can run as distributed
  • Tests run on 500 job offers documents which takes
    67 minutes
  • This means that periodic annotation of jobs takes
    78 hours more then 3 days
  • When submitting jobs with e.g. 1000 documents of
    job offers on one node 134 minutes 1000 doc
    on 35 nodes in grid 35000 doc
  • (1000 document set 3M)
  • 10 minutes of grid middleware overhead 60
    minutes data integration
  • On grid 204 minutes 3 hours 24 minutes

NAZOU, 21-23. 9. 2007, Polana
7
Publikácie
  • Michal Laclavík, Marek Ciglan, Martin eleng,
    Ladislav Hluchý Empowering Automatic Semantic
    Annotation in Grid, PPAM 2007, Springer, LNCS
  • Michal Laclavík, Marek Ciglan, Martin eleng,
    Stanislav Krajcí, Peter Vojtek, Ladislav Hluchý
    Semi-automatic Semantic Annotation of Slovak
    Texts, SLOVKO 2007
  • Michal Laclavík, Marek Ciglan, Martin eleng
    Ontea Semi-automatic Pattern based Text
    Annotation empovered with Information Retrieval
    Methods NAZOU-ITAT, 2007
  • Michal Laclavik, Martin Seleng, Emil Gatial,
    Zoltan Balogh, Ladislav Hluchy Ontology based
    Text Annotation OnTeA Information Modelling
    and Knowledge Bases XVIII. IOS Press, Amsterdam,
    Marie Duzi, Hannu Jaakkola, Yasushi Kiyoki, Hannu
    Kangassalo (Eds.), Frontiers in Artificial
    Intelligence and Applications, Vol. 154, February
    2007, pp.311-315. ISBN 978-1-58603-710-9, ISSN
    0922-6389.
Write a Comment
User Comments (0)
About PowerShow.com