Semi-Automatic Semantic Annotation for Hidden-Web Tables - PowerPoint PPT Presentation

About This Presentation
Title:

Semi-Automatic Semantic Annotation for Hidden-Web Tables

Description:

Nucleotide Size. Nucleotide Size. Nucleotide Size. Nucleotide Size. Nucleotide Size. www.deg.byu.edu ... Semi-automatic semantic annotation for hidden web tables ... – PowerPoint PPT presentation

Number of Views:60
Avg rating:3.0/5.0
Slides: 22
Provided by: cui1
Learn more at: https://www.deg.byu.edu
Category:

less

Transcript and Presenter's Notes

Title: Semi-Automatic Semantic Annotation for Hidden-Web Tables


1
Semi-Automatic Semantic Annotation for Hidden-Web
Tables
  • Cui Tao David W. Embley
  • Data Extraction Research Group
  • Department of Computer Science
  • Brigham Young University

Supported by NSF
2
Semantic Annotation
  • The Hidden Web
  • Hidden behind forms
  • Hard to query

3
Semantic Annotation
  • The Hidden Web
  • Hidden behind forms
  • Hard to query

to find the protein and the animo-acids
information for gene cdk-4"
4
Semantic Annotation
  • The Hidden Web
  • Hidden behind forms
  • Hard to query
  • Semantic annotation
  • Machine-understandable
  • Publicly accessible

5
System Overview
  • Initial semantic annotation
  • Manually annotate a sample page
  • With respect to a selected ontology
  • Table interpretation
  • Automatic
  • Tables from hidden web pages
  • Final semantic annotation
  • Automatic
  • Annotate interpreted tables

6
Initial Semantic Annotation
  • SMORE Semantic Markup, Ontology and RDF Editor
    Maryland information and network dynamics lab

7
(No Transcript)
8
Table Interpretation
  • Table interpretation
  • Locate label and value
  • Pair label-value pairs
  • Remember path
  • TISP Table Interpretation by Sibling Pages

9
TISP
10
Interpretation Technique Sibling Page Comparison
Same
11
Interpretation Technique Sibling Page Comparison
Almost Same
12
Interpretation Technique Sibling Page Comparison
Different
Same
13
Interpretation Technique Sibling Page Comparison
Structure Pattern of a Table
Label Path Identification.Gene model(s).Gene
Model
Xpath html1//table3/tr1/td2/table1/tr
6/td2/table1/tr2/td1
14
Annotation
Protein Name
Protein Name
Protein Name
Protein Name
Protein Name
15
Annotation Split
Nucleotide Size
Nucleotide Size
Nucleotide Size
Nucleotide Size
Nucleotide Size
16
Annotation Merge
Protein Information
Protein Information
Protein Information
17
AnnotationUnion
Name
Name
18
AnnotationSelection
Molecular Function
Molecular Function
19
Generated RDF Annotation
20
Querying Annotated Data
to find the protein and the animo-acids
information for gene cdk-4"
21
Summary
  • Semi-automatic semantic annotation for hidden web
    tables
  • Facilitate large-scale annotation to the web
Write a Comment
User Comments (0)
About PowerShow.com