Kno'e'sis Center: Overview - PowerPoint PPT Presentation

1 / 74
About This Presentation
Title:

Kno'e'sis Center: Overview

Description:

Kno'e'sis Center: Overview – PowerPoint PPT presentation

Number of Views:71
Avg rating:3.0/5.0
Slides: 75
Provided by: jsel3
Category:
Tags: center | hog | kno | overview | sis

less

Transcript and Presenter's Notes

Title: Kno'e'sis Center: Overview


1
Kno.e.sis CenterOverview Accomplishments
  • January 2008 to December 2008

2
Kno.e.sis Background
  • Started LSDIS at University of Georgia (UGA) in
    1994 after a decade in industry
  • One of the three largest (perhaps the largest)
    group in Semantic Web in the US
  • Major influence in Services Computing
  • 70 of funding in CS dept., 60 of GRAs (about
    20)
  • Two successful startups
  • Second hired 35 employees in Athens first
    Internet Software company with 7m payroll
    before being its third acquisition
  • Highly successful students
  • 2006 graduate Dr. Kunal Verma has 1000 citations
    (5 papers over 100 each), successfully competed
    with two Stanford PhDs
  • 2007 graduate Dr. Kemafor Anyanwu 3 papers in
    WWW, faculty at NCSU, Raleigh

3
About myself
  • Educator
  • Philosophy for students learning how to learn
    successful students that have started making
    mark, new courses
  • Researcher
  • Philosophy for research lead through vision and
    build collaborations
  • Areas information integration, workflow,
    semantic web applications to healthcare,
    biomedical research, financial services,
    intelligence
  • 12000 citations, 250 publications, h-index 50,
    100 citations 22, 30 keynotes, 200 invited
    talks
  • 10m high-quality research funds from NIH, NSF,
    DARPA, NIST, ARDA, and others
  • EIC (2) International Journal of Semantic Web
    Info. Systems, Distributed and Parallel Databases
    (Springer) Editor of 2 Springer book series
  • EB (5) IEEE Internet Computing, Applied
    Ontology, etc. Chaired 40 events member/chair
    of 125 program committees
  • W3C standard, community standard
  • Entrepreneur 2 companies based on technology
    developed at university, 5 major commercial
    products, many deployed applications advisory
    boards etc.

4
Kno.e.siss Mission
  • Best in US in two new multidisciplinary research
    areas associated with the Next Generation of the
    Web and Distributed Computing Semantic Web and
    Services Science
  • well recognized in other areas associated with
    Advanced Data Management Analysis with
    applications in Biomedicine, Health Informatics,
    Defense Intelligence, building on strengths in
    traditional computer science areas such as
    databases (including data mining), AI (including
    machine learning and NLP), distributed computing,
    and bioinformatics
  • Some of the best and most progressive courses in
    our areas
  • Graduates that successfully compete with those
    from the best universities
  • Motto Learning how to learn
  • Well rounded Publications with significant
    impact, Leadership, Networking, Communication,
    Teamwork
  • Significant regional impact through technology
    development and transfer (partnership with
    daytaOhio), development of globally competitive
    workforce for Ohio

5
Kno.e.sis People
  • 7 faculty (looking to add 1 more)
  • 17 PhD students
  • 10 moved from UGA with Sheth, one followed
    recently (WSU gained 30 person year GRA
    experience)
  • Few MS students
  • Increasing number of BS students

6
News Events
  • 2007
  • Keynotes 3
  • Invited Talks 6
  • Tutorials 3
  • Conference/Workshop Organization
  • 1 Steering Committee member 15 Program
    Committees members 3 Workshop Co-organizer 1
    Special Session organizer, 1 General Chair, 1
    Poster Chair
  • Editorships
  • Special Issues 1
  • Book Series 2
  • Journal Editorial Board 13
  • Journal EIC 2
  • 2008
  • Keynotes 1
  • Invited Talks 10
  • Conference/Workshop Organization
  • 3 Steering Committee member 1 Publicity Chair
    (student) 9 Program Committees members (4 are
    students) 2 Workshop co-organizer 1 Symposium
    co-organizer 1 Organizer Committee
  • Editorships
  • Special Issues 2
  • Conference Proceedings 1
  • Journal EIC 1

7
Kno.e.sis Center Labs (3rd Floor, Joshi)
  • Amit Sheth
  • Semantic Science Lab
  • Semantic Web Lab
  • Service Research Lab
  • TK Prasad
  • Metadata and Languages Lab
  • Shaojun Wang
  • Statistical Machine Learning
  • Michael Raymer
  • Bioinformatics Lab
  • Guozhu Dong
  • Data Mining Lab
  • Keke Chen
  • Data Intensive Analysis and Computing Lab

8
Funded Research
9
NSF-Funded Research
  • Medium ITR SemDisSemDis Discovering Complex
    Relationships in Semantic Web
  • Sheth PI/PM (sub UMBC, UGA) Oct 2005 - Dec 31,
    2008
  • 1250K total Sheth led part 925K (317K moved
    to WSU)
  • 30 publications, 10 keynotes, funded all or 75
    of PhD duration for 6 and many more for 50
    duration
  • SGER Spatio-Temporal-Thematic Queries of Semantic
    Web Data A Study of Expressivity and Efficiency
  • 146K
  • A National Model for Engineering Mathematics
    Education
  • 500K
  • RI-Small Exploiting Syntactic, Semantic and
    Lexical Regularities in Statistical Language
    Modeling
  • 102K

10
NIH-Funded Research
  • National Heart, Lung, and Blood Institute,
    NIHSemantics Services enabled Problem Solving
    Environment for T.cruzi
  • 1.5m total, 04/08-03/12
  • WSU lead Project PI Prof. Amit Sheth
  • Subcontract to UGA (Co-PI Prof. Rick Tarleton)
  • Subcontract to NCBO, Stanford (Co-PI Prof. Mark
    Musen)
  • NCRR/NIH (WSU subcontr. from UGA
    78K)Integrated Technology Resource for
    Biomedical GlycomicsJuly 2003-June 30, 2008.
  • Part led by Sheth 709,401 of total 6m

11
Funded Research (Selected)
  • daytaOhio/LexisNexis (90K28K)Metadata for
    Timelining Events, Jan 2006 - Aug 2007,
    Research Prasad PI Dong Co-PI
  • AFRL (32K)Sensor Data Mgmt Architecture (part
    of larger SAVig project), Apr -Aug 2007
  • Ohio/WSU (240K)Res. Challenge Award Advanced
    Data Mgmt Resource for Biomedical Research,
    Sheth PI N. Reo, M Raymer Co-PIs
  • OBR (235K)Ohio University Bioinformatics Choose
    Ohio First Scholarship, Prof. Raymer (WSU PI)
    235K in scholarships for Wright State students
  • OBR (235K) Ohio Consortium for Bioinformatics
  • OBR (20K) Research Challenge Enhancement and
    Commercialization of Comparative Web Search Tech

12
Funded Research (Selected)
  • N/A (126K) Trusted Sensor Web PI Sheth with
    Wang and Prasad
  • Henry Jackson Foundation (100K) Human
    Performance Ontology
  • Ohio/WSU (20K)Res. Challenge Enhancement and
    Commercialization of Comparative Web Search
    Technologies, Prof Dong (PI)
  • AFRL/DAGSI (121K)Architecture for Secure
    Semantic Sensor Network Ohio
    Student-Faculty Research Fellowships to two
    Kno.e.sis teams Cory Henson with
    Prof. Amit Sheth and Josh Pschorr with Prof. T.K.
    Prasad

13
Proposals in Progress
  • SAIC Assuring Trust in Semantic Sensor
    Networks Gap Analysis (31K)
  • NSF UIMA-based Collarboration Research A
    Computational Environment for Comparative
    Analyses of Prokaryotic Genomes (404K) NSF
  • NSF Computing for Billions (1,378K)
  • WSU Research Challenge Comparative Analysis and
    Exploration of Collections of Data Clusterings
    (25K)
  • NSF CDI-Type II The Open Metabolomics
    Workspace An inquiry-driven environment for
    metabolomics research and training (569K)
  • NSF Collaborative Research CDI-Type II
    Computationally-Enabled Modeling, Inference and
    Prediction of Synergistic Effectors of Biological
    Systems (569K)

14
Proposals in Progress
  • NSF A Computational Environment for Comparative
    Analyses of Prokaryotic Genomes (404K)
  • NSF Comparative Analysis and Exploration of
    Collections of Data Clusterings (400K)
  • NSF IIISmall - Integrating Sensor Networks
    using Collective Intelligence (400K)
  • NIH TBD (NIDA Secondary Drug Abuse)
  • NIH TBD (Knowledge discovery - multidrug
    treatment)
  • NIH TBD (Metabolomics)

15
Industry Support
  • IBM UIMA Innovation Award (23K Gift) 11/2007
    Sheth
  • UIMA-based Infrastructure for Summarizing
    Casual, Unstructured Text
  • Microsoft Research Award (60K Gift) 02/2008
    Sheth
  • Chatter, Intent, Good Karma, and Contextual
    Advertisements in Social Networks
  • HP Research (50K) 03/2008 - Sheth
  • for research in unstructured text extraction
    01/2008 (25K, 25K to come)
  • Google Research Award (50K) Shaojun Wang
  • for his proposal "Syntactic, Semantic and Lexical
    Language Models for Machine Translation"

16
Collaborations
17
Collaborations
  • AFRL Distributed Collaborative Sensors System
    Technology Branch of Sensor Directorate and SAIC
    - Semantic Sensor Web and Trusted in Distributed
    Sensor Networks
  • AFRL Human Effectiveness Directorate Human
    Performance Ontology and Semantic Search of
    Biomedical Literature
  • CCHMC work in semantic metadata extraction and
    knowledge discovery over biomedical literature.
  • Center for Healthy Communities (CHC), Boonshoft
    School of Medicine semantic Web applications to
    health care informatics enhancing the HIEx
    system
  • Complex Carbohydrate Research Center, University
    of Georgia (glycomics)
  • daytaOhio  a trusted partner connecting
    innovators, entrepreneurs, and investors.
  • eBiquity Group,University of Maryland, Baltimore
    County (Profs. Finin and Joshi), and LSDIS at
    the UGA (Profs. Arpinar, Kochit and Miller) are
    research partners in our NSF-funded Medium ITR
    SemDis project (PI Prof. Amit Sheth).
  • HP Labs Palo Alto  we collaborate in automatic
    ontology creation applications HP funds our
    research in statistical attribute and
    relationship extraction. Our collaboration has
    resulted in HP commercial offering.

18
Collaborations
  • IBM Almaden Services Research Group Services
    Science (Dr. Michael Maximilien)
  • IBM UIMA Innovation Award to our proposal
    UIMA-based Infrastructure for Summarizing
    Casual, Unstructured Text based on Meena
    Nagarajan's summer internship at IBM Almaden in
    2007 and continuing collaborations.
  • Yahoo! Search Relevance Labs Dr. Chen
    collaborates with Yahoo! Labs on developing novel
    web search ranking functions, cross-domain
    ranking, and clickthrough log data mining.
  • Lister Hill National Center for Biomedical
    Communications, National Library of Medicine
    (NIH) National Institute on Drug Abuse (NIH)
  • Tarleton Research Group at University of Georgia
    (Prof. Rick Tarleton) and the National Center for
    Biomedical Ontology at Stanford University (Prof.
    Mark Musen) partner in our National Heart, Lung,
    and Blood Institute, NIH-funded R01 project
    Semantics and Services enabled Problem Solving
    Environment for Tcruzi (PI Prof. Amit Sheth).
  • W3C  WSUniversity is an official member and
    Prof. Sheth is Advisory Committee member of the
    W3C. Kno.e.sis contributions to W3C activities
    include Semantic Web for Health Care Life
    Sciences (HCLS), Semantic Web Service Testbed
    Incubator (SWS Testbed XG), and the RDB2RDF
    Incubator Group. Prof. Sheth co-chairs SWS
    Testbed XG. Karthik Gomadam and AjithRanabahu
    actively participate in SWS Testbed XG Satya
    Sahoo actively participates in RDB2RDF XG, and
    Amit Sheth participates in HCLS.
  • GRIDs Lab University of Melbourne. We are
    jointly developing an open source software
    combining their SWE middleware with our SSW.
  • Also Accenture lab, National Library of Medicine,
    UC- Berkeley, DERI-Ireland, etc.

19
Our Latest Projects
20
Machine Learning and Natural Language Processing
  • Research Area Machine learning and its
    applications to language, speech, image and
    biological signal processing. Research
    projects
  • Exploiting syntactic, semantic and lexical
    regularities in language modeling
  • Syntactic, semantic and lexical language models
    for machine translation
  • Semi-supervised discriminative learning and
    structured prediction

21
Statistical Machine Translation
24 hours!
22
Noisy Channel ModelChinese as Garbled English
The urgent response to
E
Given input C,software chooses Ethat
maximizes p(EnglishE)
xp(ChineseC EnglishE)
StatisticalModel
C
23
What are the models?
  • Source model p(E) language model
  • Guarantees semi-fluent English
  • Channel model p(CE) translation model
  • Stochastically translates each word allows a
    little random rearrangement with high prob,
    words stay more or less put
  • Maximizing p(CE) would give really lousy Chinese
    translation of English
  • Random word translation is stupid need word
    sense from context
  • Random word rearrangement is stupid phrases
    rearrange!
  • This channel has no idea what fluent Chinese
    looks like
  • But maximizing p(E)p(CE) gives a better English
    translation of Chinese because p(E) knows what
    English should look like.
  • Currently trying to make these models less stupid.

24
Mining Data from Social Content
  • Wealth of information available from blogs,
    social networks, chats etc.
  • Free medium of self-expression makes mass
    opinions / interests available
  • Polling for popular culture opinions is easier
  • Social Production undeniably affects markets
  • Results of analysis more effectively tailored to
    specific audience geo-specific retail ads,
    demographic interests in music
  • Challenges
  • Mining content generated in social software is a
    different problem compared to the well-behaved
    text we are used to seeing
  • Atypical sentence constructions
  • Demographic slangs, webisms to express opinions
  • You are so baadd is no longer a bad opinion
  • Try this! Jus heard da album n I was lyk woooman

25
Things we work on
  • Mining popular music artists from chatter on
    MySpace
  • Sentiment extraction, Entity Identification
  • Ambiguous entities smile, it, music,
    yesterday
  • Entity Identification using domain knowledge and
    statistical NLP
  • Sentiment expressions
  • identifying expressions, computing polarities
  • they killed the song is a compliment!
  • Implemented on SoundIndex (BBC IBM)

26
Things we work on
  • Understanding how people write guides what
    analytics we perform on the data
  • Multivariate factor analysis to observe gender
    differences in language usage by men and women in
    their Online Profiles
  • Joint work with UCB

27
Things we work on
  • Eliminating Off-topic noise and Targeted Content
    delivery on social media
  • Example, enabling extremely targeted ads on
    social networks

28
Our Collaborations
1. IBM Semantic SuperComputing Group
Almaden Supporting our free text analysis through
the UIMA Innovation grant 2. Microsoft
AdLabs Supporting our research on targeted advs
in social networks through the Search and
Beyond award 3. Prof. Dr. Marti Hearst, School
of Information, UC Berkeley Joint investigations
on social network profiles
29
Three Dimensions of Information
Thematic Dimension What
Temporal Dimension When
North Korea detonates nuclear device on October
9, 2006 near Kilchu, North Korea
Spatial Dimension Where
30
Using named relationships to connect thematic
entities with spatial locations in a variety of
meaningful ways (different contexts)
E2Soldier
E4Address
lives_at
located_at
located_at
E6Address
lives_at
Georeferenced Coordinate Space (Spatial Regions)
E1Soldier
E1Soldier
occurred_at
E7Battle
assigned_to
participates_in
E8Military_Unit
E8Military_Unit
participates_in
assigned_to
E5Battle
occurred_at
Residency
E3Soldier
Battle Participation
Dynamic Entities
Spatial Occurrents
Named Places
31
Spatio Temporal Thematic Query Processing
Find dams in state of Ohio
Information of Ohio
Dams in Ohio
Dams in counties of Ohio
Query Processing Engine
KB
No information
Alum Creek,Wills Creek,Hoover Dam,
componentscounties,cities,.
Resolve state into components
Ohio counties-dams
Partonomy Resolution Engine
32
Semantic Sensor Web
High-level Sensor
Low-level Sensor
  • How do we determine if the three images depict
  • the same time and same place?
  • same entity?
  • a serious threat?

33
Semantic Sensor Observation Service Architecture
SemSOS Semantic Sensor Observation Service
Interface/Access
SOS Query
52North
SPARQL Query Engine
SML-S/ OM-S
Knowledge Base
Ontologies
Data Collection
Analysis and Reasoning
SML-S/ OM-S
RDF
34
Query What is the weather in region R at time T?
Abduction-based explanation of observations
  • Answer
  • winter weather
  • possible snow storm
  • possible blizzard
  • Trust Report
  • Conflict between sensor 2 4
  • If sensor 2 trusted
  • snow storm, possible blizzard
  • If sensor 4 trusted
  • winter weather

precipitation (2)
temp (1)
windspeed (3)
visibility (4)
35
Description of Semantic Sensor Web
All subscribe to a shared formally defined world
view
Prototype Demo
http//www.opengeospatial.org/projects/groups/sens
orweb
36
Next Generation Mobile Services
?
Tell what you think! Find what you want! Mix em
up..Smashem up Run em anywhere
User/ Developer
AIR
Current State of Web API Search
37
I need a mapping service, a product review
service and a local serach service for
Apihut Beakon Air Speed Dial your Mobile
Apps
38
Timelining News Events
  • Index, search, and visualize News documents
    datasets (150GB) using variety of metadata-based
    Timelines

39
Mining Data from Social Content
  • THINGS WE WORK ON
  • Mining popularity from chatter on MySpace
  • Most talked about artist / product of the week by
    19 yr olds in New York?
  • Sentiment Analysis ( slang ) in casual text
  • Spotting subjective expressions in light of
    language productivity
  • Separating junk from informative content the
    problem of digressing
  • Enabling extremely targeted ads on social networks
  • Wealth of information available from blogs,
    social networks, chats etc.
  • Free medium of self-expression makes mass
    opinions / interests available
  • Polling for popular culture opinions is easier
  • Social Production undeniably affects markets
  • Results of analysis more effectively tailored to
    specific audience geo-specific retail ads,
    demographic interests in music
  • CHALLENGES
  • Mining content generated in social software is a
    different problem compared to the well-behaved
    text we are used to seeing
  • Atypical sentence constructions
  • Demographic slangs, webisms to express opinions
  • You are so baadd is no longer a bad opinion
  • Try this! Jus heard da album n I was lyk
    woooman

Our Collaborations 1. IBM Semantic
SuperComputing Group Almaden Supporting our
free text analysis through the UIMA Innovation
grant 2. Microsoft AdLabs Supporting our
research on targeted advs in social networks
through the Search and Beyond award 3. Prof.
Dr. Marti Hearst, School of Information, UC
Berkeley Joint investigations on social network
profiles
40
Semantic Analytics
UNDISCOVERED PUBLIC KNOWLEDGE
Discovering connections hidden in text
41
Application in Biology
42

Understanding the Genetic-Basis of Nicotine
Dependence Collaborators Gene-Pathway Data
Integration- Complex Carbohydrate Research
Center, University of Georgia
  • Objectives
  • Understand the role of genes in nicotine
    addiction
  • Treatment of drug addiction based on genetic
    factors
  • Identify important genes and use for
    pharmaceutical productions
  • Data Sources
  • Two NCBI gene databases Three biological
    pathway databases

Reference S.S. Sahoo, O. Bodenreider, J.L.
Rutter, K.J. Skinner, A.P. Sheth, An
ontology-driven semantic mash-up of gene and
biological pathway information Application to
the domain of nicotine dependence, J. Biomedical
Informatics (in press)
43
Semantic Provenance Annotation for Data in
protEomics (SPADE)
Collaborators Complex Carbohydrate Research
Center, University of Georgia
  • Objectives
  • Help biologists with automated workflow to
    conduct experiments
  • Automated provenance creation for experimental
    data
  • Resources
  • Web based scientific workflow deployed and in
    use
  • Two large ontologies listed at Open Biomedical
    Ontologies at Stanford University
  • Stanford University and University of Georgia
    partners in recently funded NIH RO1 grant

Reference S.S. Sahoo, C. Thomas, A. Sheth, W.S.
York and S. Tartir Knowledge Modeling and Its
Application in Life Sciences A Tale of Two
Ontologies, 15th Intl WWW2006 Conf., May 2006
44
Extracting from Social Knowledge
  • in Sydney, New South Wales, Australia
  • Sydney is the most populous city in Australia
  • Canberra, the Australian capital city
  • Canberra is the capital city of the Commonwealth
    of Australia
  • Canberra, the Australian capital
  • in Sydney, New South Wales, Australia
  • Sydney is the most populous city in Australia
  • Canberra, the Australian capital city
  • Canberra is the capital city of the Commonwealth
    of Australia
  • Canberra, the Australian capital

We know that countries have capitals. Which one
is Australias?
45
Privacy-Preserving Data Publishing and Mining
Multiparty Collaborative Computing
Single-Party Data Publishing
Data perturbation Cryptographic protocols
Perturbed data
Data Service
G(X) RXTD
Return classifiers
Data Miner Training classifiers
?
G1
Optimal G

G2
Perturbed dataset
Original dataset
Perturbation optimization
G3
  • Classifiers that can use
  • geometric perturbation
  • Kernel methods
  • linear classifiers
  • SVM classifiers

Attacks Attack Analysis
1. Criteria for good perturbation 2. Analysis of
attacks 3. Randomized algorithm
46
Multidimensional Cluster Visualization
47
Data Mining Research Lab
  • Directed by Professor Guozhu Dong
  • Currently has 2 PhD students graduated 3 PhD
    students since 2007
  • Representative areas of interest
  • Contrast data mining
  • Data mining on sequence data and microarray gene
    expression data
  • Data warehousing and OLAP
  • Document collection analysis and management
  • Clustering collection analysis and management
  • Comparative and summarative web search
  • Research has been funded by NSF, AFRL, Ohio
    Agencies, Lexis-Nexis and other private companies

48
Doozer
Building Domain Hierarchies and Connecting
Concepts with Named Relationships
49
Collecting Instances
50
Creating a Hierarchy
51
Creating a Hierarchy
52
Hierarchy Creation - summary
53
MEtaData and Languages Laboratory
  • Semantic Web and Information Retrieval
  • Develop models, query languages and techniques,
    to represent , retrieve, and reason with Web
    Documents and Semantic Web Data
  • Trusted Semantic Sensor Web and Services
  • Apply semantic web technology and abductive
    reasoning techniques for abstracting,
    understanding, diagonising, standardizing, and
    enhancing trust in sensor data
  • Navigation and Visualization of Large Datasets
  • Develop novel techniques and tools for indexing,
    searching, and browsing metadata enriched, large
    (News) datasets
  • Programming Languages Design and
    Implementation
  • Develop novel constructs and algorithms for
    enhancing reliability and efficiency of
    programming languages

54
MEtaData and Languages Laboratory
  • Recent Graduates
  • Ph.D. Graduate Trivikram Immaneni
  • Employed at Technorati.com
  • Dissertation Title A Hybrid Approach to
    Retrieving Web Documents and Semantic Web Data
  • M.S. Graduate Mastan Shaik
  • Employed as Consultant
  • Thesis Title "Design and Implementation of
    Timeline Application for News Documents".

55
Teaching and Education
56
Courses
  • Dr. Sheth
  • Semantic Web
  • Service Science
  • Web Information Systems
  • Dr. Raymer
  • Bioinformatics
  • Comparative Languages
  • Dr. Dong
  • Data Mining
  • Formal Languages
  • Dr. S Wang
  • Natural Language Processing
  • Machine learning
  • Artificial Intelligence
  • Dr. Prasad
  • Information Retrieval
  • Compiler Design Construction
  • Comparative Languages
  • Programming Languages
  • Dr. Chen
  • Privacy-Aware Computing
  • Cloud Computing
  • Data structures and Algorithms

57
Kno.e.sis Students
58
Two Example of Successful PhD students
  • Kunal Verma PhD 2006
  • 1000 citations
  • beat 2 Stanford PhDs for
  • the position he took
  • Kemafor Anyanwu PhD 2007
  • 3 papers in WWW
  • 3 offers from 4 interviews at Research
    Universities
  • Faculty at North Carolina State University,
    Raleigh

59
Student Achievement Overview
60
Meena Nagarajan
  • Internships and Collaborations
  • HP Labs Systems and Storage Team
  • IBM Almaden Research Semantic SuperComputing
    group School of Information, UC Berkeley
  • Professional Achievements (including funding)
  • WSDL-S W3C submission (co-author)
  • IBM UIMA Innovation Award 2007 Primary
    contributor to UIMA-Based Infrastructure for
    Summarizing Casual, Unstructured Text
  • Microsoft's Beyond Search - Semantic Computing
    and Internet Economics Award 2008 Primary
    contributor to Chatter, Intent and Good Karma
    for Targeted Advertising in Social Networks
  • Project Page
  • Analysis of Social Media Content
  • Accurate Entity Identification
  • Sentiment Analysis
  • Content Monetization
  • Program Committee Member
  • CIKM 2007, IEEE ICSC 2008, Ontology Matching
    Workshop (ISWC 2007)

http//knoesis.wright.edu/research/semweb/projects
/socialmedia/
61
Karthik Gomadam
  • Collaborations
  • Almaden, Accenture and UGA on device-independent
    computing applications using SaRest-based search
    (Beakon, Air, and apihut)
  • Summer Internships
  • 2007 IBM India Research Labs
  • 2006IBM TJ Watson Research Center, Advanced
    Delivery Platforms Group
  • 2004 IBM TJ Watson Research Center, Component
    Systems Group
  • Recent Publications
  • ICWS, ICSC, Web Intelligence, IEEE Internet
    Computing
  • Program Committee Membership
  • Semantics4WS (06,07,08) WSCA 2008 IEEE 23rd
    Advanced Information Networking Applications
    (2009) SOCASE 2007, 2008, SOLI 2008
  • Patents
  • Pending distributed error management
  • IBM Innovation Achievement Award for pending
    patent on error management

62
Prateek Jain
  • Research Interest
  • Spatial Temporal Thematic Analysis of Data
  • Collaborators
  • Accenture Technology Labs
  • Summer Internships
  • 2008-Accenture Technology Labs, San Jose,
    CA-Requirements Analysis Tool (RAT)
  • Publications
  • W3C Video on the web workshop(2007)
  • Geospatial Semantics (2007)
  • Ontology-supported Business Intelligence (2008)
  • India Software Engineering Conference (2009)

63
Matthew Perry
Publications 11 highlights Conference paper M.
Perry, A. Sheth, F. Hakimpour, P. Jain,
"Supporting Complex Thematic, Spatial and
Temporal Queries over Semantic Web Data," 2d
Intl Conf.ce on Geospatial Semantics (GEOS '07)
  • Book chapters
  • M. Perry, A. Sheth, I.B. Arpinar. "Geospatial
    and Temporal Semantic Analytics, Encylopedia of
    Geoinformatics, 2008, forthcoming.
  • F. Hakimpour, B.Aleman-Meza, M.Perry, A. Sheth.
    "Spatiotemporal-Thematic Data Processing in
    Semantic Web," The Geospatial Web,
    Springer-Verlag, 2007
  • Program Committee Memberships
  • ODBASE 2008 ESWC 2008 Workshop on Semantic
    Metadata Management GEOS 2007 ISWC 2006
    Workshop TerraCognita Directions to Geospatial
    Semantic Web
  • Internships
  • 2006 Oracle Semantic Technologies Group
  • Collaborations
  • Preparing collaborative paper for VLDB 2008
    Funding
  • Significant contribution to NSF Small Proposal
    "Spatial, Temporal and Thematic Analysis of
    Semantic Web Data"

64
Cory Henson
  • Recent Research Collaborations
  • AFRL and SAIC Trusted Semantic Sensor Web
  • DaytaOhio Sensor Aided Vigilance, Survey on
    Sensor Data Management
  • Internships/Fellowships
  • 2009 University of Melbourne, and CSIRO
  • Australian Commonwealth Scientific and
    Industrial Research Organization)
  • 2008/2009 AFRL/DAGSI
  • Architectures for Secure Semantic Sensor
    Networks for Multi-Layered Sensing
  • Presentations
  • Semantic Technology Conference, 2008 Semantic
    Sensor Web
  • Sensor Standards Harmonization WG Meeting, NIST,
    2008 Semantic Sensor Web
  • Program Committee Memberships
  • International Symposium on Collaborative
    Technologies and Systems (CTS2009)
  • Workshop on Sensor Web Enablement 2009 (SWE2009)
  • Recent Publications on Semantic Sensor Web
  • SemSOS Semantic Sensor Observation Service,
    International Symposium on Collaborative
    Technologies and Systems (CTS2009), Workshop on
    Sensor Web Enablement (SWE2009), 2009.
    (Submitted)
  • Situation Awareness via Abductive Reasoning from
    Semantic Sensor Data A Preliminary Report,
    International Symposium on Collaborative
    Technologies and Systems (CTS2009), Workshop on
    Collaborative Trusted Sensing, 2009. (Submitted).
  • Semantic Sensor Web, IEEE Internet Computing,
    2008.
  • Video on the Semantic Sensor Web, W3C Video on
    the Web Workshop, 2007.

65
Pablo Mendes
  • Collaborations/Funding
  • NIH R01 T.cruzi Problem Solving Environment, with
    UGA and Stanford
  • Expert-stipulated knowledge extraction, with
    CCHMC at Cincinnati
  • Microbial Genome Analysis System, with IOB at UGA
  • Semi-supervised CRFs with Shaojun Wang
    (applications in protein folding and text mining)
  • Recent Publication
  • Dávila, AMR Mendes, PN et al., ProtozoaDB
    dynamic visualization and exploration of
    protozoan genomes, Nucl. Acids Res., 2007
  • Awards
  • Training Innovations in Parasitological Studies
    (TIPS) Fellowship awarded by the Ellison Medical
    Foundation and Center for Tropical and Emerging
    Global Diseases, UGA
  • Research projects 2007
  • Semantic Web application to bioinformatics and
    online communities
  • Automation of data analysis workflows
  • Creation of genome databases (Java, Oracle)

66
Cartic Ramakrishnan
  • Summer Internships
  • 2006 IBM Almaden relationship extraction to
    create ontology for cardiology
  • 2002, 2004 NLM semi-automated development of
    large-scale ontologies
  • Patents
  • Automatically Generated Ontology by combining
    structured and semi-structured knowledge
    sources, by Tanveer Syeda-Mahmood, Cartic
    Ramakrishnan, US patent filed, Docket No.
    ARC920070046US1
  • Journal Articles
  • Blazing Semantic Trails between Web Resources,''
    with A. Sheth, IEEE Internet Computing (2007)
  • Book Chapters
  • Semantics for the Semantic Web The Implicit,
    the Formal, and the Powerful, with A. Sheth C.
    Thomas, chap. 28 in Online and Distance
    Learning, IGI, 2008
  • Geospatial Ontology Development and Semantic
    Analytics, with B. Arpinar, A. Sheth, L. Usery,
    M. Azami M. Kwan, in Handbook of Geographic
    Information Science.
  • Program Committee Member
  • ICSC 2008 2d IEEE Intl Conf. on Semantic
    Computing CIKM 2007 ACM 16th Conf. on
    Information and Knowledge Management CIA 2007
    11th Intl Workshop on Cooperative Information
    Agents
  • External Reviewer
  • KDD 2007 13th ACM SIGKDD Intl Conf. on Knowledge
    Discovery and Data Mining

67
Ajith Ranabahu
  • Research Interests
  • Lightweight services, situational Web
    applications, services for mobile computing,
    cloud computing
  • Recent Publications
  • ICWS (2008),  A Faceted Classification Based
    Approach to Search and Rank Web APIs
  • ICSC (2008),Estimating User Effort to Mediate
    Between Two XML Schemas
  • IEEE Internet Computing (2008), An Online
    Platform for Web APIs and Service Mashups
  • Internship
  • 2007 IBM Almaden Research Group
  • Collaborators
  • Dr Michael Maximilien (IBM Almaden)domain-specifi
    c languages for situational Web?, cloud
    unification
  • Patent filed with IBM
  • Professional Activities
  • Member of the Apache Software Foundation
  • Successful publication of two IBM Alphaworks
    projects (http//services.alphaworks.com)
  • Senior Software Engineer at WSO2 (before joining
    PhD)?

68
Satya Sahoo
  • Research Projects
  • 2008-09NIH Semantics and Services enabled
    Problem Solving Environment for Trypanosoma cruzi
  • 2008-09 Provenance Management Framework (in
    collaboration with Microsoft Research)
  • 2007NCRR/NIH Biomedical Glycomics
  • Summer Internships
  • 2008 Microsoft Research, Technical Computing
    Group (eScience), Redmond
  • 2006, 2007 Lister Hill National Center, NLM/NIH
  • Program Committee Memberships
  • WDPP 2009, SEMAPRO 2009, SAINT 2008, 7th Intl
    Conf. ODBASE 2008 4th Intl Workshop SWSP 07
    Doctoral Consortium, 9th ICEIS07 Symposium on
    Applications Internet (SAINT2007)
  • Contributions to Standards
  • Glycan Data Exchange (GLYDE) standard XML-based
    standard for representation of complex glycan
    structures. Accepted by research institutions
    such as Kyoto Encyclopedia for Genes and Genomes
    (Japan), German Cancer Research (Heidelberg),
    Consortium for Functional Glycomics (MIT)
  • Representing WSU in W3C incubator group for
    Relational Database to Resource Description
    Framework (RDB2RDF)

69
Christopher Thomas
  • Projects
  • Ontology-building in glycobiology information
    extraction from community-generated knowledge
    analysis of community generated knowledge
    extracting relationships from community generated
    content
  • Recent Publications
  • C. Thomas and A. Sheth, Semantic Convergence of
    Wikipedia Articles, WI 2007
  • Summer Internship and Ongoing Collaboration
  • 2007 Hewlett-Packard
  • Program Committee Member
  • COMBEK 2008, Workshop on Community-Based
    Evolution of Knowledge-intensive Systems
  • WISM 2008, 5th Intl Workshop on Web Information
    Systems Modeling
  • Grants Funding
  • HP Labs Funding
  • "Growing Dense Fields of Interest" 50,000

70
Facilities, Support, and Challenges
71
Facilities and Support
  • 3rd floor of Joshi Center
  • Newly furnished, good space with room for two new
    faculty members and their lab
  • Rack of servers adequate for present but will
    need to grow/improve infrastructure before we can
    add more servers Dean has already supported -
    thanks
  • Access to daytaOhio
  • Support for Ms. Davis thanks

72
Good News
73
Good News
  • 2/5/2008
  • From Kevin Haas (Mentor of Meena Nagarajan at IBM
    Almaden Research, now at Yahoo!)
  • BTW, Meena is an absolute find.  If all of your
    other students are as talented, you are very
    lucky.  I definitely will do what I can to see
    Meena with a postdoc from either IBM or Yahoo (or
    anything else I can help her with).  Id
    definitely like to work with more interns of her
    caliber, so Ill happily endorse them for Yahoo
    internships.  If you havent reached out to Dan
    Gruhl yet, he should be able to get them into the
    IBM pipeline as well.

74
Discussion
  • Feedback?
  • Any thoughts on bringing even more value?
  • Your continuing involvement is vital to
    realizing and leveraging the full potential of
    Kno.e.sis!
Write a Comment
User Comments (0)
About PowerShow.com