Karen Fraser, 16G06 - PowerPoint PPT Presentation

1 / 24
About This Presentation
Title:

Karen Fraser, 16G06

Description:

Risen from 5th before doing anything else is this because I had added content? ... Spider/robot 'crawls' Web, collecting URLs and keywords extracted from pages ... – PowerPoint PPT presentation

Number of Views:30
Avg rating:3.0/5.0
Slides: 25
Provided by: karenfr1
Category:

less

Transcript and Presenter's Notes

Title: Karen Fraser, 16G06


1
Karen Fraser, 16G06
  • email, k.fraser_at_ulster.ac.uk, website,
    karenfraser.co.uk

2
Search Engine Optimisation
  • 27th October 2006 4th on first page
  • Risen from 5th before doing anything else is
    this because I had added content?

3
Search Engine Optimisation
  • Added address to author
  • Added name to keywords
  • Removed comma in title and replaced it with
    hyphen
  • Added the meta data to each page
  • Updated metadata on every page

4
Search Engine Optimisation
13th November 2007 Google 1st (me) 2nd (Uni)
Yahoo 1st (me) 4th (Uni) Altavista 1st
(me) 4th (Uni) MSN search 1st (me) 2nd
(Uni) Lycos/Jeeves 1st (me) - gt 100 Uni)
  • 7th November 2006
  • Google 5th (me) 2nd (Uni)
  • Yahoo 5th (me) -14th (Uni)
  • Altavista 1st (me) 2nd (Uni)
  • MSN Search 1st (me) 2nd (Uni)
  • Lycos/Jeeves dont want to talk about it!
  • (bottom of page 5 50th!!!)

5
Search Engine Optimisation
  • Content is king! There is no substitute for it
  • PageRanking
  • Check site for errors
  • Everyone add a link
  • Give away something useful
  • This Semesters challenge
  • To find site when the following are entered into
    the search engine
  • Karen Frazer
  • Karen Frasier

6
Hypertext and Hypermedia
  • Digital Multimedia, 2nd edition
  • Nigel Chapman Jenny Chapman
  • Chapter 12

7
Hypertext
  • Text augmented with links
  • Link pointer to another piece of text in same or
    different document
  • Navigational metaphor
  • User follows a link from its source to its
    destination, usually by clicking on source with
    the mouse
  • Use browser to view and navigate hypertext

8
Cursory History
  • Memex V Bush, 1945
  • Concept of linked documents photo-mechanical
    realization never implemented
  • Xanadu Ted Nelson, late 1960s/early 1970s
  • Intended as global system
  • Hypercard Apple, 1987
  • Shipped with every Mac popularized concept
  • World Wide Web 1992

9
Non-linearity
  • Hypertext not usually read linearly (from start
    to finish)
  • Links encourage branching off
  • History and back button permit backtracking
  • Not an innovation, but the immediacy of following
    links by clicking creates a different experience
    from traditional non-linearity (e.g.
    cross-references in encyclopedia)

10
Links
  • Simple unidirectional links
  • Connect single point on one page with a point on
    another page (e.g. WWW)
  • Extended links
  • Regional links (ends may be regions within a
    page)
  • Bidirectional links (may be followed in both
    directions)
  • Multilinks (may have more than two ends)

11
Browsing Searching
  • Browsing retrieve information by association
  • Follow links, backtrack
  • Maintain history, bookmarks
  • del.icio.us
  • Searching retrieve information by content
  • Construct indexes of URLs
  • Search by keyword/description of page

12
Search Engines
  • Document retrieval system
  • enterprise search engines
  • personal search engines
  • mobile search engines
  • 1st search engine developed in 1990 by a student
    in Canada (Alan Emtage)
  • Followed by Gopher in which used keywords to
    search
  • WebCrawler was launched in 1994
  • XML and RSS now being used to index

13
Web Indexes
  • Manual (Yahoo!, Open Directory Project,)
  • Classify sites on basis of human evaluation of
    their content
  • Navigate hierarchy, or search entries by keyword
  • Automatic (Google, AltaVista,)
  • Spider/robot 'crawls' Web, collecting URLs and
    keywords extracted from pages
  • Highly efficient search engine processes queries

14
Google
  • Started as a research project by 2 PhD students
    1996 (Larry Page and Sergey Brin)
  • Became popular around 2001
  • Uses keywords
  • PageRank and crosslinks (remembered by the
    algorithm)
  • Uses minimalist user interface (since been
    copied)
  • Impact on society - Google is now a verb

15
Yahoo, Microsoft, Ask.com
  • Yahoo also started at Stanford by PhD students
    Feb 94
  • David Filo and Jerry Yang
  • Until 2004 Yahoo used the Google search engine
  • Microsoft Windows Live Search
  • 2004 beta version 2006 live version
  • Ask.com
  • In February 2006, Ask Jeeves was rebranded as
    Ask.com.

16
Automatic Indexing
  • Must extract keywords automatically from pages
  • Apply heuristics to identify meaningful words
    within text
  • Use metadata added by page's author
  • ltmeta name"keywords" content""gt
  • ltmeta name"description" content""gt
  • Google applies weighting based on number of links
    pointing to a page

17
URLs
  • Uniform Resource Locators
  • Resource is something that can be accessed by a
    higher level Internet protocol
  • Often a file, but may be dynamically generated
    data
  • The way in which data can be accessed is
    constrained by the protocol used
  • e.g. mailbox

18
URL Syntax
  • Protocol // domain name / path
  • N.B. This is a slight simplification, covering
    the most common usage
  • e.g. http//www.digitalmultimedia.org/Materials/ke
    ypoints.html
  • Domain name identifies a host within a
    hierarchical naming scheme
  • Path is like Unix pathname segments separated by
    /s, identify resource in a hierarchy (e.g. file
    system)

19
URL Paths
  • Complete specification of the location of a file
    containing HTML
  • e.g. /Materials/index.html
  • Implicit specification of a standard file within
    a directory
  • e.g. /Materials/
  • Specification of a program that generates HTML
    dynamically
  • In special place (cgi-bin) or identified by
    extension (e.g. .php)

20
Partial URLs
  • URL with some of the leading components missing
  • Missing components filled in from the base URL of
    the document in which the partial URL occurs
  • Usually, base URL is the URL used to retrieve the
    document, but it can be set explicitly with
    ltbasegt tag

21
Fragment Identifiers
  • Links can point to a location within a page
  • URL identifies the entire page
  • Append a fragment identifier to a URL
  • name
  • e.g. http//www.digitalmultimedia.org/index.htmlt
    op
  • Use a named anchor to identify the corresponding
    location in the page

22
HTML Hypermedia
  • href of an a element might not point to an HTML
    file
  • Server response will include MIME type when
    resource is retrieved (deduced from extension)
  • Browser will either
  • Deal with data itself
  • Call on a helper application to display the
    retrieved resource externally
  • Use a plug-in to display it in browser window

23
Hypermedia Markup
  • If non-textual data is rendered within the
    browser, can integrate images, video, etc within
    Web page
  • img element is established way of embedding
    bitmapped images (GIF, JPEG, PNG)
  • object element can be used for any type of
    embedded data
  • embed element not standard, but widely supported
    for embedding video, audio and applets

24
Next week
  • Networks
  • Tomorrow
  • we will look at importing sound into your Flash
    Project
Write a Comment
User Comments (0)
About PowerShow.com