Title: Searching the World Wide Web
1Searching the World Wide Web
And Why You Cant Find Everything There
Arlington Campus Library George Mason University
2Our Goals
- Directories, portals, and search engines
- How do the engines work?
- Getting the most from your searches
- Invisible Web
- Authoritative Information
3Three Methods of Searching
- Directories
- Portal
- Search Engine
4Directories
- Directories are organized indexes that allow you
to browse through lists of Web sites by subject
or topic. - Directories are created by people.
5Directories
- Excellent for browsing
- Like visiting a library
- Clearly defined subjects
6Who Creates Directories?
- Libraries
- Nonprofit organizations
- Universities
- Dot-Com businesses
- but they are probably portals too
7A Sampling of Directories
- Librarians Index to the Internet www.lii.org/
- Open Directory Project www.dmoz.org
- Looksmart www.looksmart.com
- Yahoo www.yahoo.com
8(No Transcript)
9(No Transcript)
10(No Transcript)
11(No Transcript)
12Portals
- Portals offer a one-stop shopping look.
- Portals include e-mail, chat, auctions, news,
weather, horoscopes, stock info, and more. - Portals want to be YOUR starting point.
13A Sampling of Popular Portals
- Yahoo! www.yahoo.com
- Portals to the World from the Library of
Congress www.loc.gov/rr/international/portals.htm
l - AltaVista www.altavista.com
14(No Transcript)
15(No Transcript)
16(No Transcript)
17Search Engines
- Crawler-based Search Engines
- Spiders aka Crawlers visit websites and some
of their pages periodically, and adds to index - Scans links and adds them to their index
- Returns the information to the index or catalog
- Search engine software sifts the index and ranks
in relevant order - Some are Focused Crawlers
18Search Engine Watch http//searchenginewatch.com
19Other Search Engine Types
- News Search Engines
- Multimedia Search Engines
- Metacrawlers
- Kids Search Engines
- Regional Search Engines
- See http//searchenginewatch.com/links/
20Power Searching
- Command charts for major search engines at
http//searchenginewatch.com/facts/ataglance.html
- Use Advanced Search at the search engine home
page to zero in on relevant sites
21(No Transcript)
22(No Transcript)
23(No Transcript)
24www.google.com
25www.google.com
26Understand Limitations of Search Engines
- Search spiders or crawlers do not crawl in
real time - Lag times getting info to the index vary by
search engine - If a website is not submitted to the search
engine it wont be crawled - Not every page from a website is crawled
- A webmaster can choose to not have a page crawled
- Formats like PDF, Flash, Zip files, executable
programs, and others cannot be searched - The Invisible Web
27Start Your Search Engines Here
- Google www.google.com
- AllTheWeb www.alltheweb.com
- Yahoo www.yahoo.com
- MSN http//search.msn.com
- Why? See http//searchenginewatch.com/links/majo
r.html
28Other Resources for Search Engine Info
- Search Engine Showdown Search Engine Features
http//www.notess.com/search/features/byfeature.sh
tml - Search Engine Watch Search Engine Features for
Searchers http//www.searchenginewatch.com/facts/a
taglance.html - How to Choose a Search Engine or Directory
http//library.albany.edu/internet.choose.html
29Directories Vs Search Engines
- When should you use a directory?
- When you have a broad topic
- When you want experts to recommend sites
- When you want to avoid irrelevant sites
- Examples topics
- Disabilities
- Civil War
- Welfare
30Directories Vs Search Engines
- When should you use a search engine?
- When you have a narrow topic
- When you are looking for a specific website
- When you want to search for a file type or
language - Examples
- Americans with Disabilities Act
- Battle of Gettsyburg
- Welfare to Work
31The Invisible Web 4 Types
- Opaque search engines choose not to index
- The Private Web password protected
- The Proprietary Web registration required
(either fee or free) - The Truly Invisible Web cant search certain
file formats and databases
32Examples of Invisible Web Sites
- Dictionaries http//www.m-w.com
- Telephone Numbers http//www.infospace.com
- Clinical Trials http//www.clinicaltrials.gov
- Library Catalogs http//www.libdex.com/webcats
- Philanthropy and Grant Information
http//lnp.fdncenter.org/finder - Translation Tools http//world.altavista.com
33More Information on the Invisible Web
- The Invisible Web Uncovering Information Sources
Search Engines Cant See, by Chris Sherman and
Gary Price, ZA 4550 .S54 2001 - http//www.invisible-web.net/ website that goes
with the book. Includes a subject directory of
invisible web sites.
34Evaluating Online Information
- Always judge a web page for relevance and
accuracy - Who is the Author?
- What is the Purpose of the site?
- Is the information Fact, Opinion, or Propaganda?
- Is the information Reliable and Credible?
- Is the information Current?
35Evaluating Web Sites Continued
- Can you find this news reported on a legitimate
news website? - Who is the sponsor of the website?
- Are there inconsistencies or inaccuracies in the
information? - If an organization is mentioned by name, does the
organization have any related information on this
website?
36Thank You.!!! ?
- You can find this PowerPoint presentation on the
Arlington Campus Library Website at
http//library.gmu.edu/libinfo/acl.html under
Useful Sites - bottom right of screen.