Harvesting - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

Harvesting

Description:

Repeated searches of the electronic directory used to collect email addresses in batches ... By design, the directory is open to everyone in the world ... – PowerPoint PPT presentation

Number of Views:59
Avg rating:3.0/5.0
Slides: 31
Provided by: tompa6
Category:

less

Transcript and Presenter's Notes

Title: Harvesting


1
Harvesting
  • Joy Veronneau
  • JV11_at_Cornell.edu
  • Campus Developers Meeting
  • July 14, 2004

2
What is Harvesting?
  • Repeated searches of the electronic directory
    used to collect email addresses in batches
  • An example from our logs

3
"((snpne)(cornelledumiddlenamepne)(cnpne)
(givenNamepne)(cornelledudeptname1pne)
(uidpne)(edupersonnicknamepne))"
4
There are at least five ways to harvest Cornell's
directory
  • From the directory search web page
  • By email
  • From the command line
  • From a browser
  • Using finger
  • Other?

5
from the electronic directory search web page
  • This is the most common way to harvest the
    directory

Recent attacks cycled through common first names
(John, Mary, Tom, William)
Another used just two letter wildcards like lu
and nh
6
by email
  • Nickname fuzzy name matching
  • Send mail to "somebody_at_cornell.edu"
  • Get back up to 2000 email addresses for names
    containing the string "somebody"

7
(No Transcript)
8
from the command line
  • using ldapsearch with the directory server as the
    host
  • ldapsearch -h directory.cornell.edu -b
    "ouPeople, oCornell University, cus" -x
    "(uidab)"

9
from a browser
  • Using the ldap URL such as

ldap//directory.cornell.edu/ocornell20universit
y,cus??sub?cnparker
10
using finger
  • finger somebody_at_cornell.edu returns up to
    2000 entries including email addresses

11
maybe others?
  • Any finger/LDAP enabled application such as
    Eudora..

12
How does harvesting affect us?
  • Slows down our machinery
  • We get annoying SPAM
  • Users are asking us to do something about it

13
CPU loads on directory server
normal
harvesting
14
What do we currently do to prevent harvesting?
  • Right now, not much
  • By design, the directory is open to everyone in
    the world
  • Limit of 2000 entries returned per search (thats
    high)
  • No warning on web page about proper use of
    information (but are warnings even enforceable?)

15
(No Transcript)
16
What do other universities do to control
harvesting?
17
Yale
18
Yale
19
University of Florida
20
University of Florida
21
University of Florida
22
Georgetown University
23
Georgetown University
24
Georgetown University
25
Georgetown University
26
Stanford University
27
Stanford University
28
Summary of Options
  • Reduce the search limit. Easy to do protects
    against all harvesting methods.
  • Put a warning on the search results page. Easy
    to do but maybe no benefit.
  • Display email addresses only to authenticated
    users and use a clickable mail UI. This would
    take some development work.
  • Allow users to choose whether or not their email
    address will be displayed. This would take a lot
    of development work as well as user education.
  • Display email addresses as graphical images for
    non-authenticated users. Requires further
    investigation.

txt2_at_cornell.edu
Searchable text field
Unsearchable .jpeg image
29
Things to think about
  • Even if we put restrictions on our search page,
    other departments could still publish some
    information we restrict.
  • What if an on-campus department has been
    harvesting information for legitimate purposes?

30
Discussion?
Write a Comment
User Comments (0)
About PowerShow.com