The Needle in the Haystack: Find the Offending File - PowerPoint PPT Presentation

1 / 48
About This Presentation
Title:

The Needle in the Haystack: Find the Offending File

Description:

The Needle in the Haystack: Find the Offending File – PowerPoint PPT presentation

Number of Views:50
Avg rating:3.0/5.0
Slides: 49
Provided by: bob490
Category:
Tags: ey | file | find | haystack | needle | offending

less

Transcript and Presenter's Notes

Title: The Needle in the Haystack: Find the Offending File


1
The Needle in the HaystackFind the Offending
File
  • Robert K. Henry
  • CISSP, GCIH, GCFA
  • Information Security Officer

2
(No Transcript)
3
(No Transcript)
4
(No Transcript)
5
(No Transcript)
6
(No Transcript)
7
(No Transcript)
8
(No Transcript)
9
(No Transcript)
10
(No Transcript)
11
(No Transcript)
12
(No Transcript)
13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
(No Transcript)
17
(No Transcript)
18
HR Has an Employee Grievance
  • Hostile Workplace Sexual Harassment
  • Inappropriate/offensive files stored on web
    server and displayed in office
  • College Staff Already Involved

19
College Investigation
  • Course Site Files Deleted
  • Six weeks prior to HR grievance report
  • No Backups!
  • Backup System on the fritz at time files were
    deleted

20
College Investigation
  • How do we get the goods?
  • College systems admin made manual backups to
    local PC drive
  • Not removed from local drive after backup system
    was repaired

21
The Mission
  • Find inappropriate material among 6 GB of mixed
    images, word-processed, and text files.
  • Identify owner/creator of files
  • gt 7000 files

22
Search Options
  • Manual
  • grep
  • ssdeep
  • foremost
  • sorter
  • Content Based Image Retrieval, CBIR
  • Evaluation Criteria
  • Easy!
  • Free!

23
Search Options
  • Manual (The First Responder's Strategy)?
  • Thumbnails
  • Slide Show
  • One-at-a-time
  • zzzzzzzzzzzzzzzzzz!
  • Too much room for error
  • Pretty Inefficient (32 hours of searching)?
  • Two people spent two workdays each going through
    the DVD's

24
Search Options
  • But . . .
  • it worked!
  • Identified inappropriate word-processed files and
    images in one directory on one of the DVDs
  • Due to multiple file copying, creator/owner of
    files doesn't show up in Windows file properties
  • Did I mention the files were uploaded via ftp
    with shared userIDs?
  • Not much accountability!

25
Search Options
  • Theres gotta be an easier way!

26
Search Options-- grep
  • Built-in nix string search command also
    available for Windows?
  • Steps to conduct search with grep (1)
  • Make a forensic image of the disks
  • dd if/dev/sr0 ofdvdimage.img convnoerror,sync

27
Search Options--grep
  • Steps to conduct search with grep (2)
  • Extract Strings
  • Ascii strings first
  • cat dvdimage.img strings --radixd
    dvdimage.img gt dvdimage.str
  • Unicode strings second
  • cat dvdimage.img srch_strings -t d -e gt
    dvdimage.uni.str

28
Search Options--grep
  • Steps to conduct search with grep (3)
  • Examine Strings Files
  • Create dirty word file
  • Use dirty word file to search strings for,
    well, dirty words
  • grep -f dirtyWords.txt dvdimage.str gt
    grepOutput.txt
  • grep -f dirtyWords.txt dvdimage.uni.str gt
    grepOutput.uni.txt

29
Search Options--grep
  • Results
  • process sounds a little involved, however . . .
  • Took about 30 minutes to image DVDs and run
    commands.
  • Not Bad!
  • Identified Word-Processed files with
    inappropriate jokes
  • Doesn't get image files (didn't expect it to)?
  • Doesn't Identify Creator of files
  • Zero non-repudiation
  • Doesn't help investigation confirm or deny
    ownership of files
  • Bonus found survey data with Too Much
    Information
  • Protected student information in clear text

30
Search Options--ssdeep
  • linux and Windows
  • http//ssdeep.sourceforge.net/
  • Uses fuzzy hashing
  • A partial or inexact hashing of files to
    identify similar files
  • Its author, Jesse Kornblum, even uses the phrase
    finding needles in haystacks in his
    documentation!
  • Haven't heard of it being used to find
    questionable pictures, but why not give it a try?

31
Search Options--ssdeep
  • ssdeep! Go find files in the test directory
    that look like files in the homeStuff
    directory!
  • ssdeep -lrd test homeStuff
  • Bummer--
  • Identified exact matches only

32
Search Options--ssdeep
  • Need to try carving out portion of file for true
    fuzziness
  • Skip the first 20 blocks (header info and more)
    of file and cut out the next 70 blocks for the
    hash comparison
  • dd ifdsc00219.jpg of219partial.jpg skip20
    count70
  • Create file for comparison
  • ssdeep dsc00219partial.jpg gt testhash.txt
  • Compare fuzzy hash of image to images in
    directory
  • ssdeep -lrm testhash.txt homeStuff

33
Search Options--ssdeep
  • Results
  • Not Promising
  • Can check for similarities in files on a
    file-by-file basis, but that's too much like a
    manual search
  • Can easily find exact matches
  • so you must have the file you are looking for
    ???
  • However . . .
  • Useful for an intellectual property issue or
    finding known bad files

34
Search Options--foremost
  • linux and Windows
  • http//sourceforge.net/projects/foremost/
  • Identifies files based on a database of file
    headers and footers
  • Find a list of most file headers at
    http//www.wotsit.org

35
Search Options--foremost
  • This is the header of a gzip file displayed in a
    hex editor
  • The gzip header is 0x1f 0x8b 0x08

36
Search Options--foremost
  • foremost o pathToOuptutFile c pathToConfigFile
    pathToImageFile

37
foremost--Results
38
Search Options--sorter
  • linux and Windows
  • perl wrapper for several Sleuthkit tools
  • http//www.sleuthkit.org/
  • Runs against a disk image
  • Finds active or deleted files
  • Then displays thumbnail view of the files

39
Search Options--sorter
  • sorter s d pathToutputFile pathToInputFile

40
Search Options--sorter
  • Results
  • Save many steps compared to foremost
  • Still have a bunch of thumbnails to look through

41
Search Options
  • Theres gotta be an easier way!

42
(No Transcript)
43
Search Options--CBIR
  • Content Based Image Retrieval
  • Commercial Versions Available
  • My Office (me) too cheapdidnt even look into
    commercial options!
  • Free and Open Source
  • imgSeek
  • Linux and Windows
  • http//www.imgseek.net/
  • Gnu Image Finding Tool
  • Linux
  • http//www.gnu.org/software/gift/gift.html

44
Search Options--CBIR
  • ImgSeek Demo

45
Lessons Learned
  • Mission Accomplished!
  • Not so much
  • Found inappropriate material among 6 GB of mixed
    images, word-processed, and text files
  • Failed to identify owner/creator of files
  • Identified a potentially useful tool

46
Lessons Learned
  • Need to develop incident response procedure for
    entire organization
  • Procedure for breaches of Personally Identifiable
    Information and Payment Card data are on the
    books
  • Procedures for responding to HR requests needs
    documentation
  • And needs distribution to de-centralized IT units

47
References
  • The Sleuthkit (includes sorter)
  • http//www.sleuthkit.org/
  • foremost
  • http//sourceforge.net/projects/foremost/
  • ssdeep
  • http//ssdeep.sourceforge.net/
  • imgSeek
  • http//www.imgseek.net/
  • GIFT (Gnu Image Finding Tool)
  • http//www.gnu.org/software/gift/gift.html
  • Presentation available at
  • http//boisestate.edu/oit/iso/HTCIACBIR.ppt

48
Questions?bhenry_at_boisestate.edu
Write a Comment
User Comments (0)
About PowerShow.com