Title: Efficent Plagiarism Detection Using a Rolling Hash Scheme
1Efficent Plagiarism Detection Using a Rolling
Hash Scheme
2Manual Plagiarism Detection
3Select a Sentence Fragment
4Search with Google
5Search with Google
7Downsides to Manual Detection
- Boring
8Downsides to Manual Detection
- Boring - Time Consuming
9Downsides to Manual Detection
20 Students 10 minutes per paper 3
hours 20 minutes
- Boring - Time Consuming
10Downsides to Manual Detection
- Boring - Time Consuming - Repetitive
11Downsides to Manual Detection
Select Sentence Copy Switch to Browser Paste Searc
h Repeat
- Boring - Time Consuming - Repetitive
12Adler32 Hash
32 Bit Integers
Adler32 Function
All Strings of Binary Data
13Adler32 Hashes
Variable Length String
Now is the time for all good men
Now is the time for all good den
Now is teh time for all good men
14Adler32 Hashes
Variable Length String
Now is the time for all good men
Now is the time for all good den
Now is teh time for all good men
15Rolling Hashes
Now is the time for all good men to come to the
aid of their country
Hash Function
16Rolling Hashes
Now is the time for all good men to come to the
aid of their country
17Rolling Hashes
Now is the time for all good men to come to the
aid of their country
18Rolling Hashes
Now is the time for all good men to come to the
aid of their country
19Normalizing Text
Forward, the Light Brigade!' Was there a man
dismay'd? Not tho' the soldier knew Some one
had blunder'd Theirs not to make reply, Theirs
not to reason why, Theirs but to do and die Into
the valley of Death Rode the six hundred.
20Convert to Lowercase
forward, the light brigade!' was there a man
dismay'd? not tho' the soldier knew some one
had blunder'd theirs not to make reply, theirs
not to reason why, theirs but to do and die into
the valley of death rode the six hundred.
21Remove All Punctuation
forward the light brigade was there a man dismay
d not tho the soldier knew some one had
blunder d theirs not to make reply theirs not to
reason why theirs but to do and die into the
valley of death rode the six hundred
22Remove Short Words
forward the light brigade was there man
dismay not tho the soldier knew some one had
blunder theirs not make reply theirs not reason
why theirs but do and die into the valley of
death rode the six hundred
23Chunk into Six Word Groups
forward the light brigade was there man dismay
not tho the soldier knew some one had blunder
theirs not make reply theirs not reason why
theirs but do and die into the valley of death
rode the six hundred
24Compute Hashes of Word Groups
103892 forward the light brigade was there
192831 man dismay not tho the soldier 032910
knew some one had blunder theirs 492012 not make
reply theirs not reason 582101 why theirs but do
and die 281923 into the valley of death rode
192893 the six hundred
25Database Storage
26Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
27Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
28Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
29Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
30Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
31Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
32Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement
33Searching for Plagiarism
The author then said, Was there a man dismay'd?
Not tho' the soldier knew. It was with this
the author then said was there man dismay not tho
the soldier knew was with this statement