Plagiarism Detection Tools
Plagiarism Detection Tools
In the past two decade, several plagiarism detection tools have been developed. Some
Of these tools are discussed in brief, next. Also, we have analyzed their pros and cons, and
reported in a tabular form in Table 2 We reported the classification of tools in Figure 4
i. SafeAssignment : This anti-plagiarism checker claims to search an index of 8
billion documents available in the Web. It uses some major scholastic databases like
ProQuestTM, FindArticlesTM and Paper Mills during searching and detection process.
SafeAssignment maintains a database where user account is essential to keep
fingerprints of the submitted documents in order to avoid any legal or copy right
problem. This tool uses proprietary searching and ranking algorithms for match
detection of fingerprints with its resources. The results of plagiarism detection is
presented to the user within couple of minutes.
ii.Docol©c: This Web based service uses capabilities like searching and ranking of
Google API. The submitted document is uploaded to a server and evaluation is done
in the server side. The software provides a simple console to set fingerprint (search
fragments) size, date constraints, filtering and other report related options. The
evaluation result is sent to the user through email identifying plagiarized sections
and sources of plagiarism. This is totally Google API dependent and so it may be
unavailable at any point of time.
iii.Urkund: This is another Web based service which carry out plagiarism
detection in server side. This is an integrated and automated solution for plagiarism
detection. This is a paid service which uses standard email system for document
submission and for viewing results. This system claims to process 300 different
types of document submissions and it searches through all available online sources.
It gives more priority to educational sources of documents more during searching.
iv.Copycatch: This is a client-based tool which utilizes the local database of
documents during comparison. It offers ‘gold’ and ‘campus versions’, providing
comparison capabilities against large repository of local resources. It has another
Web version which utilizes the capabilities of Google API for plagiarism detection
across the Internet. To use the Web version, user needs personal Google API licence
through signup.
v.WCopyfind: It is an open source plagiarism detection tool for detection of words
or phrases of defined length within a local repository of documents. Its extended
version has the capabilities of searching across the Internet using Google API to
check plagiarism online.
vi.Eve2 (Essay Verification Engine: This system is installed in user's
computer and it checks plagiarism of a document against Internet sources. It does
not contact any online database. It accepts text in several formats but internally
converts the input file into text for processing. It presents the user with a report
identifying matches found in the Web.
vii.GPSP - Glatt Plagiarism Screening Program : This system uses different
approaches unlike other mentioned services. It finds and uses the writing style of the
author(s) to detect plagiarism. This service works locally and it asks the author to go
through a test by filling the blank spaces. The number of correctly filled spaces and
time taken to complete the test are used to make a hypothesis about plagiarism. This
system is basically developed for teachers and it cannot detect source code
plagiarism.
vii.MOSS - a Measure of Software Similarity : This system is used to detect
source code plagiarism. This service takes batches of documents as input and
attempts to present a set of HTML pages to specify the sections of a pair of
documents where matches detected. The tool specializes in detecting plagiarism in
C, C++, Java, Pascal, Ada, ML, Lisp, or Scheme programs.
JPlag [47][6]: It is a Web based source code plagiarism detection tool started in
1997. The tool accepts a set of programs as input to be compared and to present a
report identifying matches. JPlag carry out programming language syntax and
structure aware analysis to find results. It can detect plagiarism in Java, C and C++
programs. The execution time of this service is less than one minute for submissions
of 100 programs of several hundred lines each.
viii.Copyscape: This system takes URL as input and search for copies of a Web
page in the Internet. Copyscape helps to find sites that have copied from someone's
Web page content without permission. It has both free and premium version and it
pushes the free users to buy their premium by limiting the search features.
ix.DOC Cop : This plagiarism detection system creates report displaying the
correlation and matches between documents or between documents and the Web. It
is free plagiarism detection system. \
x.Ephorus : To access this tool, user is to register with the Ephorus site. Hence,
no downloads or installation is needed. The search engine compares a text document
to millions of others on the Web and reports back with an originality report [50].
This tool can be freely tried but license needs to be purchased. It is well known in
many European universities and organization.
xi.ithenticate : This is a successful Web based plagiarism detection tool for
any text document. This tool is not required to install in client computer. This
application compares input documents against the document sources available on
the Web. This well-known tool is used by most well-known journal publishers. It is
a easy to use, quick plagiarism checker for professionals. It is designed to be used
by institutions rather than personal, but lastly they provided a limit service for single
plagiarism detection user like master and doctoral students and this allows them to
check a single document of up to 25,000 words.
xii.Plagiarism Detect: To use this tool, user needs to register by providing correct
information. After registration, users are allowed to input text in a given text box or
as a file by uploading for analysis. This is a free service which finally sends
evaluation report to the user's email account with a list of links from where
information are copied. It also specifies amount of plagiarism (in \%) detected. User
needs to download and install the software in order to use it.
xiii.Exactus Like : This plagiarism detection system is not able to find simple
copy-paste plagiarism but also can detect moderately disguised borrowing
(word/phrase reordering, substitution of some words with synonyms [52]. To do
this, the system leverages deep parsing techniques. This Web based tool supports
most of the popular file formats such as Adobe PDF, Microsoft Word, RTF, ODT
and HTML. Currently Exactus Like includes about 8.5 million indexed documents.
Internally this tool is basically a distributed system and a demo version of this tool is
available online.
xiv.DupliChecker : It is a free online plagiarism checker. This tool can be accessed
by unregistered user only once, but registered user can check for plagiarism for 50
times in a day. The input file must contain more than 1000 words per similarity
search. User can check content's originality by number of ways such as via copy
paste, uploading file or by submitting URL.