Professional Documents
Culture Documents
The term “plagiarize” is defined as to take (ideas, documents, code, image, etc) from another and
pass them off as one's own without citation. So plagiarism is a global problem, which occurs in
many different areas of our life. There are many different forms of plagiarism, Plagiarism at
schools can be a highly de-motivating factor for teachers and also for students. If plagiarism is
not addressed sufficiently, plagiarists could gain undeserved advantage, e.g. more marks for their
assignments with less effort.
Disadvantages:
Proposed System:
. The objective is to determine whether the application of machine learning in the plagiarism
detection task is helpful. This is achieved by comparing a threshold setting approach against a
supervised machine learning classifier. Third, the prospect of applying the proposed framework
in a large-scale scenario is explored. The objective is to investigate the scalability of the
proposed framework and algorithms. This is achieved by experimenting with a large-scale corpus
in three stages. The first two stages are based on longer text lengths and the final stage is based
on segments of texts. Finally, the plagiarism direction identification problem is explored as
supervised machine learning classification and ranking tasks. Statistical and linguistic features
are investigated individually or in various combinations. The objective is to introduce a new
perspective on the traditional brute-force pair-wise comparison of texts. Instead of comparing
original texts against rewritten texts, features are drawn based on traits of texts to build a pattern
for original and rewritten texts. Thus, the classification or ranking task is to fit a piece of text into
a pattern. The framework is tested by empirical experiments, and the results from initial
experiments show that deep linguistic analysis contributes to solving the problems.
Advantages:
Software Requirements:
Language : JDK (1.7.0)
Frontend : JSP, Servlets
Backend : Oracle10g
IDE : my eclipse 8.6
Operating System : windows XP
Server : Tomcat
Hardware Requirements:
Processor : Pentium IV
Hard Disk : 80GB
RAM : 2GB