Abstract: Plagiarism is when someone steals someone else's idea or work and passes it off as their own. Plagiarism has been classified as a moral rights infringement in a number of countries. Plagiarism has become increasingly common in today's environment of changing technology and ever-increasing Internet usage. It is often observed in many educational areas such as research papers, blogs, articles, assignments, etc. This study focuses primarily on plagiarism, which is prevalent in schools and colleges. Many students can be found to have copied assignments from their classmates. A system can be developed for the convenience of teachers that could check the amount of plagiarism in students’ assignments. This system could be mentioned as an improvement from the old manual way as it eliminates the tedious work with increased speed and efficiency.
Keyword: plagiarism, cosine similarity, Data mining, Hash tag, Stop word, Cleaning, stemming.
| DOI: 10.17148/IJARCCE.2022.114158