← Back to VOLUME 2, ISSUE 12, DECEMBER 2013
This work is licensed under a Creative Commons Attribution 4.0 International License.
Effective Pre-Processing Activities in Text Mining using Improved Porter’s Stemming Algorithm
C.RAMASUBRAMANIAN, R.RAMYA PG Student, ANNA UNIVERSITY, Nodal Center- Kamaraj College Of Engineering & Technology, Virudhunagar, Tamilnadu, India Assistant Professor, Depatrment of IT, Kamaraj College Of Engineering & Technology, Virudhunagar, Tamilnadu, India
Downloads: Download PDF
👁 38 views📥 1 download
Abstract: Text Databases are rapidly growing due to the increasing amount of information available in various electronic forms. User need to access relevant information across multiple documents. Initial process in Text Mining system is Pre- Processing steps. Our approach to make an effective Pre-Processing steps to save both space and time requirements by using improved Stemming Algorithm. Stemming algorithms are used to transform the words in texts into their grammatical root form. Several algorithms exist with different techniques. The most widely used stemming algorithm is “M.F Porter stemming algorithm. However, it still has certain drawbacks of handling Named Entities. Our paper is to improve its structure by refining with certain constraints, so that improve the Information Retrieval System’s Efficiency. Thus our paper is demonstrate how we can effectively overcome the problem of Named Entity during stemming process.
Keywords: Extraction, NamedEntity, Stemming, StopWordRemoval.
Keywords: Extraction, NamedEntity, Stemming, StopWordRemoval.
How to Cite:
[1] C.RAMASUBRAMANIAN, R.RAMYA PG Student, ANNA UNIVERSITY, Nodal Center- Kamaraj College Of Engineering & Technology, Virudhunagar, Tamilnadu, India Assistant Professor, Depatrment of IT, Kamaraj College Of Engineering & Technology, Virudhunagar, Tamilnadu, India, “Effective Pre-Processing Activities in Text Mining using Improved Porter’s Stemming Algorithm,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE)
