📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 5, ISSUE 7, JULY 2016

Novel Sequence Clustering Approach for Biological Data

Sunila, Rishipal, Sanjeev Kumar

DOI: 10.17148/IJARCCE.2016.57151

Abstract: Clustering is one of the unsupervised learning technique in which a set of basics is separated into uniform groups. It is more hard job as compared to supervised classification where classes are already known for training the system. This dilemma becomes most awful when sequential data are to be measured. Hidden Markov Models (HMM) comprise a tool for sequential data modeling. In this paper a scheme for HMM based sequential clustering is proposed and compared with K-Means using machine learning tool WEKA. In this approach proximity based methods are used, in which the main endeavor of the clustering process is in formulating similarity or distance measures between sequences. Proposed K-Means is a useful tool for identifying co-expressed genes, biologically relevant groupings of genes and samples. Experimental results demonstrate that HMM based K-Means outperforms K-Means in terms of accuracy. But Proposed K-Means has an intense computational load.



Keywords: Data mining, Clustering, K-Means Clustering, HMM, Distance measure.

How to Cite:

[1] Sunila, Rishipal, Sanjeev Kumar, “Novel Sequence Clustering Approach for Biological Data,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.57151