Open Access Open Access  Restricted Access Subscription or Fee Access

A Study on Clustering Algorithms Used In Large Datasets

A.J. Anju, P.O. Sinciya

Abstract



A collection of related sets of information that is composed of separate elements but can be manipulated as a unit by a computer. Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner .A Markov clustering algorithm that operates on a graph built from pair wise similarity information of the input data. Edge weights stored in the stochastic similarity matrix are alternately fed to the two main operations, inflation and expansion, and are normalized in each main loop to maintain the probabilistic constraint. The clustering is very difficult in terms of processing speed, time delay, search from different database etc. In hierarchical clustering the large scale data sets are difficult to capture, store, search, analyze and visualize. Hierarchical clustering cannot represent distinct clusters with similar expression patterns.This paper surveys different clusteringalgorithms which obtainedmaximum similarity of large protein sequence dataset.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.