Open Access Open Access  Restricted Access Subscription or Fee Access

Data Classification with the Genetic Algorithm AGR

Raul Robu, Stefan Holban

Abstract


The specific literature contains an important number of genetic algorithms which successfully classify data. The paper presents a synthesis of these algorithms. The models discovered by genetic algorithms should have an accuracy and coverage as good as possible. In order to encourage the discovery of some rules with good coverage, a factor that determines the fitness of a rule to grow proportional with its coverage could be included in the fitness function, but this will not guarantee the discovery of new rules with a satisfactory coverage. The proposed genetic algorithm was named AGR and comes with a solution to this problem. The models made of rules discovered by AGR guarantee a configurable percent of minimum coverage on the instances from the training set. The discovered rules contain the logical conditions “AND” and “OR”. The algorithm can recalculate the class following the crossover, it can work only with the best rule that was discovered in each iteration. In order to ensure a maximum coverage, the algorithm can calculate a default rule. The algorithm was implemented in Weka and in a dedicated application. The analysis of the algorithm’s results on medical datasets showed that it can be successfully used for data classification.

Keywords


AGR, classification, data mining, genetic algorithm, Weka.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.