eCite Digital Repository

A novel approach to mining maximal frequent itemsets based on genetic algorithm

Citation

Kabir, M and Xu, S and Kang, BH and Zhao, Z, A novel approach to mining maximal frequent itemsets based on genetic algorithm, Proceedings of the 9th International Conference on Information Technology and Applications, 1-4 July 2014, Sydney, Australia, pp. 1-6. ISBN 978-0-9803267-6-5 (2014) [Refereed Conference Paper]

Copyright Statement

Copyright 2014 ICITA

Official URL: http://www.icita.org/2014/CD/abstracts/au-kabir.ht...

Abstract

We present a new approach based on Genetic Algorithm to generate maximal frequent itemsets from large databases. This new algorithm called GeneticMax is heuristic which mimics natural selection approaches to finding maximal frequent itemsets in an efficient way. The search strategy of this algorithm uses lexicographic tree that avoids level by level searching, which finally reduces the time required to mine maximal frequent itemsets in a linear way. Our implementation of the search strategy includes bitmap representation of the nodes in a lexicographic tree and from superset-subset relationship of the nodes it identifies frequent itemsets. Since this new algorithm uses the principles of Genetic Algorithm, it performs global search and its time complexity is less than that of other algorithms, for the reason that genetic algorithm is based on greedy approach. We separate the effect of each step of this algorithm by experimental analysis on real databases including Tic Tac Toe, Zoo, a 100008 Database, and so on. Our experimental results show that this approach is efficient and scalable for different sizes of itemsets. It accesses a major database to calculate a support value for fewer number of nodes to find frequent itemsets even when the search space is very large, which dramatically reduces the search time.

Item Details

Item Type:Refereed Conference Paper
Keywords:data mining, genetic algorithm, lexicographic tree, maximal frequent itemset
Research Division:Information and Computing Sciences
Research Group:Artificial Intelligence and Image Processing
Research Field:Neural, Evolutionary and Fuzzy Computation
Objective Division:Information and Communication Services
Objective Group:Computer Software and Services
Objective Field:Information Processing Services (incl. Data Entry and Capture)
Author:Kabir, M (Mr Mir Kabir)
Author:Xu, S (Dr Shuxiang Xu)
Author:Kang, BH (Professor Byeong Kang)
Author:Zhao, Z (Mr Zongyuan Zhao)
ID Code:96359
Year Published:2014
Deposited By:Computing and Information Systems
Deposited On:2014-10-31
Last Modified:2017-11-13
Downloads:0

Repository Staff Only: item control page