eCite Digital Repository

Consensus Clustering and Supervised Classification for Profiling Phishing Emails in Internet Commerce Security

Citation

Dazeley, R and Yearwood, JL and Kang, BH and Kelarev, AHJ, Consensus Clustering and Supervised Classification for Profiling Phishing Emails in Internet Commerce Security, Knowledge Management and Acquisition for Smart Systems and Services, 20 Aug - 3 Sept 2010, Daegu, Korea, pp. 235-246. ISBN 978-3-642-15036-4 (2010) [Refereed Conference Paper]


Preview
PDF
Restricted - Request a copy
165Kb
  

Copyright Statement

Copyright Springer-Verlag Berlin Heidelberg 2010

Abstract

This article investigates internet commerce security applica- tions of a novel combined method, which uses unsupervised consensus clustering algorithms in combination with supervised classification meth- ods. First, a variety of independent clustering algorithms are applied to a randomized sample of data. Second, several consensus functions and so- phisticated algorithms are used to combine these independent clusterings into one final consensus clustering. Third, the consensus clustering of the randomized sample is used as a training set to train several fast super- vised classification algorithms. Finally, these fast classification algorithms are used to classify the whole large data set. One of the advantages of this approach is in its ability to facilitate the inclusion of contributions from domain experts in order to adjust the training set created by consensus clustering. We apply this approach to profiling phishing emails selected from a very large data set supplied by the industry partners of the Cen- tre for Informatics and Applied Optimization. Our experiments compare the performance of several classification algorithms incorporated in this scheme.

Item Details

Item Type:Refereed Conference Paper
Research Division:Information and Computing Sciences
Research Group:Artificial Intelligence and Image Processing
Research Field:Expert Systems
Objective Division:Information and Communication Services
Objective Group:Computer Software and Services
Objective Field:Computer Software and Services not elsewhere classified
Author:Kang, BH (Professor Byeong Kang)
ID Code:64975
Year Published:2010
Deposited By:Information and Communication Technology
Deposited On:2010-09-20
Last Modified:2014-12-09
Downloads:3 View Download Statistics

Repository Staff Only: item control page