eCite Digital Repository

Experimental investigation of three machine learning algorithms for ITS dataset

Citation

Yearwood, JL and Kang, BH and Kelarev, A, Experimental investigation of three machine learning algorithms for ITS dataset, Proceedings of Future Generation Information Technology, 10-12 December 2009, Jeju Island, Korea, pp. 308-316. ISBN 978-3-642-10508-1 (2009) [Refereed Conference Paper]


Preview
PDF
Restricted - Request a copy
154Kb
  

Copyright Statement

The original publication is available at http://www.springerlink.com

Official URL: http://www.springerlink.com

DOI: doi:10.1007/978-3-642-10509-8

Abstract

The present article is devoted to experimental investigation of the performance of three machine learning algorithms for ITS dataset in their abil- ity to achieve agreement with classes published in the biological literature be- fore. The ITS dataset consists of nuclear ribosomal DNA sequences, where rather sophisticated alignment scores have to be used as a measure of distance. These scores do not form a Minkowski metric and the sequences cannot be re- garded as points in a finite dimensional space. This is why it is necessary to de- velop novel machine learning approaches to the analysis of datasets of this sort. This paper introduces a k-committees classifier and compares it with the dis- crete k-means and Nearest Neighbour classifiers. It turns out that all three machine learning algorithms are e␣cient and can be used to automate future biologically significant classifications for datasets of this kind. A simplified ver- sion of a synthetic dataset, where the k-committees classifier outperforms k-means and Nearest Neighbour classifiers, is also presented.

Item Details

Item Type:Refereed Conference Paper
Research Division:Information and Computing Sciences
Research Group:Computer Software
Research Field:Software Engineering
Objective Division:Information and Communication Services
Objective Group:Computer Software and Services
Objective Field:Computer Software and Services not elsewhere classified
Author:Kang, BH (Professor Byeong Kang)
ID Code:62422
Year Published:2009
Deposited By:Computing and Information Systems
Deposited On:2010-03-12
Last Modified:2014-12-19
Downloads:0

Repository Staff Only: item control page