eCite Digital Repository

Experimental investigation of three machine learning algorithms for ITS dataset


Yearwood, JL and Kang, BH and Kelarev, A, Experimental investigation of three machine learning algorithms for ITS dataset, Proceedings of Future Generation Information Technology, 10-12 December 2009, Jeju Island, Korea, pp. 308-316. ISBN 978-3-642-10508-1 (2009) [Refereed Conference Paper]

Restricted - Request a copy

Copyright Statement

The original publication is available at

Official URL:

DOI: doi:10.1007/978-3-642-10509-8


The present article is devoted to experimental investigation of the performance of three machine learning algorithms for ITS dataset in their abil- ity to achieve agreement with classes published in the biological literature be- fore. The ITS dataset consists of nuclear ribosomal DNA sequences, where rather sophisticated alignment scores have to be used as a measure of distance. These scores do not form a Minkowski metric and the sequences cannot be re- garded as points in a finite dimensional space. This is why it is necessary to de- velop novel machine learning approaches to the analysis of datasets of this sort. This paper introduces a k-committees classifier and compares it with the dis- crete k-means and Nearest Neighbour classifiers. It turns out that all three machine learning algorithms are e␣cient and can be used to automate future biologically significant classifications for datasets of this kind. A simplified ver- sion of a synthetic dataset, where the k-committees classifier outperforms k-means and Nearest Neighbour classifiers, is also presented.

Item Details

Item Type:Refereed Conference Paper
Research Division:Information and Computing Sciences
Research Group:Software engineering
Research Field:Software testing, verification and validation
Objective Division:Information and Communication Services
Objective Group:Information systems, technologies and services
Objective Field:Information systems, technologies and services not elsewhere classified
UTAS Author:Kang, BH (Professor Byeong Kang)
ID Code:62422
Year Published:2009
Deposited By:Information and Communication Technology
Deposited On:2010-03-12
Last Modified:2014-12-19

Repository Staff Only: item control page