eCite Digital Repository
Use of dual-filtering to create training sets leading to improved accuracy in quantitative structure-retention relationships modelling for hydrophilic interaction liquid chromatographic systems
Citation
Taraji, M and Haddad, PR and Amos, RIJ and Talebi, M and Szucs, R and Dolan, JW and Pohl, CA, Use of dual-filtering to create training sets leading to improved accuracy in quantitative structure-retention relationships modelling for hydrophilic interaction liquid chromatographic systems, Journal of Chromatography A, 1507 pp. 53-62. ISSN 0021-9673 (2017) [Refereed Article]
Copyright Statement
Crown Copyright © 2017
DOI: doi:10.1016/j.chroma.2017.05.044
Abstract
The development of quantitative structure retention relationships (QSRR) having sufficient accuracy to support high performance liquid chromatography (HPLC) method development is still a major issue. To tackle this challenge, this study presents a novel QSRR methodology to select a training set of compounds for QSRR modelling (i.e. to filter the database to identify the most appropriate compounds for the training set). This selection is based on a dual filtering strategy which combines Tanimoto similarity (TS) searching as the primary filter and retention time (tR) similarity clustering as the secondary filter, using a database of pharmaceutical compound retention times collected over a wide range of hydrophilic interaction liquid chromatography (HILIC) systems. To employ tR similarity filtering, correlation to a molecular descriptor is used as a measure of retention time. For the retention time of a compound to be modelled a relationship between experimental chromatographic data and various molecular descriptors is calculated using a genetic algorithm-partial least squares (GA-PLS) regression. The proposed dual-filtering-based QSRR model significantly improves the retention time predictability compared to the diverse, global, and TS-based QSRR models, with an average root mean square error in prediction (RMSEP) of 11.01% over five different HILIC stationary phases. The average CPU time for implementing the proposed approach is less than 10 min, which makes it quite favorable for rapid method development in HILIC. In addition, interpretation of the molecular descriptors selected by this novel approach provided some insight into the HILIC mechanism.
Item Details
Item Type: | Refereed Article |
---|---|
Keywords: | QSRR, prediction accuracy, dual-filtering, retention prediction, similarity searching, HILIC mechanism |
Research Division: | Chemical Sciences |
Research Group: | Inorganic chemistry |
Research Field: | F-block chemistry |
Objective Division: | Expanding Knowledge |
Objective Group: | Expanding knowledge |
Objective Field: | Expanding knowledge in the chemical sciences |
UTAS Author: | Taraji, M (Ms Maryam Taraji) |
UTAS Author: | Haddad, PR (Professor Paul Haddad) |
UTAS Author: | Amos, RIJ (Dr Ruth Amos) |
UTAS Author: | Talebi, M (Dr Mohammad Talebi) |
ID Code: | 120523 |
Year Published: | 2017 |
Funding Support: | Australian Research Council (LP120200700) |
Web of Science® Times Cited: | 22 |
Deposited By: | Austn Centre for Research in Separation Science |
Deposited On: | 2017-08-28 |
Last Modified: | 2018-03-28 |
Downloads: | 0 |
Repository Staff Only: item control page