Towards a chromatographic similarity index to establish localised quantitative structure-retention relationships for retention prediction. II Use of Tanimoto similarity index in ion chromatography

Park, SH; Talebi, Mohammad; Amos, RIJ; Tyteca, E; Haddad, Paul; Szucs, R; Pohl, CA; Dolan, JW

File(s) under permanent embargo

Towards a chromatographic similarity index to establish localised quantitative structure-retention relationships for retention prediction. II Use of Tanimoto similarity index in ion chromatography

journal contribution

posted on 2023-05-19, 13:29 authored by Park, SH, Mohammad TalebiMohammad Talebi, Amos, RIJ, Tyteca, E, Paul HaddadPaul Haddad, Szucs, R, Pohl, CA, Dolan, JW

Quantitative Structure-Retention Relationships (QSRR) are used to predict retention times of compounds based only on their chemical structures encoded by molecular descriptors. The main concern in QSRR modelling is to build models with high predictive power, allowing reliable retention prediction for the unknown compounds across the chromatographic space. With the aim of enhancing the prediction power of the models, in this work, our previously proposed QSRR modelling approach called “federation of local models” is extended in ion chromatography to predict retention times of unknown ions, where a local model for each target ion (unknown) is created using only structurally similar ions from the dataset. A Tanimoto similarity (TS) score was utilised as a measure of structural similarity and training sets were developed by including ions that were similar to the target ion, as defined by a threshold value. The prediction of retention parameters (a- and b-values) in the linear solvent strength (LSS) model in ion chromatography, log k = a – blog[eluent], allows the prediction of retention times under all eluent concentrations. The QSRR models for a- and b-values were developed by a genetic algorithm-partial least squares method using the retention data of inorganic and small organic anions and larger organic cations (molecular mass up to 507) on four Thermo Fisher Scientific columns (AS20, AS19, AS11HC and CS17). The corresponding predicted retention times were calculated by fitting the predicted a- and b-values of the models into the LSS model equation. The predicted retention times were also plotted against the experimental values to evaluate the goodness of fit and the predictive power of the models. The application of a TS threshold of 0.6 was found to successfully produce predictive and reliable QSRR models (Q_ext(F2)² > 0.8 and Mean Absolute Error < 0.1), and hence accurate retention time predictions with an average Mean Absolute Error of 0.2 min.

Funding

Australian Research Council

Pfizer

Thermo Fisher Scientific Australia

History

Publication title

Journal of Chromatography A

Volume

1523

Pagination

173-182

ISSN

0021-9673

Department/School

School of Natural Sciences

Publisher

Elsevier Science Bv

Place of publication

Po Box 211, Amsterdam, Netherlands, 1000 Ae

Rights statement

Repository Status

Restricted

Socio-economic Objectives

Expanding knowledge in the chemical sciences

Usage metrics

Keywords

QSRR ion chromatography Tanimoto similarity partial least squares genetic algorithm linear solvent strength (LSS) model

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Towards a chromatographic similarity index to establish localised quantitative structure-retention relationships for retention prediction. II Use of Tanimoto similarity index in ion chromatography

Funding

Australian Research Council

Pfizer

Thermo Fisher Scientific Australia

History

Publication title

Volume

Pagination

ISSN

Department/School

Publisher

Place of publication

Rights statement

Repository Status

Socio-economic Objectives

Usage metrics

Categories

Keywords

Licence

Exports