Towards a chromatographic similarity index to establish localized quantitative structure-retention models for retention prediction: use of retention factor ratio

Tyteca, E; Talebi, Mohammad; Amos, R; Park, S; Taraji, Maryam; Wen, Y; Szucs, R; Pohl, CA; Dolan, JW; Haddad, Paul

File(s) under permanent embargo

Towards a chromatographic similarity index to establish localized quantitative structure-retention models for retention prediction: use of retention factor ratio

journal contribution

posted on 2023-05-19, 02:08 authored by Tyteca, E, Mohammad TalebiMohammad Talebi, Amos, R, Park, S, Maryam TarajiMaryam Taraji, Wen, Y, Szucs, R, Pohl, CA, Dolan, JW, Paul HaddadPaul Haddad

Quantitative Structure-Retention Relationships (QSRR) have the potential to speed up the screening phase of chromatographic method development as the initial exploratory experiments are replaced by prediction of analyte retention based solely on the structure of the molecule. The present study offers further proof-of-concept of localized QSRR modelling, in which the retention of any given compound is predicted using only the most chromatographically similar compounds in the available dataset. To this end, each compound in the dataset was sequentially removed from the database and individually utilized as a test analyte. In this study, we propose the retention factor k as the most relevant chromatographic similarity measure and compare it with the Tanimoto index, the most popular similarity measure based on chemical structure. Prediction error was reduced by up to 8 fold when QSRR was based only on chromatographically similar compounds rather than using the entire dataset. The study therefore shows that the design of a practically useful structural similarity index should select the same compounds in the dataset as does the k-similarity filter in order to establish accurate predictive localized QSRR models. While low average prediction errors (Mean Absolute Error (MAE) < 0.5 min) and slopes of the regression lines through the origin close to 1.00 were obtained using k-similarity searching, the use of the structural Tanimoto similarity index, considered as the gold standard in Quantitative Structure-Activity Relationships (QSAR) studies, generally resulted in much higher prediction errors (MAE > 1 min) and significant deviations from the reference slope of 1.0. The Tanomoto similarity index therefore appears to have limited general utility in QSRR studies. Future studies therefore aim at designing a more appropriate chromatographic similarity index that can then be applied for unknown compounds (that is, compounds which have not been tested previously on the chromatographic system used, but for which the chemical structures are known).

Funding

Australian Research Council

Pfizer

Thermo Fisher Scientific Australia

History

Publication title

Journal of Chromatography A

Volume

1486

Pagination

50-58

ISSN

0021-9673

Department/School

School of Natural Sciences

Publisher

Elsevier Science Bv

Place of publication

Po Box 211, Amsterdam, Netherlands, 1000 Ae

Rights statement

Repository Status

Restricted

Socio-economic Objectives

Expanding knowledge in the chemical sciences

Usage metrics

Keywords

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Towards a chromatographic similarity index to establish localized quantitative structure-retention models for retention prediction: use of retention factor ratio

Funding

Australian Research Council

Pfizer

Thermo Fisher Scientific Australia

History

Publication title

Volume

Pagination

ISSN

Department/School

Publisher

Place of publication

Rights statement

Repository Status

Socio-economic Objectives

Usage metrics

Categories

Keywords

Licence

Exports