Linking protolith rocks to altered equivalents by combining unsupervised and supervised machine learning
Hood, SB and Cracknell, MJ and Gazley, MF, Linking protolith rocks to altered equivalents by combining unsupervised and supervised machine learning, Journal of Geochemical Exploration, 186 pp. 270-280. ISSN 0375-6742 (2018) [Refereed Article]
Metasomatism occurs when fluid interacts with rock to add, or remove, its chemical constituents; these processes form mineral deposits where economic element(s) are concentrated into small volumes of rock. It can be difficult, or impossible, to visually determine original rock types for samples that are significantly altered, e.g., when rocks have experienced texturally destructive metasomatism or deformation. A typical solution using chemical data involves the separation and labelling of chemically distinct rocks using discrimination diagrams.
However, such approaches can be subjective, and manual sample-by-sample consideration of large mining or exploration databases is untenable. Here we present an example workflow to facilitate relating rocks with similar origins but differing geological histories. We employ a combination of unsupervised and supervised machine learning algorithms to automate classification tasks typically undertaken manually by a geologist with domain expertise. In this study, data are first normalised and then clustered into natural groupings that represent protolith lithologies or rock-type subunits. These clusters are then used to inform a classification algorithm that assigns protolith equivalent labels to samples of altered rocks. Applied to problems involving large chemical datasets, machine learning provides objectivity, reproducibility and rapidity; useful advantages as compared to geostatistical domaining methods that involve manual determination and selection of geochemically similar regions. We utilise k-means++ unsupervised clustering to create objective and reproducible groupings of data points, with many geochemical variables considered simultaneously. Subsequently, Random Forests supervised classification is used to label samples while accommodating interactions and/or correlations between data points. We present a case study from the Minto Cu-Au-Ag mine, Whitehorse, Yukon, Canada. Interpretation of multi-element geochemical data using the approach that we have outlined here allows reconstruction of protolith geometry and an understanding of how rock type may have influenced later partitioning of hydrothermal fluids and ductile deformation.