eCite Digital Repository

The utility of machine learning in identification of key geophysical and geochemical datasets: a case study in lithological mapping in the Central African copper belt


Kuhn, SD and Cracknell, MJ and Reading, AM, The utility of machine learning in identification of key geophysical and geochemical datasets: a case study in lithological mapping in the Central African copper belt, Proceedings of the Australasian Exploration Geoscience Conference 2018, 18-21 February 2018, Sydney, Australia, pp. 1-4. (2018) [Refereed Conference Paper]


Copyright Statement

Copyright 2018 the Authors

Official URL:


Random Forests, a supervised machine learning algorithm, provides a robust, data driven means of predicting lithology from geophysical, geochemical and remote sensing data. As an essential part of input selection, datasets are ranked in order of importance to the classification outcome. Those ranked most important provide, on average, the most decisive split between lithological classes. These rankings provide explorers with an additional line of reasoning to complement conventional, geophysical and geochemical interpretation workflows. The approach shows potential to aid in identifying important criteria for distinguishing geological map units during early stage exploration. This can assist in directing subsequent expenditure towards the acquisition and further development of datasets which will be the most productive for mapping.

In this case study, we use Random Forests to classify the lithology of a project in the Central African Copper-Belt, Zambia. The project area boasts extensive magnetic, radiometric, electromagnetic and multi-element geochemical coverage but only sparse geological observations. Under various training data paradigms, Random Forests produced a series of varying but closely related lithological maps. In this study, training data were restricted to outcrop, simulating the data available at the early stages of the project. Variable ranking highlighted those datasets which were of greatest importance to the result. Both geophysical and geochemical datasets were well represented in the highest ranking variables, reinforcing the importance of access to both data types. Further analysis showed that in many cases, the importance of high ranking datasets had a plausible geological explanation, often consistent with conventional interpretation. In other cases the method provides new insights, identifying datasets which may not have been considered from the outset of a new project.

Item Details

Item Type:Refereed Conference Paper
Keywords:random forests, sedimentary copper, mineral exploration, geochemistry, geological mapping
Research Division:Earth Sciences
Research Group:Geophysics
Research Field:Geophysics not elsewhere classified
Objective Division:Mineral Resources (Excl. Energy Resources)
Objective Group:Mineral exploration
Objective Field:Copper ore exploration
UTAS Author:Kuhn, SD (Mr Stephen Kuhn)
UTAS Author:Cracknell, MJ (Dr Matthew Cracknell)
UTAS Author:Reading, AM (Professor Anya Reading)
ID Code:126311
Year Published:2018
Deposited By:CODES ARC
Deposited On:2018-06-05
Last Modified:2022-09-01
Downloads:125 View Download Statistics

Repository Staff Only: item control page