eCite Digital Repository

Gully erosion susceptibility mapping (GESM) using machine learning methods optimized by the multi-collinearity analysis and K-fold cross-validation


Ghorbanzadeh, O and Shahabi, H and Mirchooli, F and Valizadeh Kamran, K and Lim, S and Aryal, J and Jarihani, B and Blaschke, T, Gully erosion susceptibility mapping (GESM) using machine learning methods optimized by the multi-collinearity analysis and K-fold cross-validation, Geomatics, Natural Hazards and Risk, 11, (1) pp. 1653-1678. ISSN 1947-5705 (2020) [Refereed Article]

PDF (Published version)

Copyright Statement

Copyright 2020 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) License, ( which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

DOI: doi:10.1080/19475705.2020.1810138


Gully erosion is a severe form of soil erosion that results in a wide range of environmental problems such as, dams’ sedimentation, destruction of transportation and energy transmission lines, decreasing and losing farmland productivity, and land degradation. The main objective of this study is to accurately map the areas prone to gully erosion, by developing two machine learning (ML) models, namely artificial neural network (ANN) and random forest (RF) models within 4-fold cross-validation (CV). Moreover, we used the multi-collinearity analysis to select 11 variables among 15 conditioning factors to train the ML models for gully erosion susceptibility mapping (GESM). Lamerd county, Iran, is chosen for a study area because Lamerd county is one of the most affected areas by gully erosion in this country. From 232 gully samples, 75% was used to train the two ML models and the rest of the samples (25%) were used to validate the generated GEMSs using 4-fold CV. The RF model produced a higher accuracy with an accuracy value of 93%. The GEMS generated by the RF model shows that the areas classified as highly vulnerable and very highly vulnerable are 1,869 ha and 5,148 ha, respectively. Results from the two models indicated that the most vulnerable land use/landcover class is bare land because of the low vegetation cover. The outcome of this study can help managers in Lamerd county to mitigate the soil erosion problem and prevent future gully erosion by taking preventive measures.

Item Details

Item Type:Refereed Article
Keywords:Soil erosion, spatial modeling, artificial neural networks (ANN), random forest (RF)
Research Division:Information and Computing Sciences
Research Group:Artificial intelligence
Research Field:Artificial intelligence not elsewhere classified
Objective Division:Environmental Management
Objective Group:Other environmental management
Objective Field:Other environmental management not elsewhere classified
UTAS Author:Aryal, J (Dr Jagannath Aryal)
ID Code:151811
Year Published:2020
Web of Science® Times Cited:16
Deposited By:Information and Communication Technology
Deposited On:2022-08-05
Last Modified:2022-11-10
Downloads:1 View Download Statistics

Repository Staff Only: item control page