Dieu Tien BuiEmail author;;Quoc Phi Nguyen; Nhat-Duc Hoang;Harald Klempe
This research represents a novel soft computing approach that combines the fuzzy k-nearest neighbor algorithm (fuzzy k-NN) and the differential evolution (DE) optimization for spatial prediction of rainfall-induced shallow landslides at a tropical hilly area of Quy Hop, Vietnam. According to current literature, the fuzzy k-NN and the DE optimization are current state-of-the-art techniques in data mining that have not been used for prediction of landslide. First, a spatial database was constructed, including 129 landslide locations and 12 influencing factors, i.e., slope, slope length, aspect, curvature, valley depth, stream power index (SPI), sediment transport index (STI), topographic ruggedness index (TRI), topographic wetness index (TWI), Normalized Difference Vegetation Index (NDVI), lithology, and soil type. Second, 70 % landslide locations were randomly generated for building the landslide model whereas the remaining 30 % landslide locations was for validating the model. Third, to construct the landslide model, the DE optimization was used to search the optimal values for fuzzy strength (fs) and number of nearest neighbors (k) that are the two required parameters for the fuzzy k-NN. Then, the training process was performed to obtain the fuzzy k-NN model. Value of membership degree of the landslide class for each pixel was extracted to be used as landslide susceptibility index. Finally, the performance and prediction capability of the landslide model were assessed using classification accuracy, the area under the ROC curve (AUC), kappa statistics, and other evaluation metrics. The result shows that the fuzzy k-NN model has high performance in the training dataset (AUC = 0.944) and validation dataset (AUC = 0.841). The result was compared with those obtained from benchmark methods, support vector machines and J48 decision trees. Overall, the fuzzy k-NN model performs better than the support vector machines and the J48 decision trees models. Therefore, we conclude that the fuzzy k-NN model is a promising prediction tool that should be used for susceptibility mapping in landslide-prone areas.