1 s2.0 S0098300419304364 Main
1 s2.0 S0098300419304364 Main
A R T I C L E I N F O A B S T R A C T
Keywords: This paper aims to use recurrent neural networks (RNNs) to perform landslide susceptibility mapping in Yongxin
Landslides susceptibility mapping County, China. The two main contributions of this study are summarized as follows. First, the regular RNN is
Recurrent neural networks compared to its three variants in the case study of landslide susceptibility mapping for the first time, including
Long short term memory
long short term memory, gated recurrent unit and simple recurrent unit. Second, a sequential data representation
Gated recurrent unit
method is proposed to fully explore the predicting potential of RNNs. The study area consists of 364 historical
Simple recurrent unit
landslide locations that were divided into two parts: 255 (70%) for training and 109 (30%) for validation, and 16
landslide influencing factors were considered for spatial prediction. To validate the effectiveness of these RNN-
related methods, several objective measures of accuracy, recall, F-measure, Matthews correlation coefficient and
the receiver operating characteristic were used for evaluation. Experimental results demonstrate that very high
and high susceptible areas are concentrated in the northwest and south of Yongxin County, while landslides in
the central area are less prone to occur. Based on quantitative results, all the RNN-related methods achieved area
under the curve values above 0.83 and produced accurate prediction results with the optimized parameters.
Therefore, the RNN framework can be used as a useful tool for the landslide susceptibility mapping task to
mitigate and manage landslides.
* Corresponding author. Institute of Geophysics and Geomatics, China University of Geosciences, Wuhan, 430074, China.
** Corresponding author. Department of Geography and Regional Research, University of Vienna, Vienna, 1010, Austria.
E-mail addresses: [email protected] (Y. Wang), [email protected], [email protected], [email protected] (H. Hong).
https://doi.org/10.1016/j.cageo.2020.104445
Received 10 May 2019; Received in revised form 4 February 2020; Accepted 12 February 2020
Available online 15 February 2020
0098-3004/© 2020 Elsevier Ltd. All rights reserved.
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
machine (Pham et al., 2019b), naïve Bayes (Pham et al., 2017a) and variants are connected using hidden layers that use a "state vector"
frequency ratio (Lee and Pradhan, 2007). pattern to obtain the previous important information for the next state.
In recent years, deep learning has received great attention in Second, it is a comparative study of the regular RNN and its variants for
different fields, such as speech recognition (Noda et al., 2015), auto landslide spatial prediction in the case study of Yongxin County, China.
matic translation (Pan et al., 2016), automatic driving (Hoermann et al., To verify the effectiveness of different methods, a series of statistical
2018), etc. The concept of deep learning stems from artificial neural measures were used, including accuracy (ACC), Matthews correlation
networks, which are multi-layer perceptron structures with multiple coefficient (MCC), recall, F-measure, receiver operating characteristic
hidden layers (Hinton and Salakhutdinov, 2006). More recently, deep (ROC) curve and area under the curve (AUC). In addition, the significant
learning techniques have been applied to natural disaster susceptibility difference between the proposed RNNs was validated by using a chi-
mapping tasks (Bui et al., 2019; Wang et al., 2019c; Xiao et al., 2018; square test.
Zhang et al., 2019). For example, we first used convolutional neural
networks in LSM and achieved promising results (Wang et al., 2019b). 2. Study area
As for the another effective deep learning technique of recurrent neural
networks (RNNs), it have been widely used in natural language pro 2.1. Description
cessing because they can capture dynamic information in data by peri
odically connecting hidden layer nodes. Furthermore, the RNNs can Yongxin County has a total area of 2187 km2, located in the western
demonstrate excellent classification capability by making full use of of Jiangxi Province. This study area has a subtropical humid monsoon
context information (i.e., domain information). However, the main climate with sufficient sunshine, abundant rainfall and mild climate.
disadvantage of the regular RNN model is that it only considers the state The average annual rainfall is 1530.7 mm and the rainy season is from
of the most recent moment. To solve this problem, several variants of March to August. The geological environment is part of the South China
RNN have been presented by introducing the concept of cell state, fold system, and the structural changes are very obvious. Also, folds and
including long short term memory (LSTM) (Chen et al., 2015; Graves, faults are well developed. Excepting for the Sinian, Silurian and Tertiary,
2012), gated recurrent unit (GRU) (Cho et al., 2014) and simple recur the Cambrian to Quaternary strata were exposed to the study area with a
rent unit (SRU) (Lei et al., 2018). It should be noted that Mutlu et al. total thickness of over 20,000 m.
(2019) performed landslide susceptibility analysis using RNN, demon Historical landslide locations provide the underlying conditions for
strating its effective prediction ability. Moreover, the RNN technique has predicting the likelihood of future landslides (Wang et al., 2016). As
been used in landslide detection as well. For example, the RNN model shown in Fig. 1, local government departments in Jiangxi Province
was used for landslide deformation prediction (Chen et al., 2015). provided a total of 364 landslide locations. According to statistics, 70%
Moreover, Xu et al. successfully predicted landslide displacement of these landslides are rotational landslide and 30% of them belong to
induced by rainfall using the LSTM model (Xu and Niu, 2018). However, translational landslide. The largest and smallest landslides being 750,
these studies mainly focused on the assessment of monomer landslide 000 m2 and 32 m2, respectively. All the landslides in the study area can
risk. be divided into three scales: large (>1000 m2), moderate (400–1000 m2)
To explore more excellent deep learning techniques and further and small (<400 m2). According to the local government report, almost
extend the application of the RNN technique to regional-scale landslide all landslides occurred during or after heavy rain, of which 64.3% of the
susceptibility prediction, we present in this study an RNN framework for landslides occurred with 100 mm of precipitation every day. Moreover,
LSM. The two main contributions of this study are summarized as fol the lives of about 2174 people were affected by these landslide disasters,
lows. First, a sequential data representation method is proposed to fully and their property suffered huge losses. Therefore, it is necessary to
explore the predicting potential of RNNs. The regular RNN and its perform LSM in this area to prevent and mitigate the adverse effects of
2
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
3
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
where cov(X, Y) is the covariance of X and Y, Var[X] and Var[Y] are the
variances of X and Y, respectively. In practice, the degree of linear
correlation is summarized as follows (Booth et al., 1994; Bui et al., 2016;
4
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Fig. 3. Landslide influencing factor maps. (a) Lithology, (b) distance to faults, (c) land use, (d) soil, (e) distance to road, (f) altitude, (g) slope, (h) aspect, (i) profile
curvature, (j) plan curvature, (k) stream power index (SPI), (l) topographic wetness index (TWI), (m) sediment transport index (STI), (n) rainfall, (o) distance to river
and (p) normalized difference vegetation index (NDVI).
5
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Fig. 3. (continued).
8 pp
> a pps CF value ranges between 1 and 1. A positive value indicates a high
< ppa ð1 pps Þ ppa � pps
>
probability of landslides, where as a negative value indicates a low
CF ¼ (6)
>
> ppa pps probability of landslides. In particular, if this value is equal to 0, it is
: ppa < pps
pps ð1 ppa Þ difficult to determine the certainty between influencing factors and the
occurrence of landslides.
where ppa is the conditional probability of a landslide in a certain class a,
and pps is the prior probability of a landslide in the entire study area. The
6
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
3.5. Recurrent neural network and its variants Fig. 6. Long short term memory (LSTM) architecture.
7
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Fig. 7. Gated recurrent unit (GRU) architecture. Fig. 8. Simple recurrent unit (SRU) architecture.
controls how much new information is added, the forget gate decides whereσg and σ h represent the sigmoid function and the hyperbolic
what information should be discarded from the state of the previous tangent, respectively, W and U are parameter matrices, and b is the
unit, and the output gate modulates the extent to which the state of the corresponding bias vector.
unit is filtered (Donahue et al., 2015). Fig. 6 demonstrates the LSTM
architecture. Let ft, It, and Ot be the activation vectors of the forget gate, 3.5.4. Simple recurrent unit
input gate and output gate at time t, respectively, and Ct cell state vector Lei et al. (2018) proposed a SRU model based on the LSTM and GRU
at time t, they can be calculated as follows: models. Similar to LSTM and GRU, SRU utilizes the gate structure to
� control the transmission of information flow. The main design principle
ft ¼ σ Wf1 xt þ Wf2 ht 1 þ bf (9)
of SRU is that the calculation of the gate depends only on the loop of the
current input. Therefore, the calculation of the model only multiplying
It ¼ σðWi1 xt þ Wi2 ht þ bi Þ (10)
point-by-point matrix dependent on the previous time step, which
1
Ot ¼ σðWo1 xt þ Wo2 ht þ bo Þ (11) makes the network easy to parallelize. The SRU sets the forget gate and
1
reset gate, so that the SRU is a deeper network because each layer re
Ct ¼ ft *Ct þ It *C
~t (12) quires less computational power and a higher processing speed. Fig. 8
1
demonstrates a typical SRU architecture. Let ft, rt, It and Ot be the acti
where W1 represents the weight between the input node and the hidden vation vectors of the forget gate, reset gate, input gate and output gate at
node, W2 represents the weight connecting the hidden node to the time t, respectively, they can be calculated as follows:
output node, and b is the corresponding bias vector. �
ft ¼ σ Wf xt þ Uf � ht 1 þ bf (16)
LSTM because the number of its parameters is less than those of LSTM, rt ¼ σðWr xt þ Ur � ht 1 þ br Þ (18)
so the final model is simpler than the regular LSTM model. GRUs have
been very popular due to their better performance on smaller data sets Ot ¼ rt � ht þ ð1 rt Þ � x t (19)
(Chung et al., 2014b). The GRU merges the input and forget gates into a
single update gate with the addition of a reset gate. The update gate where � is the point-wise multiplication operation, W and U are
controls the degree to which the status information of the previous parameter matrices, b is the corresponding bias vector. (Jiang et al.,
moment is brought to the current state. The larger the value of the up 2018).
date gate, the more the status information is brought in at the previous
moment. The reset gate controls the degree of ignoring the status in
3.6. Data representation for recurrent neural networks
formation of the previous moment. The smaller the value of the reset
gate, the more it is neglected. Fig. 7 demonstrates a typical GRU ar
From the mathematical background of the above four RNN-based
chitecture. Let Zt, rt, and Ot be the activation vectors of the update gate,
methods, it can be concluded that RNN is good at processing sequen
reset gate and output gate at time t, respectively, they can be calculated
tial inputs through special recurrent hidden states. Therefore, proper
as follows:
data representation is critical to exploring the predictive potential of
Zt ¼ σ g ðWz xt þ Uz ht 1 þ bz Þ (13) RNNs. In this section, the landslide data representation for RNNs is
presented, as shown in Fig. 9. Firstly, each landslide influencing factor
rt ¼ σg ðWr xt þ Ur ht 1 þ br Þ (14) can be viewed as a single-band image, and all factor layers are stacked
together. Then, as described in section 3.3, the importance of the
Ot ¼ ð1 Zt Þ * ht 1 þ Zt *σ h ðWh xt þ Uh ðrt ht 1 Þ þ bh Þ (15) influencing factors is calculated by using an information gain algorithm.
Next, these factor layers are sorted in descending order of importance.
8
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
9
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
alt - altitude, asp - aspect, fau - distance to fault, lan - land use, lit - lithology, ndv – NDVI, pla - plan curvature, pro - profile curvature, rai – rainfall, riv-distance to river, roa - distance to road, soi – soil, spi – SPI, sti – STI, twi
0.278
0.007
0.118
0.166
0.451
0.029
0.111
0.062
0.468
0.084
0.011
0.276
0.345
0.336
0.208
1.000
twi
0.009
0.015
0.288
0.330
0.333
0.161
0.215
0.176
0.059
0.099
0.085
0.524
0.064
0.512
1.000
sti
0.006
0.009
0.089
0.152
0.121
0.058
0.083
0.080
0.034
0.015
0.032
0.094
0.002
1.000
spi
0.002
0.299
0.013
0.021
0.092
0.020
0.116
0.200
0.009
0.065
0.052
0.128
1.000
soi
0.005
0.018
0.490
0.075
0.545
0.246
0.328
0.035
0.081
0.185
0.133
1.000
slo
0.020
0.164
0.005
0.034
0.030
0.126
0.006
0.115
0.169
1.000
roa
the results (Bagui, 2005; Pham et al., 2017b). If the chi-square value
exceeds 3.841 and the p-value is less than 0.005, there is a significant
difference between the two methods.
0.260
0.046
0.319
0.059
0.002
0.016
0.126
0.009
0.035
1.000
riv
4. Results
0.021
0.007
0.141
0.140
0.003
1.000
rai
0.027
0.044
0.421
0.002
0.000
0.081
1.000
tors. It can be observed that all correlation coefficients are lower than
pro
0.7 and the VIF values of those factors are lower than 5 (TOL > 0:2).
Therefore, all the factors are independent of each other and there is no
0.039
0.010
0.013
1.000
information gain. Based on these results, the lithology factor has the
highest AM value of 0.2573, indicating that it is the dominant factor to
0.028
0.044
0.471
0.254
1.000
ndv
aspect, and land use are all between 0.1 and 0.2, whereas the AM values
of the other factors are between 0 and 0.1, indicating that these factors
can contribute to landslide occurrence.
0.002
0.242
0.294
0.369
1.000
encing factors, the CF model was used. Table 5 lists reclassification re
0.022
0.011
1.000
lan
the flat, northeast, east and southwest classes exhibit positive correla
0.001
1.000
fau
tion and the remaining classes are negatively correlated. For distance to
fault, the CF value gradually decreases as the distance is increased. For
land use, the grass and bare classes are positively correlated to land
0.009
1.000
the O and P classes are more prone to high landslide susceptibility. For
NDVI, the 0.2–0.3 and 0.3–0.4 classes demonstrate a very strong positive
correlation. For rainfall, the highest CF value of the >1100 class in
1.000
soil, the ACu, ALh and ACh classes are more likely to cause landslides.
Table 4
- TWI.
ndv
pro
asp
roa
twi
fau
lan
pla
spi
soi
slo
riv
rai
alt
sti
lit
10
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Table 5
Analysis of the relationship between each impact factor and landslide occurrence.
Influencing factor Class No. of pixels in domain Percent of domain (%) No. of landslide Percent of landslide (%) CF
11
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Table 5 (continued )
Influencing factor Class No. of pixels in domain Percent of domain (%) No. of landslide Percent of landslide (%) CF
the direction of northwest and southeast. In Fig. 11 (b), almost all the
Table 6
historical landslide locations are distributed in the very high susceptible
Parameter settings of RNNs for landslide susceptibility mapping.
zone. In Fig. 11 (c), the very high susceptible zone is distributed in the
Optimized Suitable Value Description direction of northwestern and the southern and it was less than those of
Parameter
RNN LSTM GRU SRU the other three models. In Fig. 11 (d), the very high susceptible zone is
Epoch 50 100 150 300 Train all concentrated in the direction of southwest and some areas without the
sample times historical landslide locations were classified into the high-prone zone.
Batch size 64 32 64 32 The size of Table 7 lists the percentage of landslide susceptibility classes of each
each batch of model. Landslide susceptibility assessment can produce a sensitive
data
Dropout 0 0 0.5 0 Mitigate the
model that focuses on highly susceptible areas and processes them in a
occurrence of concise manner (Kornejady et al., 2017a, 2017b). It can be observed that
over-fitting the four methods have the highest proportion of the very low susceptible
Optimizer Adam Adaptive class in the landslide susceptibility maps. Meanwhile, high susceptible
moment
classes occupy the lowest proportion in the resultant maps by RNN, GRU
estimation
Loss Categorical Binary and SRU, respectively, while low susceptible class takes up the lowest
function cross- classification proportion in LSTM-derived map. The evaluation of the practicability of
entropy problem the four models can be measured using the sum of the percentages of the
Learn rate 0.001 Determines the very high and high susceptible classes. From Table 7, the GRU model is
speed of weight
update
more practical than its variants due to the lowest practicality. It can be
also seen that the landslide density (LD) value gradually increases as the
landslide susceptible level is increased from very low to very high.
4.3. Models validation and comparison The results of the four models were assessed using the test set.
Table 8 lists the statistical results of four evaluation indicators of ACC,
All the influencing factors were input into the RNN, LSTM, GRU and recall, F-measure and MCC. It can be observed that GRU has the highest
SRU models, which were implemented in Python under the Tensorflow ACC of 0.7890, which is over 0.02 higher than that of SRU (0.7569). The
(http://www.tensorflow.org) and Scikit-learn (https://scikit-learn. highest recall, F-measure and MCC values were achieved by LSTM. In
org/stable) frameworks. To construct training and test sets, 364 non- addition, the performance of the SRU model is worse than that of the
landslide points were randomly selected from areas where no land other three models in terms of all the statistical measures.
slide occurred. Then, 364 landslide samples and 364 non-landslide Fig. 12 plots the ROC curves of the four models. It can be observed
samples were used to construct the training and validation sets: 70% that all the AUC values by the four models were above 0.83, indicating
of landslides (255) and non-landslides (255) for training and the that the RNN techniques can demonstrate very satisfactory prediction
remaining data (109 and 109) for testing. In addition, to objectively capability. Furthermore, the GRU model was better than the other
describe the behaviors of the four RNN-based models, five-fold cross- models due to its highest AUC value. Table 9 list the chi-square and p
validation was used to find the optimal parameters. Table 6 list the values between different models. It can be observed that all the chi-
parameter settings of the four models for experiments. The output re square values were much larger than 3.841 and all the p values were
sults were analyzed and processed by ArcGIS and the natural breakpoint lower than 0.005, thus there is a significant difference between the
method was used to classify a landslide susceptibility map into five landslide models.
groups, i.e., very high, high, moderate, low and very low. Finally, the Producing accurate and reliable susceptibility maps is important for
landslide susceptibility maps of the four models were obtained and preventing landslide disasters. Recently, a novel performance evalua
shown in Fig. 11. It can be observed that the historical landslide loca tion approach using the photogrammetric technique was presented to
tions are basically in the high susceptibility area, and the low suscepti assess landslide susceptibility maps (Sevgen et al., 2019). It is instructive
bility areas are distributed in the northeast direction of the study area. In in verifying the reliability of the landslide susceptibility map. However,
the northwest and south of the study area, the landslide locations are we tried to collect the remote sensing images of the study area, but we
evenly distributed. In Fig. 11 (a), the proportion of the very high class is could not obtain the stereo pairs to complete the same experiment as in
very lower than that of the other three models, mainly concentrated in the literature. To cope with this situation, we adopted another strategy
12
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Fig. 11. Landslide susceptibility maps by (a) RNN, (b) LSTM, (c) GRU and (d) SRU.
Table 7
Landslide distribution in predicted landslide susceptible zones.
Model Landslide-susceptible zones No. of pixels in domain Area of zones (%) Practicality (IV þ V) No. of landslide Landslides percentage (%) LD
13
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
5. Discussion
14
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Fig. 13. Prediction performance assessment using two high-resolution remote sensing images.
studies (Juliev et al., 2019; Ko and Lo, 2018; Yan et al., 2019). Moreover, in selecting parameters. By setting an appropriate learning rate, the
the effectiveness of the resultant susceptibility maps was further verified objective function can converge to a local minimum in an appropriate
using Gaogfen-1 satellite data. Specifically, comparing two different time. Therefore, setting the optimal parameters of the RNN model has a
temporal high-resolution images can show the validity of the resultant crucial impact on the optimization of the model and the experimental
susceptibility maps. Fig. 13 shows that landslide disasters did occur in results. Moreover, searching for optimized parameters is necessary and
these predicted high susceptible areas where no historical landslide important to perform a reliable landslide susceptibility map using the
existed, which indicates that our methods can accurately predicate the RNN techniques.
high risk areas of landslide disasters. In general, these deep learning
models are very effective for LSM, and the deep learning techniques 6. Conclusion
have broad prospects in landslide susceptibility assessment.
It was recorded that the parameters of the RNNs play a crucial role in Landslides are one of the most dangerous natural disasters in China,
demonstrating their predictive performance (Ghorbanzadeh et al., 2019; causing serious casualties and socioeconomic losses every year. This
Yu et al., 2017). Four key parameters were selected to discuss the impact paper mainly aims at the construction of LSM based on RNNs in Yongxin
on landslide susceptibility results, namely epoch, batch size, dropout County, China. A total of 364 historical landslide locations were
and optimizer. Within the range of values, each model has its optimal randomly divided into two parts: 70% for training and 30% for verifi
parameters. The main function of setting the epoch is to divide the entire cation, and 16 influencing factors were chosen based on previous studies
training process of the model into several stages so that we can better and the techniques of the correlation matrix, multi-collinearity analysis,
observe and adjust the training of the model. The batch size can be IG and CF were used to evaluate the correlation between landslides and
regarded as a batch parameter, which controls the number of input these factors. The final landslide susceptibility maps were obtained by
samples that the deep learning network can process during the iteration. the four RNN-related models for comparison. From the experiment re
Fig. 14 (b) shows that 32 and 64 are the optimal batch size. The results in sults, the two main conclusions can be summarized as follows. First, all
Fig. 14 (c) shows that the LSM results of GRU is better when 50% of the RNN-related models obtained satisfactory results in LSM. The AUC
neurons are eliminated during training. This is reasonable because values by these models were higher than 0.83 and the highest AUC value
setting the optimal dropout value can avoid the over-fitting problem was achieved by GRU. Second, the RNN-related models can produce
(Srivastava et al., 2014). Optimizer optimizes the weights of the con accurate landslide susceptibility maps in other prone zones with similar
structed neural network, and Adam is generally considered to be robust geo-environment. In the future, our research will focus on comparing the
15
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Fig. 14. Area under the curve (AUC) values by RNNs under different parameters for the study area. (a) Epoch, (b) batch size, (c) dropout and (d) optimizer.
RNN-based methods with other benchmark models and deep learning Acknowledgments
models in the field of LSM.
This work was supported by the National Natural Science Foundation
Data availability of China (61271408, 41602362). The authors acknowledge the joint
PhD scholarship awarded to Haoyuan Hong (201906860029) supported
All source codes related to this article can be found at https://github. by the China Scholarship Council. The authors would also like to thank
com/xmblb/RNN.git, an open-source online data repository hosted at the associate editor Candan Gokceoglu and the three anonymous re
GitHub. viewers for their valuable comments and suggestions, which signifi
cantly improved the quality of this paper.
Author Contributions
Appendix A. Supplementary data
Yi Wang had the original idea of this study, finished the first draft
and supervised the research, Zhice Fang and Mao Wang coded the Supplementary data to this article can be found online at https://doi.
research project and conducted the experiments, Ling Peng and org/10.1016/j.cageo.2020.104445.
Haoyuan Hong collected and analyzed research material and contrib
uted to the editing and review of the manuscript. References
Declaration of competing interest Atkinson, P.M., Massari, R., 1998. Generalised linear modelling of susceptibility to
landsliding in the central Apennines, Italy. Comput. Geosci. 24, 373–385.
Ayalew, L., Yamagishi, H., 2005. The application of GIS-based logistic regression for
The authors declare that they have no known competing financial landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan.
interests or personal relationships that could have appeared to influence Geomorphology 65, 15–31.
Bagui, S.C., 2005. Combining pattern classifiers: methods and algorithms. Technometrics
the work reported in this paper. 47, 517–518.
Basu, T., Pal, S., 2018. RS-GIS based morphometrical and geological multi-criteria
approach to the landslide susceptibility mapping in Gish River Basin, West Bengal,
India. Adv. Space Res. 63, 1253–1269.
16
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
Booth, G.D., Niccolucci, M.J., Schuster, E.G., 1994. Identifying Proxy Sets in Multiple Hong, H., Liu, J., Bui, D.T., Pradhan, B., Acharya, T.D., Pham, B.T., Zhu, A.-X., Chen, W.,
Linear Regression: an Aid to Better Coefficient Interpretation. Research paper INT Ahmad, B.B., 2018a. Landslide susceptibility mapping using J48 decision tree with
(USA). AdaBoost, bagging and rotation forest ensembles in the guangchang area (China).
Bui, D.T., Ho, T.-C., Pradhan, B., Pham, B.-T., Nhu, V.-H., Revhaug, I., 2016. GIS-based Catena 163, 399–413.
modeling of rainfall-induced landslides using data mining-based functional trees Hong, H., Pradhan, B., Sameen, M.I., Kalantar, B., Zhu, A., Chen, W., 2018b. Improving
classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ. the accuracy of landslide susceptibility model using a novel region-partitioning
Earth Sci. 75, 1101. approach. Landslides 15, 753–772.
Bui, D.T., Tuan, T.A., Hoang, N.-D., Thanh, N.Q., Nguyen, D.B., Van Liem, N., Jaafari, A., Panahi, M., Pham, B.T., Shahabi, H., Bui, D.T., Rezaie, F., Lee, S., 2019. Meta
Pradhan, B., 2017. Spatial prediction of rainfall-induced landslides for the Lao Cai optimization of an adaptive neuro-fuzzy inference system with grey wolf optimizer
area (Vietnam) using a hybrid intelligent approach of least squares support vector and biogeography-based optimization algorithms for spatial prediction of landslide
machines inference model and artificial bee colony optimization. Landslides 14, susceptibility. Catena 175, 430–445.
447–458. Jiang, C., Chen, S., Chen, Y., Bo, Y., Han, L., Guo, J., Feng, Z., Zhou, H., 2018.
Bui, Q.-T., Nguyen, Q.-H., Nguyen, X.L., Pham, V.D., Nguyen, H.D., Pham, V.-M., 2019. Performance analysis of a deep simple recurrent unit recurrent neural network (SRU-
Verification of novel integrations of swarm intelligence algorithms into deep RNN) in MEMS gyroscope de-noising. Sensors 18, 4471.
learning neural network for flood susceptibility mapping. J. Hydrol 124379. Jiao, Y., Zhao, D., Ding, Y., Liu, Y., Xu, Q., Qiu, Y., Liu, C., Liu, Z., Zha, Z., Li, R., 2019.
Chen, H., Zeng, Z., Tang, H., 2015. Landslide deformation prediction based on recurrent Performance evaluation for four GIS-based models purposed to predict and map
neural network. Neural Process. Lett. 41, 169–178. landslide susceptibility: a case study at a World Heritage site in Southwest China.
Chen, W., Zhang, S., Li, R., Shahabi, H., 2018. Performance evaluation of the GIS-based Catena 183, 104221.
data mining techniques of best-first decision tree, random forest, and naïve Bayes Juliev, M., Mergili, M., Mondal, I., Nurtaev, B., Pulatov, A., Hübl, J., 2019. Comparative
tree for landslide susceptibility modeling. Sci. Total Environ. 644, 1006–1018. analysis of statistical methods for landslide susceptibility mapping in the Bostanlik
Cho, K., Van Merri€enboer, B., Bahdanau, D., Bengio, Y., 2014. On the Properties of District, Uzbekistan. Sci. Total Environ. 653, 801–814.
Neural Machine Translation: Encoder-Decoder Approaches arXiv preprint arXiv: Ko, F.W., Lo, F.L., 2018. From landslide susceptibility to landslide frequency: a territory-
1409.1259. wide study in Hong Kong. Eng. Geol. 242, 12–22.
Chung, C.-J.F., Fabbri, A.G., 1993. The representation of geoscience information for data Kornejady, A., Bahremand, A., Ownegh, M.J.C., 2017a. Landslide susceptibility
integration. Nonrenewable Resour. 2, 122–139. assessment using maximum entropy model with two different data sampling
Chung, J., Gulcehre, C., Cho, K., Bengio, Y., 2014a. Empirical Evaluation of Gated methods. Catena 152, 144–162.
Recurrent Neural Networks on Sequence Modeling (arXiv: Neural and Evolutionary Kornejady, A., Ownegh, M., Rahmati, O., Bahremand, A., 2017b. Landslide susceptibility
Computing). assessment using three bivariate models considering the new topo-hydrological
Chung, J., Gulcehre, C., Cho, K., Bengio, Y., 2014b. Empirical Evaluation of Gated factor: hand. Geocarto Int. 33, 1–19.
Recurrent Neural Networks on Sequence Modeling arXiv preprint arXiv:1412.3555. Lee, S., Pradhan, B., 2007. Landslide hazard mapping at Selangor, Malaysia using
Chung, J., Gulcehre, C., Cho, K., Bengio, Y., 2015. Gated Feedback Recurrent Neural frequency ratio and logistic regression models. Landslides 4, 33–41.
Networks (arXiv: Neural and Evolutionary Computing). Lei, T., Zhang, Y., Wang, S.I., Dai, H., Artzi, Y., 2018. Simple recurrent units for highly
Clerici, A., Perego, S., Tellini, C., Vescovi, P., 2006. A GIS-based automated procedure for parallelizable recurrence. In: Proceedings of the 2018 Conference on Empirical
landslide susceptibility mapping by the conditional analysis method: the Baganza Methods in Natural Language Processing, pp. 4470–4481.
valley case study (Italian Northern Apennines). Environ. Geol. 50, 941–961. Matthews, B.W., 1975. Comparison of the predicted and observed secondary structure of
Crozier, M.J., 2018. Reprint of “A proposed cell model for multiple-occurrence regional T4 phage lysozyme. Biochim. Biophys. Acta Protein Struct. 405, 442–451.
landslide events: implications for landslide susceptibility mapping”. Geomorphology Meusburger, K., Alewell, C., 2009. On the influence of temporal change on the validity of
307, 3–11. landslide susceptibility maps. Natural Hazards and Earth System Sciences (NHESS) &
Ding, Q., Chen, W., Hong, H., 2017. Application of frequency ratio, weights of evidence Discussions (NHESSD) 9, 1495–1507.
and evidential belief function models in landslide susceptibility mapping. Geocarto Mutlu, B., Nefeslioglu, H.A., Sezer, E.A., Akcayol, M.A., Gokceoglu, C., 2019. An
Int. 32, 619–639. experimental research on the use of recurrent neural networks in landslide
Donahue, J., Anne Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., susceptibility mapping. ISPRS Int. J. Geo-Inf. 8, 578.
Saenko, K., Darrell, T., 2015. Long-term recurrent convolutional networks for visual Nepal, N., Chen, J., Chen, H., Sharma, T.P.P., 2019. Assessment of landslide
recognition and description. Proc. IEEE.Conf. Comput. Vis. Pattern Recogn. susceptibility along the araniko highway in poiqu/bhote koshi/sun koshi watershed,
2625–2634. Nepal himalaya. Progress in Disaster Science 3, 100037.
Dou, J., Yunus, A.P., Bui, D.T., Merghadi, A., Sahana, M., Zhu, Z., Chen, C.-W., Neuh€ auser, B., Geomorphology, B.T.J., 2007. Landslide susceptibility assessment using
Khosravi, K., Yang, Y., Pham, B.T., 2019a. Assessment of advanced random forest “weights-of-evidence” applied to a study area at the Jurassic escarpment (SW-
and decision tree algorithms for modeling rainfall-induced landslide susceptibility in Germany). Geomorphology 86, 12–24.
the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 662, 332–346. Nicu, I.C., As�andulesei, A., 2018. GIS-based evaluation of diagnostic areas in landslide
Dou, J., Yunus, A.P., Tien Bui, D., Sahana, M., Chen, C.-W., Zhu, Z., Wang, W., Pham, B. susceptibility analysis of Bahluieț River Basin (Moldavian Plateau, NE Romania). Are
T., 2019b. Evaluating GIS-based multiple statistical models and data mining for Neolithic sites in danger? Geomorphology 314, 27–41.
earthquake and rainfall-induced landslide susceptibility using the LiDAR DEM. Rem. Noda, K., Yamaguchi, Y., Nakadai, K., Okuno, H.G., Ogata, T., 2015. Audio-visual speech
Sens. 11, 638. recognition using deep learning. Appl. Intell. 42, 722–737.
Dou, J., Yunus, A.P., Xu, Y., Zhu, Z., Chen, C.-W., Sahana, M., Khosravi, K., Yang, Y., Nohani, E., Moharrami, M., Sharafi, S., Khosravi, K., Pradhan, B., Pham, B.T., Lee, S., M
Pham, B.T., 2019c. Torrential rainfall-triggered shallow landslide characteristics and Melesse, A., 2019. Landslide susceptibility mapping using different GIS-based
susceptibility assessment using ensemble data-driven models in the Dongjiang bivariate models. Water 11, 1402.
Reservoir Watershed, China. Nat. Hazards 97, 579–609. Nourani, V., Pradhan, B., Ghaffari, H., Sharifi, S.S.J.N.H., 2014. Landslide susceptibility
Ercanoglu, M., Gokceoglu, C., 2002. Assessment of landslide susceptibility for a mapping at Zonouz Plain, Iran using genetic programming and comparison with
landslide-prone area (north of Yenice, NW Turkey) by fuzzy approach. Environ. frequency ratio, logistic regression, and artificial neural network models. Nat.
Geol. 41, 720–730. Hazards 71, 523–547.
Erener, A., Sivas, A.A., Selcuk-Kestel, A.S., Düzgün, H.S., 2017. Analysis of training Nsengiyumva, J.B., Luo, G., Hakorimana, E., Mind’je, R., Gasirabo, A., Mukanyandwi, V.,
sample selection strategies for regression-based quantitative landslide susceptibility 2019. Comparative analysis of deterministic and semiquantitative approaches for
mapping methods. Comput. Geosci. 104, 62–74. shallow landslide risk modeling in Rwanda. Risk Anal. 39, 2576–2595.
Galli, M., Ardizzone, F., Cardinali, M., Guzzetti, F., Reichenbach, P., 2008. Comparing O’brien, R.M., 2007. A caution regarding rules of thumb for variance inflation factors.
landslide inventory maps. Geomorphology 94, 268–289. Qual. Quantity 41, 673–690.
Gao, H., 2011. Analyses on coupling correlation between landslides and rainfall and Oh, H.-J., Pradhan, B., 2011. Application of a neuro-fuzzy model to landslide-
application in the early-warning and forecast. In: 2011 International Conference on susceptibility mapping for shallow landslides in a tropical hilly area. Comput.
Electric Technology and Civil Engineering (ICETCE), pp. 6432–6436. Geosci. 37, 1264–1276.
Ghorbanzadeh, O., Blaschke, T., Gholamnia, K., Meena, S.R., Tiede, D., Aryal, J., 2019. Ozer, B., Mutlu, B., Nefeslioglu, H., Sezer, E., Rouai, M., Dekayir, A., Gokceoglu, C.,
Evaluation of different machine learning methods and deep-learning convolutional 2019. On the use of hierarchical fuzzy inference systems (HFIS) in expert-based
neural networks for landslide detection. Rem. Sens. 11, 196. landslide susceptibility mapping: the central part of the Rif Mountains (Morocco).
G€
okceoglu, C., Aksoy, H., 1996. Landslide susceptibility mapping of the slopes in the Bull. Eng. Geol. Environ. 1–18.
residual soils of the Mengen region (Turkey) by deterministic stability analyses and Pan, Y., Mei, T., Yao, T., Li, H., Rui, Y., 2016. Jointly modeling embedding and
image processing techniques. Eng. Geol. 44, 147–161. translation to bridge video and language. Proc. IEEE.Conf. Comput. Vis. Pattern
Graves, A., 2012. Long short-term memory. In: Graves, A. (Ed.), Supervised Sequence Recogn. 4594–4602.
Labelling with Recurrent Neural Networks. Springer Berlin Heidelberg, Berlin, Parry, S., 2011. Chapter fifteen - the application of geomorphological mapping in the
Heidelberg, pp. 37–45. assessment of landslide hazard in Hong Kong. In: Smith, M.J., Paron, P., Griffiths, J.
Guzzetti, F., Mondini, A.C., Cardinali, M., Fiorucci, F., Santangelo, M., Chang, K.-T., S. (Eds.), Developments in Earth Surface Processes. Elsevier, pp. 413–441.
2012. Landslide inventory maps: new tools for an old problem. Earth Sci. Rev. 112, Peshevski, I., Jovanovski, M., Abolmasov, B., Papi�c, J., Đuri�c, U., Marjanovi�c, M.,
42–66. Haque, U., Nedelkovska, N., 2019. Preliminary regional landslide susceptibility
Hinton, G.E., Salakhutdinov, R.R., 2006. Reducing the dimensionality of data with neural assessment using limited data. Geol. Croat. 72, 81–92.
networks. Science 313, 504–507. Pham, B.T., 2018. A novel classifier based on composite hyper-cubes on iterated random
Hoermann, S., Bach, M., Dietmayer, K., 2018. Dynamic occupancy grid prediction for projections for assessment of landslide susceptibility. J. Geol. Soc. India 91,
urban autonomous driving: a deep learning approach with fully automatic labeling. 355–362.
In: 2018 IEEE International Conference on Robotics and Automation (ICRA). IEEE, Pham, B.T., Bui, D.T., Dholakia, M., Prakash, I., Pham, H.V., 2016. A comparative study
pp. 2056–2063. of least square support vector machines and multiclass alternating decision trees for
17
Y. Wang et al. Computers and Geosciences 138 (2020) 104445
spatial prediction of rainfall-induced landslides in a tropical cyclones area. Geotech. trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks.
Geol. Eng. 34, 1807–1824. Environ. Earth Sci. 75, 1101.
Pham, B.T., Bui, D.T., Dholakia, M.B., Prakash, I., Pham, H.V., Mehmood, K., Le, H.Q., Tsangaratos, P., Ilia, I., Hong, H., Chen, W., Xu, C., 2017. Applying Information Theory
2017a. A novel ensemble classifier of rotation forest and Naïve Bayer for landslide and GIS-based quantitative methods to produce landslide susceptibility maps in
susceptibility assessment at the Luc Yen district, Yen Bai Province (Viet Nam) using Nancheng County, China. Landslides 14, 1091–1111.
GIS. Geomatics, Nat. Hazards Risk 8, 649–671. Van Den Eeckhaut, M., Vanwalleghem, T., Poesen, J., Govers, G., Verstraeten, G.,
Pham, B.T., Bui, D.T., Prakash, I., Dholakia, M.B.J.C., 2017b. Hybrid integration of Vandekerckhove, L., 2006. Prediction of landslide susceptibility using rare events
Multilayer Perceptron Neural Networks and machine learning ensembles for logistic regression: a case-study in the Flemish Ardennes (Belgium). Geomorphology
landslide susceptibility assessment at Himalayan area (India) using GIS. Catena 149, 76, 392–410.
52–63. Wang, L.-J., Guo, M., Sawada, K., Lin, J., Zhang, J., 2016. A comparative study of
Pham, B.T., Prakash, I., Dou, J., Singh, S.K., Trinh, P.T., Tran, H.T., Le, T.M., Van landslide susceptibility maps using logistic regression, frequency ratio, decision tree,
Phong, T., Khoi, D.K., Shirzadi, A., 2019a. A novel hybrid approach of landslide weights of evidence and artificial neural network. Geosci. J. 20, 117–136.
susceptibility modelling using rotation forest ensemble and different base classifiers. Wang, Q., Wang, Y., Niu, R., Peng, L., 2017. Integration of information theory, K-means
Geocarto Int. 1–25. cluster analysis and the logistic regression model for landslide susceptibility
Pham, B.T., Prakash, I., Khosravi, K., Chapi, K., Trinh, P.T., Ngo, T.Q., Hosseini, S.V., mapping in the Three Gorges Area, China. Rem. Sens. 9, 938.
Bui, D.T., 2019b. A comparison of Support Vector Machines and Bayesian algorithms Wang, Y., Duan, H., Hong, H., 2019a. A comparative study of composite kernels for
for landslide susceptibility modelling. Geocarto Int. 34, 1385–1407. landslide susceptibility mapping: a case study in Yongxin County, China. Catena 183,
Piacentini, D., Troiani, F., Soldati, M., Notarnicola, C., Savelli, D., Schneiderbauer, S., 104217.
Strada, C., 2012. Statistical analysis for assessing shallow-landslide susceptibility in Wang, Y., Fang, Z., Hong, H., 2019b. Comparison of convolutional neural networks for
South Tyrol (south-eastern Alps, Italy). Geomorphology 151, 196–206. landslide susceptibility mapping in Yanshan County, China. Sci. Total Environ. 666,
Pourghasemi, H., Pradhan, B., Gokceoglu, C., Moezzi, K.D., 2012a. Landslide 975–993.
Susceptibility Mapping Using a Spatial Multi Criteria Evaluation Model at Haraz Wang, Y., Fang, Z., Hong, H., Peng, L., 2019c. Flood susceptibility mapping using
Watershed, Iran, Terrigenous Mass Movements. Springer, pp. 23–49. convolutional neural network frameworks. J. Hydrol 582, 124482.
Pourghasemi, H.R., Pradhan, B., Gokceoglu, C., 2012b. Application of fuzzy logic and Xiao, L., Zhang, Y., Peng, G., 2018. Landslide susceptibility assessment using integrated
analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz deep learning algorithm along the China-Nepal Highway. Sensors 18, 4436.
watershed, Iran. Nat. Hazards 63, 965–996. Xu, S., Niu, R., 2018. Displacement prediction of Baijiabao landslide based on empirical
Powers, D.M., 2011. Evaluation: from Precision, Recall and F-Measure to ROC, mode decomposition and long short-term memory neural network in Three Gorges
Informedness, Markedness and Correlation. area, China. Comput. Geosci. 111, 87–96.
Regmi, A.D., Yoshida, K., Pourghasemi, H.R., DhitaL, M.R., Pradhan, B., 2014. Landslide Yalcin, A., 2008. GIS-based landslide susceptibility mapping using analytical hierarchy
susceptibility mapping along Bhalubang—shiwapur area of mid-Western Nepal using process and bivariate statistics in Ardesen (Turkey): comparisons of results and
frequency ratio and conditional probability models. J. Mt. Sci. 11, 1266–1285. confirmations. Catena 72, 1–12.
Regmi, N.R., Giardino, J.R., Geomorphology, J.D.V.J., 2010. Modeling susceptibility to Yan, F., Zhang, Q., Ye, S., Ren, B., 2019. A novel hybrid approach for landslide
landslides using the weight of evidence approach: western Colorado, USA. susceptibility mapping integrating analytical hierarchy process and normalized
Geomorphology 115, 172–187. frequency ratio methods with the cloud model. Geomorphology 327, 170–187.
Sevgen, E., Kocaman, S., Nefeslioglu, H.A., Gokceoglu, C., 2019. A novel performance Yang, J., Song, C., Yang, Y., Xu, C., Guo, F., Xie, L., 2019. New method for landslide
assessment approach using photogrammetric techniques for landslide susceptibility susceptibility mapping supported by spatial logistic regression and GeoDetector: a
mapping with logistic regression, ANN and random forest. Sensors 19, 3940. case study of Duwen Highway Basin, Sichuan Province, China. Geomorphology 324,
Shirzadi, A., Soliamani, K., Habibnejhad, M., Kavian, A., Chapi, K., Shahabi, H., 62–71.
Chen, W., Khosravi, K., Thai Pham, B., Pradhan, B., 2018b. Novel GIS based machine Yu, S., Jia, S., Xu, C., 2017. Convolutional neural networks for hyperspectral image
learning algorithms for shallow landslide susceptibility mapping. Sensors 18, 3777. classification. Neurocomputing 219, 88–98.
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R., 2014. Yu, X., Wang, Y., Niu, R., Hu, Y., 2016. A combination of geographically weighted
Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. regression, particle swarm optimization and support vector machine for landslide
Res. 15, 1929–1958. susceptibility mapping: a case study at wanzhou in the three gorges area, China. Int.
Steger, S., Brenning, A., Bell, R., Petschko, H., Glade, T., 2016. Exploring discrepancies J. Environ. Res. Publ. Health 13, 487.
between quantitative validation results and the geomorphic plausibility of statistical Z^ezere, J., Pereira, S., Melo, R., Oliveira, S., Garcia, R., 2017. Mapping landslide
landslide susceptibility maps. Geomorphology 262, 8–23. susceptibility using data-driven methods. Sci. Total Environ. 589, 250–267.
Süzen, M.L., Doyuran, V., 2004. Data driven bivariate landslide susceptibility assessment Zhang, G., Wang, M., Liu, K., 2019. Forest fire susceptibility modeling using a
using geographical information systems: a method and application to Asarsuyu convolutional neural network for yunnan Province of China. Int. J.Disaster Risk.Sci.
catchment. Turkey. Eng. Geol. 71, 303–321. 10, 386–403.
Tallarida, R.J., Murray, R.B., 1987. Chi-square test. In: Tallarida, R.J., Murray, R.B. Zhou, C., Yin, K., Cao, Y., Ahmed, B., Li, Y., Catani, F., Pourghasemi, H.R., 2018.
(Eds.), Manual of Pharmacologic Calculations: with Computer Programs. Springer Landslide susceptibility modeling applying machine learning methods: a case study
New York, New York, NY, pp. 140–142. from Longju in the Three Gorges Reservoir area, China. Comput. Geosci. 112, 23–37.
Tang, D., Qin, B., Liu, T., 2015. Document modeling with gated recurrent neural network Zhu, A.-X., Miao, Y., Liu, J., Bai, S., Zeng, C., Ma, T., Hong, H., 2019. A similarity-based
for sentiment classification. In: Proceedings of the 2015 Conference on Empirical approach to sampling absence data for landslide susceptibility mapping using data-
Methods in Natural Language Processing, pp. 1422–1432. driven methods. Catena 183, 104188.
Thai Pham, B., Bui, D.T., Prakash, I., 2019. Landslide susceptibility modelling using Zhu, A.-X., Miao, Y., Yang, L., Bai, S., Liu, J., Hong, H., 2018. Comparison of the
different advanced decision trees methods. Civ. Eng. Environ. Syst. 35, 139–157. presence-only method and presence-absence method in landslide susceptibility
Tien Bui, D., Ho, T.-C., Pradhan, B., Pham, B.-T., Nhu, V.-H., Revhaug, I., 2016. GIS- mapping. Catena 171, 222–233.
based modeling of rainfall-induced landslides using data mining-based functional
18