Agriculture 12 01707
Agriculture 12 01707
Article
Winter Wheat Yield Prediction Using an LSTM Model from
MODIS LAI Products
Jian Wang 1 , Haiping Si 1 , Zhao Gao 2 and Lei Shi 1, *
1 College of Information and Management Science, Henan Agricultural University, Zhengzhou 450002, China
2 The First Geodetic Survey Team of the Ministry of Natural Resources, Shaanxi Bureau of Surveying,
Mapping and Geoinformation, Xi’an 710054, China
* Correspondence: [email protected]; Tel.: +86-0371-56990030
Abstract: Yield estimation using remote sensing data is a research priority in modern agriculture. The
rapid and accurate estimation of winter wheat yields over large areas is an important prerequisite
for food security policy formulation and implementation. In most county-level yield estimation
processes, multiple input data are used for yield prediction as much as possible, however, in some
regions, data are more difficult to obtain, so we used the single-leaf area index (LAI) as input data for
the model for yield prediction. In this study, the effects of different time steps as well as the LAI time
series on the estimation results were analyzed for the properties of long short-term memory (LSTM),
and multiple machine learning methods were compared with yield estimation models constructed by
the LSTM networks. The results show that the accuracy of the yield estimation results using LSTM
did not show an increasing trend with the increasing step size and data volume, while the yield
estimation results of the LSTM were generally better than those of conventional machine learning
methods, with the best R2 and RMSE results of 0.87 and 522.3 kg/ha, respectively, in the comparison
between predicted and actual yields. Although the use of LAI as a single input factor may cause
yield uncertainty in some extreme years, it is a reliable and promising method for improving the
yield estimation, which has important implications for crop yield forecasting, agricultural disaster
monitoring, food trade policy, and food security early warning.
Citation: Wang, J.; Si, H.; Gao, Z.; Shi,
L. Winter Wheat Yield Prediction
Keywords: winter wheat; yield estimation; LSTM; LAI; deep learning
Using an LSTM Model from MODIS
LAI Products. Agriculture 2022, 12,
1707. https://doi.org/10.3390/
agriculture12101707
1. Introduction
Academic Editor: Koki Homma
Wheat is an important crop in China, and its yield is directly related to the development
Received: 25 August 2022 of the national economy. Timely, accurate, and wide-ranging monitoring and forecasting of
Accepted: 12 October 2022 wheat yields is of great practical significance for national economic development and food
Published: 17 October 2022 policy formulation [1,2]. Due to its large coverage area and short detection period, satellite
Publisher’s Note: MDPI stays neutral
remote sensing provides a new technical tool for large-scale crop estimation and is rapidly
with regard to jurisdictional claims in
becoming the most widely used technology in crop estimation.
published maps and institutional affil- At present, the methods of crop yield estimation using remote sensing technology can
iations. be broadly classified into three categories according to the characteristics of the models used:
(1) the empirical modeling method; (2) the mechanistic modeling method; and (3) the semi-
empirical (semi-mechanistic) modeling method. The empirical model directly uses spectral
vegetation indices or canopy remote sensing inversion parameters to establish relationships
Copyright: © 2022 by the authors. with crop yields, which are characterized by their simplicity and ease, involving fewer
Licensee MDPI, Basel, Switzerland. crop yield formation mechanisms, and the relationships are generally established using
This article is an open access article conventional machine learning methods, such as support vector machine and random
distributed under the terms and forest, with NDVI or leaf area index (LAI) as input parameters [3–8]. Such relationships
conditions of the Creative Commons are usually localized and difficult to generalize to other agricultural areas. Semi-empirical
Attribution (CC BY) license (https://
models and semi-mechanical models are also known as parametric models, among which
creativecommons.org/licenses/by/
the light energy utilization efficiency model is the most widely used [6,9–11], but some
4.0/).
parameters are difficult to quantitatively simulate. Mechanical models fully consider the
mechanism of crop yield formation, but their solution process is complex and requires
more input parameters, and some necessary parameters in the operation are difficult to
obtain at a regional scale; meanwhile, most of these models estimate crop yields at the
field scale [12–15]. Although the models work well for archiving yields, the accuracy may
be reduced when scaled to the national level. Many a priori parameters are required in
regional estimation. Due to the heterogeneity of the ground surface, the accuracy of the
ground parameters is generally low, especially in the case of small farmland in China,
resulting in low regional accuracy. Moreover, the computational process is complicated
and requires many parameters, which can be limited in practical use.
In recent years, deep learning has been successfully applied to several fields, such as
image recognition and language translation [11,16–22]. Compared with traditional machine
learning methods, deep learning techniques often achieve better performance. CNN and
recurrent neural network (RNN) are more widely used models in neural networks and
have also been applied to crop yield estimation and prediction [19,23–29]. LSTM is a special
kind of RNN [30,31], due to its recursive structure and gating mechanism that regulates
the entry and exit of information into and out of cells, as well as its processing of sequential
data. The LSTM has feedback connections and can handle the input sequences of arbitrary
length and is often preferred in the classification, processing, and prediction based on time
series data. Several studies used LSTM for crop yield prediction with impressive results.
LSTM not only captures trends in the data but also describes the dependencies of the time
series data. Tian et al. built an LSTM model by integrating the meteorological data and
two remote sensing indices (vegetation temperature condition index (VTCI) and LAI) to
estimate wheat yield in Guanzhong Plain [32]. Jeong et al. used water-related indices and
the maximum temperature as inputs for rice yield prediction using an LSTM model, which
showed reliable early prediction accuracy [16]. Sun et al. used a CNN-LSTM model to
predict the end-of-season and in-season yields of soybean in the county. The input data for
the model included meteorological data and MODIS surface temperature (LST) [24]. The
LSTM model was shown to have high prediction accuracy for crop yield estimation, but all
of the above methods for estimating crop yield use multiple data as input parameters, and
it is difficult to obtain non-remote sensing data in some regions [33–35]. There are a large
number of mature remote sensing products for LAI data, so this study mainly considered
using LAI remote sensing products as single model input data and what kind of accuracy
could be achieved when using deep learning algorithms for yield estimation.
In this study, LSTM was used to estimate the winter wheat yield at the county scale
based on the relationship between time series LAI products and winter wheat yield. Con-
sidering the simplicity of obtaining LAI data, the model input parameters were only the
leaf area index to verify the accuracy that could be achieved under the influence of a
single factor. Moreover, the time step of the input remote sensing data of the model was
considered so as to determine the accuracy in different time data.
large amount of quality information associated with each pixel, whether the pixel is la-
Agriculture 2022, 12, 1707 3 of 13
beled as cloudy, clear, or in cloud shadow. In a subsequent study, MODIS LAI selected
high-quality pixels derived using the master algorithm under cloud-free conditions. Fig-
ure 2 shows the time series results of the MODIS LAI for the entire winter wheat growing
monsoons,
region Henan
in Henan, and Province has a wide
the maximum valuevariety
of the of
LAIsoilwas
types and large differences
approximately in climate
2. Considering
resources, resulting in significant differences in crop yields in different regions.
that LAI is at a low level in the early and late stages of wheat growth, and that these According
to the
growth production
stages conditions
do not have of producing
a particularly areas andonthe
strong influence climatic
yield characteristics
formation, of the
we used LAI
wheat growth period, the wheat-growing
from flowering to maturity each year as input data.areas in Henan Province can be divided into
five regions.
Figure 1. The geographic location of the study area and classification results (The red box on the left
Figure 1. The geographic location of the study area and classification results (The red box on the left
is location
is the the location of Henan
of Henan Province.
Province. (a)—false
(a)—false color
color composite
composite withwith the MODIS09
the MODIS09 data;
data; andand (b)—land
(b)—land
classification,
classification, green
green represents
represents the the winter
winter wheat
wheat region).
region).
1. The wheat area in Nanyang Basin, including Nanyang City and Biyang Countyin
Zhumadian, is a typical rainfed and semi-rainfed area due to the relatively poor field
supporting projects and irrigation conditions.
2. Rice stubble and wheat areas in southern Henan, including Xinyang, southern Zhu-
madian, and Nanyang Tongbai. The soil in this area is heavy, and the precipitation
during the wheat growth period is relatively high.
3. In western Henan, southwestern Henan, and northern Henan, dry wheat areas,
including Luoyang, Sanmenxia, Jiyuan, Pingdingshan, Anyang, and other shallow
hilly areas, drought, winter, and spring freezing damage, rust, and yellow dwarf
disease are the main factors affecting wheat yield.
4. The wheat area in north-central Henan Province, including Xuchang, Zhengzhou,
Luoyang, and the irrigated land north of the Yellow River, has good production
conditions and high production levels.
Agriculture 2022, 12, x FOR PEER REVIEW 4 of 13
large amount of quality information associated with each pixel, whether the pixel is la-
beled as cloudy, clear, or in cloud shadow. In a subsequent study, MODIS LAI selected
Agriculture 2022, 12, 1707 4 of 13
high-quality pixels derived using the master algorithm under cloud-free conditions. Fig-
ure 2 shows the time series results of the MODIS LAI for the entire winter wheat growing
region in Henan, and the maximum value of the LAI was approximately 2. Considering
5. LAI
that The is
wheat
at a area
low in the central
level and eastern
in the early partstages
and late of Henan Province
of wheat includes
growth, andthe irrigated
that these
land in the middle- and high-yield wheat areas in the north-central part of Zhumadian,
growth stages do not have a particularly strong influence on yield formation, we used LAI
Luohe, Zhoukou,
from flowering Shangqiu,
to maturity andas
each year Pingdingshan
input data. Mountain.
The topography and climate within each of the above production areas are relatively
consistent; so, we developed a winter wheat yield model for each production area. The
actual yield data for winter wheat in this study were provided by the Henan Provincial
Bureau of Statistics.
Figure 2. The mean curve of MODIS LAI in Henan Province from 2003 to 2016. The red line is the
error bars plotted using standard deviation; blue dots represent the mean; between the two green
dotted lines is the LAI for one year.
Agriculture 2022, 12, 1707 5 of 13
2.4. LSTM
The recurrent neural network (RNN) is a type of neural network with short-term
memory capabilities. In a cyclic neural network, a neuron cannot only receive information
from other neurons but also its own information, forming a network structure with loops.
Compared with feedforward neural networks, recurrent neural networks are more in line
with the structure of biological neural networks. Recurrent neural networks have been
widely used in tasks such as speech recognition, language modeling, and natural language
generation. The parameter learning of recurrent neural networks can be learned by the
backpropagation algorithm over time. The backpropagation algorithm with time transmits
the error information step by step in the reverse order of time. When the input sequence
is relatively long, there will be the problem of gradient explosion and disappearance. In
order to solve this problem, people have made many improvements to the cyclic neural
network. The most effective means of improvement is to introduce a gating mechanism,
one of which is called a long short-term memory network (LSTM). LSTM is a variant of a
cyclic neural network, which can effectively solve the problem of gradient explosion or the
disappearance of a simple cyclic neural network.
The ingenuity of LSTM is that, by increasing the input threshold, the forgetting
threshold, and output threshold, the weight of the self-loop is changed. As such, when
the model parameters are fixed, the integration scale at different times can be dynamically
changed, thereby avoiding the problem of gradient disappearance or gradient expansion.
The LSTM network introduces a gating mechanism to control the path of information
transmission. The three “gates” are the input gate it , the forget gate f t , and the output gate
ot . The functions of these three gates are as follows:
(1) The forgetting gate f t controls how much information needs to be forgotten to
control the internal state ct−1 at the last moment;
(2) The enter gate it controls the candidate state cet at the current moment and how
much information needs to be saved;
(3) The output gate ot controls how much information of the internal state ct at the
current moment needs to be output to the external state ht .
When f t = 0 and it = 1, the memory unit clears the historical information and writes
the candidate state vector cet . However, the memory unit ct is still related to the historical
information at the previous moment. When f t = 1 and it = 0, the memory unit will copy the
content of the previous moment without writing new information.
Figure 3 shows the cyclic unit structure of the LSTM network. The calculation process
is: (1) first use the external state ht−1 at the previous moment and the input xt at the current
moment to calculate the three gates and the candidate state cet ; (2) combine the forget gate
f t and the input gate it to update the memory unit ct ; and (3) combine the output gate ot to
transfer the information of the internal state to the external state ht .
Agriculture 2022, 12, x FOR PEER REVIEW 6 of 13
Figure3.3.Cyclic
Figure Cycliccell
cellstructure
structureof
ofthe
theLSTM
LSTMnetwork.
network.
By
Bymeans
meansof
ofLSTM
LSTMcyclic
cyclicunits,
units,the
thewhole
wholenetwork
networkcan canbe
bebuilt
builtup
upwith
with long-distance
long-distance
temporal dependencies. It can be succinctly described
temporal dependencies. It can be succinctly described as as
𝑐̃ 𝑡𝑎𝑛ℎ
cet tanh
ot 𝑜 = 𝜎 𝑥
𝑖 = σ 𝜎 w𝑤 xℎt ++b𝑏 , (1)
it σ (1)
h t −1
𝑓
f t σ𝜎
𝑐 =K 𝑓 ⨀𝑐 + 𝑖K ⨀𝑐̃ , (2)
ct = f t c t −1 + i t cet (2)
ℎ o=
t 𝑜 ⨀tanh
tan h((𝑐
ct )). (3)
K
ht = (3)
Inthe
In theLSTM
LSTMnetwork,
network, aa memory
memory unitunit cc can
can capture
capture aakey
keypiece
pieceof
ofinformation
information atat aa
certainmoment
certain momentandandcan
canstore
storethis
thiskey
keyinformation
informationfor foraacertain
certaintime
timeinterval.
interval.The
Thelifetime
lifetime
ofinformation
of information stored
stored in
in memory
memory unit
unit cc isislonger
longerthan
thanthat
thatofofshort-term
short-term memory
memory hhbut but
Agriculture 2022, 12, x FOR PEER REVIEW
muchshorter
much shorterthan
thanthat
thatofoflong-term
long-termmemory.
memory.FigureFigure44shows
showsthetheworkflow
workflowfrom
fromLAI
LAI7 data
of 13
data
processingto
processing toLSTM
LSTMmodel
modelbuilding
buildingand
andyieldyieldestimation.
estimation.
𝑅𝑀𝑆𝐸 = ∑ (𝑦 − 𝑦 ) , (4)
∑ (𝑦 − 𝑦 )
𝑅 =1− , (5)
∑ (𝑦 − 𝑦 )
100% 𝑦 −𝑦
𝑀𝐴𝑃𝐸 = , (6)
𝑛 𝑦
where 𝑦 represents the actual yield, 𝑦 is the predicted yield, and 𝑛 is the sample size.
Figure 4. Overall
Figure4. Overall structure
structureof
ofthe
theLSTM
LSTMmodel
modelfor
forwheat
wheatyield
yieldestimation.
estimation.
2
∑in=1 (ŷi − yi )
R2 = 1 − 2
, (5)
∑in=1 (yi − ŷi )
100% n ŷi − yi
n i∑
MAPE = , (6)
=1
yi
where yi represents the actual yield, ŷi is the predicted yield, and n is the sample size.
Figure 5. Comparison of yield estimation results for LSTM models based on different time steps:
Figure 5. Comparison of yield estimation results for LSTM models based on different time steps:
(A) one time
(A) one time step;
step; (B)
(B) two
two time
timesteps;
steps;(C)
(C)three
threetime
timesteps;
steps;(D)
(D)four time
four timesteps; (E)(E)
steps; fivefive
time steps;
time and
steps;
(F) six
and (F) time steps.
six time TheThe
steps. probability distributions
probability of the
distributions statistical
of the and
statistical estimated
and estimatedyields
yieldsareare
plotted on
plotted
thethe
on right and
right toptop
and of of
thethe
Y axis, respectively.
Y axis, respectively.
Table 1.
Table The accuracy
1. The accuracy evaluation
evaluationof
ofthe
the LSTM
LSTMprediction
predictionmodel
modelwas
wascompared
comparedbetween
betweendifferent
different
timesteps
time stepswhen
when1212LAI
LAIdata
datawere
wereinput.
input.
Figure 6.
Figure The model
6. The model performance
performance (predicted
(predicted RMSE)
RMSE)using
usingdifferent
differenttime
timesteps
stepsfor
forthe
thewhole
wholegrowing
grow-
season.
ing (A)(A)
season. oneone
timetime
step; (B) (B)
step; twotwo
timetime
steps; (C) three
steps; timetime
(C) three steps; (D) four
steps; timetime
(D) four steps; (E) five
steps; (E) time
five
steps;
time and (F)
steps; andsix
(F)time steps.steps.
six time
Comparisonofofthe
Figure7.7.Comparison
Figure theyield
yieldestimation
estimationresults
resultsfor
forLSTM
LSTMmodels
modelsbased
basedon ondifferent
differenttime
timesteps
steps
(A) one time step; (B) two time steps; (C) three time steps; (D) four time steps; (E) five
(A) one time step; (B) two time steps; (C) three time steps; (D) four time steps; (E) five time steps;time steps; and
(F) (F)
and six six
time steps.
time TheThe
steps. probability
probabilitydistributions of the
distributions statistical
of the and
statistical estimated
and estimated yields areare
yields plotted on
plotted
thethe
on right
rightand top
and topof of
thethe
Y axis, respectively.
Y axis, respectively.
Table 2. The accuracy evaluation of the LSTM prediction model was compared between different
time steps when 6 LAI data were input.
Figure 8. The model performance (predicted RMSE) using different time steps for only one period of
the growing season. (A) one time step; (B) two time steps; (C) three time steps; (D) four time steps;
(E) five time steps; and (F) six time steps.
models for each agroecological zone. Random forest (RF) is a supervised machine learning
algorithm based on integration learning [39]. Different subsets are randomly drawn from
the provided data and used to build several different decision trees and integrate the results
of one decision tree according to Bagging’s rules. Support vector regression (SVR) is a
regression algorithm that is a variant of SVM in regression analysis [40]. SVR also consid-
ers maximization intervals but considers points within the decision boundary so that as
many sample points as possible lie within the interval. The partial least squares regression
(PLSR) [41] algorithm is a regression modeling method for multiple dependent variables
Y on multiple independent variables X. The algorithm considers extracting as many prin-
cipal components as possible from Y and X in building the regression and maximizing
the correlation between the principal components extracted from X and Y, respectively.
XGBoost [42] is a scalable machine learning system that adds to the objective function of
each iteration regular term to further reduce the risk of overfitting, XGBoost is an all-in-one
machine learning algorithm.
The yield prediction of winter wheat in Henan Province was constructed using the
four methods mentioned above. The prediction model first used winter wheat LAI time
series data from 2003 to 2015, followed by yield prediction for 2016, and the R2 and RMSE
of the prediction results are shown in Table 3. For the whole of Henan Province, the best
performance among the four methods was the SVR with R2, RMSE and MAPE of 0.76,
725.8 kg/ha and 6.33%, respectively, and the worst was PLSR with R2, RMSE and MAPE
of 0.7 and 809.1 kg/ha and 7.74%, respectively. Compared with these machine learning
methods, the prediction results of the LSTM had better accuracy and performance both for
individual wheat growing areas and for the whole of Henan province.
The prediction accuracy of the four machine learning methods was the lowest in the
southwest Henan region, which is a mountainous and hilly area with complex topography, a
small winter wheat growing area, and large yield variation, and there may be a lack of yield
accuracy in this region using conventional methods to construct the model. Compared with
these, using LSTM model for winter wheat yield prediction had a superior performance.
Compared to algorithms such as SVR and RF, the accuracy of estimation was not sufficient,
because the ability to analyze complex nonlinear relationships between long time series
variables is not as good as LSTM, resulting in poor model performance. These machine
learning methods do not consider the time correlation between winter wheat yields in the
modeling process, and the estimation for each year’s yield is conducted independently,
while LSTM takes into account the time series correlation of yields and also can better
handle the nonlinear relationship, so it has higher accuracy compared to machine learning.
Overall, a better performance capability can be obtained using LSTM models for forecasting
time series data.
4. Conclusions
In this study, considering the complexity of data collection, we used LAI as a single
input variable and five models, including four machine learning models (RF, SVR, PLSR,
and XGBOOST) and one deep learning model (LSTM) to predict the winter wheat yield in
Henan Province in 2016. In general, the LSTM model had superior performance compared
with the machine learning models. Moreover, considering the characteristics of the LSTM,
the time step of the modeled data as well as the growth period data were analyzed, and the
Agriculture 2022, 12, 1707 12 of 13
time step needs to be analyzed for different growth regions, while using only the necessary
growth period data can also obtain a high prediction accuracy; however, for the robustness
of the model, the more growth period data are used accordingly, the better. To date, winter
wheat yield prediction based on remote sensing images has been carried out at the county
level. However, the determination of crop yield remains a challenge because the variability
and uncertainty within the region is unknown. The results of our study on winter wheat
yield prediction at a regional scale using publicly available data, using LAI as an input
variable for determining crop yield, can potentially be applied to crop yield estimation in
regions with sparse observational data and worldwide.
References
1. Becker-Reshef, I.; Justice, C.; Sullivan, M.; Vermote, E.; Tucker, C.; Anyamba, A.; Small, J.; Pak, E.; Masuoka, E.; Schmaltz, J.
Monitoring global croplands with coarse resolution earth observations: The Global Agriculture Monitoring (GLAM) project.
Remote Sens. 2010, 2, 1589–1609. [CrossRef]
2. Tilman, D.; Balzer, C.; Hill, J.; Befort, B.L. Global food demand and the sustainable intensification of agriculture. Proc. Natl. Acad.
Sci. USA 2011, 108, 20260–20264. [CrossRef] [PubMed]
3. Becker-Reshef, I.; Vermote, E.; Lindeman, M.; Justice, C. A generalized regression-based model for forecasting winter wheat
yields in Kansas and Ukraine using MODIS data. Remote Sens. Environ. 2010, 114, 1312–1323. [CrossRef]
4. Xie, Y.; Wang, P.; Bai, X.; Khan, J.; Zhang, S.; Li, L.; Wang, L. Assimilation of the leaf area index and vegetation temperature
condition index for winter wheat yield estimation using Landsat imagery and the CERES-Wheat model. Agric. For. Meteorol.
2017, 246, 194–206. [CrossRef]
5. Mkhabela, M.; Bullock, P.; Raj, S.; Wang, S.; Yang, Y. Crop yield forecasting on the Canadian Prairies using MODIS NDVI data.
Agric. For. Meteorol. 2011, 151, 385–393. [CrossRef]
6. Lai, Y.; Pringle, M.; Kopittke, P.M.; Menzies, N.W.; Orton, T.G.; Dang, Y.P. An empirical model for prediction of wheat yield, using
time-integrated Landsat NDVI. Int. J. Appl. Earth Obs. Geoinf. ITC J. 2018, 72, 99–108. [CrossRef]
7. Moriondo, M.; Maselli, F.; Bindi, M. A simple model of regional wheat yield based on NDVI data. Eur. J. Agron. 2007, 26, 266–274.
[CrossRef]
8. Wall, L.; Larocque, D.; Léger, P. The early explanatory power of NDVI in crop yield modelling. Int. J. Remote Sens. 2008, 29,
2211–2225. [CrossRef]
9. Chen, Y.; Zhang, Z.; Tao, F. Improving regional winter wheat yield estimation through assimilation of phenology and leaf area
index from remote sensing data. Eur. J. Agron. 2018, 101, 163–173. [CrossRef]
10. Franch, B.; Vermote, E.; Becker-Reshef, I.; Claverie, M.; Huang, J.; Zhang, J.; Justice, C.; Sobrino, J.A. Improving the timeliness of
winter wheat production forecast in the United States of America, Ukraine and China using MODIS data and NCAR Growing
Degree Day information. Remote Sens. Environ. 2015, 161, 131–148. [CrossRef]
11. Pantazi, X.E.; Moshou, D.; Alexandridis, T.; Whetton, R.L.; Mouazen, A.M. Wheat yield prediction using machine learning and
advanced sensing techniques. Comput. Electron. Agric. 2016, 121, 57–65. [CrossRef]
12. Huang, J.; Ma, H.; Sedano, F.; Lewis, P.; Liang, S.; Wu, Q.; Su, W.; Zhang, X.; Zhu, D. Evaluation of regional estimates of winter
wheat yield by assimilating three remotely sensed reflectance datasets into the coupled WOFOST–PROSAIL model. Eur. J. Agron.
2019, 102, 1–13. [CrossRef]
13. Huang, J.; Sedano, F.; Huang, Y.; Ma, H.; Li, X.; Liang, S.; Tian, L.; Zhang, X.; Fan, J.; Wu, W.; et al. Assimilating a synthetic Kalman
filter leaf area index series into the WOFOST model to improve regional winter wheat yield estimation. Agric. For. Meteorol. 2016,
216, 188–202. [CrossRef]
14. Huang, J.; Tian, L.; Liang, S.; Ma, H.; Becker-Reshef, I.; Huang, Y.; Su, W.; Zhang, X.; Zhu, D.; Wu, W.; et al. Improving winter
wheat yield estimation by assimilation of the leaf area index from Landsat TM and MODIS data into the WOFOST model. Agric.
For. Meteorol. 2015, 204, 106–121. [CrossRef]
Agriculture 2022, 12, 1707 13 of 13
15. Zhuo, W.; Huang, J.; Li, L.; Zhang, X.; Ma, H.; Gao, X.; Huang, H.; Xu, B.; Xiao, X. Assimilating soil moisture retrieved from
Sentinel-1 and Sentinel-2 data into WOFOST model to improve winter wheat yield estimation. Remote Sens. 2019, 11, 1618.
[CrossRef]
16. Jeong, S.; Ko, J.; Yeom, J.-M. Predicting rice yield at pixel scale through synthetic use of crop and deep learning models with
satellite data in South and North Korea. Sci. Total Environ. 2022, 802, 149726. [CrossRef] [PubMed]
17. Bolton, D.K.; Friedl, M.A. Forecasting crop yield using remotely sensed vegetation indices and crop phenology metrics. Agric. For.
Meteorol. 2013, 173, 74–84. [CrossRef]
18. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Commun. ACM 2017,
60, 84–90. [CrossRef]
19. Koirala, A.; Walsh, K.B.; Wang, Z.; McCarthy, C. Deep learning–Method overview and review of use for fruit detection and yield
estimation. Comput. Electron. Agric. 2019, 162, 219–234. [CrossRef]
20. Fischer, T.; Krauss, C. Deep learning with long short-term memory networks for financial market predictions. Eur. J. Oper. Res.
2018, 270, 654–669. [CrossRef]
21. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [CrossRef] [PubMed]
22. Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [CrossRef] [PubMed]
23. Khaki, S.; Wang, L.; Archontoulis, S.V. A cnn-rnn framework for crop yield prediction. Front. Plant Sci. 2020, 10, 1750. [CrossRef]
[PubMed]
24. Sun, J.; Di, L.; Sun, Z.; Shen, Y.; Lai, Z. County-level soybean yield prediction using deep CNN-LSTM model. Sensors 2019, 19,
4363. [CrossRef]
25. Khaki, S.; Wang, L. Crop yield prediction using deep neural networks. Front. Plant Sci. 2019, 10, 621. [CrossRef]
26. Maimaitijiang, M.; Sagan, V.; Sidike, P.; Hartling, S.; Esposito, F.; Fritschi, F.B. Soybean yield prediction from UAV using
multimodal data fusion and deep learning. Remote Sens. Environ. 2020, 237, 111599. [CrossRef]
27. Cao, J.; Zhang, Z.; Luo, Y.; Zhang, L.; Zhang, J.; Li, Z.; Tao, F. Wheat yield predictions at a county and field scale with deep
learning, machine learning, and google earth engine. Eur. J. Agron. 2021, 123, 126204. [CrossRef]
28. Nevavuori, P.; Narra, N.; Linna, P.; Lipping, T. Crop yield prediction using multitemporal UAV data and spatio-temporal deep
learning models. Remote Sens. 2020, 12, 4000. [CrossRef]
29. Hara, P.; Piekutowska, M.; Niedbała, G. Selection of independent variables for crop yield prediction using artificial neural network
models with remote sensing data. Land 2021, 10, 609. [CrossRef]
30. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [CrossRef]
31. Graves, A. Generating sequences with recurrent neural networks. arXiv Prepr. 2013, arXiv:1308.0850.
32. Tian, H.; Wang, P.; Tansey, K.; Han, D.; Zhang, J.; Zhang, S.; Li, H. A deep learning framework under attention mechanism for
wheat yield estimation using remotely sensed indices in the Guanzhong Plain, PR China. Int. J. Appl. Earth Obs. Geoinf. ITC J.
2021, 102, 102375. [CrossRef]
33. Liu, Y.; Wang, S.; Wang, X.; Chen, B.; Chen, J.; Wang, J.; Huang, M.; Wang, Z.; Ma, L.; Wang, P.; et al. Exploring the superiority
of solar-induced chlorophyll fluorescence data in predicting wheat yield using machine learning and deep learning methods.
Comput. Electron. Agric. 2022, 192, 106612. [CrossRef]
34. Zhang, L.; Zhang, Z.; Luo, Y.; Cao, J.; Xie, R.; Li, S. Integrating satellite-derived climatic and vegetation indices to predict
smallholder maize yield using deep learning. Agric. For. Meteorol. 2021, 311, 108666. [CrossRef]
35. Cai, Y.; Guan, K.; Lobell, D.; Potgieter, A.B.; Wang, S.; Peng, J.; Xu, T.; Asseng, S.; Zhang, Y.; You, L.; et al. Integrating satellite
and climate data to predict wheat yield in Australia using machine learning approaches. Agric. For. Meteorol. 2019, 274, 144–159.
[CrossRef]
36. Dong, J.; Fu, Y.; Wang, J.; Tian, H.; Fu, S.; Niu, Z.; Han, W.; Zheng, Y.; Huang, J.; Yuan, W. Early-season mapping of winter wheat
in China based on Landsat and Sentinel images. Earth Syst. Sci. Data 2020, 12, 3081–3095. [CrossRef]
37. Sharma, L.K.; Singh, T.N. Regression-based models for the prediction of unconfined compressive strength of artificially structured
soil. Eng. Comput. 2018, 34, 175–186. [CrossRef]
38. Peng, J.; Kim, M.; Kim, Y.; Jo, M.; Kim, B.; Sung, K.; Lv, S. Constructing Italian ryegrass yield prediction model based on climatic
data by locations in South Korea. Grassl. Sci. 2017, 63, 184–195. [CrossRef]
39. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
40. Chang, C.-C.; Lin, C.-J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2011, 2, 1–27. [CrossRef]
41. Wegelin, J.A. A Survey of Partial Least Squares (PLS) Methods, with Emphasis on the Two-Block Case; Technical Report; University of
Washington: Seattle, DC, USA, 2000.
42. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794.