0% found this document useful (0 votes)

17 views27 pages

Tomato Leaf Disease

Uploaded by

sonnetchy19

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views27 pages

Tomato Leaf Disease

Uploaded by

sonnetchy19

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Journal Pre-proof

Attention embedded residual CNN for disease detection in tomato leaves

Karthik R., Hariharan M., Sundar Anand, Priyanka Mathikshara,

Annie Johnson, Menaka R.

PII: S1568-4946(19)30714-8
DOI: https://doi.org/10.1016/j.asoc.2019.105933
Reference: ASOC 105933

To appear in: Applied Soft Computing Journal

Received date : 29 April 2019

Revised date : 8 October 2019
Accepted date : 6 November 2019

Please cite this article as: Karthik R., Hariharan M., S. Anand et al., Attention embedded residual
CNN for disease detection in tomato leaves, Applied Soft Computing Journal (2019), doi:
https://doi.org/10.1016/j.asoc.2019.105933.

This is a PDF file of an article that has undergone enhancements after acceptance, such as the
addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive
version of record. This version will undergo additional copyediting, typesetting and review before it
is published in its final form, but we are providing this version to give early visibility of the article.
Please note that, during the production process, errors may be discovered which could affect the
content, and all legal disclaimers that apply to the journal pertain.

© 2019 Elsevier B.V. All rights reserved.

Journal Pre-proof

Attention Embedded Residual CNN for Disease Detection in Tomato

Leaves

Karthik R1, Hariharan M2, Sundar Anand3, Priyanka Mathikshara4, Annie Johnson5, Menaka R6
1,3-6
School of Electronics Engineering, Vellore Institute of Technology, Chennai.
2
School of Computing sciences and Engineering, Vellore Institute of Technology, Chennai

of
Abstract

Automation in plant disease detection and diagnosis is one of the challenging research areas that

pro
has gained significant attention in the agricultural sector. Traditional disease detection methods rely on
extracting handcrafted features from the acquired images to identify the type of infection. Also, the
performance of these works solely depends on the nature of the handcrafted features selected. This can
be addressed by learning the features automatically with the help of Convolutional Neural Networks
(CNN). This research presents two different deep architectures for detecting the type of infection in
tomato leaves. The first architecture applies residual learning to learn significant features for
classification. The second architecture applies attention mechanism on top of the residual deep network.
re-
Experiments were conducted using Plant Village Dataset comprising of three diseases namely early
blight, late blight, and leaf mold. The proposed work exploited the features learned by the CNN at various
processing hierarchy using the attention mechanism and achieved an overall accuracy of 98% on the
validation sets in the 5-fold cross-validation.
lP

Keywords: Attention, CNN, Residual Connections, Tomato, Deep Learning.

1. Introduction

Tomato holds an inevitable place in the economy of Indian agriculture. India stands third in the
production of tomatoes with a yield of 53,00,000 tons and it is harvested around 3,50,000 hectares of
a

land. The harvest index of tomato in India is comparatively less than in other countries. One of the major
reasons for the reduction in yield is due to diseases that occur frequently on the leaves of the plant.
Tomato crops are highly affected by diseases like bacterial spot, early blight, late blight, and leaf mold.
urn

The blight is the most prevalent disease among others.

The tomato crop is highly susceptible to a wide range of disease at each stage of its growth. This
is due to different factors based on climatic conditions and environmental parameters. By identifying
these diseases, tremendous loss in the yield can be alleviated. Also, the final agricultural product obtained
in terms of quality and quantity can be improved. It is relatively complex in real time to maintain a
manual record of all the symptoms and signs caused by the diseases. Also, monitoring of plants in a large
field requires extensive manual effort. Hence, different automation schemes for disease detection were
Jo

presented in the last two decades [1-5].

As a part of sustainable agriculture, various measures can be taken by leveraging the technology
towards automated inspection of diseases. Factors like pathogen development, modification of host
resistance and wider global transfer of diseases have led to the development of many solutions [6].
Precision farming was aimed at limiting the employment of expensive methods of farming which uses

1
Journal Pre-proof

harmful chemicals. In this type of farming, mobile robotics, remote sensor networks and drones are used
to advocate controlled and measured amounts of medicine to the infected areas of the plants.

The main challenge of precision farming is that, it had numerous challenges in data collection,
processing and make expert inferences. Therefore, precision farming incorporated image processing and
computer vision techniques to process the information in the cultivation field. Image processing was

of
quite successful in solving the problems of disease detection, weed detection, understanding the
symptoms of a disease and even more recently, grading the yield output. As machine learning algorithms
continued to advance, the accuracy in image processing techniques continued to grow consistently.
However, these algorithms demand handpicked features to detect diseases which made deep learning

pro
techniques pertinent. This research places an attempt towards application of deep learning architectures
for detection of diseases in tomato leaves.

2. Related Works

Several research works have been presented in the last two decades towards detection of disease
in different crops. Image processing techniques were applied to extract the features and given as input to
re-
machine learning algorithms for precise classification. In short, these approaches can be broadly
classified into (1) Machine learning methods (2) Deep learning methods.

2.1 Machine learning based methods

Akhtar et al. presented an automated approach for plant disease detection using Gray Level Co-
lP

occurrence Matrix (GLCM) and Wavelet-based features [7]. The features were trained with different
machine learning algorithms namely K- Nearest Neighbor (KNN), Naïve Bayes Classifier, Support
Vector Machine (SVM), Decision Tree and Recurrent Neural Networks. An automated approach for
tomato grading system was presented by Semary et al. [8]. This approach utilized color and texture
features and classified using SVM. Prasad et al. developed an automated approach for leaf disease
diagnosis using Gabor Wavelet Features (GWF) and GLCM features. These multi-resolution features
a

were trained using weighted KNN [9].

Ashourloo et al. presented a method to detect leaf disease using hyperspectral measurement [10].
urn

An approach to detect the severity of the disease in leaves was proposed in [11]. Statistical features in
the RGB and HSV color space were utilized for determining the severity level. H. Sabrol et al. presented
an approach for leaf disease detection in tomatoes by combining Otsu’s segmentation with decision trees
for classification. This approach considered color, shape and texture features for learning the
characteristics of the leaf diseases [12].
Padol et al. presented an approach to detect leaf diseases using color and texture features. The
infected region was initially segmented using K-means clustering. Then, features were extracted from
Jo

the required region of interest and trained using SVM for classification [13]. Another approach using K-
means algorithm was proposed for leaf disease detection and classification [14]. T. Mehra et al. employed
K-means clustering to identify the presence of fungal infections on leaves [15]. One of the major
challenge in applying the above clustering algorithms is the determination of precise number of clusters
and fixing of parameters to differentiate each cluster.

2
Journal Pre-proof

In the past few years, Scale invariant feature transforms were explored for many image processing
problems [16-18]. An approach using Scale Invariant Feature Transform (SIFT) for detection of leaf
disease was presented by Dandawate et al. [19]. In this work, SIFT features were trained using SVM for
detecting the presence of disease. SIFT based features were combined with Johnson SB distribution for
effective classification of diseases in tomatoes [20].
All the above methods for disease detection were based on hand engineered features extracted

of
from the leaf portion of the image. The accuracy of these works solely depends on the nature of the
handcrafted features selected. Also, it is to be noted that the performance of these works needs to be
validated against a wide range of datasets. These drawbacks can be addressed by using deep learning
techniques.

pro
2.2 Deep learning based methods

Unlike machine learning algorithms, deep learning algorithms can be applied directly over the
input data and does not require any handcrafted features. In today’s world, the computing power delivered
by High-Performance Computing (HPC) and Graphics Processing Unit (GPU) allows for efficient
training of deep models while simultaneously implementing parallelism in computing. A number of deep
re-
learning models have been proposed in order to train leaf images to perform disease detection.
Most of the researches were based on applying existing deep learning architectures like VGG16,
AlexNet, ResNet, GoogleNet etc. for detection of infection in tomato leaves. Jia Shijie et al. presented
an approach to detect diseases in tomato leaves using VGG16 architecture [21]. Suryawati et al. presented
another deep CNN using VGG16 architecture to detect infection in tomato leaves [22]. Aravind et al.
lP

compared the performance of VGG16 with AlexNet architecture for disease detection in tomato leaves.
It was inferred that the model trained with AlexNet architecture was accurate than VGG16 by a small
margin [23]. Jayme Garcia et al. utilized a pre-trained GoogleNet CNN architecture for disease detection
in leaves [24]. Zhang et al. presented a transfer learning approach using AlexNet, GoogleNet, and
ResNet architectures for disease detection in tomato leaves [25]. Liang et al. presented an approach
involving the use of Resnet50, Wideresnet50, DPN92 neural networks for classification of plant diseases
a

[26]. A deep architecture based on LeNet was proposed to detect the type of disease in tomato leaves in
[27].
Q. H. Cap et al. used two super resolution (SR) models which are based on super resolution
urn

convolutional neural networks (SRCNN) and enhanced super resolution generative adversarial networks
(ESRGAN). The SRCNN model is used to identify prominent disease features, whereas, the ESRGAN
model focuses on high frequency details to obtain a more accurate prediction [28]. Another deep learning
architecture named ‘PD2SENET’ was proposed to detect and indicate the severity of the disease [29]. In
this architecture, the shallow layers considered raw pixel values of plant images as input and the
progressive feature maps are generated with the help of residual learning. Srdjan Sladojevic et al.
presented a CaffeNet based architecture for detection of leaf diseases. This architecture had eight layers
Jo

for learning the characteristics of the disease patterns and utilized around 30K samples for training the
model [30]. Alvaro Fuentes et al. presented deep learning meta architectures for disease detection by
combining Faster Region-based Convolutional Neural Network (Faster R-CNN), Region-based Fully
Convolutional Network (R-FCN), and Single Shot Multibox Detector (SSD) with ResNet and VGG

3
Journal Pre-proof

architectures. It was inferred that, R-FCN with ResNet combination outperformed the other two methods
[31].

In addition to the application of existing CNN architectures, several custom architectures were
proposed for disease detection in tomato leaves. Ferdouse et al. presented one such CNN to identify
diseases in tomato leaves [32]. This architecture consists of 15 layers to extract a wide range of features

of
for classification. Ruedeeniraman et al. presented a VegeCare tool that made use of Deep Neural Network
(DNN) to classify six tomato diseases [33]. Fuentes et. al. presented another deep architecture to identify
diseases in tomatoes [34]. Melike Sardogan et al. presented a CNN model to identify the type of disease
in tomato plant [35]. This method considered only 400 images for training, which is relatively less for a

pro
deep learning model. Pardede et al. presented an unsupervised convolutional auto-encoder for automatic
detection of plant diseases [36].
In contrast to the above standalone deep learning applications, few CNN models were also
presented as mobile applications focusing on disease detection in tomato leaves. A. Elhassouny et al.
presented a MobileNet CNN model that involves depth-wise separable convolution operations to address
computational burden of the traditional CNN for real time applications [37]. Another mobile application
was developed by H. Durmus et al. which used SqueezeNet for classification of tomato leaf diseases
re-
[38].

2.3 Research gaps and Motivation

Though several approaches were presented for detection of diseases in tomato leaves, there
lP

exist some significant challenges in it.

.
1. It is quite complex to identify and extract significant features in tomato leaves to
differentiate the properties of different diseases using traditional image processing
techniques. As the characteristics of these diseases exhibit huge variation, the properties
of the disease patterns have to be studied exhaustively with a wide range of datasets in an
a

automated way.
2. The performance of the machine learning based models solely depends on the nature of
the manually selected handcrafted features. Hence, feature extraction has to be made
urn

automatic to select and learn an optimal set of features for classification purpose.
3. Most of the deep learning models give equal weightage to all features derived across
different levels. But to make the model more sensitive for classification, feature weighting
has to be done at each stage. By doing so, significant features can be learnt and passed to
deeper levels of the network for precise classification.
4. Some of the deep learning models utilize generic and proven architectures like VGG16,
GoogleNet etc. Hence, it utilizes millions of parameters for classification. For real-time
Jo

deployment of such models, a trade-off has to be achieved between the computational

burden and accuracy.
5. Also, the deep learning network has to be trained with a large collection of samples to
ensure better generalization of features.

4
Journal Pre-proof

2.4 Research Contributions

To address the above research gaps, the proposed research employed two different deep
architectures for disease detection in tomato leaves. The following are the major contributions of the
proposed work.
1. Two different deep learning architectures were proposed in this research. The first
architecture employed residual learning to learn a hierarchy of features for better

of
classification. The second architecture employed the attention mechanism to specifically
learn distinctive feature maps and improve the performance of the residual CNN.
2. To the best of our knowledge, this is the first attempt to develop attention based residual

pro
deep network for disease detection in tomato leaves. Attention mechanism is employed to
learn and weight significant features across different levels. Hence, the significant features
were given more weightage with the help of attention coefficients learnt and passed to
deeper levels for precise classification.
3. The proposed architecture was trained with a large collection of samples. 95999 images
were used for training the model and 24001 images were used for validation purpose.
re-
3. Proposed Work

This research proposes a novel CNN framework that specializes in the task of infestation
detection in the tomato plant. The objective of this work is to design a computationally inexpensive and
accurate learning model for disease detection. Two different deep architectures were proposed in this
work, to detect disease infestation in tomato leaf. The first architecture integrates residual learning on
lP

top of a feed-forward CNN. The second architecture integrates the strengths of Attention mechanism and
Residual Learning on CNN.

3.1 Residual Learning based CNN

The learning pattern of a CNN is generally based on aggregation of feature maps derived at
a

multiple levels. As a consequence of this aggregation occurring in the deep layers of the CNN, it tends
to lose the significance of the fine granular details learnt by the initial layers. The traditional CNN based
urn

methods for tomato leaf infestation detection focusses on learning the features in an orderly fashion
starting from basic image level features like edges and move towards complex texture based differences.
By doing so, few significant details are not passed to the deeper layers of the network. Hence in this
method, residual connections are employed to pass those significant features extracted in the initial layers
to the deeper layers of the network. This supports effective aggregation of feature maps for precise
classification.
The architecture of the proposed residual connection based on CNN is presented in Fig. 1. It
Jo

consists of a sequence of three Residual Progressive Feature Extraction (RPFE) blocks, each set to learn
progressive features. The number of channels increases from 32 to 128 along the depth of the network.
The first RPFE block has a convolution receptive area of size 7x7, trailed by a 5x5 kernel for the second
block and finally a 3x3 filter for the third block. Then, it applies the average pooling over the feature
map. This enables the classifier to model a reduced set of features without much loss of context and also

5
Journal Pre-proof

avoids the risk of overfitting. The entire model is followed by a sequence of 1x1 convolutional layers
after the last RPFE block.

of
pro
Fig. 1. The architecture of the Proposed Residual CNN

3.1.1 Residual Progressive Feature Extraction (RPFE) Block

re-
The Residual Progressive Feature Extraction (RPFE) block consists of a 2D convolutional layer,
a max pooling layer, and a batch norm layer. In the ‘Conv’ layer, a set of variable filters distinctly
convolve across the face of the feature map (padded to the same size), one for each channel. The filter
sizes for the ‘Conv’ layer are receptive to a smaller region of interest along the line of the blocks (7x7
lP

for the first block, 5x5 for the second, 3x3 for the third and so on). This is followed by a Rectified Linear
Unit (ReLU) activation layer that rectifies the convolved image, zeroing out negative values. ReLU was
used because it sustains a steady gradient even for larger activations, thus stabilizing the learning.

The output from the convolutional layers, ‘x’ in the first and second RPFE blocks is directed in two
functional paths, F(x) and G(x). F(x) denotes the set of operations (max pooling and batch normalization)
a

that were applied to take ‘x’, in a simple feed-forward manner to the next block. G(x) indicates the set of
operations that skip ‘x’ to the next block using convolutional and max-pooling layers. Finally, the
response from the RPFE blocks, Y(x) are generated by summing the individual responses of F(x) and
urn

G(x), as given by Eq. 1.

Y(x) = F(x) + G(x) (1)

Fig. 2 presents the visual representation of the skip functions, generically to an RPFE block in the
described Residual CNN.
Jo

6
Journal Pre-proof

of
pro
Fig. 2. A representation of the functions F(x) and G(x) that are skipped through the RPFE block
re-
to generate the output Y(x) from the residual block.

The proposed network was designed end-to-end only with 2D convolution, pooling, batch norm layers,
with no dense layers. The observed bottleneck in the case of the last layers being fully connected is that
the model fails to exploit the run entirely in the GPU. With full convolution, the system can now generate
a spatial map whose correspondences can be tracked to different parts of the input image. This essentially
lP

translates to sliding a classifier over the input image, making predictions at each window, regardless of
the input size. This approach towards identifying the infection makes it possible to,

(a) share parameters (significantly at the initial few layers)

(b) exploit spatial locality (when used with a stride less than the filter size)
a

filters down to the deep layers) all in one-shot.

Convolutional Layer

The convolutional layer defines a set of filters that perform the convolution operation over the entire
image. involve a series of convolution operation among an input volume ‘I’ and a set of ‘n’ convolutional
filters ‘FE’ followed by a non-linear activation. This finally yields an output volume ‘O’ as presented in
Jo

Eq. 2.

𝑂 𝑖, 𝑗 𝑎 ∑ ∑ ∑ 𝐹 𝑢, 𝑣 𝐼 𝑖 𝑢, 𝑗 𝑣 𝑏 (2)

where,

7
Journal Pre-proof

‘2k+1’ is the side of a square with odd convolutional filter

‘a’ refers to the activation function
‘bm’ refers to the bias for the mth feature map

The activation maps produced with the help of above relation are the encoding of the input ‘I’ in a low
dimensional space i.e. it refers to the parameters used to build every feature map ‘Om’. After ‘Om’ is

of
calculated, it is subjected to a max-pooling operation to down-sample it. Intuitively, each convolutional
layer in this architecture learns the various attributes that capture discriminatory patterns to differentiate
the type of infection in the tomato leaf.

pro
ReLU Activation

The Rectified Linear Unit (ReLU) is an activation function adopted in the design of most neural
networks, particularly CNN's. It is the identity function, f(x) = x, for all positive values and zeros out for
negative values of input ‘x’. ReLU is sparsely activated, which helps to mimic the inactivity of the
biological neuron to certain impulses.
re-
Max Pooling Layer

This pooling layer maximally activates only a bunch of neurons from the feature map. It is used with a
stride factor of ‘2’ on a ‘2-by-2’ window, across all the RPFE blocks. This effectively reduces the width
and height of the feature maps while preserving the number of channels.
lP

Batch Normalization Layer

In Deep Neural Networks each layer sees different feature information from the previous layer after every
single gradient update on a batch of data. And the data distribution of this input feature map largely
varies, as the parameter of the previous layers is updated during the training phase. This significantly
affects the training pace and also calls for various heuristics to decide upon the parameter initialization.
a

Batch Normalization is a popular trick used to curtail this problem of Internal Covariate Shift and the
outputs of the BN layer for a batch ‘x’ is given by Eq. 3.
urn

𝑦 𝛽 𝜑 (3)
√

where ‘m’ and ‘s’ are respectively the mean and standard deviation of the batch ‘x’. ‘β’, ‘φ’ are trainable
parameters, that are updated at each iteration. ‘ε’ is set to a small constant, introduced to increase the
variance, as well as prevent the denominator from zeroing out. Batch Normalization overcomes the
vanishing/exploding gradient problem by normalizing the values to a range between -3 to 3, fitting a
Jo

maximum likelihood estimate (along with the line of channel activations, across a batch) for normal
distribution.

The details of the tensor at each layer of this architecture are tabulated in Table 1.

8
Journal Pre-proof

Table 1: A tabulation of the connections between the layers and the dimensions of the output
tensors at each layer, for the entire Residual CNN.
No. of Connected to the previous
Layer (type) Output Shape
Parameters layer
input_1 (InputLayer) (None, 256, 256, 3) 0
conv2d_1 (Conv2D) (None, 256, 256, 32) 4736 input_1 (0,0)

of
conv2d_2 (Conv2D) (None, 128, 128, 32) 9248 conv2d_1(0,0)
max_pooling2d_1 (MaxPooling2D) (None, 128, 128, 32) 0 conv2d_1(0,0)
max_pooling2d_2 (MaxPooling2D) (None, 128, 128, 32) 0 conv2d_39(0,0)
batch_normalization_1 (BatchNorm) (None, 128, 128, 32) 128 max_pooling2d_1(0,0)

pro
add_1 (Add) max_pooling2d_2(0,0),
(None, 128, 128, 32) 0 batch_normalization_1(0,0)

conv2d_3 (Conv2D) (None, 64, 64, 64) 51264 add_1(0,0)

conv2d_4 (Conv2D) (None, 64, 64, 64) 36928 conv2d_1(0,0)
conv2d_5 (Conv2D) (None, 64, 64, 64) 18496 max_pooling2d_2(0,0)
max_pooling2d_3 (MaxPooling2D) (None, 32, 32, 64) 0 conv2d_3(0,0)
re-
max_pooling2d_4 (MaxPooling2D) (None, 32, 32, 64) 0 conv2d_4(0,0)
max_pooling2d_5 (MaxPooling2D) (None, 32, 32, 64) 0 conv2d_5(0,0)
batch_normalization_2 (BatchNorm) (None, 32, 32, 64) 256 max_pooling2d_3(0,0)
add_2 (Add) max_pooling2d_4(0,0),
(None, 32, 32, 64) 0 max_pooling2d_5(0,0),
batch_normalization_2(0,0)
lP

conv2d_6 (Conv2D) (None, 32, 32, 128) 204928 add_2(0,0)

max_pooling2d_6 (MaxPooling2D) (None, 16, 16, 128) 0 conv2d_6(0,0)
batch_normalization_3 (BatchNorm) (None, 16, 16, 128) 512 max_pooling2d_6(0,0)
average_pooling2d_1(AveragePooling2D) (None, 8, 8, 128) 0 batch_normalization_3(0,0)
conv2d_7 (Conv2D) (None, 1, 1, 64) 401472 average_pooling2d_1(0,0)
lambda_1 (Lambda) (None, 1, 1, 64) 0 conv2d_7(0,0)
a

conv2d_8 (Conv2D) (None, 1, 1, 4) 260 lambda_1(0,0)

reshape_1 (Reshape) (None, 4) 0 conv2d_8(0,0)
urn

3.2 Attention-based Residual CNN

The attention model works on top of the RPFE CNN by retaining the context relevant features.
The previous RPFE based model combines the features extracted in each block with the features derived
from its preceding layer. In this way, equal importance is given to all features collected from the earlier
RPFE blocks. For precise feature learning, significant features from the previous blocks need to be
Jo

weighted high relative to other features. Hence, an attention mechanism was introduced on top of the
RPFE architecture to learn and select prominent features from the previous RPFE blocks. This model
learns an attention mask that weighs the relative importance of spatial features at that feature map. This
way it learns attention coefficients for each pixel in the feature map to understand the properties of the

9
Journal Pre-proof

infestation in an effective manner. The architecture of the proposed attention based on CNN, built on top
of the described residual architecture in section 2.1. is presented in Fig. 3.

of
pro
Fig. 3. An overview of the architecture employed to integrate attention within the residual net
framework.
re-
3.2.1 Attention embedded Residual Progressive Feature Extraction (ARPFE) block

This architecture uses the attention mechanism across blocks to learn a weighted function for
modeling the activations from the preceding blocks. The skip connections from the previous blocks are
now weighted across the depth axis for each pixel in the spatial expanse of that layer.
lP

The output from the convolutional layers, ‘x’ in the first and second ARPFE blocks is directed in
two functional paths, F(x) and G’(x). F(x) denotes the set of operations (max pooling and batch
normalization) that were applied to take ‘x’, in a simple feed-forward manner to the next block. G’(x)
indicates the attention-aided weighted set of operations that skip ‘x’ through convolutional and max-
pooling layers. As discussed in 3.1.1 the weighted summation is used to generate the output Y(x) from
a

an ARPFE block, given by Eq. 3.

Y(x) = F(x) + G’(x) (3)

urn

The functional path G’(x) is computed as

G’(x) = G(x) * α (4)

where ‘α’ is the attention weight matrix whose dimensions are the same as the spatial dimensions of G(x).
The attention weight matrix ‘α’ is point-wise multiplied (broadcasted along the depth) across the
Jo

corresponding cross-section of G(x). So, ‘α’ is weighted function in G(x) and G’(x) is derived from ‘α’
as given by Eq. 4. This process is illustrated in Fig. 4.

10
Journal Pre-proof

The method for learning these weights is shown in Fig. 5 The residual feature map G(x) is passed
through a dense layer (with ReLU activation) that learns a parameter ‘𝛼 ’ for each pixel cross-section
volume G(x)(i,j) .

The dense matrix is flattened out, to form a feature vector. The activation values from the feature
vector are passed through a Softmax layer. The weights ‘𝛼 ’ are now computed calculated as a Softmax

of
probability distribution, such that summation of ∑ 𝛼 =1.

pro
re-
lP

Fig. 4. A visual representation for generating the output Y(x) from an ARPFE block using
attention based weights.
a
urn
Jo

11
Journal Pre-proof

Fig. 5 A generic scheme for learning the attention function ɑij . (a): Feature map G(x) (b): A ReLU-
activated dense layer matrix with one unit for each cross-section (i,j) of G(x)i,j . (c): A softmax
activation layer following the dense layer from (b) to learn a probability distribution of weights for
each 𝛼 . The weighted sum of ɑ s over G(x) yields G’(x) (d): Output G’(x) is computed as
∑ 𝛼 ∗ 𝐺 𝑥

Softmax Classifier

of
The proposed system uses a k-way softmax classifier to make classify the image to one among k
categories. This loss is given by Eq. 5.

pro
𝐶𝐸 ∑ 𝑡 𝑙𝑜𝑔 𝑓 𝑠 (5)

where the f(s)i is the output conditional probability P( y = ŷi| si ) for some training example ‘si’, predicted
value ŷi . This probability function for softmax activation is given in Eq. 6.

𝑓 𝑠 ∑
(6)
re-
The details of the tensor at each layer of this architecture are tabulated in Table 2.

Table 2: A tabulation of the connections between the layers and the dimensions of the output
tensors at each layer, for the entire Residual CNN.
lP

No. of Connected to the previous

Layer (type) Output Shape
Parameters layer
input_1 (InputLayer) (None, 256, 256, 3) 0
conv2d_1 (Conv2D) (None, 256, 256, 32) 4736 input_1(0,0)
conv2d_2 (Conv2D) (None, 128, 128, 32) 9248 conv2d_1(0,0)
max_pooling2d_1 (MaxPooling2D) (None, 128, 128, 32) 0 conv2d_2(0,0)
a

conv2d_3 (Conv2D) (None, 64, 64, 64) 18496 max_pooling2d_1(0,0)

max_pooling2d_2 (MaxPooling2D) (None, 32, 32, 64) 0 conv2d_3(0,0)
dense_1 (Dense) multiple 65 max_pooling2d_2(0,0),
urn

max_pooling2d_4(0,0)

dense_2 (Dense) (None, 128, 128, 1) 33 max_pooling2d_1(0,0)

attention_weights (Activation) multiple 0 dense_2(0,0),
dense_1(0,0),
dense_1(0,0)

max_pooling2d_3 (MaxPooling2D) (None, 128, 128, 32) 0 conv2d_1(0,0)

multiply_1 (Multiply) multiple 0 attention_weights(0,0),

max_pooling2d_1(0,0),
attention_weights(0,0),
max_pooling2d_2(0,0),
attention_weights(0,0),
max_pooling2d_4(0,0)

12
Journal Pre-proof

batch_normalization_1 (BatchNorm) (None, 128, 128, 32) 128 max_pooling2d_3(0,0)

add_1 (Add) (None, 128, 128, 32) 0 multiply_1(0,0),
batch_normalization_1(0,0)

conv2d_4 (Conv2D) (None, 64, 64, 64) 51264 add_1(0,0)

conv2d_5 (Conv2D) (None, 64, 64, 64) 36928 conv2d_4(0,0)
max_pooling2d_4 (MaxPooling2D) (None, 32, 32, 64) 0 conv2d_5(0,0)

of
max_pooling2d_5 (MaxPooling2D) (None, 32, 32, 64) 0 conv2d_4(0,0)
batch_normalization_2 (BatchNorm) (None, 32, 32, 64) 256 max_pooling2d_5(0,0)
add_2 (Add) (None, 32, 32, 64) 0 multiply_1(0,0),
multiply_1(0,0),

pro
batch_normalization_2(0,0)

conv2d_6 (Conv2D) (None, 32, 32, 128) 204928 add_2(0,0)

max_pooling2d_6 (MaxPooling2D) (None, 16, 16, 128) 0 conv2d_6(0,0)
batch_normalization_3 (BatchNorm) (None, 16, 16, 128) 512 max_pooling2d_6(0,0)
average_pooling2d_1 (AveragePooling) (None, 8, 8, 128) 0 batch_normalization_3(0,0)
conv2d_7 (Conv2D) (None, 1, 1, 64) 401472 average_pooling2d_1(0,0)
re-
lambda_1 (Lambda) (None, 1, 1, 64) 0 conv2d_7(0,0)
conv2d_8 (Conv2D) (None, 1, 1, 4) 260 lambda_1(0,0)
reshape_1 (Reshape) (None, 4) 0 conv2d_8(0,0)

4. Results and Discussion

The proposed system was trained with the augmented collection of the benchmarked Plant Village
Dataset. The source code was written in Tensorflow Deep Learning programming framework and
compiled to run on the NVIDIA Tesla P100 GPU. The model was evaluated on a 5-fold cross validation
set (of 120K samples) with each fold stratified into roughly equal numbers for each class (by random
sampling with replacement). The loss function was minimized using the Adaptive Moment Estimation
a

(Adam) optimizer. This optimization algorithm uses the running average of both the gradient and the
second moment.
urn

Three different experiments were conducted. The first experiment applied a baseline model for
disease detection and classification in tomato. The second experiment used the residual connections
across the Progressive Feature Extraction blocks. The third experiment integrated both attention and
residual connections in CNN.

4.1 Dataset
Jo

The proposed model for disease detection in Tomato was developed using the Plant Village
Disease Classification Challenge dataset and further data augmentation techniques were applied to
increase the size of the dataset. Table 3 presents the distribution of augmented samples for each fold in
cross-validation process. The dataset used in our experiment includes one healthy class and 3 diseased

13
Journal Pre-proof

classes. Table 4 shows the samples for each disease class and the effects of the data augmentation
techniques on them.

Data augmentation techniques have been applied for increasing the data set, thereby reducing the
overfitting. Central zoom was performed to produce a data set of images that have only the leaf and not
the background information, random crop & zoom was performed to focus on specific parts of the leaf

of
and various contrast levels were used to make the dataset robust to various lighting conditions. The
stratified 5-fold cross-validation used to evaluate the proposed model and this ensured balance between
the classes for each of the 5 folds due to random sampling.

pro
Table 3. Details of distribution of samples in cross-validation process

Healthy Early Blight Late Blight Leaf Mold

Fold
Training Validation Training Validation Training Validation Training Validation
1 50402 12601 20590 5148 17600 4400 7407 1852
2 50402 12601 20590 5148 17600 4400 7407 1852
3 50402 12601 20590 5148 17600 4400 7407 1852
re-
4 50403 12600 20591 5147 17600 4400 7407 1852
5 50403 12600 20591 5147 17600 4400 7408 1851

Table 4. Sample results of data augmentation process.

Random Zoom
Original Image Contrast Central Zoom
Category and Crop
a

Healthy
urn

Early Blight
Jo

14
Journal Pre-proof

Late Blight

of
pro
Leaf Mould

re-
4.2 Experiment 1: Application of the baseline model

The baseline model was a simple feed-forward CNN with no cross-connections or any learning
aided mechanisms. It took around 1.5 days for training the model on the GPU. This resulted in an
lP

accuracy of 84%.

4.3 Experiment 2: Application of Residual CNN

Building on experiment 1, the residual model includes skip connections from one block to the
other. The skip connections take the feature map from the ReLU activated convolutional layer in RPFE
block b, onto the convolutional layer in the RPFE block b+1, as described in section 2.1. The dimensions
a

at the both ends of the skip connection are matched by filtering the admitted feature maps with
convolutional layers and trimming with max pooling.
urn

It took around 10 hours for training the model and it took approx. 150 epochs to reach
convergence. The proposed residual based network is subjected to 5 – fold cross-validation process and
the resultant observations are presented in Table 5.

Table 5: Observation of the proposed Residual CNN

Folds Accuracy Loss

15
Journal Pre-proof

Fold 1

of
Fold 2
pro
re-
lP

Fold 3
a
urn
Jo

16
Journal Pre-proof

Fold 4

of
pro
re-
Fold 5
lP

It could be observed that the Residual CNN was able to detect the type of disease in tomatoes
with an accuracy of 90-95%. Also, the loss of the network decays appreciably during the training phase,
leading to precise classification.

4.4 Experiment 3: Application of Residual CNN with attention

Building on the second experiment, the attention based residual model adds on a weighing scheme
urn

to the output feature map G(x) from the skip connections. The weighing scheme computes an attention
matrix ′𝛼 ′ is point-wise multiplied (broadcasted along the depth) across the corresponding cross-section
of G(x)ij as described in section 2.2. These attention weights ′𝛼 ′ are learnt dynamically upon seeing new
training batches.

It took around 10 hours for training the model and it took approx. 150 epochs to reach
convergence. The proposed residual based network is subjected to 5-fold cross-validation process and
Jo

the resultant observations are presented in Table 6.

17
Journal Pre-proof

Table 6: Observation of the proposed Attention based Residual CNN

Folds Accuracy Loss

of
Fold 1

pro
re-
Fold 2
a lP
urn

Fold 3
Jo

18
Journal Pre-proof

Fold 4

of
pro
re-
Fold 5

.
lP

It was evident that the proposed attention based residual CNN was able to converge better than residual
CNN.

4.5 Performance Analysis

In this research, three different deep architectures were analyzed to detect the performance of
disease detection. The first method applied a baseline model for disease detection in tomato leaves. The
second approach was based on the residual connections across the Progressive Feature Extraction blocks.
urn

The third approach integrated both attention mechanism and residual connections in CNN. The
observations of these experiments are tabulated in Table 7. It could be observed that the proposed
attention based residual CNN performed better in detecting the type of infection with an accuracy of
98%.

Table 7. Summary of the proposed experiments

S. No Method Accuracy
(in %)
1 Baseline CNN model 84
2 Residual CNN model 95

19
Journal Pre-proof

3 Attention embedded Residual 98

CNN model

The performance of the proposed attention based residual CNN is compared against the existing
methods reported in the literature and the resultant observations sorted according to accuracy obtained
are highlighted in Table 8.

of
Table 8. Performance comparison of proposed work with other existing works.

Size of Accuracy
S. No Source Type of features Method dataset (in %)

pro
1 Ferdouse et al. [32] Automatic CNN 3000 76
2 Chit Su Hlaing et al. [20] Hand-Crafted features Quadratic SVM 3535 83.5
3 Melike Sardogan et al. [35] Automatic CNN with LVQ 500 86
4 P. B. Padol et al [13] Hand-Crafted features SVM classifier 137 88.89
5 J. Shijie et al. [21] Automatic VGG16 based CNN 7040 89
6 Azeddine Elhassouny et al. [37] Automatic CNN 7176 90.3
re-
7 Semary et al. [8] Hand-Crafted features SVM 708 92
8 P. Tm et al. [27] Automatic LeNet based CNN 54306 95
9 Suryawati et al. [22] Automatic VGGNet based CNN 18160 95.24
10 Jayme Garcia et al. [24] Automatic GoogleNet based CNN 40409 96
11 Sladojevic et al [30] Automatic CNN 30880 96.3
lP

12 Halil Durmus et al. [38] Automatic SqueezeNet 54309 97.22

13 Keke Zhang et al. [25] Automatic ResNet 5550 97.28
14 Sabrol et al. [12] Hand-Crafted features Decision Tree 383 97.3
Attention based
15 Proposed approach Automatic Residual CNN 95999 98
a

It could be observed that, an accuracy of 83 to 97% was obtained for machine learning methods that
employed hand crafted features for disease detection [8,12,13,20]. Also, the model was trained with less than
urn

4k samples, which is very less to generalize all feature patterns. Recent deep learning researches employed
well trained architectures like VGG16, ResNet, GoogleNet etc. for disease detection in tomato leaves
[21,22,24,25,27]. In addition to these works, certain deep-layered CNN architectures were also proposed for
infestation detection in tomato leaves [30,34,35,37]. Though these works yield appreciable results, the
accuracy of these works were in the range of 76 to 97%. As the proposed model employed attention mechanism
to learn and weight significant features, it was able to achieve an accuracy of 98%, which is a significant
improvement when compared to other works.
Jo

5. Conclusion

This research presents an efficient mechanism to detect the type of infestation in tomato leaves.
To the best of our knowledge, this is the first attempt to employ the attention gating mechanism in

20
Journal Pre-proof

residual CNN for disease detection in tomatoes. The main contribution of this work is the integration of
attention mechanism on top of the Residual network for effective feature learning. It helps to selectively
weigh the features different layers at the inception of a single layer. Hence, the receptive field at a layer
is extended to look at feature maps from different levels of the processing hierarchy. The current layer
can now process its input with more contextual information. Learning at the layers preceding the current
layer is now aided by the perception of the features at the current layer. This is due to back propagation

of
of the tensors along the skip connections.
The proposed network learnt around 600K parameters to detect the type of infection, which is
comparatively less than the existing deep learning approaches reported in the literature. Experimental
results indicate that the proposed attention based residual network was able to detect the type of infection

pro
with an accuracy of 98%. It could also be noted that the ARPFE blocks establish the extensibility of the
design of the proposed system to any input size.

References

1. R. Anand, S. Veni and J. Aravinth, "An application of image processing techniques for detection of
diseases on brinjal leaves using k-means clustering method," 2016 International Conference on Recent
re-
Trends in Information Technology (ICRTIT), pp. 1-6, 2016.

2. K. Thangadurai and K. Padmavathi, "Computer Vision image Enhancement for Plant Leaves Disease
Detection," 2014 World Congress on Computing and Communication Technologies, pp. 173-175, 2014.

3. C. Mattihalli, E. Gedefaye, F. Endalamaw and A. Necho, "Real Time Automation of Agriculture Land, by
lP

automatically Detecting Plant Leaf Diseases and Auto Medicine," 2018 32nd International Conference on
Advanced Information Networking and Applications Workshops (WAINA), pp. 325-330, 2018.

4. Y. Liu, S. Zhou and J. Sun, "Detection of Ginseng leaf cicatrices base on K-means clustering
algorithm," 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering
and Informatics (CISP-BMEI), pp. 1-5, 2017.
a

5. V. Singh, Varsha and A. K. Misra, "Detection of unhealthy region of plant leaves using image processing
and genetic algorithm," 2015 International Conference on Advances in Computer Engineering and
Applications, pp. 1028-1032, 2015.
urn

6. P. Lottes, J. Behley, A. Milioto and C. Stachniss, "Fully Convolutional Networks With Sequential
Information for Robust Crop and Weed Detection in Precision Farming," in IEEE Robotics and
Automation Letters, vol. 3, no. 4, pp. 2870-2877, Oct. 2018.

7. Akhtar, A., A. Khanum, S. A. Khan, and A. Shaukat. Automated Plant Disease Analysis (APDA):
Performance comparison of machine learning techniques. Proceedings of the 11th International
Conference on Frontiers of Information Technology, 60–65, 2013.
Jo

8. Semary, N. A., Tharwat, A., Elhariri, E., & Hassanien, A. E. (2015). Fruit-Based Tomato Grading System
Using Features Fusion and Support Vector Machine. Intelligent Systems’, 401–410, 2014.

9. Prasad, S., Peddoju, S. K., & Ghosh, D. (2015). Multi-resolution mobile vision system for plant leaf
disease diagnosis. Signal, Image and Video Processing, 10(2), 379–388.

21
Journal Pre-proof

10. D. Ashourloo, H. Aghighi, A. A. Matkan, M. R. Mobasheri and A. M. Rad, "An Investigation Into
Machine Learning Regression Techniques for the Leaf Rust Disease Detection Using Hyperspectral
Measurement," in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,
vol. 9, no. 9, pp. 4344-4351, 2016.

11. Parikh, M. S. Raval, C. Parmar and S. Chaudhary, "Disease Detection and Severity Estimation in Cotton

of
Plant from Unconstrained Images," 2016 IEEE International Conference on Data Science and Advanced
Analytics (DSAA), pp. 594-601, 2016.

12. H. Sabrol and K. Satish, "Tomato plant disease classification in digital images using classification
tree," 2016 International Conference on Communication and Signal Processing (ICCSP), 2016, pp. 1242-

pro
1246.

13. P. B. Padol and A. A. Yadav, "SVM classifier based grape leaf disease detection," Proceedings of the
Conference on Advances in Signal Processing (CASP), 2016, pp. 175-179.

14. S. Kaur, S. Pandey and S. Goel, "Semi-automatic leaf disease detection and classification system for
soybean culture," in IET Image Processing, vol. 12, no. 6, pp. 1038-1048, 2018.
re-
15. T. Mehra, V. Kumar and P. Gupta, "Maturity and disease detection in tomato using computer vision," 2016
Fourth International Conference on Parallel, Distributed and Grid Computing (PDGC), Waknaghat,
2016, pp. 399-403.

16. Annis Fathima, R. Karthik, V. Vaidehi, Image stitching with combined moment invariants and SIFT
features, Elsevier Procedia Computer Science, Vol. 19, pp. 420 – 427, 2013.
lP

17. R. Karthik, Annis Fathima, V. Vaidehi, Panoramic view creation using Invariant moments and SURF
features, Third IEEE International Conference on Recent trends in Information technology ICRTIT,
2013.

18. Menaka, R. and Karthik, R. ‘A novel feature extraction scheme for visualisation of 3D anatomical
a

structures’, Int. J. Biomedical Engineering and Technology, Vol. 21, No. 1, pp.49–66, 2016.

19. Dandawate, Y., and R. Kokare. An automated approach for classification of plant diseases towards
urn

development of futuristic decision support system in Indian perspective. Proceedings of the International
Conference on Advances in Computing, Communications and Informatics (ICACCI), 794–99, 2015.

20. C. S. Hlaing and S. M. Maung Zaw, "Tomato Plant Diseases Classification Using Statistical Texture
Feature and Color Feature," 2018 IEEE/ACIS 17th International Conference on Computer and
Information Science (ICIS), Singapore, 2018, pp. 439-444.

21. J. Shijie, J. Peiyi, H. Siping and s. Haibo, "Automatic detection of tomato diseases and pests based on leaf
Jo

images," 2017 Chinese Automation Congress (CAC), pp. 2537-2510, 2017.

22. E. Suryawati, R. Sustika, R. S. Yuwana, A. Subekti and H. F. Pardede, "Deep Structured Convolutional
Neural Network for Tomato Diseases Detection," 2018 International Conference on Advanced Computer
Science and Information Systems (ICACSIS), pp. 385-390, 2018.

22
Journal Pre-proof

23. Aravind Krishnaswamy Rangarajan, Raja Purushothaman, Aniirudh Ramesh, Tomato crop disease
classification using pre-trained deep learning algorithm, Procedia Computer Science,Volume 133,
2018,Pages 1040-1047.

24. Jayme Garcia Arnal Barbedo, Plant disease identification from individual lesions and spots using deep
learning,Biosystems Engineering,Volume 180,2019,Pages 96-107.

of
25. Keke Zhang, Qiufeng Wu, Anwang Liu, and Xiangyan Meng, “Can Deep Learning Identify Tomato Leaf
Disease?,” Advances in Multimedia, vol. 2018, Article ID 6710865, 10 pages, 2018.

26. Liang S., Zhang W. (2020) Accurate Image Recognition of Plant Diseases Based on Multiple Classifiers
Integration. In: Jia Y., Du J., Zhang W. (eds) Proceedings of 2019 Chinese Intelligent Systems Conference.

pro
CISC 2019. Lecture Notes in Electrical Engineering, vol 594. Springer.

27. P. Tm, A. Pranathi, K. SaiAshritha, N. B. Chittaragi and S. G. Koolagudi, "Tomato Leaf Disease Detection
Using Convolutional Neural Networks," 2018 Eleventh International Conference on Contemporary
Computing (IC3), pp. 1-5, 2018.

28. Q. H. Cap, H. Tani, H. Uga, S. Kagiwada and H. Iyatomi, "Super-Resolution for Practical Automated
re-
Plant Disease Diagnosis System," 2019 53rd Annual Conference on Information Sciences and Systems
(CISS), Baltimore, MD, USA, 2019, pp. 1-6.

29. Qiaokang Liang, Shao Xiang, Yucheng Hu, Gianmarc Coppola, Dan Zhang, Wei Sun, PD2SE-Net:
Computer-assisted plant disease diagnosis and severity estimation network, Computers and Electronics in
Agriculture,Volume 157,2019,Pages 518-529.
lP

30. Srdjan Sladojevic, Marko Arsenovic, Andras Anderla, Dubravko Culibrk, and Darko Stefanovic, Deep
Neural Networks Based Recognition of Plant Diseases by Leaf Image Classification, Computational
Intelligence and Neuroscience, Volume 2016, Article ID 3289801, 11 pages.

31. Alvaro Fuentes, Sook Yoon, Sang Cheol Kim and Dong Sun Park, A Robust Deep-Learning-Based
Detector for Real-Time Tomato Plant Diseases and Pests Recognition, Sensors 2017, 17, 2022, pp. 1-21.
a

32. Ferdouse Ahmed Foysal M., Shakirul Islam M., Abujar S., Akhter Hossain S. (2020) A Novel Approach
for Tomato Diseases Classification Based on Deep Convolutional Neural Networks. In: Uddin M., Bansal
urn

J. (eds) Proceedings of International Joint Conference on Computational Intelligence. Algorithms for

Intelligent Systems. Springer.

33. Ruedeeniraman N., Ikeda M., Barolli L. (2020) Performance Evaluation of VegeCare Tool for Tomato
Disease Classification. In: Barolli L., Nishino H., Enokido T., Takizawa M. (eds) Advances in Networked-
based Information Systems. NBiS - 2019 2019. Advances in Intelligent Systems and Computing, vol 1036.
Springer.
Jo

34. Fuentes A., Im D.H., Yoon S., Park D.S. (2017) Spectral Analysis of CNN for Tomato Disease
Identification. In: Rutkowski L., Korytkowski M., Scherer R., Tadeusiewicz R., Zadeh L., Zurada J. (eds)
Artificial Intelligence and Soft Computing. ICAISC 2017. Lecture Notes in Computer Science, vol 10245.
Springer.

23
Journal Pre-proof

35. M. Sardogan, A. Tuncer and Y. Ozen, "Plant Leaf Disease Detection and Classification Based on CNN
with LVQ Algorithm," 2018 3rd International Conference on Computer Science and Engineering
(UBMK), pp. 382-385, 2018.

36. H. F. Pardede, E. Suryawati, R. Sustika and V. Zilvan, "Unsupervised Convolutional Autoencoder-Based

Feature Learning for Automatic Detection of Plant Diseases," 2018 International Conference on
Computer, Control, Informatics and its Applications (IC3INA), pp. 158-162, 2018.

of
37. A. Elhassouny and F. Smarandache, "Smart mobile application to recognize tomato leaf diseases using
Convolutional Neural Networks," 2019 International Conference of Computer Science and Renewable
Energies (ICCSRE), Agadir, Morocco, 2019, pp. 1-4.

pro
38. H. Durmuş, E. O. Güneş and M. Kırcı, "Disease detection on the leaves of the tomato plants by using deep
learning," 2017 6th International Conference on Agro-Geoinformatics, Fairfax, VA, 2017, pp. 1-5.

re-
a lP
urn
Jo

24
Journal Pre-proof

*Highlights (for review)

HIGHLIGHTS

 An attention based deep residual network is proposed in this research to detect the type of
infection in tomato leaves.
 This enhanced deep learning architecture is the first of its kind developed for automatic

of
detection of infection in tomato leaves.
 95999 images were used for training the model and 24001 images were used for
validation purpose.
 Experimental results indicate that the proposed attention based residual network was able

pro
to detect the type of infection with an accuracy of 98%.

re-
a lP
urn
Jo
*Declaration of Interest Statement Journal Pre-proof

Conflict of interest

None

of
pro
re-
a lP
urn
Jo

Review - Paper-Tomato Plant
No ratings yet
Review - Paper-Tomato Plant
15 pages
Tomato Disease Detection Review
No ratings yet
Tomato Disease Detection Review
18 pages
Tomato Leaf Disease Detection AI
No ratings yet
Tomato Leaf Disease Detection AI
7 pages
Low-Cost CNN for Tomato Disease Detection
No ratings yet
Low-Cost CNN for Tomato Disease Detection
9 pages
Tomato Plant Diseases Detection Via Image Processing Using ML and DL
No ratings yet
Tomato Plant Diseases Detection Via Image Processing Using ML and DL
8 pages
Deep Learning for Tomato Disorder Detection
No ratings yet
Deep Learning for Tomato Disorder Detection
7 pages
Review - Paper-Tomato Plant
No ratings yet
Review - Paper-Tomato Plant
18 pages
Reference Paper
No ratings yet
Reference Paper
5 pages
An Efficient Deep Learning Model For Tomato Disease Detection
No ratings yet
An Efficient Deep Learning Model For Tomato Disease Detection
18 pages
Early Detection of Tomato Leaf Diseases Based On Deep Learning Techniques
No ratings yet
Early Detection of Tomato Leaf Diseases Based On Deep Learning Techniques
7 pages
Research Paper
No ratings yet
Research Paper
6 pages
IJDSML Vol 5 Iss 3 Paper 6 650 654
No ratings yet
IJDSML Vol 5 Iss 3 Paper 6 650 654
5 pages
Plant Disease Prediction (Tomato) - A4
No ratings yet
Plant Disease Prediction (Tomato) - A4
4 pages
Disease Detection On The Leaves of The Tomato Plants by Using Deep Learning
No ratings yet
Disease Detection On The Leaves of The Tomato Plants by Using Deep Learning
5 pages
Plant Disease Detection with AI
0% (1)
Plant Disease Detection with AI
7 pages
Paper 88-Deep Learning For Early Detection of Tomato Leaf Diseases
No ratings yet
Paper 88-Deep Learning For Early Detection of Tomato Leaf Diseases
8 pages
Research Article: Can Deep Learning Identify Tomato Leaf Disease?
No ratings yet
Research Article: Can Deep Learning Identify Tomato Leaf Disease?
11 pages
Tomato Leaf Disease Detection Using Convolutional Neural Network With Data Augmentation
No ratings yet
Tomato Leaf Disease Detection Using Convolutional Neural Network With Data Augmentation
8 pages
Shrestha 2020
No ratings yet
Shrestha 2020
5 pages
Paper 22732
No ratings yet
Paper 22732
10 pages
394 - ICAECA - IEEE-Camera Ready
No ratings yet
394 - ICAECA - IEEE-Camera Ready
6 pages
SVM-Based Detection of Tomato Leaves Diseases: Abstract. This Article Introduces An e Cient Approach To Detect and
No ratings yet
SVM-Based Detection of Tomato Leaves Diseases: Abstract. This Article Introduces An e Cient Approach To Detect and
12 pages
Lijuan Tan Et Al - 2021 - Tomato Leaf Diseases Classification Based On Leaf Images
No ratings yet
Lijuan Tan Et Al - 2021 - Tomato Leaf Diseases Classification Based On Leaf Images
17 pages
Efficient CNN For Tomato Disease Classification - A Novel Architecture With Reduced Image Size and Comparative Analysis
No ratings yet
Efficient CNN For Tomato Disease Classification - A Novel Architecture With Reduced Image Size and Comparative Analysis
9 pages
Progress Seminar1
No ratings yet
Progress Seminar1
29 pages
Literature Survey - PPT
No ratings yet
Literature Survey - PPT
12 pages
Tomato Disease Prediction Model Using Machine Learning Algorithms and Image Processing Techniques
No ratings yet
Tomato Disease Prediction Model Using Machine Learning Algorithms and Image Processing Techniques
6 pages
Brinjal Disease Detection Using DCNN
No ratings yet
Brinjal Disease Detection Using DCNN
26 pages
IJCRT22A6204
No ratings yet
IJCRT22A6204
7 pages
An Effective Analysis of Tomato Plant Leaf Disease Identification Using Deep Learning
No ratings yet
An Effective Analysis of Tomato Plant Leaf Disease Identification Using Deep Learning
4 pages
Tomato Disease Detection Using CNN
No ratings yet
Tomato Disease Detection Using CNN
18 pages
VotTomNet: Voting-Based Tomato Disease Diagnosis With Transfer Learning
No ratings yet
VotTomNet: Voting-Based Tomato Disease Diagnosis With Transfer Learning
9 pages
Literature Review of Disease Detection in Tomato Leaf Using Deep Learning Techniques
No ratings yet
Literature Review of Disease Detection in Tomato Leaf Using Deep Learning Techniques
5 pages
Image-Based Tomato Disease Identification Using Convolutional Neural Network
No ratings yet
Image-Based Tomato Disease Identification Using Convolutional Neural Network
7 pages
Detection of Tomato Leaf Disease Locations Using Deep Learning
No ratings yet
Detection of Tomato Leaf Disease Locations Using Deep Learning
9 pages
Deep Learning Approach To Automated Tomato Plant Leaf Disease Diagnosis
No ratings yet
Deep Learning Approach To Automated Tomato Plant Leaf Disease Diagnosis
8 pages
Plant Disease Detection Using Machine Learning
No ratings yet
Plant Disease Detection Using Machine Learning
14 pages
Preserving Tomato Crop With Disease Classification and Severity Estimation Using Deep Learni
No ratings yet
Preserving Tomato Crop With Disease Classification and Severity Estimation Using Deep Learni
6 pages
Progress Seminar1 - PPT - Final
No ratings yet
Progress Seminar1 - PPT - Final
27 pages
Potato Disease PDF ADA
No ratings yet
Potato Disease PDF ADA
17 pages
Potato Leaf Disease Detection via CNN
No ratings yet
Potato Leaf Disease Detection via CNN
6 pages
Reference Paper
No ratings yet
Reference Paper
6 pages
LKyz Pixmz M2 FZK P84 VE5988 SH JP SW 8 As WZ78 NJ 8 G
No ratings yet
LKyz Pixmz M2 FZK P84 VE5988 SH JP SW 8 As WZ78 NJ 8 G
9 pages
Tomato Plant Diseases Classification Using Deep Learning Based Classifier From Leaves Images
No ratings yet
Tomato Plant Diseases Classification Using Deep Learning Based Classifier From Leaves Images
5 pages
PLANT LEAF DISEASE DETECTION BASED ON DEEP LEARNING WITH REAL (20-05-2024) Mounika
No ratings yet
PLANT LEAF DISEASE DETECTION BASED ON DEEP LEARNING WITH REAL (20-05-2024) Mounika
19 pages
Progress Seminar1 - PPT - Final
No ratings yet
Progress Seminar1 - PPT - Final
27 pages
Artificial Intelligence in Tomato Leaf Disease Detection: A Comprehensive Review and Discussion
No ratings yet
Artificial Intelligence in Tomato Leaf Disease Detection: A Comprehensive Review and Discussion
20 pages
A Survey On Plant Leaf Disease Identification and Classification by Various Machine-Learning Technique
No ratings yet
A Survey On Plant Leaf Disease Identification and Classification by Various Machine-Learning Technique
8 pages
A Transfer Learning-Based Deep Neural Network For Tomato Plant Disease Classification
No ratings yet
A Transfer Learning-Based Deep Neural Network For Tomato Plant Disease Classification
10 pages
Artificial Intelligence in Tomato Leaf Disease Detection: A Comprehensive Review and Discussion
No ratings yet
Artificial Intelligence in Tomato Leaf Disease Detection: A Comprehensive Review and Discussion
20 pages
INTRODUCTION
No ratings yet
INTRODUCTION
2 pages
Performanec Based Leaf Deseace
No ratings yet
Performanec Based Leaf Deseace
33 pages
Shengyi Zhao Et Al - 2021 - Tomato Leaf Disease Diagnosis Based On Improved Convolution Neural Network by
No ratings yet
Shengyi Zhao Et Al - 2021 - Tomato Leaf Disease Diagnosis Based On Improved Convolution Neural Network by
15 pages
Research Paper
No ratings yet
Research Paper
10 pages
Deep Learning For Tomato Diseases: Classification and Symptoms Visualization
No ratings yet
Deep Learning For Tomato Diseases: Classification and Symptoms Visualization
18 pages
Crop Disease Detection with AI
No ratings yet
Crop Disease Detection with AI
9 pages
(IJCST-V11I3P2) :K.Vivek, P.Kashi Naga Jyothi, G.Venkatakiran, SK - Shaheed
No ratings yet
(IJCST-V11I3P2) :K.Vivek, P.Kashi Naga Jyothi, G.Venkatakiran, SK - Shaheed
4 pages
10.1007@s11554 020 00987 8
No ratings yet
10.1007@s11554 020 00987 8
14 pages
Detection of Plant Diseases in An Industrial Greenhouse Development Validation Amp Exploitation
No ratings yet
Detection of Plant Diseases in An Industrial Greenhouse Development Validation Amp Exploitation
6 pages
Method Overload
No ratings yet
Method Overload
8 pages
Class 6
No ratings yet
Class 6
35 pages
Khan FishNet A Large-Scale Dataset and Benchmark For Fish Recognition Detection ICCV 2023 Paper
No ratings yet
Khan FishNet A Large-Scale Dataset and Benchmark For Fish Recognition Detection ICCV 2023 Paper
11 pages
Few-Shot Fish Image Generation & Classification
No ratings yet
Few-Shot Fish Image Generation & Classification
6 pages
ResNet & VGGNet Deep Learning Guide
No ratings yet
ResNet & VGGNet Deep Learning Guide
44 pages
UNIT3
No ratings yet
UNIT3
17 pages
Quantization and Training of Neural Networks For Efficient Integer-Arithmetic-Only Inference
No ratings yet
Quantization and Training of Neural Networks For Efficient Integer-Arithmetic-Only Inference
14 pages
CS601 - Machine Learning - Unit 2 New
No ratings yet
CS601 - Machine Learning - Unit 2 New
56 pages
Keras - Multiple Outputs and Multiple Losses - PyImageSearch
No ratings yet
Keras - Multiple Outputs and Multiple Losses - PyImageSearch
71 pages
Two Marks Question With Answers
No ratings yet
Two Marks Question With Answers
12 pages
Deep Learning for Sign Language Recognition
No ratings yet
Deep Learning for Sign Language Recognition
9 pages
Unit 2 - Neural Networks (DL Illustrated)
No ratings yet
Unit 2 - Neural Networks (DL Illustrated)
146 pages
YOLOv3 Object Detection on PYNQ-Z2
No ratings yet
YOLOv3 Object Detection on PYNQ-Z2
30 pages
Machine Learning For Corporate Default Risk Multi-Period Prediction, Frailty Correlation, Loan Portfolios, and Tail Probabilities
No ratings yet
Machine Learning For Corporate Default Risk Multi-Period Prediction, Frailty Correlation, Loan Portfolios, and Tail Probabilities
38 pages
Gan Cts
No ratings yet
Gan Cts
8 pages
A High-Throughput and Power-Efficient FPGA Implementation of YOLO CNN For Object Detection
No ratings yet
A High-Throughput and Power-Efficient FPGA Implementation of YOLO CNN For Object Detection
13 pages
Chapter 3 - Training Deep Neural Networks
No ratings yet
Chapter 3 - Training Deep Neural Networks
25 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
DL Ut - 1
No ratings yet
DL Ut - 1
14 pages
Applied Information Processing Systems 2022
100% (1)
Applied Information Processing Systems 2022
588 pages
Block Encryption LAyer BELA Zero-Trust Defense Against Model Inversion Attacks For Federated Learning in 5G 6G Systems
No ratings yet
Block Encryption LAyer BELA Zero-Trust Defense Against Model Inversion Attacks For Federated Learning in 5G 6G Systems
13 pages
Deep Learning Unit 2 GPT
No ratings yet
Deep Learning Unit 2 GPT
23 pages
DL Unit 1
No ratings yet
DL Unit 1
21 pages
Using Machine Learning To Detect Dustbathing Behavior of Cage-Free Laying Hens Automatically
No ratings yet
Using Machine Learning To Detect Dustbathing Behavior of Cage-Free Laying Hens Automatically
7 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Corn Leaf Disease Detection Using CNN
No ratings yet
Corn Leaf Disease Detection Using CNN
26 pages
CNN Based Automatic Detection of PV Cell Defects...
No ratings yet
CNN Based Automatic Detection of PV Cell Defects...
15 pages
Retele Neuronale Convolutionale
No ratings yet
Retele Neuronale Convolutionale
60 pages
Unit 4 NNDL
No ratings yet
Unit 4 NNDL
37 pages
Hyper Parameter Tuning Batch Normalization
No ratings yet
Hyper Parameter Tuning Batch Normalization
37 pages
Efficient Scaling with PEER MoE
No ratings yet
Efficient Scaling with PEER MoE
12 pages
Neural Networks & Deep Learning 2025
No ratings yet
Neural Networks & Deep Learning 2025
73 pages
A Survey On Deep Learning For Data-Driven Soft Sensors
No ratings yet
A Survey On Deep Learning For Data-Driven Soft Sensors
14 pages
Unit 3
No ratings yet
Unit 3
21 pages

Tomato Leaf Disease

Uploaded by

Tomato Leaf Disease

Uploaded by

Journal Pre-proof

Attention embedded residual CNN for disease detection in tomato leaves

Karthik R., Hariharan M., Sundar Anand, Priyanka Mathikshara,

To appear in: Applied Soft Computing Journal

Received date : 29 April 2019

© 2019 Elsevier B.V. All rights reserved.

Attention Embedded Residual CNN for Disease Detection in Tomato

Keywords: Attention, CNN, Residual Connections, Tomato, Deep Learning.

The blight is the most prevalent disease among others.

presented in the last two decades [1-5].

2.1 Machine learning based methods

were trained using weighted KNN [9].

2.3 Research gaps and Motivation

exist some significant challenges in it.

deployment of such models, a trade-off has to be achieved between the computational

2.4 Research Contributions

3.1 Residual Learning based CNN

3.1.1 Residual Progressive Feature Extraction (RPFE) Block

G(x), as given by Eq. 1.

Y(x) = F(x) + G(x) (1)

(a) share parameters (significantly at the initial few layers)

filters down to the deep layers) all in one-shot.

‘2k+1’ is the side of a square with odd convolutional filter

Batch Normalization Layer

conv2d_3 (Conv2D) (None, 64, 64, 64) 51264 add_1(0,0)

conv2d_6 (Conv2D) (None, 32, 32, 128) 204928 add_2(0,0)

conv2d_8 (Conv2D) (None, 1, 1, 4) 260 lambda_1(0,0)

3.2 Attention-based Residual CNN

an ARPFE block, given by Eq. 3.

Y(x) = F(x) + G’(x) (3)

The functional path G’(x) is computed as

G’(x) = G(x) * α (4)

No. of Connected to the previous

conv2d_3 (Conv2D) (None, 64, 64, 64) 18496 max_pooling2d_1(0,0)

dense_2 (Dense) (None, 128, 128, 1) 33 max_pooling2d_1(0,0)

max_pooling2d_3 (MaxPooling2D) (None, 128, 128, 32) 0 conv2d_1(0,0)

multiply_1 (Multiply) multiple 0 attention_weights(0,0),

batch_normalization_1 (BatchNorm) (None, 128, 128, 32) 128 max_pooling2d_3(0,0)

conv2d_4 (Conv2D) (None, 64, 64, 64) 51264 add_1(0,0)

conv2d_6 (Conv2D) (None, 32, 32, 128) 204928 add_2(0,0)

4. Results and Discussion

Healthy Early Blight Late Blight Leaf Mold

Table 4. Sample results of data augmentation process.

4.3 Experiment 2: Application of Residual CNN

Table 5: Observation of the proposed Residual CNN

Folds Accuracy Loss

4.4 Experiment 3: Application of Residual CNN with attention

the resultant observations are presented in Table 6.

Table 6: Observation of the proposed Attention based Residual CNN

Folds Accuracy Loss

4.5 Performance Analysis

Table 7. Summary of the proposed experiments

3 Attention embedded Residual 98

12 Halil Durmus et al. [38] Automatic SqueezeNet 54309 97.22

images," 2017 Chinese Automation Congress (CAC), pp. 2537-2510, 2017.

J. (eds) Proceedings of International Joint Conference on Computational Intelligence. Algorithms for

36. H. F. Pardede, E. Suryawati, R. Sustika and V. Zilvan, "Unsupervised Convolutional Autoencoder-Based

*Highlights (for review)

You might also like