Indin
Indin
Abstract— Using high resolution (HR) images collected from to achieve a level of accuracy unattainable with other remote
UAV, aerial craft or satellites is a research hotspot in the field data Earth sounding. After identification and differentiation of
forest areas analyzing. In practice, HR images are available for individual breeds, schemes can be obtained distribution and
a small number of regions, while for the rest, the maximum crown contouring can be done for each breed. So maybe
density various around 1 px/m. HR image reconstruction is a valuable information was obtained, for example, the number
well-known problem in computer vision. In recent years, deep of trees of each species, the volume of wood of each tree. This
learning algorithms have achieved great success in processing information is of great importance for modern forestry.
images, thus we introduced them into the field of processing However, there are still unsolved problems related to the use
orthoimages. At the same time, we noticed that orthoimages
of multispectral images for forest analysis. Difficulty
generally have colorful blocks of different sizes. Based on this
feature, we did not directly apply the classic algorithm, but
classification of breeds can be explained by the following
made some improvements. Experiments show that the effect of factors: a huge variety of breeds on a small area, variability of
our proposed method is equivalent to that of the classic spectral parameters of old trees, healthy trees and trees,
algorithms, however, in the pre-processing stage, it saves time partially or completely shaded by other trees. Researchers
significantly. An approach to the forest areas analyzing, should use the latest data processing techniques to achieve
including image segmentation and the tree spices classification higher accuracy [7]. The use of super resolution in the
is proposed. The results of numerical calculations are presented. problems of processing satellite snapshots is one of the
advanced areas of using the machine learning apparatus in
Keywords—Orthoimages, Forest Area Analysis, Super- processing images [8-10].
Resolution, Deep learning.
I. INTRODUCTION
For effective tracking of forest use, it is necessary to
develop tools that allow detailed accounting of forest Fig. 1. Some orthoimages
resources. Today, the introduction of space-based imagery
and high-resolution systems, coupled with the development of II. PROPOSED METHOD
methods for automated interpretation of digital images, has There are many solutions which allows forest areas
become one of the main engines for the digital transformation analyzing, including image segmentation and the tree spices
of production processes. However, many years of experience classification. It’s important achievement, because these 3
in the use of satellite monitoring systems have revealed methods allow you to track the state of the forest and its
several shortcomings, including insufficient spatial resolution cutting. These methods require more information to divide
of data, long waiting times for surveys, dependence on trees from each other and retrieve features of a tree. There-
cloudiness and other atmospheric processes. The use of fore, they use high resolution images for teaching and
unmanned aerial vehicles over the past few years has shown processing which can be made by UAV or aerial craft.
the possibility of solving problems requiring high detail.
The solution to this problem can be HR satellite imagery
Today, in fact, there are no complete solutions that allow
and geological data, that can help a neural network to
solving problems in subject areas, such as forestry, with a
reconstruct the geometry and solve the segmentation and
sufficient level of reliability due to the low reliability and
classification tasks. And in this case the researcher will not be
detail of the existing remote means of automated obtaining
limited with the research area.
data on forest resources [1-6].
But HR satellite images are available for a small number
Satellite data can definitely be useful for harvesting
of regions, while for the rest, the maximum density various
organizations timber, protection or study of forests, namely
around 1 px/m. It is not enough for the NN to separate trees
for: calculation of available resources; assessment of forest
from each other and determine the tree characteristics. It leads
quality; logging planning; assessing the diversity of breeds in
us to the problem of a low-quality dataset that cannot be used
a particular area; reforestation works. Thanks to additional
for the task described above. But we can use super-resolution
channels (yellow, extreme red and NIR -2) satellites improve
methods to up-sample pixel density of these images.
the capabilities separation of various tree species. High spatial
resolution of images, e.g. satellite WorldView-2, allows you
HR image reconstruction is a well-known problem in the rescaled VGG feature maps from the discriminator. The
computer vision. Its solutions have been widely used in network was trained on 350 thousand images from the
medicine, video and photo processing, social networks and ImageNet database and tested on Set5, Set14 and BSD100
wearable gadgets. Currently, there are several approaches, datasets. The new architecture was able to outperform
including using multiple LR images to restore one HR images. SRResNet on the BSD100 dataset.
But they are not suitable for solving this problem, since it is
not always possible to obtain several images of the same area. However, SRGAN had several weaknesses, some of
Therefore, the best solution seems to be the choice of which were eliminated later.
approaches to restore one image from one LR image. Based on it, the ESRGAN architecture [15] was created. It
Initially, the researchers created small networks with a low introduced a new Residual-in-Residual Dense Block (Fig 3),
level of super-resolution. These networks were not so deep which had a larger number of neurons and was easier to learn.
and had encoder-decoder architecture (Fig. 2). The first In addition, batch normalization has been removed, because it
several layers retrieve futures from an image and then they leads to artefacts in the generated image. Also in this work, an
pass to the decoder which reconstruct the image using inverse improved discriminator is described to obtain a more
convolution. perception beautiful image. These improvements made it
possible to improve the result of the generator: the sharpness
of the images and the number of small details in the
reconstructed image increased.
2
𝑃𝑆𝑁𝑅 = 10 log10 (𝐺𝑀𝐴𝑋 /𝑀𝑆𝐸)
where:
1
𝑀𝑆𝐸 = 𝑖=1 ∑𝑗=1[𝑓(𝑖, 𝑗) − 𝑓𝑏 (𝑖, 𝑗)]
∑𝑁 𝑀 2
𝑁𝑀