Deep Learning for Asphalt Crack Detection
Deep Learning for Asphalt Crack Detection
Informatics Institute for Postgraduate Studies, University of Information Technology and Communications. Iraq
*
Corresponding author.
E-mail address: ms202220727@[Link]
DOI: 10.24237/djes.2025.18305
This work is licensed under a Creative Commons Attribution 4.0 International License.
68
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
In recent years, the fast development of method for pavement crack identification
computer technology and Artificial intelligence utilizing a deep convolutional neural network
(AI) has led to its integration across numerous fusion model, which effectively identifies
fields. In recent years, it has become extensively cracks and ensures recognition accuracy
utilized for road crack identification, resulting in through the YOLOv10 model. A detected crack
numerous automated detection techniques. can be segmented using Residual-Attention
Traditional automatic detection techniques UNet 3+, and the resulting binary image can be
encompass the Canny algorithm, which relies on utilized to compute the geometry characteristics
threshold segmentation [3], and the Otsu of the crack. Consequently, the suggested model
approach [4]. Nevertheless, owing to the holds substantial importance for intelligent
intricate characteristics of the road surface and pavement detection and can concurrently
pavement environment, along with the general perform detection and segmentation, thereby
applicability and robustness of the traditional markedly enhancing model efficiency.
Canny and Otsu algorithms, the accuracy of the The main contributions to the suggested model
detection findings is suboptimal. Subsequently, include the following:
the minimal cost path search algorithm [4], the
support vector machine (SVM) detection
algorithm [5], the Crack Tree detection
1. The suggested system employs YOLOv10
technique, and others emerged. These
for object detection because it effectively
techniques have solved low Precision, however,
resolves a significant obstacle in
detecting Precision still takes a long time.
organizational development: balancing
Additionally, the complex architecture of
accuracy and computational efficiency
detection and Crack Tree techniques using SVM
compared to earlier versions.
prevents their practical application. Based on
2. This study presents a novel technique for
these difficulties and the prevalent machine
image segmentation. We introduced
learning technology [6], a deep learning and
Residual-Attention UNet 3+, a composite
neural network-based automatic crack
neural network that amalgamates the
identification and detection technique is
advantages of UNet 3+, residual units, and
developed. Crack detection data can also
attention mechanisms for the segmentation
monitor pavement conditions and determine
of crack images. This technique has
road maintenance strategies. Thus, pavement
improved predicted accuracy relative to
crack identification would considerably impact
earlier methods, hence differentiating our
road monitoring and maintenance automation,
approach from prior methodologies. This
thus its precision and speed must be improved.
results in attaining a high level of precision
Currently, the direction of Road pavement
in the identification of pavement cracks.
crack recognition research is separated into two
3. Utilizing another dataset by capturing
sections. The digital image processing method
pavement crack images of local roads to test
uses artificial feature identification, including
the proposed system in order to reflect its
frequency, edge, HOG, gray level, texture, and
applicability and generalization in the real
entropy, to construct feature recognition
environment.
conditions for limited and total recognition. The
second type uses deep learning to create a This paper is organized into the following
Convolutional Neural Network (CNN) for sections: Section 2 presents the related research,
automatic feature recognition. The network Section 3 outlines our methodology, and Section
adjusts to meet or exceed label accuracy by 4 details the experiments and analysis. In
following specific rules. This paper establishes conclusion, Section 5 encapsulates the entirety
a deep learning-based convolutional network to of the work.
detect pavement cracks automatically.
Considering the aforementioned issues in
pavement crack detection, this paper proposes a
69
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
70
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
71
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
segmentation. In Stage 3, a novel approach for Simultaneously, a novel technique for surface
estimating surface cracks is introduced to feature estimation has been devised to examine
measure the length, width, and orientation of the surface feature data, emphasizing crack
segmented cracks. The primary benefit of the morphology precisely [16]. The specifics of
suggested model is the substantial enhancement each phase of the planned architecture are
in the precision and efficacy of road crack explained in the subsequent subsections.
segmentation among intricate backgrounds.
72
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
3.1.3 Splitting the dataset datasets. SUT-Crack datasets were split into
three subsets for this study: 70% for training,
A common technique in machine learning, 20% for validation, and 10% for testing.
data mining, pattern recognition, and other comprehensive information is available in Table
fields is splitting the dataset into smaller sub- 3.
Table 3: Details of splitting dataset
73
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
74
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
75
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
4 5
decoder layer 𝑋𝐷𝑒 and 𝑋𝐷𝑒 . As a result, it will be aggregation process, comprising 320 filters of
formed five same resolution feature maps. To size 3 × 3, batch normalization, and a ReLU
eliminate unnecessary information and further activation function, has been applied on the
standardize the number of channels a concatenated feature map from five scales in
convolution with 64 filters of size 3 × 3 could be order to smoothly combine the shallow exquisite
a good option. Furthermore, a feature information with deep semantic information
[28].
3
Figure 3. Illustration of how to construct the full-scale aggregated feature map of third decoder layer 𝑋𝐷𝑒 in original
Unet 3+
76
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
processing techniques and geometric crack width is computed by dividing the crack's
calculations. However, the morphological length by the area of the linked component. The
characteristics of cracks are not thoroughly contour is examined after it is sketched in the
addressed, which reduces efficiency and image of the crack area. The contour is analysed.
accuracy. Finally, the results of the crack length and width
At this stage, the suggested model is are displayed in the crack image, as shown in
subjected to a case study to verify its robust and Figure 4, which shows the steps of crack
dependable performance in a real-world quantification.
environment. We used a dataset comprising
asphalt crack images from local Iraqi roadways. i. Image Pre-processing: As shown in Figure
The segmentation model produces segmented 5, A series of operations is applied to find
images. Through this rigorous testing, the and analyze the contour of the crack
proposed model can be validated for its (Convert image to Gray-scale, to blur the
effectiveness in solving many safety problems, image, apply Gaussian filter [34], to convert
improving road performance, and reducing the image pixels to a binary image, apply
maintenance costs. adaptive Thresholding, Morphological
We provide a region-connected search Operations, most common morphological
method based on the linked component of operations are erosion and dilation. Erosion
cracks to make the visible cracks more removes pixels from image borders,
comprehensive, distinct, and consistent with the whereas dilation adds pixels.
real trend of cracks. Following the acquisition of Morphological processing removes tiny
the crack binary image's contour coordinates, cracks and fills gaps in detected cracks,
the crack's length is computed using the improving crack detection accuracy [35].
coordinates that were obtained, and the average
ii. Find Optimal Contour: We relied on The optimal contour was chosen based on
geometric analysis of the crack contour to the area. The contour area is then calculated
accurately detect real cracks, which helps in using the function counter area (); small area
filtering real cracks from noise in the image. contours are neglected because they often
To achieve this, we applied canny edges represent noise or unimportant details, while
filter to find the edges (connected large areas are considered because they
components) and then find the contour represent mostly cracks.
using the OpenCV, a “library in Python The optimal crack is calculated by assuming
programming language,” simplifies a minimum threshold (Min value), so areas
locating and drawing crack contours smaller than the previously specified value are
through two basic functions: find-Contours neglected, and areas larger than the specified
() and draw-Contours () [35]. value are mostly considered a contour of cracks.
77
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
After that, we verify whether the contour Algorithm (1): Analysis Crack
represents a crack or not by determining the
Input: Valid crack contour , Angl_Threshold=30
smallest rectangle surrounding the contour
through (GetMinAreaRect(contour)), and the Output: crack properties
width and length of the rectangle are calculated.
Through (aspect-ratio = length/width) and if the Begin
cracks are real, this ratio is greater or equal to Step 1: Get rotated rectangle properties:
(3), but if this ratio is less, it is not considered a
rect = GetMinAreaRect(contour)
crack [36]. This ratio was adopted as a minimum
because studies of crack analysis in road Step 2: Calculate dimensions:
engineering and materials science show that width = Min([Link], [Link])
actual cracks in substructures have a length-to- length = Max([Link], [Link])
width ratio ranging from 3 to 20 or more... If the
angle = [Link]
ratio is less than 3, this means that the shape is
square or circular, but if the ratio is greater than Step 3: Normalize angle
3, this means that the shape is longitudinal and area = CalculateArea(contour)
thin, as the cracks are considered longitudinal IF width > length:
and thin, which is caused by mechanical stress
angle = angle + 90
and thermal changes, which leads to linear
cracking, making them much longer than their END IF
width. Step 4: Determine orientation
IF (Angle < Angl_Threshold (Angle > (180 -
iii. Crack analysis
Angl_Threshold))Then
The algorithm (1) includes analyzing cracks orientation ="horizontal"
to determine their dimensions (width, length,
IF |Angle - 90| < Angl_Threshold Then
angle, and orientation), which will then be
translated into actual measurements. This orientation ="vertical"
algorithm mixes mathematical geometry, image IF Angle < 90 Then orientation ="diagonal-
analysis, and engineering data processing to right"
present an accurate and efficient method for
IF Angle > 90 Then orientation = "diagonal-
analyzing cracks in infrastructure. It generates
interpretable results, making it suitable for left"
practical applications in road maintenance. We Step 5: return{contour: contour,
employ a specified angle threshold (Angle width: width,
Threshold = 30°) to consistently classify crack
length: length,
direction into horizontal, vertical, and diagonal
orientations. Previous investigations angle: angle,
corroborate this threshold, which indicated that center: CenterPoint,
an angular tolerance of 25°–35° was efficient in orientation: orientation}
categorizing cracks under diverse situations.
End
The 30° threshold value signifies an effective
equilibrium between accuracy and tolerance in
practical settings, particularly where fractures iv. Convert to real measurements
may display minor angular variations due to
surface defects or perspective distortions. It is necessary to translate the resultant
measurements (in pixels) to millimetres (mm)
for informed decision-making in road
maintenance. We derive the conversion factor
for applying it to the real measurements of
78
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
length, width, and area, as demonstrated in the and bridges in Diyala governorate. It includes
subsequent equation [37]: various types of images that present various
𝑅𝑒𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑤𝑖𝑑𝑡ℎ (𝑚𝑚)
problems for crack detection, such as
𝐶𝑜𝑛𝑣𝑒𝑟𝑠𝑖𝑜𝑛 𝐹𝑎𝑐𝑡𝑜𝑟 (𝐶𝐹) =
𝑅𝑒𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑤𝑖𝑑𝑡ℎ (𝑝𝑖𝑥𝑒𝑙)
(1) shadows and stains of oil. A fixed height of
one meter, directly above the pavement, was
𝑙𝑒𝑛𝑔𝑡ℎ𝑚𝑚 = 𝐶𝐹 ∗ 𝐿𝑒𝑛𝑔𝑡ℎ𝑝𝑖𝑥𝑒𝑙 (2) used to capture the high-quality photos. using
a digital camera type (Canon RP + 18-
𝑤𝑖𝑑𝑡ℎ𝑚𝑚 = 𝐶𝐹 ∗ 𝑤𝑖𝑑𝑡ℎ𝑝𝑖𝑥𝑒𝑙 (3) 135mm), with a resolution of (6240 × 4160).
All pictures were captured during morning
𝐴𝑟𝑒𝑎𝑚𝑚 2 = 𝐶𝐹 2 ∗ 𝐴𝑟𝑒𝑎𝑝𝑖𝑥𝑒𝑙2 (4) hours to ensure clarity and similar lighting
conditions. The images in this dataset were
annotated by the use of Labelme application.
4. Experiments results and analysis This dataset was prepared specially to reflect
4.1 Implementation details and Dataset
the real-world environment on local asphalt
Collection
roads. Figure 7 shows a sample of these
The training utilized the Adam optimizer, images with their corresponding masks.
including a learning rate of 0.0001 and a batch These images of datasets present various
size of 32 at 100 epochs. All tests were problems for crack detection, such as oil
conducted with TensorFlow on a Windows 10 stains, shadows, and varying lighting
computer with an Intel Core i7 running at 3.60 conditions. This feature improves the
GHz and 16 GB of RAM. In this study, we reliability of automated pavement crack-
conducted experiments using two datasets to get detecting methods and simulates real-world
more accurate results: circumstances.
In this study, we conducted experiments The training, validation, and testing images
using two datasets to get more accurate results: are from the SUT-Crack dataset, whilst the real-
time test images are sourced from the IRD-
Set 1: The “SUT-Crack Dataset “which Crack dataset. During pre-processing, images
contains 130 high-resolution images in jpg designated for training, validation, and testing
format, with dimensions of 3024 by 4032 are downsized to 640 x 640 x 3 and 320 × 320
pixels [38]. The images are organized dually, pixels. The limited size of the SUT-Crack
meaning that each original image is matched dataset may not furnish sufficient training data
by its corresponding ground truth image, as to attain appropriate outcomes. To tackle this
shown in Figure 6. SUT-Crack is available at difficulty, diverse strategies are utilized to
[Link] enhance the dataset and increase the quantity of
photographs. Augmentation employs rotation,
Set 2: The “IRD-Crack Dataset”: which shifting, shearing, zooming, flipping, and
represents our local dataset. It comprises of reflection as shown in Figure 8.
asphalt crack images that were collected in
cooperation with the directorate of highways
79
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
Horizontal Horizontal
Original Rotation Vertical Flip Zoom Vertical shift
Flip shift
1
4.2 Evaluation metrics 𝐴𝑃 ∑𝑁
1 ∫0 𝑝(𝑟)𝑑𝑟
𝑚𝐴𝑃 = = (9)
𝑁 𝑁
To statistically assess the experimental To
statistically assess the experimental results, where N is the number of crack classes, p is the
many performance metrics were analysed, percentage of all anticipated positive samples
including Accuracy (ACC), Precision (Pr), that are successfully detected, and r is the
Recall (Re), Dice Coefficient (DC), mean percentage of all actual positive samples that
Average Precision (mAP), and Intersection over were correctly detected.
Union (IoU). The methods used for metric The Intersection over Union (IoU) is the
calculation are delineated in Equations (5), (6), ratio of the intersection to the union of the
(7), (8), (9), and (10) respectively[39]. predicted mask and the ground truth data,
𝑇𝑃+𝑇𝑁
expressed as:
𝐴𝐶𝐶 = (5)
𝑇𝑃+𝑇𝑁+𝐹𝑃+𝐹𝑁 𝐴∩𝐵
𝐼𝑜𝑈 = (10)
𝑇𝑃 𝐴∪𝐵
𝑃𝑟 = (6)
𝑇𝑃+𝐹𝑃
where A and B indicated the predicting image
𝑇𝑃 mask and ground truth image mask,
𝑅𝑒 = (7)
𝑇𝑃+𝐹𝑁 respectively.
2𝑇𝑃
𝐷𝐶 = 𝐹𝑃+2𝑇𝑃+𝐹𝑁 (8) 4.3 Crack detection results
Where TP = True Positives, TN =True We utilized validation data from the SUT-
Negatives, FP = False Positive, and FN = False Crack dataset to assess the efficacy of the
Negatives. proposed crack detection algorithm in the object
detection phase. The findings are displayed in
The average precision (AP) denotes the area Table 4, illustrating performance indicators like
below the precision-recall curve, whereas mean Precision, Recall, mAP@0.5, and
average precision (mAP) refers to the average of mAP@0.5:0.95. At an IoU threshold of 0.5, the
different classes of AP values: suggested YOLOv10 model attained an
80
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
(mAP@0.5) of 68.90%. When the IoU threshold noise, illumination fluctuations, and oil stains.
was varied from 0.5 to 0.95, it produced an YOLOv10 not only delivered great detection
mAP@0.5:0.95 of 54.58%, as seen in Figure 9. accuracy but also enhanced computational
Furthermore, Figure 10 and Figure 11 depict efficiency, rendering it particularly suitable for
the precision-confidence and recall-confidence real-time asphalt crack detection systems. This
curves, affirming the model's resilience across results from its enhanced design, which
different confidence levels. Figure 12 diminishes processing time and resource use
illustrates that YOLOv10 effectively detects while preserving detection accuracy.
fractures, even under adverse situations like
Table 4: Detection Results and compare with previous studies. This mark "N/R" indicates that the
results are not available.
Table 4 demonstrates that the suggested model successfully absorbed the fundamental
YOLOv10 model attained enhanced accuracy patterns present in the training data. Figure 13
and recall relative to prior studies. Although the Illustrates both training/validation loss.
study of Li et al,2024 [7] indicated a diminished Utilizing 100 epochs. It indicates that the
mAP\@0.5 and lacked other metrics, and Deng minimal loss score attained during training and
et al,2023[8] presented merely an aggregate validation was 0.16.
mAP score without a detailed analysis, our The aforementioned values indicate the model's
methodology delivers a more thorough and excellent performance and efficacy,
dependable assessment across various demonstrating its generalizability, as the results
thresholds, substantiating the efficacy of are consistently low and equivalent in both
YOLOv10 in practical detection contexts. training and validation scenarios.
Figure 14 depicts visual representations of
the model's training and validation performance
with epoch = 100 for Accuracy, Precision, and
4.4 Crack Segmentation Results
Recall metrics. The maximum accuracy,
The loss percentages indicate the disparity precision, and recall scores achieved during
between the predicted outcomes and the actual training and validation were 0.9906, 0.974, and
ground truth values. Lower loss percentages 0.999, respectively.
indicate a higher concordance between the
predicted and actual values, so implying that the
81
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
Figure 9. The network's performance during the validation process: (a) at an IoU threshold of 0.5, the computed mAP
(mAP@0.5), and (b) with the IoU threshold varying from 0.5 to 0.95, the computed mAP (mAP@0.5: 0.95).
The results indicate that the model performance and robust generalization
demonstrated efficiency in making accurate capability.
predictions. In contrast, the equilibrium between The proposed system assesses the model
recall and precision indicates that the model is utilizing various metrics, including (IoU) and
accurate in identifying cracks and thorough in the Dice coefficient. Our model attained an IoU
encompassing every relevant feature, hence of 0.956 and a Dice coefficient of 0.977,
minimizing false negatives. Reflecting effective signifying a substantial correspondence
between the predicted and actual segmentations.
82
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
(a)
83
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
(b)
(c)
Figure 14. Training and Validation (a) Accuracy (b) Precision (c) Recall.
84
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
Table 5: Segmentation Results and compare with previous studies. This mark "-" indicates that the results are not
available.
Figure 15. Results of the crack segmentation with residual-attention UNet. 3+.
85
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
introducing an algorithm to search for connected image processing technique,” Measurement, vol.
components of cracks, find the crack's optimal 215, p. 112832, 2023.
[5] K. Sarkar, A. Shiuly, and K. G. Dhal,
contour, and analyze it for real measurements. “Revolutionizing concrete analysis: An in-depth
Consequently, we reached the following survey of AI-powered insights with image-centric
conclusions: The suggested technique can approaches on comprehensive quality control,
accurately detect at the pixel scale. Our method's advanced crack detection and concrete property
superiority was assessed using precision, recall, exploration,” Constr. Build. Mater., vol. 411, p.
134212, 2024.
mAP@0.5, and mAP@0.5:0.95 measures, [6] K. C. Laxman, N. Tabassum, L. Ai, C. Cole, and P.
demonstrating greater object detection accuracy Ziehl, “Automated crack detection and crack depth
than prior studies. The crack detection method prediction for reinforced concrete structures using
attained 100% precision, 91% recall, and deep learning,” Constr. Build. Mater., vol. 370, p.
68.90% mAP@0.5. Incorporating an attention 130709, 2023.
[7] Y. Li, C. Yin, Y. Lei, J. Zhang, and Y. Yan, “RDD-
gate and residual connection significantly YOLO: Road Damage Detection Algorithm Based
enhances the accuracy of Residual-Attention on Improved You Only Look Once Version 8,”
UNet 3+ for crack segmentation, resulting in an Appl. Sci., vol. 14, no. 8, p. 3360, 2024.
IOU of 95.60% and a dice coefficient of 97.74% [8] L. Deng, A. Zhang, J. Guo, and Y. Liu, “An
for the segmented cracks. The advanced crack integrated method for road crack segmentation and
surface feature quantification under complex
quantification technique can significantly backgrounds,” Remote Sens., vol. 15, no. 6, p. 1530,
mitigate pavement damage by analyzing cracks 2023.
and translating them into precise measures [9] Z. Shu, Z. Yan, and X. Xu, “Pavement Crack
(mm). These data provide an accurate Detection Method of Street View Images Based on
assessment and characterization of the cracks, Deep Learning,” J. Phys. Conf. Ser., vol. 1952, no.
2, 2021, doi: 10.1088/1742-6596/1952/2/022043.
hence aiding maintenance teams in executing [10] Q. An, X. Chen, X. Du, J. Yang, S. Wu, and Y. Ban,
appropriate maintenance strategies. “Semantic Recognition and Location of Cracks by
In future work, we aim to expand the local Fusing Cracks Segmentation and Deep Learning,”
dataset to ensure the diversity and severity of Complexity, vol. 2021, 2021, doi:
pavement defects to detect patching, erosion, 10.1155/2021/3159968.
[11] Z. Zhang, Q. Liu, and Y. Wang, “Road Extraction
and many other defects, not just cracks in road by Deep Residual U-Net,” IEEE Geosci. Remote
pavements. We also aspire to promote the Sens. Lett., vol. 15, no. 5, pp. 749–753, 2018, doi:
system through edge computing to detect cracks 10.1109/LGRS.2018.2802944.
directly from edge devices such as drones and [12] Q. Zhang et al., “Improved U-net network asphalt
IoT evaporators. This can provide real-time pavement crack detection method,” PLoS One, vol.
19, no. 5 May, pp. 1–21, 2024, doi:
crack detection and segmentation from 10.1371/[Link].0300679.
photographs or videos. [13] M. He and T. L. Lau, “CrackHAM: A Novel
Automatic Crack Detection Network Based on U-
References Net for Asphalt Pavement,” IEEE Access, vol. 12,
no. November 2023, pp. 12655–12666, 2024, doi:
[1] X. Feng et al., “Pavement crack detection and 10.1109/ACCESS.2024.3353729.
segmentation method based on improved deep [14] Z. Zhang, K. Yan, X. Zhang, X. Rong, D. Feng, and
learning fusion model,” Math. Probl. Eng., vol. S. Yang, “Automated highway pavement crack
2020, no. 1, p. 8515213, 2020. recognition under complex environment,” Heliyon,
[2] Z. Li, H. Zhang, Z. Li, and Z. Ren, “Residual- vol. 10, no. 4, p. e26142, 2024, doi:
attention UNet++: a nested residual-attention U-net 10.1016/[Link].2024.e26142.
for medical image segmentation,” Appl. Sci., vol. 12, [15] M. Hussain, “Yolov5, yolov8 and yolov10: The go-
no. 14, p. 7149, 2022. to detectors for real-time vision,” arXiv Prepr.
[3] K. Malek, A. Mohammadkhorasani, and F. Moreu, arXiv2407.02988, 2024.
“Methodology to integrate augmented reality and [16] A. J. Yousif and M. H. Al-Jammas, “Real-time
pattern recognition for crack detection,” Comput. Arabic Video Captioning Using CNN and
Civ. Infrastruct. Eng., vol. 38, no. 8, pp. 1000–1019, Transformer Networks Based on Parallel
2023. Implementation,” Diyala J. Eng. Sci., pp. 84–93,
[4] M.-V. Pham, Y.-S. Ha, and Y.-T. Kim, “Automatic 2024.
detection and measurement of ground crack [17] M. M. Islam, M. B. Hossain, M. N. Akhtar, M. A.
propagation using deep learning networks and an Moni, and K. F. Hasan, “CNN based on transfer
86
Shemeam T. Muhey, Sinan A. Naji/ Diyala Journal of Engineering Sciences Vol (18) No 3, 2025: 68-87
87