CrackForest: Advanced Road Crack Detection
CrackForest: Advanced Road Crack Detection
Abstract—Cracks are a growing threat to road conditions and attentions in recent years. As it is known, traditional man-
have drawn much attention to the construction of intelligent ual road crack detection approaches are very time-consuming,
transportation systems. However, as the key part of an intelli- dangerous, labor-intensive and subjective [2]–[5]. Thus, the
gent transportation system, automatic road crack detection has
been challenged because of the intense inhomogeneity along the slow and subjective traditional procedures have been replaced
cracks, the topology complexity of cracks, the inference of noises gradually by automatic crack detection, which is developed for
with similar texture to the cracks, and so on. In this paper, we fast and reliable crack analysis in intelligent transportation sys-
propose CrackForest, a novel road crack detection framework tems [6]. Automated crack detection systems can quantify the
based on random structured forests, to address these issues. Our quality of road surfaces and assist in prioritizing and planning
contributions are shown as follows: 1) apply the integral channel
features to redefine the tokens that constitute a crack and get the maintenance of the road network and thereby accomplish
better representation of the cracks with intensity inhomogeneity; the objective of preserving the roads in good condition and
2) introduce random structured forests to generate a high- extending the service life.
performance crack detector, which can identify arbitrarily com- With the development of image processing techniques, road
plex cracks; and 3) propose a new crack descriptor to characterize crack detection and recognition have been widely discussed
cracks and discern them from noises effectively. In addition, our
method is faster and easier to parallel. Experimental results prove in the past few decades [7]–[11]. In early methods [12], [13],
the state-of-the-art detection precision of CrackForest compared researchers usually use threshold-based approaches to find
with competing methods. crack regions based on the assumption that real crack pixel is
Index Terms—Road crack detection, structured learning, consistently darker than its surroundings. These methods are
machine learning, random structured forests, crack descriptor, very sensitive to noises, since only brightness feature is taken
crack characterization. into consideration. Moreover, these approaches are performed
on individual pixels. Lack of global view also makes these
I. I NTRODUCTION methods unsatisfying.
In terms of the current methods [5], [8], [9], [11], [14], [15],
superior to other state-of-the-art detecting techniques like maps are built by a 2D continuous wavelet transform, wavelet
CrackTree [20], CrackIT [6], FFA [25] and MPS [17], [19]. coefficients maximal values are obtained for crack detection.
CrackForest incorporates complementary features from multi- As a result, differences between crack regions and crack free
ple levels to characterize cracks and to take advantage of the regions could be raised up. However, due to the anisotropic
structured information in crack patches. In specific, we first ex- characteristic of wavelets, these approaches may not handle the
tend the traditional road crack detection feature set by introduc- cracks with low continuity or high curvature properly.
ing the integral channel features [26] to re-define crack tokens Minimal Path Selection: Give both endpoints of the curve
with structured information. After that, we apply random struc- as user’s input, minimal path based method can extract simple
tured forests [27] to exploit such structured information. Ran- open curves in images, that is first proposed by Kass et al. [39].
dom structured forests predict a patch crack of structured tokens In [40], Kaul et al. propose a method that is able to detect
that are aggregated across the image to compute our prelimi- the same types of contour-like image structures with less prior
nary crack detection result. In this step, the structured tokens knowledge about both the topology and the endpoints of the
assigned to each image patch can be obtained simultaneously. desired curves. To avoid false detections that are assimilating
Then, the structured tokens are used to construct the crack de- loops, Amhaz et al. [17], [19] propose an improved algorithm
scriptor which consists of two statistical histograms to charac- to select endpoints at the local scale and then to select minimal
terize cracks with arbitrary topology. With the crack descriptor, paths at the global scale. It can also detect the width of the
a classification method is applied to discriminate the cracks crack. In [25], Nguyen et al. propose a method which takes into
from noises. In addition, we also propose a quantitative eval- account intensity and crack form features for crack detection
uation method for road crack detection task. Extensive exper- simultaneously by introducing Free-Form Anisotropy.
iments demonstrate the efficiency of CrackForest on real road Machine Learning: With the increasing size of image data,
crack dataset and our method shows state-of-the-art precision. machine learning based methods [3], [5], [15], [41]–[43] have
become an important branch in detecting road cracks. In [3],
artificial neural network models are used to separate crack
II. R ELATED W ORK pixels from the background by selecting proper thresholds.
In this section, we first give a brief review of crack detection, [41] deals with the detection of poorly contrasted cracks in
after that, the related crack characterization methods are dis- textured areas using a Markov random field model. In [43],
cussed. Crack characterization exploits the spatial distribution Cord et al. use AdaBoost to distinguish images of road surfaces
of image tokens composing the detected cracks and thereby with defects from road surfaces based on textual information
transforms the structured tokens into discrete labels. with patterns. For all these methods, the training and classifi-
cation are conducted on each sub-image and as local method,
they have drawbacks in finding complete crack curves over the
A. Crack Detection
whole image.
Numerous papers have been written on road crack detection
over the past 30 years. Early works [1], [28]–[30] are mainly
B. Crack Characterization
based on intensity-thresholding for its simplicity and efficiency.
Most recent work explores crack detection under more chal- Existing methods on crack characterization are mainly based
lenging conditions and can be divided into five branches: on shape descriptor, crack seeds and assigning crack type on
methods based on saliency detection, textured-analysis, wavelet each image block.
transform, minimal path and machine learning. An assessment Reference [14] gives the definition of cracks based on math-
of various pavement distress detection methods can be found in ematical morphology and proposes that a crack is thought to
[31] and [32]. be a succession of saddle points with linear features. But this
Salient Detection: Salient regions are visually more con- definition is pretty vague. Reference [2], [44] use the direc-
spicuous due to their contrast with the surroundings. Although tion indices of each pixels and extensible directions for each
existing methods [33], [34] demonstrate their effectiveness in direction to characterize cracks. A chromosome representation
detecting salient regions in the Berkeley database [35], they is applied to encode the different ensemble of directions and its
perform poor on the completeness and continuity of detected extensible directions. Therefore, a crack can be represented as
crack. a long sequence of 0 and 1.
Textured-Analysis: Since road surface images are often Reference [31], [42] categorize the cracks into five types:
highly textured, textured-analysis methods [8], [36], [37] are longitudinal, transverse, diagonal, block, and alligator. Refer-
introduced in road crack detection. In order to distinguish the ence [42] uses a neural network based method to search patterns
cracks and the backgrounds, [8], [36] use the Wigner model, of various crack types horizontally and vertically. Reference
and [37] uses classification method. These methods use a [31] uses curves and buffers to describe certain regions of a
local binary pattern operator to determine whether each pixel crack. Reference [9] uses longitudinal, transverse, or diagonal
belongs to a crack and the local neighbor information is not crack seeds to identify longitudinal and transverse cracks. Ori-
taken into consideration. Therefore, the cracks with intensity entation and strength information are taken into consideration
inhomogeneity can not be detected precisely. by [20], which largely improves the diversity of crack seeds.
Wavelet Transform: Wavelet transform is applied to sepa- In [6], cracks are classified into three types as defined by
rate distresses from noises [38]. In [4], complex coefficient the Portuguese Distress Catalog. They use two block feature
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
SHI et al.: AUTOMATIC ROAD CRACK DETECTION USING RANDOM STRUCTURED FORESTS 3
A. Structured Tokens
Token (segmentation mask) indicates the crack regions of
an image patch. Current block-based methods [6], [38] are
Fig. 2. Examples of tokens learned from a manually labeled image database.
usually used to extract small patches and calculate mean
(a) Most representative token for each token set. (b) Mean contour structure for and standard deviation value on these patches to represent
each token set. an image token. These traditional features are computed on
gray level images and applied to describe the brightness and
including the mean and the standard deviation values of pixel- gradient information. However, local structured information is
normalized intensities to categorize an image block as longi- not taken into consideration. So in the first step, we re-define
tudinal, transversal or miscellaneous. Reference [5] computes the tokens by introducing the integral channel features which
CTA (Conditional Texture Anisotropy) values over the distrib- incorporate the color, gradient information from multiple levels
ution of the mean and the standard deviation values calculated and facets.
on pixels to distinguish crack pixels from defect free pixels. 1) Learning the Tokens: Assume that we have a set of im-
However, there are two main drawbacks in these methods. ages I with a corresponding set of binary images G representing
On the one hand, new types of crack cannot be generated. By the manually labeled crack edge from the sketches. We use a
applying the structured tokens, we extend the crack types into 16 × 16 sliding window to extract image patches x ∈ X from
thousands of dimensions. On the other hand, these methods the original image. Image patch x which contains a labeled
perform poor on the cracks with complex topology. To address crack edge at its center pixel, will be regarded as positive
this issue, we propose a novel crack descriptor to describe the instance and vice versa. y ∈ Y encodes the corresponding local
cracks with arbitrary complex topology. image annotation (crack region or crack free region), which also
indicates the local structured information of the original image.
These tokens cover the diversity of various cracks, which are
III. AUTOMATIC ROAD C RACK D ETECTION
not limited to straight lines, corners, curves, etc.
In this section, we will introduce our novel crack detection From Fig. 4, we can see the extracted image patches and their
method which can take advantage of the structured information hand drawn contour tokens. These image patches and tokens
of cracks. Fig. 3 shows the overall procedure of our proposed will be used to train CrackForest later.
method. This framework can be divided into three parts: In 2) Feature Extraction: To describe the above tokens, fea-
the first part, we extend the feature set of traditional crack tures are computed on the image patches x extracted from the
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
token in each leaf to represent the token class. The class number
Fig. 4. (Top) Example of original image and its ground truth. (Bottom)
Example of extracted image patches and their hand-drawn contours. Notice the
of tokens equals to the number of leaves.
variety of sketches. A forest T can be seen as an ensemble of decision trees ft .
Each tree ft (x) gives a prediction of a sample x ∈ X . The final
training images I, and considered to be weak classifiers in the class prediction of multiple trees is integrated by a majority
next step. voting algorithm. A leaf L(π) ∈ ft can assign a class prediction
We use mean and standard deviation value as features. Two for samples it is reached by, where π stands for the most
matrices are computed for each original image: the mean ma- represented token in the leaf. Each node N (h, ftL , ftR ) ∈ ft is
trix Mm with each block’s average intensity and the standard associated with a binary split function
deviation matrix STDm with corresponding standard devia-
tion value std. Each image patch yields a mean value and a h(x, θj ) ∈ {0, 1} (1)
16 × 16 standard deviation matrix. with feature θj for each node j. If h(x, θj ) = 0, sample x
To characterize the cracks more comprehensively, we also should be branched to the left sub-tree ftL , otherwise the right
apply a set of channel features composed with color, gradient sub-tree ftR .
and oriented gradient information. Integral channel features not 1) Class Prediction: Given a tree ft ∈ T , the class pre-
only perform better than other features including histogram of diction of an image patch x ∈ X can be obtained by recur-
oriented gradient (HOG), but also achieve fast detecting results sively branching it forward until a leaf is reached. An intuitive
and integrate heterogeneous sources of information [26]. example has shown in Fig. 5. The prediction function ψ(x|ft ) :
3 color, 2 magnitude and 8 orientation channels, for a total X → Y for node j is
of 13 channels yield 3328 candidate features. Each of the chan-
nel captures a different aspect of information. Self-similarity ψ x|ftL , for h(x, θj ) = 0
L
ψ x|N h, ft , ft R
=
features are compute for each channel. These features capture ψ x|ftR , for h(x, θj ) = 1
the portion that an image patch contains similar textures based
on color or gradient information [45]. Texture information is ψ (x|L(π)) = π. (2)
on a m × m grid over the patch. These differences
computed The final class prediction of x is obtained from the prediction
yield 5·5
2 = 300 more features per channel. of each tree as the one receiving the majority voting.
2) Randomized Training: Each tree is trained individually.
B. Structured Learning For a given node Nj and training set Sj ⊂ X × Y, the goal is
In previous step, a set of tokens y which indicate the struc- to find the optimal feature θj that results in a good split of the
tured information of local patches, and features which describe data. In other words, the discrepancy of tokens in the same leaf
such tokens, are acquired. In this step, we cluster these tokens should be as small as possible. We apply information gain to
by using a state-of-the-art structured learning framework, ran- measure this discrepancy and maximize the information gain to
dom structured forests, to generate an effective crack detector. choose θj . The form of information gain for node j is defined
Random structured forests can exploit the structured informa- as follow:
tion and predict the segmentation mask (token) of a given image
Ij = I Sj , SjL , SjR (3)
patch. Thereby we can obtain the preliminary result of crack
detection. where Sj = SjL ∪ SjR , SjL = {(x, y) ∈ Sj |h(x, θj ) = 0}
In random structured forests, each decision tree ft (x) clas- stands for a set of samples that reaches the left sub-tree of the
sifies an image patch x ∈ X by recursively branching left or current node and SjR = {(x, y) ∈ Sj |h(x, θj ) = 1} refers to
right down to the tree until a leaf is reached. And the class of the other set of samples that reaches the right sub-tree.
the node is assigned to patch x. The leaf stores the prediction Whether a terminal node should be further split depends on
of the input x, which is a target label y ∈ Y or a distribution the maximum depth, the minimum size of node or the entropy
over Y. By training such a tree, tokens with the same structure of the class distribution. If the node is no longer splitting, a
will be gathered at one leaf. We use the most representative leaf is grown where the class prediction π is set to the most
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
SHI et al.: AUTOMATIC ROAD CRACK DETECTION USING RANDOM STRUCTURED FORESTS 5
Fig. 9. Statistical feature histogram showing the occurrence (in logs) of each
token (sorting in descending order of occurrence). (a) Statistical feature of all
26443 tokens. (b) Only the most representative tokens are shown.
SHI et al.: AUTOMATIC ROAD CRACK DETECTION USING RANDOM STRUCTURED FORESTS 7
Fig. 10. Part of the results of road crack detection using our proposed method. Notice that our method can eliminate the influence of oil stains, shadows, and
complex background, effectively, and can cope with miscellaneous crack topology.
Crack Detection Accuracy: We use precision, recall and The precision, recall and F1 Score on detect region can be
F1 Score to evaluate the performance of different crack detec- similarly computed by (7) and (8)
tion algorithms.
The precision and recall can be computed on true positive TPr
Prregion = (10)
(TP), false negative (FN) and false positive (FP) TPr + FPr
TP TPr
Prpixel = (7) Reregion = (11)
TP + FP TPr + FNr
TP
Repixel =
TP + FN
(8) 2 × Prregion ×Reregion
F 1region = (12)
2 × Prpixel ×Repixel Prregion +Reregion
F 1pixel = . (9)
Prpixel +Repixel
Crack Continuity Assessment: We define the “Continuity
Assume that the detected pixels which are no more than Index (CI)” as a degree of continuity. It measures how much
five pixels away from the manually labeled pixel are true the detected regions are connected if they belong to the same
positive pixels. crack. Denote M as the number of images in the testing set. Ni
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 11. Results of different algorithms on CFD (from top to bottom: original image, ground truth, Canny, CrackIT, CrackTree, FFA, and CrackForest).
A. CFD Results
as the number of ground truth cracks in the ith image and nij
as the number of true positive regions that cover the jth ground We propose an annotated road crack dataset called CFD. This
truth crack in the ith image dataset is composed of 118 images, which can generally reflect
⎛ ⎞ urban road surface condition in Beijing, China. Each image has
1 ⎝ 1 hand labeled ground truth contours. All the images are taken by
M Ni
1 ⎠
CI = (13) an iPhone5 with focus of 4mm, aperture of f/2.4 and exposure
M i=1 Ni j=1 nij
time of 1/134s. The width of the images ranges from 1 to 3 mm.
From Fig. 10, we can see that these images contain noises such
The continuity is better as CI gets closer to 1. as shadows, oil spots and water stains.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
SHI et al.: AUTOMATIC ROAD CRACK DETECTION USING RANDOM STRUCTURED FORESTS 9
TABLE I
C RACK D ETECTION R ESULTS E VALUATION ON CFD
Fig. 12. Results of different algorithms on AigleRN (from top to bottom: original image, ground truth, CrackIT, FFA, MPS, and CrackForest).
We use the 60%/40% training/testing split with the images detection due to its high sensitivity. CrackIT does not perform
reduced to 480 × 320. Example detections on CFD are shown well on low-resolution and low-contrast images. As a result,
in Fig. 10. The first column lists the original images. The it fails to detect most of the crack pixels in the images. The
corresponding manually labeled cracks are shown in the second accuracy of CrackTree is acceptable. But it may hallucinate a
column as ground truth. The third column shows the prelimi- crack that does not exist. In addition, the width of the crack can
nary detection results after applying random structured forests. not be observed. As for FFA, it may falsely detect landmarks as
Darker color indicates that the pixel is more likely to contain a defects.
crack. After the binarization, crack pixels with less confidence Our method CrackForest performs better than the alterna-
are removed. The use of crack descriptor allows us to transform tives. To be specific, CrackForest (SVM) gives both good
each detected region into a vector. By applying classification precision and recall.
method such as SVM, we can eliminate the noise regions and
keep the crack regions effectively. The final detection results
B. AigleRN Results
are shown in column 5. Our method is robust to noise.
Five methods are conducted on this dataset: Canny, CrackIT, AigleRN dataset [49] contains 38 images with ground truth.
CrackTree, FFA and CrackForest. Results are shown in Fig. 11 We use 60% for training and the rest for testing.
and summary statistics are in Table I. As it can be observed We compare four methods on this dataset: CrackIT, FFA,
intuitively, our method outperforms the alternatives. Traditional MPS and CrackForest. Example AigleRN results are shown
edge detection method Canny is not suitable for road crack in Fig. 12 and Table II. Although CrackIT can detect most
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE II
C RACK D ETECTION R ESULTS E VALUATION ON A IGLE RN
TABLE III
C ROSS -D ATASET G ENERALIZATION T EST FOR C RACK F OREST. TRAIN/TEST I NDICATES THE T RAINING /T ESTING D ATASET U SED
of the cracks, a lot of noises are still remained. Besides, the it possible to exploit such structured information and predict
continuity of the detected cracks is not very good. As for FFA, local segmentation masks of the given image patch. Thirdly, a
the precision is acceptable. But when it comes to detecting crack descriptor, which consists of two statistical histograms, is
cracks with complex topology, FFA is less competitive. MPS proposed to characterize the structured information of cracks
performs well on detecting light cracks, but it may hallucinate a and discriminate cracks from noises. In addition, we also
crack that does not exist. CrackForest shows promising results propose an annotated road crack image dataset which can
on most of the indicators. To be specific, CrackForest (SVM) generally reflect the urban road surface condition in China and
still gives both better precision and recall. two indicators to evaluate the performance of crack detection
methods.
C. Cross Dataset Generalization Experimental results prove the effectiveness of our method
To study the ability of our approach to generalize across in suppressing noises compared to several competing methods.
datasets, we ran a final set of experiments. In Table III, we show Our approach yields promising processing speed and state-of-
results on AigleRN using CrackForest trained on CFD and also the-art accuracy.
results on CFD using CrackForest trained on AigleRN. Note Source code is available online: [Link]
that images in the CFD and AigleRN datasets are qualitatively cuilimeng/CrackForest. Our annotated road crack image
quite different, see Figs. 11 and 12, respectively. dataset CFD is also available online: [Link]
In Table III, top, results on AigleRN of the AigleRN and cuilimeng/CrackForest-dataset.
CFD trained models are compared. The precision and recall
do not fluctuate much using two datasets for training. Results VI. L IMITATIONS AND F UTURE W ORK
on CFD of the CFD and AigleRN models, shown in Table III,
bottom, are likewise similar. In our experiments, CrackForest has proven to be quite
The experimental results show that CrackForest could serve promising. However, it does have some limitations:
as a general purpose crack detector without the necessity of • Our method has only performed on static images so far.
retraining. The video streaming is not taken into consideration. In
the future, we will test our method on video datasets.
V. C ONCLUSION • The width of the crack is not measured in our method.
In this paper, we propose an effective and fast automatic road We will focus on the severity level assessment in the
crack detection method, which can suppress noises efficiently future work.
by learning the inherent structured information of cracks. Our
detection framework builds upon representative and discrimina- ACKNOWLEDGMENT
tive integral channel features and combines this representation Thanks professor Qin Zou and professor Manuel Avila for
with random structured forests. This also allows us to train their kind help. And thanks professor Sylvie Chambon and
our framework in a completely supervised manner from a Dr. Rabih Amhaz for their valuable discussions.
small training set. More importantly, we can characterize cracks
and eliminate noises marked as cracks by using two feature R EFERENCES
histograms proposed. [1] H. Oliveira and P. L. Correia, “Automatic road crack segmentation using
Our innovation is shown as follows: Firstly, to capture the entropy and image dynamic thresholding,” in Proc. 17th EUSIPCO,
inherent structure of the road crack, we apply integral channel Aug. 24–28, 2009, pp. 622–626.
[2] H. Cheng, J.-R. Chen, C. Glazier, and Y. Hu, “Novel approach to pave-
features to enrich the feature set of traditional crack detection. ment cracking detection based on fuzzy set theory,” J. Comput. Civil Eng.,
Secondly, the introducing of random decision forests makes vol. 13, no. 4, pp. 270–280, Oct. 1999.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
SHI et al.: AUTOMATIC ROAD CRACK DETECTION USING RANDOM STRUCTURED FORESTS 11
[3] H. Cheng et al., “Novel approach to pavement cracking detection based [30] H. Zhao, G. Qin, and X. Wang, “Improvement of canny algorithm based
on neural network,” Transp. Res. Rec., J. Transp. Res. Board, vol. 1764, on pavement edge detection,” in Proc. 3rd Int. Conf. CISP, 2010, vol. 2,
pp. 119–127, 2001. pp. 964–967.
[4] P. Subirats, J. Dumoulin, V. Legeay, and D. Barba, “Automa- [31] Y.-C. Tsai, V. Kaul, and R. M. Mersereau, “Critical assessment of pave-
tion of pavement surface crack detection using the continuous ment distress segmentation methods,” J. Transp. Eng., vol. 136, no. 1,
wavelet transform,” in Proc. IEEE Int. Conf. Image Process., 2006, pp. 11–19, Jan. 2009.
pp. 3037–3040. [32] S. Chambon and J.-M. Moliard, “Automatic road pavement assessment
[5] T. S. Nguyen, M. Avila, and S. Begot, “Automatic detection and classi- with image processing: Review and comparison,” Int. J. Geophys.,
fication of defect on road pavement using anisotropy measure,” in Proc. vol. 2011, 2011, Art. no. 989354.
Eur. Signal Process. Conf., 2009, pp. 617–621. [33] R. Achanta, F. Estrada, P. Wils, and S. Süsstrunk, “Salient region detection
[6] H. Oliveira and P. L. Correia, “Automatic road crack detection and charac- and segmentation,” in Computer Vision Systems. Berlin, Germany:
terization,” IEEE Trans. Intell. Transp. Syst., vol. 14, no. 1, pp. 155–168, Springer-Verlag, 2008, pp. 66–75.
Mar. 2013. [34] R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk, “Frequency-tuned
[7] H. Oh, N. W. Garrick, and L. E. Achenie, “Segmentation algorithm salient region detection,” in Proc. IEEE CVPR, 2009, pp. 1597–1604.
using iterative clipping for processing noisy pavement images,” in [35] P. Arbelaez, M. Maire, C. Fowlkes, and J. Malik, “Contour detection
Proc. 2nd Int. Conf. Imaging Technol., Tech. Appl. Civil Eng., 1998, and hierarchical image segmentation,” IEEE Trans. Pattern Anal. Mach.
pp. 138–147. Intell., vol. 33, no. 5, pp. 898–916, May 2011.
[8] M. Petrou, J. Kittler, and K. Song, “Automatic surface crack detection [36] K. Y. Song, M. Petrou, and J. Kittler, “Texture crack detection,” Mach.
on textured materials,” J. Mater. Process. Technol., vol. 56, no. 1–4, Vis. Appl., vol. 8, no. 1, pp. 63–75, Jan. 1995.
pp. 158–167, Jan. 1996. [37] Y. Hu and C.-X. Zhao, “A local binary pattern based methods for pave-
[9] Y. Huang and B. Xu, “Automatic inspection of pavement cracking dis- ment crack detection,” J. Pattern Recognit. Res., vol. 1, no. 20103,
tress,” J. Electron. Imag., vol. 15, no. 1, pp. 013 017–013 017, 2006. pp. 140–147, 2010.
[10] S. Cafiso, A. Di Graziano, and S. Battiato, “Evaluation of pavement [38] J. Zhou, P. S. Huang, and F.-P. Chiang, “Wavelet-based pavement distress
surface distress using digital image collection and analysis,” in Proc. 7th detection and evaluation,” Opt. Eng., vol. 45, no. 2, Feb. 2006,
Int. Congr. Adv. Civil Eng., 2006, pp. 1–10. Art. no. 027007.
[11] M. Gavilán et al., “Adaptive road crack detection system by pavement [39] M. Kass, A. Witkin, and D. Terzopoulos, “Snakes: Active contour
classification,” Sensors, vol. 11, no. 10, pp. 9628–9657, Oct. 2011. models,” Int. J. Comput. Vis., vol. 1, no. 4, pp. 321–331, Jan. 1988.
[12] M. S. Kaseko and S. G. Ritchie, “A neural network-based methodol- [40] V. Kaul, A. Yezzi, and Y. Tsai, “Detecting curves with unknown endpoints
ogy for pavement crack detection and classification,” Transp. Res. C, and arbitrary topology using minimal paths,” IEEE Trans. Pattern Anal.
Emerging Technol., vol. 1, no. 4, pp. 275–291, Dec. 1993. Mach. Intell., vol. 34, no. 10, pp. 1952–1965, Oct. 2012.
[13] Q. Li and X. Liu, “Novel approach to pavement image segmentation [41] P. Delagnes and D. Barba, “A Markov random field for rectilinear struc-
based on neighboring difference histogram method,” in Proc. CISP, 2008, ture extraction in pavement distress image analysis,” in Proc. Int. Conf.
vol. 2, pp. 792–796. Image Process., 1995, vol. 1, pp. 446–449.
[14] N. Tanaka and K. Uematsu, “A crack detection method in road surface [42] B. J. Lee and H. Lee, “Position-invariant neural network for digital
images using morphology,” in Proc. MVA, 1998, vol. 98, pp. 17–19. pavement crack analysis,” Comput.-Aided Civil Infrastruct. Eng., vol. 19,
[15] H. Oliveira and P. L. Correia, “Supervised strategies for crack detection no. 2, pp. 105–118, Jan. 2004.
in images of road pavement flexible surfaces,” in Proc. EUSIPCO, 2008, [43] A. Cord and S. Chambon, “Automatic road defect detection by textural
pp. 25–29. pattern recognition based on AdaBoost,” Comput.-Aided Civil Infrastruct.
[16] C. Koch and I. Brilakis, “Pothole detection in asphalt pavement images,” Eng., vol. 27, no. 4, pp. 244–259, Apr. 2012.
Adv. Eng. Inf., vol. 25, no. 3, pp. 507–515, Aug. 2011. [44] L. Ying and E. Salari, “Beamlet transform-based technique for pavement
[17] R. Amhaz, S. Chambon, J. Idier, and V. Baltazart, “A new minimal path crack detection and classification,” Comput.-Aided Civil Infrastruct. Eng.,
selection algorithm for automatic crack detection on pavement images,” vol. 25, no. 8, pp. 572–580, Nov. 2010.
in Proc. IEEE ICIP, 2014, pp. 788–792. [45] J. J. Lim, C. L. Zitnick, and P. Dollár, “Sketch tokens: A learned mid-level
[18] M. Avila, S. Begot, F. Duculty, and T. S. Nguyen, “2D image based representation for contour and object detection,” in Proc. IEEE CVPR,
road pavement crack detection by calculating minimal paths and dynamic 2013, pp. 3158–3165.
programming,” in Proc. IEEE ICIP, 2014, pp. 783–787. [46] P. Dollár, Piotr’s Computer Vision Matlab Toolbox (PMT). [Online].
[19] R. Amhaz, S. Chambon, J. Idier, and V. Baltazart, “Automatic crack Available: [Link]
detection on 2D pavement images: An algorithm based on minimal path [47] J. Canny, “A computational approach to edge detection,” IEEE Trans.
selection,” IEEE Trans. Intell. Transp. Syst., 2015, 24p, DOI: 10.1109/ Pattern Anal. Mach. Intell., vol. PAMI-8, no. 6, pp. 679–698,
TITS.2015.2477675. hal-01206038. Nov. 1986.
[20] Q. Zou, Y. Cao, Q. Li, Q. Mao, and S. Wang, “CrackTree: Automatic crack [48] H. Oliveira and P. L. Correia, “Crackit—An image processing toolbox
detection from pavement images,” Pattern Recognit. Lett., vol. 33, no. 3, for crack detection and characterization,” in Proc. IEEE ICIP, 2014,
pp. 227–238, Feb. 2012. pp. 798–802.
[21] K. Fernandes and L. Ciobanu, “Pavement pathologies classification using [49] S. Chambon, AigleRN. [Online]. Available: [Link]
graph-based features,” in Proc. IEEE ICIP, 2014, pp. 793–797. Chambon/Crack_Detection_Database.html
[22] Y.-C. J, C. Jiang, and Y. Huang, “Multiscale crack fundamental element
model for real-world pavement crack classification,” J. Comput. Civil
Eng., vol. 28, no. 4, pp. 388–396, Jul. 2012, Art. no. 04014012.
[23] Y. J. Tsai, C. Jiang, and Z. Wang, “Implementation of automatic crack
evaluation using crack fundamental element,” in Proc. IEEE ICIP, 2014,
pp. 773–777. Yong Shi received the Ph.D. degree in management
[24] P. Kontschieder, S. R. Bulo, H. Bischof, and M. Pelillo, “Structured class- science and computer system from The University of
labels in random forests for semantic image labelling,” in Proc. IEEE Kansas, Lawrence, KS, USA.
ICCV, 2011, pp. 2190–2197. He is currently a Professor with the Chinese
[25] T. S. Nguyen, S. Begot, F. Duculty, and M. Avila, “Free-form anisotropy: Academy of Sciences, Beijing, China, where he
A new method for crack detection on pavement surface images,” in Proc. serves as the Director of Research Center on
18th IEEE ICIP, 2011, pp. 1069–1072. Fictitious Economy and Data Science. He is also a
[26] P. Dollár, Z. Tu, P. Perona, and S. Belongie, “Integral channel features,” Professor and a Distinguished Chair of Information
in BMVC, 2009, pp. 91.1–91.11. Technology with the College of Information Science
[27] P. Dollár and C. L. Zitnick, “Structured forests for fast edge detection,” in and Technology, University of Nebraska Omaha,
Proc. IEEE ICCV, 2013, pp. 1841–1848. Omaha, USA. His research interests include data
[28] H.-D. Cheng and M. Miyojim, “Automatic pavement distress detection mining, information overload, optimal system designs, multiple-criteria deci-
system,” Inf. Sci., vol. 108, no. 1–4, pp. 219–240, Jul. 1998. sion making, decision support systems, and information and telecommunica-
[29] A. Ayenu-Prah and N. Attoh-Okine, “Evaluating pavement cracks with tions management.
bidimensional empirical mode decomposition,” EURASIP J. Adv. Signal Dr. Shi is the Editor-in-Chief of International Journal of Information
Process., vol. 2008, no. 1, Mar. 2008, Art. no. 861701. Technology and Decision Making and Annals of Data Science.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Limeng Cui is currently working toward the Ph.D. Fan Meng is currently working toward the Ph.D.
degree with University of Chinese Academy of Sci- degree with University of Chinese Academy of Sci-
ences, Beijing, China, with a focus on computer ences, Beijing, China.
vision and machine learning. His research interests include edge detection, im-
Her work lies in image processing, object detec- age segmentation, road crack detection, and propor-
tion and scene recognition. tion learning.
Zhiquan Qi is an Assistant Professor with the Re- Zhensong Chen is currently working toward the
search Center on Fictitious Economy and Data Sci- Ph.D. degree with University of Chinese Academy
ence, Chinese Academy of Sciences, Beijing, China. of Sciences, Beijing, China.
His research interests include object detection, His research is focused on image segmentation
object tracking, change detecting, and machine and proportion learning.
learning.