Comparative Study of Global Color and Text - 2012 - Journal of Visual Communicat
Comparative Study of Global Color and Text - 2012 - Journal of Visual Communicat
23 (2012) 359–380
Comparative study of global color and texture descriptors for web image retrieval
Otávio A.B. Penatti a,⇑, Eduardo Valle a,b, Ricardo da S. Torres a
a
RECOD Lab – Institute of Computing (IC), University of Campinas (Unicamp), Brazil
b
Department of Computer Engineering and Industrial Automation (DCA), School of Electrical and Computer Engineering (FEEC), University of Campinas (Unicamp), Brazil
a r t i c l e i n f o a b s t r a c t
Article history: This paper presents a comparative study of color and texture descriptors considering the Web as the envi-
Received 10 March 2011 ronment of use. We take into account the diversity and large-scale aspects of the Web considering a large
Accepted 10 November 2011 number of descriptors (24 color and 28 texture descriptors, including both traditional and recently pro-
Available online 28 November 2011
posed ones). The evaluation is made on two levels: a theoretical analysis in terms of algorithms complex-
ities and an experimental comparison considering efficiency and effectiveness aspects. The experimental
Keywords: comparison contrasts the performances of the descriptors in small-scale datasets and in a large hetero-
Comparative study
geneous database containing more than 230 thousand images. Although there is a significant correlation
Color descriptors
Texture descriptors
between descriptors performances in the two settings, there are notable deviations, which must be taken
Web into account when selecting the descriptors for large-scale tasks. An analysis of the correlation is pro-
Content-based image retrieval vided for the best descriptors, which hints at the best opportunities of their use in combination.
Efficiency and effectiveness Ó 2011 Elsevier Inc. All rights reserved.
Asymptotic complexity
Correlation analysis
1. Introduction the feature vectors of the database images, that had been previ-
ously extracted. The comparisons are made by computing distance
This paper presents a comparative study of global color and values and those values are used to rank the database images
texture descriptors considering the Web as the environment of according to their similarities to the query image. The most similar
use. images are finally shown to the user. The image descriptor is in-
In the latest years, the amount of digital images has grown rap- volved in this process in the extraction of images properties and
idly. Among the main reasons for that, one may mention digital in the distances computations. Therefore, it is clear the critical
cameras and high-speed Internet connections. Those elements importance of image descriptors for CBIR systems.
have created a simple way to generate and publish visual content It is known that many image descriptors are application depen-
worldwide. That means that a huge amount of visual information dent, that is, their performances vary from one application to another.
becomes available everyday to a growing number of users. Much Therefore, conducting comparative evaluation of image descriptors
of that visual information is available on the Web, which has be- considering different environments of use is very important.
come the largest and most heterogeneous image database so far. Literature presents several comparative studies for color, tex-
In that scenario, there is a crucial demand for image retrieval ture, and shape descriptors. A recent study [3] compares a large
systems [1,2], which could be satisfied by content-based image re- number of image descriptors in five different image collections
trieval (CBIR) systems. In CBIR systems, the image descriptor is a for tasks of classification and image retrieval. Other studies are spe-
very important element. It is responsible for assessing the similar- cific to certain properties: shape descriptors [4–8], texture descrip-
ities among images. Descriptors can be classified depending on the tors [9–11], or color descriptors [12–14]. Surveys and comparisons
image property analyzed, like, for example, color or texture markedly related to ours can be found in Sections 2.4 and 2.6.
descriptors, that analyze color or texture properties, respectively. In comparative studies of descriptors, the Web is rarely consid-
In CBIR systems, the searching process works as follows. The ered as the environment of use. In general, the amount of descrip-
user queries the system, usually, by using a query image. Its prop- tors considered is small and the application analyzed is specific.
erties are encoded into feature vectors and then compared against Besides that, asymptotic theoretical analysis and other efficiency
considerations are generally not discussed in detail.
⇑ Corresponding author. Our study has many novel aspects. First, it considers a large
E-mail addresses: [email protected] (O.A.B. Penatti), [email protected]. number of descriptors: 24 color and 28 texture descriptors, includ-
unicamp.br (E. Valle), [email protected] (Ricardo da S. Torres). ing both traditional and recently proposed ones. Our evaluation is
1047-3203/$ - see front matter Ó 2011 Elsevier Inc. All rights reserved.
doi:10.1016/j.jvcir.2011.11.002
360 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
made in two levels: a theoretical analysis of algorithms complexi- Dencodes image visual properties into feature vectors, as
ties and an experimental comparison. The experimental compari- shown in Fig. 1. A feature vector contains information related
son is made over specific and heterogeneous collections. to the image visual properties, like color, texture, shape, and
Descriptors are analyzed in a Web environment with a database spatial relationship of objects.
containing more than 230 thousand images with very heteroge- dD compares two feature vectors. As shown in Fig. 1, given two
neous content. The experimental analysis considers efficiency feature vectors, the function computes a distance or similarity
and effectiveness aspects. The effectiveness evaluation in the value between these vectors. The distance or similarity between
Web environment takes into account how much the descriptors the vectors is considered as the distance or similarity between
agree with the human perception of semantic similarity, by asking the images from which the vectors were extracted.
a pool of users to annotate the relevance of answers for each query.
Another important aspect of our study is related to scalability It is worth noting that in some papers from literature, what we
and diversity. How does a descriptor perform as the size of the col- call here as feature vector is considered the descriptor, with the dis-
lection increases considerably? And how does the heterogeneity of tance function being accounted elsewhere. We adopt the definition
the collection affect the descriptors effectiveness? Our experi- of [19], in which the descriptor also includes the distance function,
ments in the Web environment address both issues. since feature vectors need a specific distance function to establish
The large-scale, heterogeneous nature of the Web can benefit from the geometry of the description space: the vectors alone, without
the use of descriptors in combination. Although the combination of the metric, are meaningless. That is better understood if we ob-
descriptors is a complex topic, with many competing techniques, serve that the same type of feature vector may have radically dif-
and thus outside the scope of this work, we perform a general analysis ferent performances when used with different distance functions.
of the complementarity of the best descriptors, which should be
taken into account when selecting them for combined use. 2.2. Distance functions
We concentrate our evaluation on global image descriptors, since
local descriptors have a radically different cost benefit compro- Distance functions are very important for the image descriptors.
mise, especially in the context of information retrieval involving Their choice has a huge impact in the descriptor performance.
high-level semantic contexts. Local image detectors and descriptors The most common distance functions are the function L1, also
have been extensively surveyed in [15–17], respectively. named Manhattan or city-block, the function L2, also known as
We have decided not to include shape descriptors in our study, Euclidean distance, and the function L1. Those common functions
because almost all of them are segmentation dependent. As is (or variations of them) are largely used. There are also, more com-
known, segmentation is still a hard and extremely application plex functions, like the Earth Mover’s Distance (EMD) [20].
dependent task. Therefore, shape descriptors are still not mature
for a heterogeneous environment like the Web. Readers interested 2.3. Color descriptors
in shape descriptors should refer to one of the comparative studies
in specific environments [4–8]. Another less common kind of One of the most important visual properties identified by hu-
descriptor, called spatial relationship descriptor, tries to encode man vision is color, making it one of the most used properties in
the spatial relationships between objects [18]. However, those CBIR systems.
descriptors also depend on the segmentation of images.
The paper is organized as follows. Section 2 presents the image 2.3.1. Taxonomy for color descriptors
descriptor definition used throughout this work and taxonomies Literature presents three main approaches for color analysis, as
for color and texture descriptors. Section 3 presents the results of shown in Fig. 2.
theoretical analysis of the descriptors, discussing the evaluation The global approach considers the image color information
criteria used and the theoretical comparative tables. Section 4 pre- globally. As no partitioning or pre-processing stage is applied to
sents the experimental evaluation, showing the experimental mea- the image during features extraction, descriptors from this ap-
sures adopted, the implementation details of each descriptor, and proach usually have simple and fast algorithms for extracting fea-
the results achieved for the specific collections. Section 5 presents ture vectors. However, as no information about the spatial
the evaluation for the Web environment. In Section 6, we conclude distribution of colors is encoded, those descriptors can have little
the paper. discriminating power. Many descriptors from global approach gen-
erate histograms as feature vectors, like, for example, the global
color histogram [21] and the cumulative global color histogram [22].
The fixed-size regions approach divides an image into cells of
2. Descriptors
fixed size and extracts color information from each cell separately.
The descriptors from this approach encode more spatial informa-
Both the effectiveness and the efficiency of content-based im-
tion, at the cost of usually generating larger feature vectors. Exam-
age retrieval systems are very dependent on the image descriptors
ples of descriptors from fixed-size regions approach are local color
that are being used. The image descriptor is responsible for charac-
histogram [21] and cell/color histogram [23].
terizing the image properties and to compute their similarities. In
The segmentation-based approach divides an image in regions
other words, the image descriptor makes it possible to rank images
that may differ in size and quantity from one image to another.
according to their visual properties.
This division is usually made by a segmentation or clustering algo-
rithm, what introduces an extra complexity to the features extrac-
tion process. Another kind of segmentation is the classification of
2.1. Definition pixels before feature extraction. Descriptors from that approach
usually present better effectiveness, although they are often more
The image descriptor can be conceptually understood as complex. Examples of descriptors from segmentation-based ap-
responsible for quantifying how similar two images are. Formally, proach are color-based clustering [24] and dominant-color [25,26].
an image descriptor D can be defined as a pair (D, dD) [19], where Authors often do not give their methods neither a name nor an
D is a feature-extraction algorithm and dD is a function suitable to acronym. To refer to those methods less awkwardly through the
compare the feature vectors generated: text, we have taken the liberty of giving them a short descriptive
O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380 361
texture [27]. For example, texture elements can be modeled as: a which is an on-line process. The on-line part directly affects the re-
dark or a bright spot, an horizontal or vertical transition, corners, sponse time of the system. Therefore, it is important for the feature
lines, etc. Descriptors from that approach work well for regular extraction algorithm to be fast.
textures. The local binary pattern [34] descriptor is an example of Our complexity analysis is informed in asymptotic big-O nota-
model-based descriptor. tion, which ignores additive and multiplicative constants. As such,
Signal processing methods characterize textures by applying fil- it should be taken with a grain of salt, since two O(n) algorithms,
ters over the image. Both spatial domain and frequency domain fil- though theoretically similar in behavior, may vary considerably
ters can be used. Descriptors based on wavelets and Gabor filters in practice, because of the hidden constants.
follow that approach, like the homogeneous texture descriptor It will be seen that almost all extractors are linear, or at worse
[28,35], for instance. (when they work on the frequency domain), log-linear in the num-
We have addressed the issue of methods originally without a ber of image pixels. There are notably exceptions, though, for
name or acronym in the same way as we did for the color descrip- example: methods based on scale-spaces must take into account
tors (Section 2.3). the number of scales analyzed. The rare methods which perform
complex adaptive clustering are more expensive.
2.4.2. Previous comparisons on texture descriptors
Literature has some comparative studies on texture descriptors 3.1.2. Distance function complexity
[9,11,10,36]. A study compares a Fourier-based descriptor with a The distance function of a descriptor is very time consuming
Gabor-based descriptor [36] trying to identify the descriptor that when a query is being processed in a CBIR system. Distance func-
is the most robust to noise. The results show that the Fourier-based tions need to be fast, because during the search process, the query
descriptor has better performance in images with no noise and that image will be compared to a large number of candidate images.
the Gabor-based descriptor is better for noisy images. The distance function is also important for indexing issues. The
Another study compares the texture descriptors of MPEG-7 stan- use of indexing structures is important for systems with large dat-
dard [10]. Considering features extraction cost, the study shows that abases and, therefore, critical in a Web environment. Without
TBD [26,35] is the most expensive and that EHD [26] is the cheapest indexing structures the response time would be unacceptable.
one. Besides that, the study points out that HTD [35,28] captures glo- Indexing, however, imposes restrictions on distance functions, be-
bal information, while TBD captures global and local information cause simple, axis-monotone norm-based functions are much
and EHD captures only local information. The study also shows that friendlier to indexing structures than elaborated and unpredictable
TBD is not indicated for CBIR tasks, being more useful for image decision procedures. The interaction between descriptors and
browsing and for defects detection. Additionally, HTD is suggested indexing structures is, however, complex, depending on the statis-
for being used in CBIR and in texture segmentation tasks, while tical characteristics of the feature vector, the image dataset, and
EHD is recommended for CBIR tasks. The study also shows that the properties of the index employed. Prospective studies on that
HTD and TBD are less sensitive to noise while EHD is not recom- subject may be found in [40,41].
mended in environments with noise. The experiments realized by Again, in our theoretical analysis, the complexity is given
the study [10] show that TBD has low effectiveness and that HTD asymptotically. It will be seen that almost all methods are based
is not good for images with rotation. That last problem of HTD is no- on simple distances (Manhattan, Euclidean, etc.) which are linear
ticed in the literature, given the number of versions proposing HTD in the feature vector size.
with rotation invariance, like the descriptors HTDR [37], HTDI [38],
and Han Gabor [39] present in our study. 3.1.3. Storage requirements
The image descriptor stores into feature vectors the encoded
image properties. Each image managed by a CBIR system is associ-
3. Theoretical comparison
ated with one or more feature vectors. As a result, the storage
space required for feature vectors is proportional to the amount
The comparative study performed in this paper comprises two
of images in the database. In a Web scenario, in addition to the very
analyses. This section presents the first one: a theoretical evalua-
large database size, there is the issue of image heterogeneity, mak-
tion conducted for 24 color descriptors and for 28 texture descrip-
ing almost indispensable to employ several descriptor at once. As a
tors. The theoretical evaluation considers the asymptotic
consequence, the storage space required is also proportional to the
complexities of the descriptors algorithms. The next section pre-
array of descriptors employed.
sents the second analysis: an experimental evaluation for the most
Seldom the feature vector size for a method is fixed, more usu-
promising color and texture descriptors. In the following section,
ally it depends on a set of parameters which can be adapted from
color and texture descriptors are tested in a Web-like environment.
application to application. For color descriptors, for example, many
descriptors allow to customize how many colors are to be chosen
3.1. Evaluation criteria on a quantization step. Methods based on frequency-domain trans-
forms (Wavelets, Fourier, DCT) often have a choice of selecting how
Based on the main elements related to the search process in a many transform coefficients are considered. Many methods can be
content-based image retrieval system (as explained in Section 1), seen as multidimensional histograms, and, as such, grow as fast as
the following criteria are used to compare the descriptors: features the product of the number of bins in each dimension. Our analysis
extraction complexity, distance function complexity, storage considers those mentioned possible variations in vectors sizes.
requirements, effectiveness, and validation environment. Dimensionality reduction techniques may be employed in order
to alleviate the storage requirements of the larger descriptors.
3.1.1. Features extraction complexity However, often (but not always) they also impose a loss in effec-
The features extraction algorithm of a descriptor is used when- tiveness. There are many dimensionality reduction techniques,
ever the features from an image need to be extracted. The extrac- including the linear projection techniques like principal compo-
tion is required basically in two moments in a Web CBIR system. nent analysis (PCA) and linear discriminant analysis (LDA), and
The first one is when the images are being collected from the metric embedding techniques (both linear and non-linear). The
Web to be included in the local database, which is usually an off- interaction between dimensionality reduction and descriptor per-
line process. The second moment is when a query is performed, formance are, however, far from trivial and outside the scope of
O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380 363
Table 1
Color descriptors: reference, short description, and complexity analysis.
Fixed-size Sun et al. [53] CDE⁄ Spatial information O(n) O(vs) 2 NC values
regions of colors, using
entropy
Stehling et al. CCH⁄ For each color in O(n) O(vs) 0.45 NC NB or average case or worst
[23] image, histogram NC NB values case, respectively
representing its
distribution among
the cells
Li [45] SBS⁄ Color histograms for O(n) O(k m NC) NB or NC NB value or histogram for
each block in an values each block; k = NB in query
image combined by image, m = NB in database
weighting according image
to their position
Manjunath CLD⁄ Uses DCT on average O(nlogn) O(vs) 3 NDCT values NDCT = number of DCT
et al. [26] – colors of an image coefficients
(Section E) divided by a grid
Table 1 (continued)
Segmentation Yang et al. DCDI DCD (see below) O(n) + O(NDq NDi) 4 ND values Linear Block Algorithm
based [43] with a new color (LBA) has unknown
clustering algorithm complexity, but lower than
GLA; NDq = ND in query
image, NDi = ND in database
image
Wong et al. DCSD⁄ Combination of DCD O(n) + notec O(vs) 3 ND values
[42] (see below) and CSD
(see above)
Lee et al. [54] Lee SC Color spatial O(n) O(NC) 2 NC + NBord NBord = number of border
information with pixels
color adjacency
histogram and color
vector angle
histogram
Stehling et al. BIC⁄ Two color O(n) O(vs) 2 NC values
[55] histograms in RGB
space, one for border
pixels, one for
interior pixels
Manjunath DCD⁄ Dominant colors O(n) + notec O(NDq NDi) 4 ND values NDq = ND in query image,
et al. [26] – (most representative NDi = ND in database image
(Section D) colors), as well as
spatial coherence
and color variances
Stehling et al. CBC⁄ Characterization for O(nlogn) O(NRq NRi log 6 NR values NRq = NR in query image,
[24] each region of an (NRq NRi)) NRi = NR in database image
image, which are
determined by color
clustering
Pass et al. [56] CCV⁄ Two color O(n) O(vs) 2 NC values
histograms in RGB
space, one for large
regions, one for
small regions
a
Names originally given by the authors are marked with ⁄.
b
n = number of pixels in an image, vs = vector size, NC = number of colors in quantized space, ND = number of dominant colors, NDist = number of distances, NB = number of
blocks or cells, NR = number of regions.
c
Employs the Generalized Lloyd Algorithm (GLA), commonly used in the k-means clustering algorithm, whose complexity is currently unknown.
this work. Here we are interested in evaluating the ‘‘raw’’ behavior considered the ground truth and creates the possibility for an auto-
of the descriptors, with minimum external influences. matic effectiveness evaluation. A second way to evaluate descrip-
tors is conducted over databases with no fixed ground truth. In
3.1.4. Effectiveness those cases, the system performs a pool of queries and selected
Effectiveness stands for the capacity of a descriptor to retrieve users judge how well the system fared according to their own sub-
relevant images. An effective descriptor puts together images jective criteria. That procedure introduces a source of variability,
which are related in the users’ viewpoint, so that during retrieval but has the advantage of evaluating the system according to the
relevant images are ranked first. The success of an image retrie- opinion of potential users.
val system is, of course, directly dependent on the quality of its Once the matter of annotation is settled, there remains the
results, an thus, on the effectiveness of the descriptors. A user choice of which metric to use to summarize effectiveness over
might tolerate occasional delays, but not systematically wrong the set of queries (and occasionally users) employed. The most tra-
answers. ditional choice are the measures of Precision and Recall, often in
combination or as a function of each other (in a Precision Recall
3.1.5. Measuring effectiveness graph, for example). Precision measures the fraction of relevant
Effectiveness poses an issue: it depends upon the agreement images in proportion to the answer set, while Recall measures
about which answers are considered relevant for queries. Reaching the fraction of relevant images retrieved in the answer set in pro-
that agreement may be very difficult, since the concept of relevant portion to all relevant images existing in the database. A perfect
is dependent on each user mental model of the query. system would have a Precision of 1 (all images in the answer set
In practice, experimenters have two choices. First, to use a cat- are relevant) and a Recall also of 1 (all relevant images in the
egorized image database, where the database categorization is database are retrieved in the answer set). In practice there is a
O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380 365
Table 2
Color descriptors: validation environment.
compromise between the two measures: Recall is related to the Measures that incorporate the concept of ranking ab initio have
concept of true positive rate, of the theory of receiver operating also been proposed, notably for the evaluation of the descriptors on
characteristic (ROC), while Precision is related to 1-false positive the MPEG-7 (the Rank itself, AVR, MRR, NMRR, ANMRR, etc. [26]).
rate. Because of the compromise between Precision and Recall, As it will be seen, there is less consensus on the literature on those
sometimes a combination of the two is employed, as a single metric. measures and more tendency of authors on making ad hoc
This is the case of the mAP (mean average precision), F1-score, etc. adjustments each time they are used, instead of keeping them
The problem of Precision and Recall (and their combined mea- consistent.
sures) is that they tend to ignore the ranking of the images. For
example a Precision of 0.5 indicates that half the images in the an-
swer set are correct without saying if it is the first half or the sec- 3.1.6. Validation environment
ond half. In order to count that, those measures can be taken at The validation environment comprised, in addition to the cho-
specific points, considering answer sets of (usually small) growing sen metrics of efficiency and effectiveness, the image databases
sizes. Comparing the Precision at an answer set of 10, 20, and 30 in which the descriptor was tested, and, most importantly, other
images (called P10, P20, and P30, respectively), one can have a better descriptors to which it was compared. Those aspects reveal, for
idea of how the correct images are ranked. Usually, one is inter- example, if the descriptor was adequately assessed.
ested only in the top 10 or 20 answers, corresponding to the first It will be seen that there is little standardization in the valida-
page of answers, since it is observed in real-world search engines tion environment, specially for color descriptors. Several different
that users seldom take the effort of checking the second page, in- evaluation measures and image databases are employed, making
stead preferring to reformulate the query. Corresponding measures it difficult to make a meta-analysis. For texture descriptors, valida-
for the Recall (R10, R20, etc.) also exist. tion is commonly based on classification experimental designs,
366 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
Table 3
Texture descriptors: reference, short description, and complexity analysis.
Table 3 (continued)
instead of retrieval ones. In general, it will be seen little interest in [24], that have more complex extraction algorithms. It is observed
measuring efficiency. that many of the more complex descriptors are based on segmen-
tation techniques.
3.2. Analysis for color descriptors Considering the distance functions complexity, it is observed
that the great majority of descriptors is linear to the feature vector
Table 1 presents the asymptotic complexities of features extrac- size. Notwithstanding, descriptors like DCDI [43], DCD [26], SBS
tion algorithms and distance functions, the feature vectors size, [45], and CBC [24] are more complex for distance computations.
and the taxonomy class of each descriptor. Descriptors are sorted, For the storage requirements criterium, we highlight the
inside each taxonomy class, in reverse chronological order. descriptors CW-LUV [46] and CW-HSV [47] because of the compact
The great majority of color descriptors extracts the feature vec- feature vectors they generate.
tor in linear time on the number of pixels. However, there are some Table 2 shows the validation environment of each descriptor.
descriptors, like DCSD [42], DCDI [43], DCD [26], WC [44], and CBC Many of the color descriptors are validated using simple image
368 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
Table 4
Texture descriptors: validation environment.
databases and the databases are usually small, rarely having more 4.1. Selected descriptors
than 10,000 images. One of the image databases commonly used is
Corel, in whole or in parts, but it is often used in non-standard sub- In this section we present the descriptors selected for the exper-
sets. Table 2 also shows that some descriptors are not compared imental evaluation, justifying each choice. The implementation de-
with other descriptors. The global color histogram (GCH) comes clo- tails are shown in Sections 4.3 and 4.4 for color and texture
ser to a standard method again which most other methods tend to descriptors, respectively.
compare, but this is not an absolute rule – some methods for exam-
ple, compare only with versions of themselves. 4.1.1. Color descriptors
The most common evaluation measures are Precision, Recall, Global color histogram descriptor (GCH) [21], cumulative global
and ANMRR. There is seemingly little interest in measuring effi- color histogram (CGCH) [22], local color histogram descriptor
ciency: retrieval and extraction times are rarely offered, and no for- (LCH) [21], and color coherence vectors (CCV) [56] are popular
mal comparison of the complexities is ever considered. descriptors from literature, usually used as baseline for compari-
sons. LCH is a traditional descriptor based on fixed-size regions
and CCV is segmentation-based. GCH, CGCH, and LCH are all very
3.3. Analysis for texture descriptors
simple descriptors.
Descriptors based on correlograms are promising because they
Table 3 presents the results of the theoretical analysis of texture
encode spatial information, as the traditional color autocorrelo-
descriptors in terms of algorithms complexity for features extrac-
gram descriptor (ACC) [51] and the more elaborated joint autocor-
tion and distance computation. Table 3 also shows the feature vec-
relogram descriptor (JAC) [48]. Spatial information is usually lost
tors sizes and presents the taxonomy class of each texture
when using simple color histograms.
descriptor. Descriptors are sorted in reverse chronological order in-
The border/interior pixel classification descriptor (BIC) [55] and
side each taxonomy class.
color-based clustering descriptor (CBC) [24] are descriptors based
From Table 3, it is noticed that several descriptors present fea-
on segmentation. CBC has a complex extraction algorithm while
tures extraction complexity equal to O(nlogn). Most of these more
BIC has a very simple one. BIC has also presented promising results
complex descriptors are based on image filtering algorithms, using
in heterogeneous collections [55].
Gabor filters in many cases.
The Color Bitmap descriptor [52] was selected because it ana-
Considering the complexities of distance functions, it is pre-
lyzes image color properties globally and by using fixed-size re-
dominant the use of linear distance functions, which shows that
gions. Color structure descriptor (CSD) [26] was selected because,
the most laborious step related to texture description is the fea-
according to previous comparisons among the MPEG-7 color
tures extraction phase.
descriptors [13,14], it achieves the best performance.
From the analysis of feature vectors sizes, we highlight the com-
CW-HSV [47] and CW-LUV [46] descriptors are very simple and
pact feature vectors generated by the descriptors TBDI [57], TBD
generate very compact feature vectors. The chromaticity moments
[26,35], EHD [26], and LBP [34].
descriptor (CM) [50] is very simple and generates compact feature
Table 4 shows the validation environments of each texture
vectors.
descriptor. Considering the databases used for validation, there is
a predominance in the use of Brodatz database [58], which indi- 4.1.2. Texture descriptors
cates a certain standardization in the validation of texture descrip- The local binary pattern descriptor (LBP) [34] is very popular in
tors. However, some works use only part of the Brodatz database the literature and has simple algorithms presenting also good
instead of using the complete collection. It is also noteworthy that effectiveness and invariance to rotations and gray-scale variation.
because of the nature of the Brodatz textures, many articles use The homogeneous texture descriptor (HTD) [35,28] is a traditional
classification experimental designs instead of retrieval ones. Few descriptor from MPEG-7 standard.
descriptors are validated in heterogeneous databases, showing The Steerable Pyramid Decomposition descriptor with scale and
the lack of validation of texture descriptors for general image re- rotation invariance (M-SPyd) [71] is used in our experimental com-
trieval tasks. parison due to its good results [71] and its invariance to scale and
Considering the descriptors used as baselines for comparisons, rotation. The statistical analysis of structural information descrip-
there is a clear predominance in the use of descriptors based on tor (SASI) [62] was chosen because it presented better results than
Gabor filters. There are also descriptors that are compared with descriptors based on Gabor filters, as reported in [62].
no other descriptor or only compared with variations of The color co-occurrence matrix (CCOM) [66] was chosen due to
themselves. the popularity of the co-occurrence matrix for analyzing textures.
The evaluation measures most used are related to average re- It also aggregates color information to the original gray-level co-
trieval rate. There are some works worried about extraction and re- occurrence matrix (GCOM) [32]. Unser descriptor [67] was chosen
trieval times, but they are still rare. due to its ability to generate more compact feature vectors, to have
lower complexity and to present effectiveness similar to the GCOM
4. Experimental comparison in small datasets descriptor.
The quantized compound change histogram (QCCH) [60] was
Given the large amount of descriptors selected for theoretical chosen due to its simple extraction algorithm, its fast distance func-
evaluation, we have to choose the most promising descriptors for tion and its compact feature vector. The main reason for choosing
the experimental comparison. In order to contrast with the large- local activity spectrum descriptor (LAS) [65] was its simplicity for
scale Web-like environment, experiments were performed aiming both feature vectors extraction and distance computations.
to evaluate the descriptors’ performance in a controlled small-scale
4.2. Evaluation measures
scenario.
All those experiments were performed in the Eva tool,1 which 4.2.1. Efficiency in features extraction and distance computations
creates a standardized environment for the evaluation of the image The times required for features extraction and distance compu-
descriptors. Details of the Eva tool can be found in [79]. tations of each descriptor were measured in seconds, milliseconds or
microseconds. Time measures were taken by the Eva tool [79],
1
http://www.ic.unicamp.br/penatti/eva/ (as of September 15, 2011). which measures the average and standard deviation value for each
370 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
Table 5
Color descriptors implementation details.
Fig. 4. Precision Recall curves for color descriptors in ETH database. The zoomed region represents the high-precision/low-recall zone, which is most important for a Web
environment. (The graph is best viewed in color.)
Table 7
Texture descriptors implementation details.
3,280 full color images categorized in 8 classes (apples, pears, toma- As observed, CBC is the most time-consuming descriptor for fea-
toes, cows, dogs, horses, cups, and cars), each class containing 410 tures extraction. In the theoretical evaluation, CBC is the most
images. Each image is composed by one object in the center over a complex descriptor for features extraction, having complexity
homogeneous background. The objects appear in different positions O(nlogn), while all the other 12 descriptors are O(n). Although all
and point of views. other descriptors are O(n), there still were large differences in their
In the following sections, the results of the experimental evalua- actual time measurements.
tion with the color descriptors are presented. The relative values are In distance computations. Table 6(b) shows the average time for
computed using global color histogram (GCH) as the reference one distance computation and the standard deviation for each
descriptor, because GCH is one of the most popular descriptors from descriptor, sorted by average time. The times relative to GCH are
the literature and it is also often used as baseline for comparisons. also shown. The values were obtained from 32,275,200 distance
computations for each descriptor (3,280 3,280 3).
4.3.1. Efficiency The descriptors BIC, CCV, CSD, ACC, LCH, and JAC are consider-
In features extraction. Table 6(a) shows the average time for ably slower than GCH descriptor. On the other hand, CM, CW-
extracting one feature vector, as well as the standard deviation, HSV, CW-LUV, and Color Bitmap are considerably faster than
for each descriptor, sorted by average time. Table 6(a) also shows GCH descriptor for distance computations.
the relative times of each descriptor in relation to GCH descriptor. According to the theoretical analysis, CBC was the only descrip-
The values were obtained from a total of 9,840 features extractions tor with distance function more complex than O(vs). That higher
for each descriptor (3 times for each of the 3,280 images). complexity is observed in the times showed in Table 6(b) where
Only CCV, CSD, CW-LUV, JAC, and CBC are significantly slower CBC was the slowest descriptor. The differences in the times con-
than GCH for features extraction. None of the tested color descrip- sidering the other descriptors that are O(vs) were basically due to
tors is significantly faster than GCH. the feature vector sizes.
372 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
4.4.1. Efficiency
4.4. Results for texture descriptors In features extraction. Table 8(a) shows the average times for
extracting one feature vector and the standard deviation values
The 8 texture descriptors showed in Section 4.1.2 were imple- for each descriptor, ordered by average time. The values were com-
mented according to the details showed in Table 7. puted from 5,328 features extractions (3 times for each of the
Given our application scenario, we had to adjust some descrip- 1,776 images). Relative times are also shown.
tors implementations. Our implementation of LBP uses values 1 for All descriptors are significantly slower than LBP for features
the parameter R (radio) and 8 for P (neighbors). Also, we use L1 dis- extraction. As observed, the descriptors M-SPyd, SASI, and HTD
tance instead of the original proposed for LBP, which is non-sym- are considerably slower than the others. As presented in the theo-
metric. The original M-SPyd descriptor, has invariances to scale retical comparison in Section 3.3, those three descriptors have
and rotation independently. In the implemented version, however, higher complexity than the other five descriptors compared here.
both invariances are computed at the same time. In other words, That high complexity resulted in their larger times for features
the feature vector generated by our implementation of M-SPyd is extraction. HTD is very time-consuming when extracting feature
invariant to scale and rotation at the same time. HTD, M-SPyd vectors, being yet more than 10 times slower than SASI.
and SASI had database dependent normalization steps which were The five descriptors with complexity O(n) presented little vari-
forgone, as they are incompatible with the highly dynamic Web- ation among themselves.
like scenario intended. Our implementation of LAS descriptor In distance computations. The average time and standard devia-
quantizes components non-uniformly, to avoid generating histo- tion for one distance computation by each descriptor are shown in
grams with an overwhelming majority of values in the first bin. Table 8(b), ordered by average time. The times relative to LBP are
When the descriptor proposed no distance function (which was also shown. The values were obtained from 9,462,528 distance
the case for Unser), we used L1. computations (1,776 1,776 3).
The texture descriptors were tested on Brodatz database. Bro- The difference in times relative to LBP is considerable only for
datz [58] is one of the most popular image databases for the eval- Unser and CCOM descriptors. Unser is faster than LBP and CCOM
uation of texture descriptors. As observed in Section 3.3, Brodatz is slower.
collection was the most widely used in the evaluation of the tex- Although there exist some difference in the times measured for
ture descriptors present on this work. Brodatz contains 111 differ- each descriptor, it is small. The theoretical analysis showed that
ent textures. As done in the great majority of papers, each texture those 8 descriptors have distance function with linear complexity.
is divided into fixed-size cells. Our experiments divided each tex- That fact is observed in the times measured, as they are similar
ture in 16 blocks, totalizing 1,776 images. The resulting database among the descriptors. The small differences are related to the fea-
is composed by 111 categories having 16 images each. ture vector sizes.
O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380 373
In feature vectors storage. The feature vector sizes of each descrip- images, but no semantical labeling. From now on, we refer to this
tor are presented in Table 8(c). The sizes were computed based on database as WebSample.
the parameters given in Table 7 as explained in Section 4.2.2. CCOM As we did for the previous experiments, efficiency and effective-
descriptor generates variable-size feature vectors, thus the size ness measures were computed for each descriptor. The reference
showed in Table 8(c) is the average size for Brodatz database. descriptor is global color histogram (GCH).
The biggest difference in feature vectors sizes is for LAS descrip-
tor. Although it generates feature vectors more than 25 times 5.1. Efficiency
larger than LBP’s feature vectors, the size is equal to ACC color
descriptor, which is a feasible size in many situations. For the efficiency measures, a subset of 500 images was ran-
The differences in feature vectors sizes become evident when domly selected from the original WebSample database.
we consider a Web environment. Considering a database with In features extraction. Table 9(a) presents the average time for
109 images, LBP descriptor would require nearly 37 GB for storing extracting one feature vector by each descriptor, as well as the
feature vectors, while HTD would use 179 GB, SASI 238 GB, and LAS standard deviation and the relative times. The values were com-
954 GB, for example. puted from 1,500 features extractions for each descriptor (3 times
for each of the 500 images).
4.4.2. Effectiveness WebSample has, in general, larger images and more image size
For the effectiveness evaluation, all images from Brodatz collec- variability than the dedicated databases used earlier. That is re-
tion were used as query images. Fig. 5 shows the average values flected in the larger averages and standard deviations reported in
among all queries performed. the extraction times.
LAS and SASI descriptors are clearly superior in terms of effec- No descriptor was considerably faster or slower than the others,
tiveness than the other descriptors. Considering small Recall val- although they present large differences in the average times. For
ues, the three best descriptors are LAS, SASI, and CCOM. example, the texture descriptors M-SPyd, SASI, and HTD are very
slow to process one image. HTD specially takes almost 20 s, on
average, to extract one feature vector. CBC descriptor is also slow,
5. Evaluation in a Web-like environment taking almost 1 s to process one image, on average. Considering a
Web search engine, M-SPyd, SASI, and HTD face considerable chal-
All the 13 color and 8 texture descriptors were evaluated in a lenges in order to be employed.
Web-like environment. That environment is mainly characterized In distance computations. Table 9(b) shows the average times
by a huge image database with highly heterogeneous content and standard deviation values for one distance computation, or-
and with no ‘‘fixed’’ or ‘‘final’’ ground-truth giving an authoritative dered by average time. The values were obtained from 750,000 dis-
classification of the images. tance computations (500 500 3) for each descriptor.
The database used was collected by researchers from Federal The times observed were similar to the small-scale experi-
University of Amazonas (UFAM), Brazil, with the objective to create ments on the previous section because most of the descriptors
a collection with representative data from the Web. The data gath- generate fixed-size feature vectors. CBC descriptor, however, gen-
ering started recursively from the Yahoo directory and generated a erates variable-size feature vectors and their distance times were
database with 234,143 images (excluding icons and banners) and more than 10 times larger in WebSample than in ETH database. To
1.2 million HTML documents. The database contains all kinds of compare one pair of feature vectors, CBC takes, on average, more
Fig. 5. Precision Recall for texture descriptors in Brodatz database. The zoomed region represents the high-precision/low-recall zone, which is most important for a Web
environment. (The graph is best viewed in color.)
374 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
Table 10
Query images, their identification numbers, and the sizes of their respective pool of relevants.
Table 11 for one query, while BIC, LCH, and CCOM reached P10 of 90% in
Average P1, P5, P10, R1, R5, and R10 values over all the query images for WebSample one case.
database, sorted by P10. Texture descriptors appear in italic. The bold values show the To illustrate the effectiveness of the descriptors in some queries,
sorting criterium.
we have selected four queries, corresponding to the two easiest
Descriptor P1 P5 P10 R1 R5 R10 queries, and two cases that illustrate well the problem of semantic
JAC 0.667 0.433 0.287 0.036 0.118 0.146 gap. The worst descriptors in terms of effectiveness (CGCH, CM,
BIC 0.567 0.367 0.287 0.029 0.085 0.141 Color Bitmap, CW-LUV, LBP, M-SPyd, QCCH, and Unser) were elim-
ACC 0.500 0.367 0.267 0.025 0.092 0.138 inated from the following analysis since their precision was zero or
LCH 0.467 0.320 0.200 0.028 0.092 0.109
close to zero for almost all queries considered.
CCOM 0.367 0.227 0.167 0.017 0.055 0.077
CCV 0.333 0.180 0.113 0.013 0.042 0.053 Table 12 shows the precision and recall values for the two eas-
CSD 0.233 0.153 0.103 0.012 0.041 0.054 iest queries. Despite that, only 6 descriptors had a top-10 precision
HTD 0.233 0.147 0.093 0.011 0.031 0.038 above 50%, showing the challenge of the task. Query 11 is very ste-
GCH 0.367 0.113 0.090 0.020 0.031 0.049 reotypical, and query 29 has a large pool of relevant answers,
SASI 0.233 0.120 0.080 0.011 0.034 0.040
which are both factors that help to ease the task. However, we have
CW-HSV 0.167 0.107 0.070 0.010 0.023 0.030
LAS 0.200 0.073 0.053 0.010 0.021 0.028 observed that those are neither necessary nor sufficient conditions
CBC 0.100 0.073 0.043 0.007 0.024 0.030 to explain query difficulty.
LBP 0.133 0.067 0.040 0.005 0.015 0.016 Table 13 and Fig. 7 shows the results for two queries that illus-
Color Bitmap 0.067 0.053 0.033 0.003 0.012 0.014
trate well the semantic gap phenomenon.3
CW-LUV 0.133 0.060 0.033 0.009 0.017 0.019
CGCH 0.100 0.033 0.023 0.005 0.007 0.010 In general, color descriptors performed better than texture
QCCH 0.167 0.040 0.023 0.008 0.010 0.010 descriptors, but there was a large variability among queries. As
M-SPyd 0.067 0.027 0.020 0.004 0.006 0.009 we hinted above, the most common explanations (size of the pool
Unser 0.033 0.013 0.013 0.002 0.002 0.005 of relevants, simplicity of the image) were not sufficient to explain
CM 0.033 0.007 0.003 0.002 0.002 0.002
that variability. We have observed that the specific choice of the
query image may have an important effect on the user interpreta-
tion of its contents, which poses a challenge for query-by-example
systems based on a single query, and reinforces the importance of
query refinement/user feedback.
The precision and recall values indicate that the best overall
effectiveness is achieved by descriptors JAC, BIC, and ACC, which
have reached more than 26% of P10 and more than 13% of R10, in 5.3. Comparing effectiveness in small and large-scale experiments
the average case. The three are all color descriptors. The best
texture descriptor is CCOM with average P10 over 16%. Local col- In general, it is remarkable the effect of heterogeneity in the
or histogram (LCH) also has average P10 of 20%. descriptors’ effectiveness. To evaluate that effect, we have com-
Fig. 6 shows that many descriptors present very poor effec- puted the correlation between the effectiveness measures in the
tiveness, having P10 values close to zero for most of the queries. small-scale experiments and the results in the WebSample data-
This was the case for descriptors CGCH, CM, Color Bitmap, CW- base. Fig. 8 shows the correlation between the rank of the descrip-
LUV, LBP, M-SPyd, QCCH, and Unser. We can observe that there tors in each scenario and plots the linear regression model for
is a large variation in the behavior of most descriptors, having them. The rank is based on the mAP for the small-scale and the
queries with both very low and very high precisions. P10 for the large-scale experiment. For both color and texture
ACC, BIC, and JAC, which present the best average precisions,
also have similar medians in Fig. 6, except that BIC has more que- 3
We have provided an interface for the visualization of all queries results: <http://
ries with higher P10 values. For example, JAC reached P10 of 100% www.recod.ic.unicamp.br/eva/yahoo230k> (as of September 15, 2011).
376 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
Table 13
Average P10 and R10 values for queries 5 and 15, sorted by P10.
Texture descriptors appear in italic.
Fig. 6. Box plot with each descriptor P10 values on all 30 queries. Values above or few texture descriptors designed with heterogeneous database in
below 1.5 the interquartile range were considered outliers and indicated as small
mind. Accordingly, it has the best relative performance on the
circles. Color descriptors appear in light gray while texture ones appear in the right
in dark gray. Descriptors are sorted alphabetically. large-scale experiment, though its performance on the Brodatz
dataset is unremarkable.
Fig. 7. Results for query 5 and 15 showing the semantic gap phenomenon. The query image is the first in the top-left position of the grid. In (a), there are many images having
people, but only one of them with people graduating. In (b), there are some images very similar in color properties, but not necessarily in semantics. Images retrieved by SASI
and BIC, respectively.
14 9
ColorBitmap
13 SASI
BIC 8
12
11 7 LAS
ACC JAC
10
CSD 6 HTD
9
rank(mAP)
rank(mAP)
CBC QCCH
8 5
7 CCV
LCH 4 CCOM
6
CGCH
5 3 LBP
4
CW-LUV GCH 2
3
M-SPyd
2
CW-HSV 1
1 Unser R=0.6429
CM R=0.7951
0 0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 1 2 3 4 5 6 7 8 9
rank(P10) rank(P10)
(a) (b)
Fig. 8. Correlation of effectiveness results of (a) color and (b) texture descriptors between small-scale experiments and the experiments in the Web scenario. The correlation
is based on the rank of the descriptors in each experiment. The correlation coefficient and the regression for the data are also shown. For the color descriptors regression, Color
Bitmap was considered an outlier. Note that the performance in small and large-scale experiments has significant correlation, but there are important deviations for both
color and texture descriptors.
Table 14 The effectiveness results have shown a real impact when chang-
Correlation coefficients between each pair of the five best descriptors considering the ing the scenario from specific to broad. Overall, descriptors
P10 values over the queries in WebSample database. Texture descriptors appear in performance fall considerably, making many of them not adequate
italic.
for the Web environment. Such decrease may come from both
ACC BIC JAC LCH CCOM the size and heterogeneity of the database, which aggravate the
ACC – – – – – semantic gap in images, since their interpretation by users in an
BIC 0.801 – – – – open setting is more subtle than in a specific context.
JAC 0.578 0.670 – – – We have noticed the influence of the image background in the
LCH 0.661 0.478 0.610 – –
descriptors’ effectiveness. Heterogeneous backgrounds, with colors
CCOM 0.659 0.712 0.622 0.431 –
and textures that mix up with the image main object, affected the
values generated in the feature vectors. While the human vision eas-
ily discerns the background and the main object, the extraction
it was noticed an increase in extraction times mainly due to the algorithms usually do not distinguish what is in evidence in an im-
larger dimensions of WebSample images. Some descriptors like age. In query images with homogeneous background, the feature
M-SPyd, SASI, HTD, and CBC face challenges in a Web search en- vectors generated were more representative for the image’s main
gine, because of their large extraction times. object.
Considering the times to compute distances, changing the sce- Considering all the experimental results, we could point five
nario caused very few impact for most of the descriptors, because good descriptors: JAC, BIC, ACC, LCH, and CCOM. However, consid-
most of them generate fixed-size feature vectors. For the descrip- ering the Web scenario, JAC would face the challenge of the effi-
tors whose vectors sizes depend on the complexity of the input im- ciency problems presented. LCH and ACC are slower than BIC for
age, some of them have suffered an increase in distance times more distance computations while ACC and CCOM generate larger fea-
than others. Descriptors like CBC and JAC show times challenging ture vectors than BIC. LCH generates larger vectors than BIC, CCOM,
for a Web scenario. and ACC. BIC presented better effectiveness. Therefore, BIC is the
378 O.A.B. Penatti et al. / J. Vis. Commun. Image R. 23 (2012) 359–380
−1.0 −0.5 0.0 0.5 1.0 cal comparison is conducted over 24 color and 28 texture descrip-
tors, including traditional and recently proposed descriptors. The
11 theoretical evaluation shows that most of the descriptors have
0.4
1.0
27
O(n) complexity for features extraction and have also linear com-
8
plexity for distance computations. The most promising descriptors
LCH
25 in the theoretical analysis were implemented and evaluated exper-
20
0.2
0.5
JAC imentally. In the experiments it was possible to measure the effi-
2nd component (11% of variability)
0.0
5 3 12 descriptors with high theoretical complexity presented poor effi-
29 28 24
ACC 21
14 ciency performance in the experiments.
16 17 4
CCOM 19
Experiments were conducted over specific and heterogeneous
9
−0.5
BIC
−0.2
−1.0
−0.4
[65] B. Tao, B.W. Dickinson, Texture recognition and image retrieval using gradient [73] Q. Wang, D.D. Feng, Z. Chi, B-spline over-complete wavelet based fractal
indexing, Journal of Visual Communication and Image Representation 11 (3) signature analysis for texture image retrieval, in: International Symposium on
(2000) 327–342. Intelligent Multimedia, Video and Speech Processing, 2004, pp. 462–466.
[66] V. Kovalev, S. Volmer, Color co-occurrence descriptors for querying-by-example, [74] M. Kokare, B.N. Chatterji, P.K. Biswas, Cosine-modulated wavelet based texture
in: International Conference on MultiMedia Modeling, 1998, pp. 32-38. features for content-based image retrieval, Pattern Recognition Letters 25 (4)
[67] M. Unser, Sum and difference histograms for texture classification, IEEE (2004) 391–398.
Transactions on Pattern Analysis and Machine Intelligence 8 (1) (1986) 118–125. [75] D. Sim, H. Kim, R. Park, Invariant texture retrieval using modified Zernike
[68] V. Takala, T. Ahonen, M. Pietikäinen, Block-based methods for image retrieval moments, Image and Vision Computing 22 (4) (2004) 331–342.
using local binary patterns, in: Scandinavian Conference on Image Analysis, [76] X. Yang, J. Liu, Maximum entropy random fields for texture analysis, Pattern
2005, pp. 882–891. Recognition Letters 23 (1–3) (2002) 93–101.
[69] P. Janney, Z. Yu, Invariant features of local textures – a rotation invariant local [77] D. Sim, H. Kim, R. Park, Fast texture description and retrieval of DCT-based
texture descriptor, in: IEEE Conference on Computer Vision and Pattern compressed images, Electronics Letters 37 (1) (2001) 18–19.
Recognition, 2007, pp. 1–7. [78] Y. Rubner, C. Tomasi, Texture-based image retrieval without segmentation, in:
[70] J.A.M. Zegarra, J. Beeck, N.J. Leite, R. da S. Torres, A.X. Falcão, Combining global IEEE International Conference on Computer Vision, vol. 2, 1999, p. 1018.
with local texture information for image retrieval applications, in: [79] O.A.B. Penatti, R. da S. Torres, Eva – an evaluation tool for comparing
International Symposium on Multimedia, vol. 0, 2008, pp. 148–153. descriptors in content-based image retrieval tasks, in: ACM International
[71] J.A.M. Zegarra, N.J. Leite, R. da S. Torres, Rotation-invariant and scale-invariant Conference on Multimedia Information Retrieval, 2010, pp. 413–416.
steerable pyramid decomposition for texture image retrieval, in: Brazilian [80] P.A.S. Kimura, J.M.B. Cavalcanti, P.C. Saraiva, R. da S. Torres, M.A. Gonçalves,
Symposium on Computer Graphics and Image Processing, 2007, pp. 121–128. Evaluating retrieval effectiveness of descriptors for searching in large image
[72] P.W. Huang, S.K. Dai, P.L. Lin, Texture image retrieval and image segmentation databases, Journal of Information and Data Management 2 (3) (2011) 305–320.
using composite sub-band gradient vectors, Journal of Visual Communication
and Image Representation 17 (5) (2006) 947–957.