0% found this document useful (0 votes)

31 views10 pages

Behavioral Segmentation For Supermarket Customers Using Unsupervised Machine Learning Algorithms

Uploaded by

zemene yeshambel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views10 pages

Behavioral Segmentation For Supermarket Customers Using Unsupervised Machine Learning Algorithms

Uploaded by

zemene yeshambel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Proceedings of the International Conference on Industrial Engineering and Operations Management

Monterrey, Mexico, November 3-5, 2021

Behavioral Segmentation for Supermarket Customers

Using Unsupervised Machine Learning Algorithms
Carlos Hernández
Departamento de Procesos Industriales
Universidad Católica de Temuco
Temuco, Chile
[email protected]

Galo Paiva
Departamento de Ingeniería Industrial y Sistemas
Universidad de La Frontera
Temuco, Chile
[email protected]

Abstract
Clustering is a machine learning technique to analyze data and to discover groups that share some similarities or
closeness. It is useful for marketing segmentations because it allows classifying customers into groups based on
certain characteristics. In literature, the most commonly studies segmentation types are: geographic, demographic,
psychographic, behavioristic, volume, product-space, and benefit segmentation. This research is focused on
behavioristic segmentations for supermarket chains.
Behavior patterns are the core of the behavioristic segmentation. It considers customers’ attitude toward brands, the
knowledge of brands, purchasing habits and frequency. The segmentation related to loyalty is crucial to identify
loyal customers and to focus marketing strategies and tactics. The revenue depends greatly on that segment.
The research has been carried in four stages: analysis, design, development, and discussion. During the analysis,
1.073 customer loyalty surveys are preprocessed and analyzed. Two algorithms are employed during the
investigation: Simple K-Means Algorithm (SKMA) and Expectation-Maximization Clustering (EMC).
While SKMA cluster sizes are 27%, 15%, 15%, 26%, and 17%, EMC cluster sizes are 2%, 29%, 24%, 32%, and
13%. In conclusion, both SKMA and EMC help segmenting supermarket customers based on their behavior.
However, behavioral segmentation requires a deeper analysis since the cluster boundaries are not evident.

Keywords
Behavioristic Market Segmentation, Customer Loyalty, Machine Learning, Clustering Analysis, Supermarket Chains

1. Introduction
Market segmentation is one of the most relevant part of the marketing strategy. It was first introduced by W. Smith
in 1056 and it has been deeply analyzed ever since. The objective of segmentation is to find a set of variable which
allow identify homogeneous groups within a heterogeneous market to help focus the marketing strategies and
tactics. To accomplish it, a variety of models and techniques have been developed and used thru the years, from
simple statistical models to algorithms based on Artificial Intelligence (AI) (Mckechnie, 2006). Nowadays, the
availability of new technologies makes possible to access customer data to be analyzed quickly. In the retail
industry, for example, point of sales (POS) already allow apply data mining techniques and AI models.
Customer segmentation can be carried out following different criteria. For instance, it can be based on
demographics (age, sex, income, occupation, social class, stage of life, Internet access and use), geographical
(country, region, city, rural, density), behavioral (frequency of purchase, loyalty, where you buy, quantity
purchased), purchase occasion (routine, special, hours or days of purchase, fixed place or while traveling),
psychographic (lifestyle, personality, needs, values, attitudes, motivations), benefits (comfort, quality, economy,
ease, speed, etc.), beliefs and attitudes (towards brands, products, purchase channels) (Tynan and Drayton, 1987;
Rayport, 2001; Wyner, 2002; Kotler et al., 2009; Villarreal, R., 2014).

© IEOM Society International 1391

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

As expected, different segmentation criteria give rise to different results or segments. The degree of difficulty to
complete the segmentation varies too. In particular, psychographic variables are the most complex mainly because
they are related to the internal structure of individuals and are subjective in nature. Despite that, studies show that it
is an approach that can be appropriate to guide marketing strategies (Barry and Weinstein, 2009; Kaze and Skapars,
2011; Scheuffelen et al., 2019).
Selecting segmentation criteria is not simple and yet crucial. It depends on the purpose of the segmentation and on
the data availability, among others. Data-based segmentation is a useful tool to understand customer, however most
of the times data are not easy to be understood (Dolnicar and Leisch, 2014; Venter et al., 2015). Once the
segmentation is completed, it is necessary to identify the segments of mayor interest and the way they will be
approached, which depends on the specific customers’ needs and the products the company offers. (Wyner, 2002;
Wedel and Kamakura, 2002).
Knowing the behavior of consumers is crucial to create value and communicate it. Along the years, several models
to explain customers’ behavior have been created. They are based on a paradigm commonly referred in the literature
as CAB (cognition, affect, behavior). Howar and Sheth (1969) proposed that the brand recognition influences the
attitude and the purchase intention.
This investigation make use of an extensive survey based on behavioral CAB paradigm to analyze supermarket
customers by means of the following constructs: purchasing objectives (Baltas, 1997, Putrevu and Lord, 2001);
supermarket image (Semeijn et. Al, 2004; Collins et. Al, 2002, Grewal et al., 1998); brand loyalty (Garretson et. Al,
2002; Harcar and Kucukemiroglu, 2006, Ailawadi, 2001); shopping experience (Ailawadi, 2001); convenience of
the commercial relationship (Flavián et al., 2001); brand satisfaction (Burton et al., 1998); private label perception
(Collins-Dodd, and Lindley, 2003; Garretson, 2002; Burton et al., 1998, Dick et al., 1995).

1.1 Objective
To carry out a customer behavioristic segmentation for supermarket chains by means of applying unsupervised
machine learning techniques and clustering algorithms.

2. Literature Review

2.1 Market segmentation

Market segmentation can be understood as the split separation of the market in groups of customers with different
characteristics and behavior, who might require separate products (Kotler and Armstrong, 1999).

Several criteria can be applied to define segment. Some of the most studied segmentation types in the literature are:
• Behavioral : brand loyalty, buyer journey stage, price sensitivity, purchasing style, etc.
• Benefit : customer service, quality, etc.
• Demographic : age, education level, gender, income, family members, status, religion, etc.
• Geographic : country, city, district, etc.
• Psychographic : hobbies, interests, lifestyle, etc.

2.2 Behavioral segmentation

This investigation is focused on the behavioral segmentation for supermarket customers. This type of segmentation
classifies customers based on pattern of their behavior. Four types of behavioral segmentation have been widely
studied: purchase behavior, occasion-based purchases, benefits sought, and customer loyalty

© IEOM Society International 1392

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

Figure 1. Behavioral segmentation

2.2.1 Segmentation based on purchase and usage behavior

This type of segmentation is useful to understand the stage of the buyer’s journey in which customers are, and
therefore to determine appropriate purchase triggers.

Figure 2. Consumer’s journey-stage model

2.2.2 Segmentation based on occasion

This segmentation classifies customers based on the specific dates or moments of the day they purchase.

2.2.3 Segmentation based on sought benefits

This segmentation groups customers based on value proposition they look for. Understanding the benefits
consumers expect can help redefined strategies and tactics to satisfice each segment.

2.2.4 Segmentation based on customer loyalty

This segmentation classifies customers based on the level of loyalty, which can be measured in terms of the
purchase frequency.

2.3 Machine learning

Machine learning is usually referred as the branch of artificial intelligence (AI) that uses algorithms to find patterns
and to learn from datasets through experience. There several types of machine learning algorithms: supervised,
unsupervised, and reinforcement algorithms. In supervised learning, the training is carried out using labelled
datasets. This means that the class or the value to be predicted is included in the dataset so it can be used for
training. In the case of unsupervised learning, instead, the desired class is not known. The machine learning
algorithms used in this work have been implemented with WEKA 3.8.5. (Witten et al., 2017)

2.4 Clustering algorithms

A clustering machine learning algorithm is an unsupervised machine learning algorithm used for discovering natural
groupings or patterns. Some of the most popular clustering algorithms are:

• Agglomerative hierarchical clustering

• Density-based spatial clustering
• Expectation-maximization clustering (EMC)
• Simple K-Means algorithm (SKMA)
• Mean-shift clustering

2.5 Simple k-means algorithm (SKMA)

This type of clustering is probably the most popular of all. K-means clustering is an unsupervised machine learning
algorithm that is used to group or categorize unlabeled data. The algorithm works iteratively and assign every new
instance to one of the existing K clusters. The classification is carried out by similarity using the attributes or
features of the instances.

© IEOM Society International 1393

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

2.5.1 Number of clusters

Finding the optimal number of clusters is important because not all values of K will produce the best model. There
are several methods to determine the optimal number of clusters. Some of the most common are: average silhouette
method, elbow method, and gap statistic method.

2.6 Expectation-maximization clustering (EMC)

The expectation-maximization (EM) algorithm is an iterative procedure for the maximum likelihood estimate of a
parametric distribution. A particular case of this algorithm is the parameter estimation of a Gaussian Mixture Model
(GMM) when the generating Gaussian of each observation is unknown, commonly known as Expectation-
Maximization Clustering (EMC) (Garriga et al., 2016; Jung et al., 2014).

Figure 3. Clustering data

3. Methods
This investigation is carried out following a 4-stage model: analysis, design, construction, and discussion (Figure 5).

Figure 4. Four-stage model

3.1 Analysis
In the first stage, a complete review of the data gathered during an extensive survey about the purchasing habits and
supermarket customers’ preferences is carried out. The survey took place in Temuco, a city in the south of Chile
with a population of approximately 221.000 inhabitants, and it considered five supermarket chains each of them

© IEOM Society International 1394

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

targeting different demographic market segments. These chains are quite distinguishable. For instance, one of them
is focused on convenient stores, while other one offers exclusive products at higher prices.
The original questionnaire was based on previous well-documented survey created by renowned authors. It has 69
questions grouped in several domains such us: product quality, product availability and variety, value/price
proposition, discount and loyalty campaigns, customer service, facility organization, etc. For the purposes of the
present work only 25 questions grouped in 5 domains are considered, all of which were answered using a scale from
1 to 7 (Table 1).

Table 1. Selected domains

Domain Questions
D.1 Products’ quality 4
D.2 Prices and discounts 6
D.3 Facility and service 5
D.4 Products’ availability and variety 5
D.5 Overall purchase experience 5

Since the chains considered in the original survey target different demographic segments, this research focusses in
only one of them. The selected chain is the middle of the price range, it counts with stores in different districts of the
city and it has a well-established private label with a variety of products.

3.2 Design
The original unlabeled dataset, a matrix of 1073 rows (instances) by 25 columns, is prepared to be clustered by
means of applying the algorithms SKMA and EMC. During the experiments, different number of clusters will be
tested and compared. Survey domains and their corresponding questions are presented below (Table 2 and Table 3).

Table 2. Questions per domain

Domain Q.3 Q.4 Q.5 Q.6 Q.7 Q.16 Q.17 Q.18 Q.19 Q.20 Q.21 Q.22 Q.23
D.1 ✓ ✓ ✓
D.2 ✓ ✓ ✓
D.3 ✓ ✓ ✓ ✓ ✓
D.4 ✓ ✓
D.5

Table 3. Questions per domain (continuation)

Domain Q.41 Q.42 Q.43 Q.44 Q.45 Q.46 Q.47 Q.48 Q.49 Q.50 Q.52 Q.53
D.1 ✓
D.2 ✓ ✓ ✓
D.3
D.4 ✓ ✓ ✓
D.5 ✓ ✓ ✓ ✓ ✓

3.3 Construction
As aforementioned, the objective of this investigation is to carry out the behavioral segmentation of a specific
supermarket chain customers by means of applying clustering algorithms. However, behavioral segmentation differs
significantly from other types. For instance, in demographic segmentation is evident whether customers are in
certain age range. In this case, instead, a deeper analysis is required.
In particular, the survey data used in this work is organized in discrete values in the range 0 to 7. The summary of
the answers to que questions are presented below (Table 4)

Table 3. Questions’ answer summary

Question ID Value 0 Value 1 Value 2 Value 3 Value 4 Value 5 Value 6 Value 7
Q.3 17 9 5 19 47 94 190 692

© IEOM Society International 1395

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

Q.4 17 30 18 53 110 265 247 333

Q.5 18 13 16 31 73 161 240 521
Q.6 21 87 40 71 105 190 219 340
Q.7 27 7 10 26 46 146 263 548
Q.16 19 8 9 48 87 144 347 411
Q.17 20 3 9 36 54 128 320 503
Q.18 23 2 8 35 64 140 307 494
Q.19 24 6 8 38 94 189 306 408
Q.20 24 3 7 31 85 137 333 453
Q.21 25 3 11 17 50 133 326 508
Q.22 23 6 3 35 100 161 333 412
Q.23 40 141 36 64 172 145 193 282
Q.41 25 92 54 91 144 278 230 159
Q.42 22 186 76 111 224 168 168 118
Q.43 51 134 52 108 264 170 152 142
Q.44 30 127 61 107 195 205 211 137
Q.45 30 29 11 60 104 177 337 325
Q.46 31 91 36 68 137 218 289 205
Q.47 34 116 41 100 133 226 228 195
Q.48 22 13 8 24 60 150 309 487
Q.49 21 6 9 9 73 154 309 492
Q.50 23 5 6 31 83 181 326 418
Q.52 24 15 16 29 96 188 340 365
Q.53 21 8 6 22 79 120 344 473

3.4 Discussion
Even though there exist more algorithms, the scope of this work is restricted to 2 of the most common algorithms:
SKMA and EMC, which belong to a broader family usually called Gaussian mixture models.
The first algorithm studies, SMKA, requires the definition of K centroids and the iterations until certain degree of
convergence to a local minimum is achieved. The latter, EMC, is meant to solve some of the weaknesses of SKMA.
Rather than focusing on the accuracy of the classification, due to the nature of the behavioral clustering the interest
is set on the number of clusters and the distribution of them.

4. Data Collection
Finding the optimal number of clusters for a given dataset requires the application of optimization algorithms.
However, it might have some drawbacks. Especially when the number of clusters is too high to be applied in reality.
The following tables present the data (instances) classification distribution when clustering algorithms are force to
generate 1, 2, 3, 4, and 5 clusters (Table 4 and Table 5).

Table 4. SKMA with k=2, 3, 4, and 5

K Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5
% Instances % Instances % Instances % Instances % Instances
2 55 585 45 488
3 42 454 44 468 14 151
4 26 282 19 204 16 175 38 412
5 27 291 15 157 15 161 26 277 17 187

Table 5. EMC with 2, 3, 4, and 5 clusters

K Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5
% Instances % Instances % Instances % Instances % Instances
2 38 404 62 669
3 31 328 44 474 25 271
4 41 444 25 266 2 26 31 337
5 2 24 29 316 24 256 32 338 13 139

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

5. Results and Discussion

It might be interesting to determine the number of cluster that mathematically minimized the total squared distance
between each data point and its closest centroid. SKMA can do that after several iterations. In spite of being the
correct this approach might lead a not practical clustering. The following table presents how the dataset is
distributed after applying SKMA with the optimal K equals to 16. The clustering produced by EMC is also shown.
Implementing a marketing strategy for 16 different segment might be troublesome. From them, in the case of SKMA
only 4 clusters concentrate between 10% and 20% of the data and only 2 of them in the case of EMC (Table 6).

Table 6. EMC and SKMA with 16 clusters

# Cluster EMC SKMA
% Instances % Instances
1 4 47 10 104
2 9 92 8 88
3 2 22 5 57
4 7 70 7 80
5 4 45 13 135
6 5 53 9 99
7 8 89 5 56
8 17 181 4 44
9 10 102 7 72
10 12 130 5 59
11 1 15 6 66
12 13 141 7 76
13 2 22 5 51
14 3 30 3 31
15 1 10 3 34
16 2 24 2 21

5.1 Numerical Results

Behavioral segmentation might not be as evident as other types of segmentation due to the difficulty to differentiate
customers. The definition of each cluster or segment requires a deep analysis. The success of a marketing strategy
depends greatly on an appropriate understanding of customers purchase practices.
As mentioned earlier, the data used in this work were taken from an extensive supermarket customer survey whose
questions are organized in several domains (Table 1 and Table2). The following tables present the distribution of the
clusters generated by SKMA and EMC when the survey domains are separately analyzed (Table 7, Table 8, Table 9,
Table 10, and Table 11).

Table 7. Clustering for domain D.1 (Q.3, Q.4, Q.7, and Q.53)
SKMA EMC
Cluster % Instances % Instances
C.1 31 328 51 549
C.2 11 121 1 8
C.3 40 430 6 60
C.4 9 99 2 18
C.5 9 95 41 438

Table 8. Clustering for domain D.2 (Q.5, Q.6, Q.16, Q.48, Q.50, and Q.52)
SKMA EMC
Cluster % Instances % Instances
C.1 35 374 2 22
C.2 17 179 34 360
C.3 9 101 35 376
C.4 15 156 18 196

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

C.5 25 263 11 119

Table 9. Clustering for domain D.3 (Q.17, Q.18, Q.19, Q.20, and Q.22)
SKMA EMC
Cluster % Instances % Instances
C.1 26 282 2 26
C.2 19 202 32 345
C.3 18 188 9 101
C.4 8 82 36 382
C.5 30 319 20 219

Table 10. Clustering for domain D.4 (Q.21, Q.23, Q.43, Q.47, and Q.49)
SKMA EMC
Cluster % Instances % Instances
C.1 30 325 31 334
C.2 21 227 30 322
C.3 20 214 4 41
C.4 17 186 22 235
C.5 11 121 13 141

Table 11. Clustering for domain D.5 (Q.41, Q.42, Q.44, Q.45 and Q.46)
SKMA EMC
Cluster % Instances % Instances
C.1 34 365 3 28
C.2 25 266 15 159
C.3 14 150 33 355
C.4 13 141 12 129
C.5 14 151 37 402

6. Conclusion
Behavioral segmentation, different from other types, is not so evident and usually requires a deeper analysis to
establish the difference between segments. This investigation takes advantage of an extensive supermarket customer
survey to outline a segmentation based on the well-known SKMA and EMC clustering algorithms.
Clustering algorithms are a special case type of machine learning algorithms used to classified unlabeled data by
means of grouping data points having similarities or some degree of closeness with each other.
Both SKMA and EMC are iterative optimization methods. Depending on the circumstances and necessities it might
be possible to find the optimal number of clusters or define a given number that is more practical. Experiments
revealed that the optimal number of clusters for the dataset is 16. However, only between 2 and 4 clusters
concentrates more than 10% of the data points. A collection of small clusters or market segments might complicate
excessively the design of an effective marketing campaign. Instead of that, a fixed number of clusters, from K=2 to
K=5, was analyzed. In the case of five clusters, while SKMA cluster sizes are 27%, 15%, 15%, 26%, and 17%,
EMC cluster sizes are 2%, 29%, 24%, 32%, and 13%.
An additional analysis was carried out to determine whether the nature of the survey’s domain has an influence on
the cluster definition. The difference in the sizes of the resulting clusters confirms that the segmentation depends, up
to certain point, on the criteria being applied.
In conclusion, both clustering algorithms SKMA and EMC can help segmenting supermarket customers based on
their behavior. However, behavioral segmentation requires a deeper analysis since the cluster borders usually are not
evident.

References

Ailawadi, K., The retail power-performance conundrum, Journal of Retailing, vol. 77, no. 3, pp. 299-318, 2001

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

Baltas, G., Determinants of store brand choice: a behavioral analysis, Journal of Product & Brand Management, vol.
6, no. 5, pp. 315-324, 1997.
Barry, J., and Weinstein, A., Business psychographics revisited: From segmentation theory to successful marketing
practice, Journal of Marketing Management, vol. 25, no. 3-4, pp. 315–340, 2009.
Burton, S., Lichtenstein, D., Netemeyer, R., and Garretson, J., A scale for measuring attitude toward private label
products and an examination of its psychological and behavioral correlates, Journal of the Academy of
Marketing Science, vol. 26, no. 4, pp. 293-306, 1998.
Collins-Dodd, C., and Lindley, T., Store brands and retail differentiation: the influence of store image and store
brand attitude on store own brand perceptions, Journal of Retailing and Consumer Services, vol. 10, no. 6,
pp. 345-352, 2003.
Dick, A., Jain, A., and Richardson, P., Correlates of store brand proneness: Some empirical observations, Journal of
Product & Brand Management, vol. 4, no. 4, pp. 15-22. 1995.
Dolnicar, S., and Leisch, F., Using graphical statistics to better understand market segmentation solutions,
International Journal of Market Research, vol. 56, no. 2, pp. 207-230, 2014.
Flavián, C., Martínez, E., and Polo, Y., Loyalty to grocery stores in the Spanish market of the 1990s, Journal of
Retailing and Consumer Services, vol. 8, no. 8), pp. 85-93, 2001.
Garretson, J. A., Fisher, D., and Burton, S., Antecedents of private label attitude and national brand promotion
attitude: similarities and differences, Journal of Retailing, vol. 78, no. 1, pp. 91-99, 2002.
Garriga J., Palmer J., Oltra A., and Bartumeus F., Expectation-Maximization Binary Clustering for Behavioural
Annotation, PLoS ONE, vol. 11, no. 3 2016.
Grewal, D., Krishnan, R., Baker, J., and Borin, N., The effect of store name, brand name and price discounts on
consumers’ evaluations and purchase intentions, Journal of Retailing, vol. 74, no. 3, pp. 331-352, 1998.
Harcar, T., Kara, A., and Kucukemiroglu, O., Consumer’ s perceived value and buying behavior of store brands: An
empirical investigation, The Business Review, Cambridge, vol. 5, no. 2, pp. 55-62, 2006.
Jung Y., Kang m., and Heo M., Clustering performance comparison using K-means and expectation maximization
algorithms, Biotechnology & Biotechnological Equipment, vol. 28, pp. 44-48, 2014.
Kaze, V., and Skapars, R., Paradigm shift in consumer segmentation to gain competitive advantages in post-crisis
FMCG markets: Lifestyle or social values? Economics and Management, vol. 16, no. 1956, pp. 1266-1274,
2011.
Kotler, P. and Keller, K., Marketing Management, 13th edition, Pearson Prentice-Hall, 2009.
Mckechnie, S., Integrating intelligent systems into marketing to support market segmentation decisions, Intelligent
Systems in Accounting, Finance and Management, vol. 14, no. 3, pp.117-127, 2006.
Putrevu, S., and Lord, K., Search dimension, patterns segment profiles of grocery shoppers. Journal of Retailing and
Consumer Service, vol. 8, no. 3, pp. 127-137, 2001.
Scheuffelen, S., Kemper, J., and Brettel, M., How do human attitudes and values predict online marketing
responsiveness? Comparing consumer segmentation bases toward brand purchase and marketing response,
Journal of Advertising Research, vol. 59, no. 2, pp. 142-157, 2019.
Semeijn, J., van Riel, A., and Ambrosini, A., Consumer evaluations of store brands: effects of store image and
product attributes, Journal of Retailing and Consumer Services, vol. 11, no. 11, pp. 247-258, 2004.
Sigurdsson, V., and Kahamseh, S. (2013). An econometric examination of the behavioral perspective model in the
context of Norwegian retailing, The Psychological Record, vol. 63, no. 1, pp. 277-294, 2013.
Smith, W., Product differentiation and market segmentation as alternative marketing strategies, Journal of
Marketing, 21, 3–8, 1956.
Tynan, A., and Drayton, J., Market segmentation, Journal of Marketing Management, vol. 2, no. 3, pp. 301-335,
1987.
Venter, P., Wright, A., and Dibb, S., Performing market segmentation: a performative perspective. Journal of
Marketing Management, vol. 31, pp. 62-83, 2015.
Villarreal, R., Improving multicultural marketing: between-group and within-group segmentation approaches,
Journal of Brand Strategy, vol. 3, no. 3, pp. 278-289, 2014.
Wedel, M., and Kamakura, W., Introduction to the special issue on market segmentation. International Journal of
Research in Marketing, vol. 19, pp. 181-183, 2002.
Witten, I., Frank, E., Hall, M., and Pal, C., Data Mining: Practical Machine Learning Tools and Techniques, 4th
Edition, Morgan Kaufmann, Cambridge, 2017.
Yoseph, F., Malim, N., Heikkilä, M., Brezulianu, A., Geman, O., and Rostam, N., The impact of big data market
segmentation using data mining and clustering techniques. Journal of Intelligent and Fuzzy Systems, vol.
38, no. 5, pp. 6159-6173, 2020.

Proceedings of the International Conference on Industrial Engineering and Operations Management
Monterrey, Mexico, November 3-5, 2021

Biographies

Carlos Hernández is an industrial engineer, consultant, and university professor. He earned Licentiate Degree in
Engineering from Universidad de La Frontera, Temuco, Chile, Master of Sciences in Computational Engineering
and Doctor of Engineering from Technische Universität Braunschweig, Brunswick, Germany. He is the author of
several scientific and engineering articles. He has taught lectures in Discrete Event Simulation, Supply Chain
Management, Engineering Economics, Corporate Finances, Financial Engineering, Business Analytics, Data Mining
and Machine Learning for engineering students. He has developed a professional career working for large
multinational companies (PricewaterhouseCoopers, BHP Billiton, and Merck Sharp & Dohme). He also worked as a
scientific researcher in the Institut für Produktionsmesstechnick at TU Braunschweig, Germany. His research
interests include manufacturing process simulation, transportation systems simulation, supply chain design and
simulation, and machine learning for finances. He is a member of IEOM.

Galo Paiva is an industrial engineer, consultant, and university professor. He earned Licentiate Degree in
Engineering from Universidad de Santiago de Chile, Chile, and Doctor of Business Management from Universidad
Autónoma de Madrid, Spain. He has taught lectures in Strategic Management, Operations Management, Industrial
Engineering, and Project Planning & Management. His research interests include manufacturing process simulation,
industrial design, business management, and entrepreneurship.

Customer Segmentation Based On Machine Learning Method
No ratings yet
Customer Segmentation Based On Machine Learning Method
7 pages
Customer Segmentation With Machine Learning
No ratings yet
Customer Segmentation With Machine Learning
7 pages
Customer Segmentation Via Data Mining Techniques: State-of-the-Art Review
No ratings yet
Customer Segmentation Via Data Mining Techniques: State-of-the-Art Review
20 pages
Data Science for Customer Segmentation
No ratings yet
Data Science for Customer Segmentation
7 pages
A Cluster-Based Analysis For Targeting Potential Customers in A Real-World Marketing System
No ratings yet
A Cluster-Based Analysis For Targeting Potential Customers in A Real-World Marketing System
8 pages
Customer Segmentation Using K Means Clustering IJERTV11IS030152
No ratings yet
Customer Segmentation Using K Means Clustering IJERTV11IS030152
6 pages
Updated Thesis
No ratings yet
Updated Thesis
29 pages
Customer Segmentation Using Machine Learning
No ratings yet
Customer Segmentation Using Machine Learning
8 pages
Machine Learning for Customer Segmentation
No ratings yet
Machine Learning for Customer Segmentation
7 pages
Amazon User Segmentation: Ankit Chaudhary, Abhishek Pal, Ankit Saraswat, Harshit Jindal, Jagbeer Singh
No ratings yet
Amazon User Segmentation: Ankit Chaudhary, Abhishek Pal, Ankit Saraswat, Harshit Jindal, Jagbeer Singh
7 pages
Janardhanan 2020 J. Phys. Conf. Ser. 1706 012160
No ratings yet
Janardhanan 2020 J. Phys. Conf. Ser. 1706 012160
9 pages
Behavioural Customer Segmentation Based
No ratings yet
Behavioural Customer Segmentation Based
7 pages
1 (Buchdadi)
No ratings yet
1 (Buchdadi)
21 pages
IJCSP23D1055
No ratings yet
IJCSP23D1055
9 pages
Paper 7-Application of K Means Algorithm For Efficient Customer Segmentation
No ratings yet
Paper 7-Application of K Means Algorithm For Efficient Customer Segmentation
5 pages
Yoseph 2019
No ratings yet
Yoseph 2019
19 pages
Mall Customer Segmentation Kalash Daf
No ratings yet
Mall Customer Segmentation Kalash Daf
12 pages
Digital Marketing: Audience Insights
No ratings yet
Digital Marketing: Audience Insights
4 pages
IGI - Book 270 292
No ratings yet
IGI - Book 270 292
24 pages
маркетинг инжениринг
No ratings yet
маркетинг инжениринг
27 pages
Hariprasath Conferencepaper
No ratings yet
Hariprasath Conferencepaper
6 pages
Employee Mangement System
No ratings yet
Employee Mangement System
60 pages
Segmentation of Bank Consumers For Artificial Intelligent Marketing PDF
No ratings yet
Segmentation of Bank Consumers For Artificial Intelligent Marketing PDF
16 pages
Market Segmentation Using High-Dimensional Sparse Consumers Data - ScienceDirect
No ratings yet
Market Segmentation Using High-Dimensional Sparse Consumers Data - ScienceDirect
7 pages
Basepaper 4
No ratings yet
Basepaper 4
7 pages
Papar
No ratings yet
Papar
11 pages
Segmentation of Shopping Mall Customers Using Machine Learning
No ratings yet
Segmentation of Shopping Mall Customers Using Machine Learning
11 pages
Janardhanan 2020 J. Phys. Conf. Ser. 1706 012160
No ratings yet
Janardhanan 2020 J. Phys. Conf. Ser. 1706 012160
9 pages
Women & FMCG Segmentation in India
No ratings yet
Women & FMCG Segmentation in India
31 pages
Strategic Marketing & Planning
No ratings yet
Strategic Marketing & Planning
23 pages
Market Segmentation Essentials
No ratings yet
Market Segmentation Essentials
14 pages
2024-10-31T08 50 30.040Z NguyenThiNgocTran BA1905 MKT101 ESSAY.
No ratings yet
2024-10-31T08 50 30.040Z NguyenThiNgocTran BA1905 MKT101 ESSAY.
4 pages
Market Segmentation Strategies
100% (2)
Market Segmentation Strategies
11 pages
Multiple PCWare
No ratings yet
Multiple PCWare
8 pages
Research Paper Mini Project
No ratings yet
Research Paper Mini Project
13 pages
Fin Irjmets1653303840
No ratings yet
Fin Irjmets1653303840
4 pages
Unit 2
No ratings yet
Unit 2
54 pages
Goods and Services Demographics: Requirements of Market Segments
No ratings yet
Goods and Services Demographics: Requirements of Market Segments
4 pages
Difference Between Industrial Market Segmentation and Traditional Market Segmentation
No ratings yet
Difference Between Industrial Market Segmentation and Traditional Market Segmentation
5 pages
Segmentation 091008102605 Phpapp01
No ratings yet
Segmentation 091008102605 Phpapp01
23 pages
Segmentation Targeting Positioning (STP)
No ratings yet
Segmentation Targeting Positioning (STP)
20 pages
Market Segmentation: Criteria For Segmenting
No ratings yet
Market Segmentation: Criteria For Segmenting
7 pages
Market Segmentation
No ratings yet
Market Segmentation
30 pages
Trends in Consumer Segmentation
No ratings yet
Trends in Consumer Segmentation
9 pages
The Need For Market Segmentation
No ratings yet
The Need For Market Segmentation
4 pages
Rediscovering Market Segmentation
100% (1)
Rediscovering Market Segmentation
14 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
86 pages
Buying Behavior Study With Basket Analysis: Pre-Clustering With A Kohonen Map
No ratings yet
Buying Behavior Study With Basket Analysis: Pre-Clustering With A Kohonen Map
14 pages
Market Segmentation Insights
No ratings yet
Market Segmentation Insights
3 pages
Yao, Mikaila Mari C. RO2
No ratings yet
Yao, Mikaila Mari C. RO2
72 pages
Marketing Segmentation Guide
100% (2)
Marketing Segmentation Guide
11 pages
Updated Thesis
No ratings yet
Updated Thesis
28 pages
Mall Customer Segmentation Using Cluster
No ratings yet
Mall Customer Segmentation Using Cluster
6 pages
Final Synopsis
No ratings yet
Final Synopsis
9 pages
Identifying Market Segments and Targets
No ratings yet
Identifying Market Segments and Targets
2 pages
Topic 2-Marketing Segmentation
No ratings yet
Topic 2-Marketing Segmentation
23 pages
Market Segmentation Project
No ratings yet
Market Segmentation Project
9 pages
CertificateOfCompletion - HTTP Essential Training
No ratings yet
CertificateOfCompletion - HTTP Essential Training
1 page
Deep Learning For Image Spam Detection
No ratings yet
Deep Learning For Image Spam Detection
44 pages
Shashu Brhane
No ratings yet
Shashu Brhane
55 pages
Quantitative Worksheet Marryland
No ratings yet
Quantitative Worksheet Marryland
2 pages
Atsede Abebe
No ratings yet
Atsede Abebe
70 pages
SMU Chala Tamirat Customers Identity Card Data Detection and Recognition
No ratings yet
SMU Chala Tamirat Customers Identity Card Data Detection and Recognition
96 pages
Tajudin Mohammed
No ratings yet
Tajudin Mohammed
78 pages
Master's Thesis Juho Jaaskelainen
No ratings yet
Master's Thesis Juho Jaaskelainen
97 pages
2018 MCS 039
No ratings yet
2018 MCS 039
120 pages
Customer Segmentation
No ratings yet
Customer Segmentation
119 pages
FeMoUS Go Live Communication 1018
No ratings yet
FeMoUS Go Live Communication 1018
26 pages
CAT-5941 FleX-Net FX-2003-12NDS Network Fire Alarm Control Panel
No ratings yet
CAT-5941 FleX-Net FX-2003-12NDS Network Fire Alarm Control Panel
7 pages
Python List Assignments
No ratings yet
Python List Assignments
4 pages
IT Model Exam With Brief Answers
No ratings yet
IT Model Exam With Brief Answers
98 pages
Lecture TWP Python A03 1a General Features of Python
No ratings yet
Lecture TWP Python A03 1a General Features of Python
37 pages
Database Design for Students
No ratings yet
Database Design for Students
29 pages
Free MS Office 2013 Keys
100% (1)
Free MS Office 2013 Keys
3 pages
Module 4
No ratings yet
Module 4
64 pages
Shri Saibaba Sansthan Trust, Shirdi (Offical Booking Portal)
No ratings yet
Shri Saibaba Sansthan Trust, Shirdi (Offical Booking Portal)
1 page
PHD Thesis On Wireless Ad - Hoc Network
100% (3)
PHD Thesis On Wireless Ad - Hoc Network
6 pages
Information Technology in Business
No ratings yet
Information Technology in Business
89 pages
Ds Europe An401plus Series
No ratings yet
Ds Europe An401plus Series
20 pages
Oracle - Examcollection.1z0 083.vce.2023 Jul 20.by - Fabian.169q.vce
No ratings yet
Oracle - Examcollection.1z0 083.vce.2023 Jul 20.by - Fabian.169q.vce
13 pages
12 RGB Display
No ratings yet
12 RGB Display
113 pages
Lecture Notes w4 - Scrum and Kanban
No ratings yet
Lecture Notes w4 - Scrum and Kanban
65 pages
System Realization
No ratings yet
System Realization
22 pages
Compare Traditional Programming and Machine Learning.
No ratings yet
Compare Traditional Programming and Machine Learning.
2 pages
Social Media The Convergence of Public and Personal Communication 2nd Edition Graham Meikle Download
No ratings yet
Social Media The Convergence of Public and Personal Communication 2nd Edition Graham Meikle Download
52 pages
Evermotion Archinteriors Vol 3 PDF
No ratings yet
Evermotion Archinteriors Vol 3 PDF
2 pages
Mortyr
No ratings yet
Mortyr
32 pages
Cessna Aircraft Company: Wichita Ansas
No ratings yet
Cessna Aircraft Company: Wichita Ansas
252 pages
Deep Learning-Based Quality Assessment of 3d Point Clouds Without Reference
No ratings yet
Deep Learning-Based Quality Assessment of 3d Point Clouds Without Reference
6 pages
Review of Related Literature
100% (1)
Review of Related Literature
5 pages
Blockchain Development Tasks & Instructions - CodeAlpha
No ratings yet
Blockchain Development Tasks & Instructions - CodeAlpha
2 pages
Vemag Robot 500 Spec Sheet
No ratings yet
Vemag Robot 500 Spec Sheet
2 pages
Instagram Marketing Tips
No ratings yet
Instagram Marketing Tips
11 pages
PhonePe Statement Jun2025 Jul2025
No ratings yet
PhonePe Statement Jun2025 Jul2025
3 pages
Sampling Techniques and Sample Size Determination
No ratings yet
Sampling Techniques and Sample Size Determination
35 pages
Implementation and Validation of NSGA-II Algorithm For Constrained and Unconstrained Multi-Objective Optimization Problem
No ratings yet
Implementation and Validation of NSGA-II Algorithm For Constrained and Unconstrained Multi-Objective Optimization Problem
7 pages
Python Network Programming Guide
100% (1)
Python Network Programming Guide
73 pages
Learn Linux System Programming With C - Katie Millie
No ratings yet
Learn Linux System Programming With C - Katie Millie
257 pages

Behavioral Segmentation For Supermarket Customers Using Unsupervised Machine Learning Algorithms

Uploaded by

Behavioral Segmentation For Supermarket Customers Using Unsupervised Machine Learning Algorithms

Uploaded by

Proceedings of the International Conference on Industrial Engineering and Operations Management

Monterrey, Mexico, November 3-5, 2021

Behavioral Segmentation for Supermarket Customers

© IEOM Society International 1391

2.1 Market segmentation

2.2 Behavioral segmentation

© IEOM Society International 1392

Figure 1. Behavioral segmentation

2.2.1 Segmentation based on purchase and usage behavior

Figure 2. Consumer’s journey-stage model

2.2.2 Segmentation based on occasion

2.2.3 Segmentation based on sought benefits

2.2.4 Segmentation based on customer loyalty

2.3 Machine learning

2.4 Clustering algorithms

• Agglomerative hierarchical clustering

2.5 Simple k-means algorithm (SKMA)

© IEOM Society International 1393

2.5.1 Number of clusters

2.6 Expectation-maximization clustering (EMC)

Figure 3. Clustering data

Figure 4. Four-stage model

© IEOM Society International 1394

Table 1. Selected domains

Table 2. Questions per domain

Table 3. Questions per domain (continuation)

Table 3. Questions’ answer summary

© IEOM Society International 1395

Q.4 17 30 18 53 110 265 247 333

Table 4. SKMA with k=2, 3, 4, and 5

Table 5. EMC with 2, 3, 4, and 5 clusters

© IEOM Society International 1396

5. Results and Discussion

Table 6. EMC and SKMA with 16 clusters

5.1 Numerical Results

© IEOM Society International 1397

C.5 25 263 11 119

© IEOM Society International 1398

© IEOM Society International 1399

© IEOM Society International 1400

You might also like