Basepaper 4
Basepaper 4
Abstract—Recently, the use of accumulated data in the trade profitable market segments. The era of big data challenges new
and financial sector has become increasingly important to tasks associated with efficient aggregation, processing and
improve the quality of services provided. For successful business, interpretation of large amounts of weakly formalized data. The
decision-making systems in retail should combine various main tasks closely related to the analysis of data in the socio-
marketing strategies with analysis of available data. For suppliers
of goods and services and banks acquirers the forecast of the
economic sphere are associated with predictive analytics and
success of outlets and customer's churn prediction are crucial, forecasting in retail, performing optimal marketing strategies,
especially when price policy changes. We propose data-driven customer’s churn prediction models etc. [2], [6].
approach that evaluates clusters of the stores based on the success One of the tasks of price discrimination is to set prices that
of each segment of suppliers and develop the model for assigning
the optimal bank’s acquiring rate within the shopping center
will allow you to sell the product to those categories of
taking into account different customer's loyalty rate accordingly customers who would not buy it at higher prices. And, at the
to their success. In this study we analyze the data on consumer's same time, increase them for segments with high purchasing
activity inside large shopping centers on the basis of the power. Using data-driven approach it is possible to analyze the
transactional dataset provided by our partner bank. We highlight dynamics of sales, comparing it with statistics accumulated
the segments of outlets based on their location, category of goods over the previous time, and to take into account many factors
and services, average monthly turnover and the number of in pricing. But the idea remains the same - to maximize
transactions and apply the constructed model to recommend the company profits.
optimal rate. The proposed approach can be applied in the retail
sector by large suppliers of goods and services and banks. Most dynamic pricing models involve complex
mathematical calculations and computer programming [5].
I. INTRODUCTION The basis of dynamic pricing is the use of information about
As it states from World Payments Report 2018 in the last various characteristics of customers (outlets) and related
year, the volume of non-cash transactions almost doubled the market conditions. Based on this information, the optimal
number of cash withdrawal transactions from bank cards [1]. interest rate for each client group selected using cluster
This indirectly confirms the statement, which analysts have analysis is established, which provides the maximum income
been talking about for a long time that money becomes taking into account the probabilities of an outflow of
“electronic”. customers from target groups. If the margin is too high, they
would go to the competitor, and in case of too low margin,
However, given the ever-increasing importance of they could pay more for the same offer.
electronic data for business in e-commerce, the significance of
using in-depth knowledge of market features and the The main objective of this study is to develop a data-based
characteristics of its main players - customers, suppliers and approach to optimize marketing strategies and bank acquiring
competitors - also in-creases. In order to sale best business systems by the example of shopping centers. The rest of the
marketers should analyze what market segment has the highest paper is organized as follows. Section II presents related work
willingness to buy and to pay and what specific price level will on methods which were applied to solve similar problems
generate the highest market share, revenue and profits [2]. earlier. Section III gives formal description of the available
Detailed segmentation based on socioeconomics, value drivers, data on which we made extrapolation of the data and further
decision drivers and loyalty to pay more generates the most modeling, Section IV presents the mathematical formalization
important data about the market [2], [4]. of the proposed dynamic loyalty model. Section V is for the
methodic of experimental study and experimental results. And
The modern approach to the differentiated pricing strategy finally, Section VI presents conclusion and discussion.
involves wider use of the available information that could be
collected from the retailers for the further assigning of different II. RELATED WORK
prices for the same goods and services in relation to different
categories of customers or at different times [2], [3]. Such The dependence of consumer sensitivity to price changes
strategies allow suppliers to regulate demand throughout the was discussed in [4], [6], [7] on the basis of historicaldataon
year (weeks, days) and enable them to work with the most relations between suppliers DQG FRQVXPHUV
ISSN 2305-7254
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.
______________________________________________________PROCEEDING OF THE 25TH CONFERENCE OF FRUCT ASSOCIATION
Fig. 1. Distribution of the number of the stores in different categories: left: in total for Northwest region of Russia, right: for 10 largest shopping centers in St.
Petersburg
Main ideas of the customer’s satisfaction analytics can be proposed model is based on aggregated transactions dataset
summarized as follows: (i) the longer the relationship, the provided by our bank-partner, supplemented by data from the
higher the propensity to use a certain product of the company; web-sites of banks-acquirers. Such data in most cases is the
(ii) the longer the relationship, the less sensitivity to price only thing analysts can work with, therefore the development
changes for a certain product of the company; (iii) the greater of such a decision-making model is a popular business task.
the Relationship Breadth (RB, the amount of goods delivered
to the customer), the higher the propensity to use a particular III. DATA AND MODELING
product of the company, but depending on the duration of the
relationship; (iv) the greater the RB, the greater the sensitivity A. Data
to price changes for a specific product. Nevertheless, 1) Shopping centers characteristics
historical data on customer interactions are often either
We use the dataset provided by our bank-partner (one of
inaccessible or insufficient to provide a comprehensive
the largest regional banks in Russia) to extract the information
analysis and identify the degree of customer satisfaction.
about electronic purchases in 10 largest shopping malls T(i),
Providing special marketing research or special surveys is also
i={1,…10} in Saint-Petersburg during the last 5 years. The
often not applicable, therefore, it is necessary to use an
binding of transactions to a specific shopping center was made
available data to form the optimal marketing strategies.
on the basis of the addresses of the shopping center and the
The degree of customer satisfaction is directly related to the geo-location of the successful transaction. The data was
price elasticity of this customer segment [5]. Many studies extracted from the following 10 shopping centers: “Galeria”,
investigating customer’s churn and the dependence of “Europolis”, “Mercury”, “MEGA Dybenko”, “MEGA
customer’s satisfaction on the existing pricing policy Parnas”, “Piterland”, “LETO”, “City Mall”, “Continent
(including discussed above) used data accumulated over a Zvezdnaya” and “Nevskiy Center”. Given information about
large historical period [4], [7], [8]. This made it possible to the purchases in different stores O(ij), i={1,…10}, j={1,….N(i)}
investigate the response of various customers to the changes in from the specific shopping center T(i) all the transactions were
pricing strategy or targeted offers, solving the classification or merged into 18 groups accordingly to the Merchant Category
regression problem [6]. Codes (MCC), which were assigned to the category of each
transaction.
In this study we propose an approach dealing with customer
churn and dynamic pricing strategy in the case where complete Fig. 1 represents the distributions of the number of the
historical data on the interaction of retail outlets with banks (or stores in specific category summarizing the data: (i) in total for
any suppliers of goods and services) is not available. The every transaction in the dataset (to the left) and (ii) only for the
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.
______________________________________________________PROCEEDING OF THE 25TH CONFERENCE OF FRUCT ASSOCIATION
extracted transaction in the largest shopping centers of Saint- To evaluate monthly turnover for the specific store O(j) in
Petersburg (to the right). One can see that the most popular (in the shopping center T(i) we have taken into account
terms of the number of stores) categories significantly differs information about the market share for our bank partner, and
in these two cases: in shopping centers, the most popular have multiplied the cumulative turnover value for the store in
categories are “Clothing, shoes & accessories” stores, the specific category in our data by this coefficient. The result
“Restaurants”, and “Children's wear”, while in the whole of evaluation shows that 96% of the outlets in the shopping
dataset markets mostly correspond to the “Food” category. centers have relatively large turnover (with > 300 000 rubles
per month).
We added new values to the existing characteristics of
transactions in shopping centers: the distance from the nearest
metro station, reflecting the geo-availability of the outlet, the
average number of transactions in a particular category in the
store, and the average monthly transaction value in a given
category.
For clustering and highlighting segments of retail
categories in shopping centers, we have used data with
extracted set of 62 “category in the mall” groups of retail
stores in 10 shopping centers. Next, using implementation of
the DBSCAN algorithm by scikit-learn Python library, 5
groups of stores were identified. 2D representation of these
clusters made by the tSNE algorithm is shown in Fig. 3.
Taking into account marketing research described above
(see discussions in [3], [4], [6], [7]), we have marked the
segment with a high average monthly turnover and top
categories (big black markers on the Fig. 3) as a segment with Fig. 2. The distribution of acquiring rates of 20 largest Russian banks for retail
greater loyalty to increasing the rate of acquiring, and the stores in dependence on its monthly turnover. Data was obtained from public
sources and official websites of banks
remaining groups as a segment with reduced loyalty
(“Others”).
The study [8] proposes an approach that relates tolerance
B. MODELING to price change with an indicator of customer satisfaction. And
since customer satisfaction in the case of store-bank
1) Calculation of acquiring rates interaction directly depends on sales, it is reasonable to
The rate of payment (authorization) consists of: associate the value of the acquiring rate with the turnover of
interchange fee (commission re-turned to the bank that issued the point of sale, which we observe in bank’s practice and in
the card), commissions charged by International Payment the experimental results of clustering in Section III.
Systems and the mark-up of the acquiring bank and. Thus the 2) The model of dynamic loyalty
bank’s revenue is determined only by the last term. As the rate
of payment and interchange fee can vary a lot depending on We develop the mathematical model to evaluate the optimal
the bank’s marketing strategy and existing agreements rate for the specific segment of retailers in the shopping center.
between banks in this study we will evaluate full rate of This model must meet the following conditions: (i) acquiring
payment which should pay the store to the bank-acquirer for rate should be different for retail outlets from different network
each transaction. retail segments; (ii) the probability of leaving for the customer
(point of sale) increases with an increase of the acquiring rate;
Russian banks have different strategies for assigning the (iii) the loyalty of retailer’s outlets to the increase of the
acquiring rate and very few of them take into account the acquiring rate differs for different segments.
categories of goods and services which are of the point of In order to distinguish between the segment of loyalty stores
interest for the specific store. The most popular characteristic (customers of the bank acquiring system) and the segment of
which affects the acquiring rate as the bank offer for retailers customers who violently reacting to any changes in pricing
is the monthly turnover of the specific store. In the Fig. 2 we policy we introduce the loyalty rate ݇ௌ . This rate should be
collect the open sources data from the Top-20 largest Russian greater than 0, and the case when ݇ௌ is close to 0 corresponds
bank’s web-sites about the dependence of the acquiring rates the situation when the segment is intolerant for the price price
on the turnover of specific store. It can be seen that there is a changes, and will almost certainly go away when it rises. Bank’s
clear tendency to a decrease in the rate with an increase in proceeds from the store O(j) from the shopping center T(i) will
thus depend on the loyalty rate ݇ௌ , which corresponds to this
ͳ ሺሻ (1)
ܲሺሻ ሺݎǡ ݇ௌ ǡ ௌ ሻ ൌ ௌ ሺ ݎെ ݎ ሻ store. The probability of the store’s churn P(i) for the specific
݇ௌ loyalty segment S is:
monthly turnover. where r0 – current acquiring rate for the store O(i), r – chosen
2) Stores segmentation acquiring rate, ௌ – is the initial probability of the outlet’s churn.
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.
______________________________________________________PROCEEDING OF THE 25TH CONFERENCE OF FRUCT ASSOCIATION
ሺሻ ሺሻ
ܨ௫ ൌ ܨ௦ ሺݎ௧ ǡ ݇ௌ ǡ ௌ ሻ (3)
௦
The model formulation (1) reflects the idea that even with no
change in acquiring rate there is still the probability ௌ of
customer’s churn and this value increases proportionally to the
change of the acquiring rate r and depends on the loyalty rate
݇ௌ . We propose to split all the stores into two segments
accordingly to the loyalty rate which is ruled by success of the
store, as we have done in Section III (A): “Segment 1” consists
of non-loyalty customers (stores with low turnover and low
frequencies of transactions) and “Segment 2” consists of Fig.4. Revenue dynamics due to the acquiring system for three customer
successful, and therefore, potentially loyalty stores, which are segments with different loyalty rate
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.
______________________________________________________PROCEEDING OF THE 25TH CONFERENCE OF FRUCT ASSOCIATION
Fig. 5. Experimental results of the probabilistic model for bank's income (relative values) from the acquiring system implemented in 10 largest shopping centers in
St. Petersburg, depending on the cost of acquiring for two segments of stores-customers with different loyalty rate to price changes. Bold line corresponds to the
loyal segment of the stores and dashed line is for the non-loyal ones
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.
______________________________________________________PROCEEDING OF THE 25TH CONFERENCE OF FRUCT ASSOCIATION
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.
______________________________________________________PROCEEDING OF THE 25TH CONFERENCE OF FRUCT ASSOCIATION
Authorized licensed use limited to: KIIT University. Downloaded on January 14,2025 at 16:17:50 UTC from IEEE Xplore. Restrictions apply.