Intelliarts ML in Insurance White Paper 2022
Intelliarts ML in Insurance White Paper 2022
Insurance Business
Using Data to Foster Innovation
Table of contents Get in touch
Andriy Zakharchuk
Executive summary 03
CEO @ Intelliarts
volodymyr-mudryi
Path to ML implementation 19
alexander-barinov
About us 28
Oleksandr Stefanovskyi
Head of R&D @ Intelliarts
oleksandr-stefanovskyi
02
Executive summary
The most critical asset in insurance
Data has always played a central role in the insurance industry. Today insurers have access to
more data than ever before — in the past few years, we have produced twice as much data as
decades before.
Despite this, a recent study1 revealed that most insurers are struggling to maximize the benefits
of data. They process not over 10-15% of the data they have access to. Moreover, most of this
data is structured and housed in traditional databases. This means insurance companies fail to
benefit from all structured data available to them while also ignoring the value of semi-structured
and unstructured data.
Analysis of this unstructured data, as well as better use of structured one, requires the
introduction of advanced data science techniques. Innovative data analytics technologies
centered on artificial intelligence (AI) and machine learning (ML) can bring order and purpose to
insurance data and help an insurer drive better business decisions.
AI/ML in Insurance
03
Executive summary
This doesn’t surprise, though. AI/ML technology is widely known for its potential to bring greater
efficiency and innovation across the entire insurance value chain. The potential applications of
ML are plentiful, from better risk assessment to faster claims processing and more accurate
fraud detection. ML is also useful for improving the insurer’s customer service with efficient lead
management, more personalized marketing offers, and customer churn prediction.
Together, these improvements in business operations allow insurers to enhance productivity and
generate better revenue streams. Check the survey results below that compare the revenue
distribution of traditional insurance organizations vs. so-called data master insurance
organizations with the technical capabilities, culture, and behaviors to support their data-driven
programs, which often rely on ML/AI technology.
$20-$50 $10-$20
$5-$10
billion billion
billion
51% 8%
12%
Source: Capgemini Research Institute, Data Mastery in Insurance survey, September-October 2021, N=204 insurance
organizations with N=37 data masters.
As we observe a global push toward digital insurance, this white paper will
help you understand the potential gains of introducing ML into your
business operations.
04
Data as the untapped resource in
insurance
Insurance has always revolved around data. Its traditional data sources ranged from financial
data to actuarial data to historical claims and risk data. These used to affect every single
important decision that an insurer made.
Over the years, the critical role of data in the insurance sector hasn’t changed. With no physical
products to manufacture and sell, data remains one of the most valuable assets in the industry.
Yet, we can boldly say that the nature of insurance data has altered, with data growing in
volumes, variety, and velocity.
The times when data was stored in rows and columns are long gone. Now
data comes in structured, semi-structured, and unstructured formats. And
insurance companies could use any of these to put insight into perspective.
Variety Claims and policy data, as well as consumer information, are well-known
examples of structured data. Insurers also use semi- and unstructured data,
such as real-time weather feeds, public records, social media posts,
geospatial data, cold calling records, and emails.
They say that, estimatedly, the volume of data doubles every two years.
Velocity Insurance data is growing really fast, with businesses aiming to make data
flow as quickly as possible — real-time data is preferred.
05
Data as the untapped resource in insurance
40%
Developing new features and coverages in 46%
traditional products 33%
35%
We continually revisit actuarial
assumptions (such as loss ratios, 35%
39%
emerging-risk scenarios at regional level,
30%
etc.) through data-derived insight
06
Unlocking the power of insurance
data with ML
As consumer expectations are growing bigger in terms of efficiency and customer service, we
see the shift to data-driven insurance businesses. By unlocking the power of data, ML
technology has a big chance of becoming the frontrunner in the industry.
A subfield of AI and computer science that focuses on the use of data and algorithms to
imitate how humans learn by gradually improving its accuracy.
Deriving from historical data, an ML algorithm can learn on its own and make predictions and
conclusions without being explicitly programmed. So, an ML model fuels data-driven decision-
making, where an insurer moves from data to insights to decisions. It generates the results as
accurately as possible, but the model is important to be fed with enough quality data.
There is a substantial difference between AI and ML, yet the two concepts are frequently
confused in business. As implied by its definition, ML is a branch of artificial intelligence. While
both belong to computer science, AI is a broader concept than ML that is designed to simulate
human thinking capability and behavior.
Another critical concept to mention is deep learning. Alike, it’s a subset of ML that handles
algorithms inspired by the structure and function of the human brain. The major difference
between the two lies in how the data gets presented to the machine.
While traditional ML algorithms rely on structured data, deep learning can work with both
structured and unstructured information working based on the multiple layers of artificial neural
networks.
07
Unlocking the power of insurance data with ML
08
Unlocking the power of insurance data with ML
The key drivers of ML in insurance vary. We can distinguish between the main ones:
So, what are specific ways to unleash the value of insurance data?
09
Use cases Machine learning in insurance
Today, we can speak about at least eight use cases of machine learning in
insurance that can bring insurance services to an entirely new level:
10
Use cases of machine learning in insurance
Use-case examples:
Case Study
A midsize insurance company has reached out to us to optimize their resources with better
lead scoring. Since the company got their leads from multiple sources, they wanted to
prioritize them efficiently.
Throughout our cooperation, the Intelliarts team has built a predictive scoring model that could
forecast the probability of how likely the lead would buy a policy. With this system in place,
insurance agents could call leads with high scores first and leave those leads with low scores
for later or skip them entirely in case of a lack of resources.
NewLeads
Your
Data
Lead Scoring
Models Segmentation
Ordering
External Machine Prioritization
Data Learning
Action
The ML-based lead scoring model we’ve built has shown over 90% accuracy and good results
in production. The model divided leads into several groups based on their buying capacity. The
most promising leads had an 80%+ probability of purchasing a policy, and the expected
conversion rate was 3.5 times bigger than on average. The insurer became able to process
leads more efficiently and cut costs, saving time on less profitable prospects.
11
Use cases of machine learning in insurance
Besides, insurers can use ML for market segmentation. By analyzing consumer data, such as
income, age, gender, and locations, ML algorithms can classify customers into different groups.
An insurer can then develop specific attitudes or adjust marketing tactics based on these groups
to target different customer segments.
Use-case examples:
Building innovative products and services that would target specific customer
segments
Our research proved a zero correlation between the company’s numbers being marked as spam
and the low success rate of the cold calls. Instead, Intelliarts has discovered multiple other
factors affecting the effectiveness of cold calls, such as the insurance agent who was calling
and the way how the calls were managed. The timing of the original and follow-up calls, as well
as the demographic state of the lead, also mattered.
Our data engineers offered the customer to build an ML system that could detect the
underperformance of phone numbers, track the factors causing the underperformance of cold
calls, and avoid such patterns in the future. We created the MVP for this solution that provided
valuable outcomes for the customer and created opportunities for the company to optimize
their cold calling approach. Now we’re implementing this full-scale solution.
12
Use cases of machine learning in insurance
Use-case examples:
More accurate pricing models, which discern patterns from data, integrate extra
sources and information, and notice trends and demands at the early stages
More dynamic prices that respond to monetary policies and market fluctuations
According to the company's statistics, most of the car insurance policyholders that get into
road traffic accidents cause minor accidents, while only around 1% of accidents are classified
as large-loss cases with huge payouts.7 AXA’s R&D team decided to apply deep learning to
predict if a driver may cause a large-loss case during the insurance period.
The company identified 70 parameters and risk factors, including age range of the driver,
annual insurance premium range, previous accidents, etc. They entered these features into a
neural network and were able to predict large-loss traffic accidents with 78% accuracy.
This approach helped the company to build a strategy to optimize prices for car insurance
policies by offering the possibility of real-time analytics and dynamic pricing at a point of sale.
PPA Limit
Region
Previous accidents
Source: Schematic visualization of AXA's deep learning model for insurance price optimization
13
Use cases of machine learning in insurance
Use-case examples:
Swiss Life
In their research8, Swiss Life insurance company investigated opportunities to predict churn in
terms of a customer buying back their life insurance.
In order to accomplish this task, the research team used an anonymized excerpt of the
historical data about more than 150 000 customers and their past or ongoing contracts,
including life insurance, pension insurance, health insurance, etc.
Initial modeling has shown that just using the customer's description data does not entail the
relevant information for predicting early contract termination. So the research team has
incorporated additional time-series data into the training dataset, including data about
changes in the policy or its components related to buying a house, marriage, birth of a child,
etc. They also used the TF/IDF representation for compiling time-related features of the data
set, as it allowed them to gain significant improvement in terms of ML model's prediction
accuracy.
As a result, ML solution built on top of the SVM algorithm has shown striking prediction
accuracy of 99.7%. This allows the company to discover customers with a high risk of churn
and take marketing actions to improve customer relationships.
14
Use cases of machine learning in insurance
ML-powered data extraction allows insurers to forget about manual re-typing. When paired with
computer vision, it can also accurately render every single pixel and translate the info.
Use-case examples:
Faster customer onboarding and claims settlement since the information about the
customer is extracted automatically
Tokio Marine
Tokio Marine uses an ML-based OCR service to handle claims. The tool is cloud-based and
helps the company to process handwritten claims by “reading” the documents and submitting
them to the existing system. Its recognition rate is over 90%, which allowed the company to
reduce human error by 80% and processing time by 50%.1
15
Use cases of machine learning in insurance
ML-based intelligent underwriting can help insurers better analyze risks, provide more accurate
pricing, and process applications faster. With ML and deep learning, the underwriting process
time is reduced to a few minutes (instead of hours or days) because most activities are
automated and data-driven.
Use-case examples:
Improved coverage recommendations that align with customer needs and risk profiles
Progressive Insurance
Progressive Insurance is using telematics to collect data from drivers and improve the accuracy
of risk assessment. For example, the car insurer buys the driving data from the Snapshot
mobile app, which provides the insurer with billions of data monthly. Based on this, the insurer
can assess risks better and offer discounts to less risky drivers.
16
Use cases of machine learning in insurance
Since ML algorithms work great for anomaly detection and classification of large datasets,
machine learning is a good fit for fraud detection and prevention. An ML system seeks patterns
and analyzes consumers’ behaviors. As soon as it detects any abnormal activity, it warns the
insurer immediately.
Block
Review
Use-case examples:
Easy detection of inconsistencies in the submitted claims-related files to find fake and
duplicate claims
Use of alternative data sources, such as public data or IoT, to improve fraud detection
results
Image recognition as an additional filter for fraud detection at the submission stage
Anadolu Sigorta
The Turkish insurance company10 tells about their ROI improvement by 210% in one year after
they switched to an ML-based predictive fraud detection system. The ML model was
developed based on the 3-year historical claims data provided by the insurer plus enhanced
with 260 business rules for fraud scoring. ML introduction has saved the company $5.7
million.
ZhongAn Technology
ZhongAn Technology, a Chinese insurer, has employed image recognition to mitigate claims
fraud. The company asks users to upload a photo of their cracked phone screen. Then ML
algorithms help the insurer understand whether the actual damage took place.
17
Use cases of machine learning in insurance
The case study by Insurance Thought Leadership11 reveals that claims settlement usually takes
around 11.5 minutes, from submitting the receipt to updating or closing the record. With an ML
model in place, this time shortens to three minutes. This means a 74% reduction in time needed
for insurance claims management and an 80% increase in productivity. The same case study
confirms that ML can manage claims with 80% of accuracy. The only thing that affects accuracy
refers to database limitations and spelling errors. However, since an ML solution tends to “learn”
over time, insurers should expect fewer errors in the long run.
1 2 3 4
Use-case examples:
Grinnell Mutual
Grinnell Mutual uses computer vision to assess damage in property claims. The company
offers aerial intelligence for roof inspection instead of involving many parties (insurance
agents and roofing contractors). An insurance agent then uploads the pictures from drones to
an ML-based system and gets the claim processed in a few days rather than spending weeks
or months for damage estimation.
1. Goal definition
Before an insurer moves forward with building an ML solution, they should understand the
existing problem and define the value they’re expecting to get from the solution, ideally including
the measurement criteria. Data scientists that the company will work with will need this
information to understand what type of ML problem they are going to solve: classification,
regression, clustering, or the combination of any:
Aims to evaluate the relationship between the variable and the data
values that influence it. In simple words, the outcome of regression
Regression always includes a number. Taking the same example with a churn
model, it could be the period when the customer is projected to leave
the company.
Groups the unlabelled dataset. Let’s say the insurer has no idea about
the characteristics that define customers that are about to leave the
Clustering company. The algorithm can divide the customers into clusters
(churners and non-churners) on its own.
19
Path to ML implementation
2. Data collection
The next step will be to decide what sources to use for data collection and actually gather this
data. In ML, data matters a lot, and the quality and relevance of the insurer’s data will directly
affect the results that the ML models produce.
Most insurers would use historical datasets of their insurance claims as a backbone of data. But
you can also think about extra data sources, such as public records, demographic and geospatial
data, customer reviews, and so on. Although the general rule states “the more data, the better,” it
is also important to make sure the quality of data is good — the data science team will gladly
help you with this.
Even if the company doesn’t have enough historical data, they shouldn’t worry. Experienced data
scientists can help insurers with data collection, for example, by producing synthetic data or
advising on data collection, data storage, and labeling.
3. Data preparation
Data scientists will also need to prepare the raw data and convert it into a suitable format for an
ML system to digest. They should structure data points according to the same logic and get rid
of any inconsistencies. Pre-processing could include:
All Features
Feature Selection
Final Features
20
Path to ML implementation
Gradient
Boosting
Random
Forest
Decision
Trees
K-means
Neural
Networks
Reinforcement
Learning
Computer
Vision
Clustering
Analysis
Natural
Language
Processing
(NLP)
Logistic
Regression
Also, one should know that data scientists usually test several models before choosing ones that
are the best candidates to solve the business problem. Building an ML model engages multiple
iterations: specialists have to train the model, tune parameters, evaluate, and monitor
performance. They stop only when they notice that the model makes the most accurate
prediction based on the training data.
21
Path to ML implementation
After the model is trained, the team needs to evaluate its results. The primary goal of testing is to
check that the logic learned by the machine remains consistent. Insurers widely apply A/B
testing as an efficient approach to make sure the new solution performs well and brings the
desired outcomes.
Dataset
Pre-processing Data
Test set Trained Model
Output
6.Personnel education
Personnel education is also a critical part of a full-scale ML solution. At this stage, the data
science team, together with the insurer, teaches the personnel how to use the newly built
solution in the most efficient way. The HRM department should also make sure that employees
understand that the ML-powered solution is there to help them solve their day-to-day tasks
more productively and reduce the manual work, rather than replace the workers.
22
Create the most eicient ML-powered
solutions with Intelliarts
As evident from the six steps we’ve covered, the success of an ML-powered project largely
depends on the proficiency of your data science team. Intelliarts is a trusted partner for many
businesses across domains. We build end-to-end data-driven and ML-powered solutions and
help our partners with digital transformation, innovation, and expansion of engineering
capabilities.
The company has vast domain expertise in the insurance sector. This allows our engineers to
become involved in the project faster and naturally contribute to its outcomes. Having solved
different business challenges in insurance, we know the nuances of the industry, can provide
value from the business perspective, and help to evaluate risks better.
Strong
Focus on
AI/ML Domain Full-cycle CRISP-DM
business
engineering knowledge development methodology
outcomes
team
23
ML-powered solution to improve
Case Study
lead quality and agent
eiciency
Business challenge:
Our customer (under NDA) is one of the fastest-growing health tech insurers. The company
specializes in selling Medicare-eligible insurance coverage by working as an insurance broker.
They’re a big supporter of disruptive innovation and always seek ways to improve business
operations.
The insurer has contacted us to improve sales effectiveness, and after a series of the first
meetings with the customer, we came to an agreement that the best way for them to convert
leads into sales will be through:
Solution:
To meet these demands, the Intelliarts team has built the ML-powered solution composed of two
parts. The solution aimed at helping the customer detect regions with the most promising leads
(1) while also assigning them to high-performing insurance agents (2) to raise the chance of a
successful sale.
However, the preliminary research proved that the team didn’t have enough sales data for
quality analysis at the zip code level. So, we moved to the data analysis based on a destination
sectional center facility (SCF). As a result, we’ve gotten two key insights into the lead quality:
the conversion rate was a bit bigger in sub-urban areas (1); there was no linear dependency
between AGI (Adjusted Gross Income) and conversion rate (2).
0,16 0,25
Conversion rate
Conversion rate
0,14
0,20
0,12
0,10 0,15
0,08
0,06 0,10
0,04
0,05
0,02
0,00 0,00
Rural Suburbs City Very low Low High Very high Medium
Density_rank Agi_rank
24
ML-powered solution to improve lead quality and agent eiciency
To detect SCF codes with high conversion, our data engineers labeled SCF as high- and
low-performing based on conversion and built a PCA visualization. We then built the ML
classification model using decision tree and clustering analysis techniques, which should help
the customer choose the most profitable SCF regions for lead selection.
Principal Component 2
6
4
8
6 2
4 0
-4
2 -2 -2
0
PC 3
PC 2
0
2 -4
-2 4
-4
-6
15 0 -5 0 5 10 15
Principal Component 1
Color: 1 0 Color: 1 0
To increase the efficiency of insurance agents, we built an algorithm that could detect agent
efficiency and connect the top-performing agents with the most promising leads.
Based on the collected data about agent calls, we created a statistical model that predicted an
agent conversion rate per week and contrasted it with the actual conversion rate. According to
the results, the agent could be promoted to a higher tier and, thus, receive more quality leads
and earn more. This motivated agents to work better and increase the chance of a successful
sale.
Business value
The ML-powered solution we’ve built helped the customer establish a better lead selection
strategy. Now the insurtech company spends much less time on SCF with possibly low
conversion and focuses most sales efforts on those high-performing ones. Respectively, the lead
quality has already increased by 5%.
Also, agent efficiency has grown by 3% because agents are more motivated to work to be able to
earn more. Altogether, these changes have improved overall conversion rates and maximized
company profitability.
Read more: Case Study on Improving Insurance Lead Quality and Agent Efficiency
25
Looking ahead: Transforming your
insurance business with ML
No doubt data is the single most critical differentiator for insurance organizations. Today,
information is being used for much more than traditional historical analysis. Insurers are
leveraging data to attract and retain customers with more customized offers and personalized
messages; assess and mitigate risks; optimize premiums; process claims; and improve fraud
detection.
With competition heating up for insurance companies, becoming a data-driven business seems
the only reasonable solution. And one of the most reliable ways to do it is to transform your
insurance business with machine learning. Here are also a couple of best practices you can
consider when deciding to add value to your insurance company with the help of ML:
Intelliarts offers insurers a strategic partnership — with our vast expertise and domain knowledge,
we can help you build full-cycle ML-powered solutions and take your business to the next level.
Regardless of whether you’re just getting started with ML or want to innovate your business
operations, email us and let’s partner.
26
Endnotes
Ravi Malhotra and Swati Sharma, “Machine Learning in Insurance”, Accenture,
1 https://www.accenture.com/_acnmedia/pdf-84/accenture-machine-leaning-insurance.pdf
3 Forrester, “Unlock the Power of Data to Transform Your Business,” IBM, 2019.
“Hype or Reality: The State of Artificial Intelligence and Machine Learning in the
5 Insurance industry”, LexisNexis Risk Solutions, 2019.
Henry Burton, “AXA Predicts Large Loss Claims with 78% Accuracy”,
7 https://www.linkedin.com/pulse/momentum-gathers-axa-turns-machine-learning-optimis
e-henry-burton/
Katharina Morik and Hanna Köpcke, “Analysing Customer Churn in Insurance Data – A
8 Case Study”,
https://sfb876.tu-dortmund.de/PublicPublicationFiles/morik_koepcke_2004a.pdf
Tony Harrington, “Disrupting the insurance landscape through artificial intelligence and
deep learning”, ITIJ,
12 https://www.itij.com/latest/long-read/disrupting-insurance-landscape-through-artificial-in
telligence-and-deep-learning
27
About us
ai.intelliarts.com
If you’re ready to turn your
insurance company into a
data-driven business, let’s have
[email protected]
a chat to discuss
Book a call