Beyond The Hype: A Guide To Understanding and Successfully Implementing Artificial Intelligence Within Your Business
Beyond The Hype: A Guide To Understanding and Successfully Implementing Artificial Intelligence Within Your Business
White Paper
Contents                                                                               Introduction
                                                                                       To implement AI within your organization successfully you
	2	Introduction
                                                                                       need to understand what AI is, where it currently stands,
	3	What is AI?                                                                         what value can provide to businesses and how it can be
                                                                                       successfully adopted. This white paper is written for
	5	How does an AI system learn?
                                                                                       business leaders looking for practical advice on how to
	5	What has driven the development of AI?                                              leverage artificial intelligence (AI) for their organizations.
	7	Where are we today with AI?                                                         The fact that AI has been hyped doesn’t take away from its
	8	What are the areas in which AI provides the most                                    capabilities as a real value driver. Heavy investments have
   value today?                                                                        been made in AI across multiple industries; the Chinese
                                                                                       government even made it one of their core sectors. And
	8	What are some examples of successful implementations?                               companies like IBM, Microsoft, Google and Amazon are
	9	Best practices to successfully implement                                            leading the pack when it comes to utilizing data and AI.
   AI within an organization
                                                                                       AI provides enormous amounts of value in multiple
1
	 1 	 Pitfalls of AI implementation	                                                   industries. Because of its high value potential, many
14	Summary                                                                             companies have been scrambling to implement AI within
                                                                                       their organizations. And the projects, when implemented
15	 IBM Services                                                                       properly, have shown significant returns and improved
18	 About the authors                                                                  competitive edge. If your company hasn’t started
                                                                                       implementing AI, it may lag behind its competitors, so it’s
18	Acknowledgements                                                                    critical to evaluate what AI can do for your organization.
1
	 8	Appendix                                                                           But this doesn’t mean you should hire data scientists or
                                                                                       acquiring data science solutions without a clear strategy.
                                                                                       Implementing AI should be a carefully thought out process.
                                                                                       Otherwise it may turn out to be a costly failure.
What is AI?
When people hear AI they often think about sentient
robots and magic boxes. AI today is much more mundane                                      Artificial intelligence
and simple—but that doesn’t mean it’s not powerful.                                  A program that can sense, reason,
Another misconception is that high-profile research                                            act and adapt.
projects can be applied directly to any business situation.
AI done right can create extreme return on investments                                       Machine learning
                                                                                  Algorithms whose performance improve
(ROIs)—for instance through automation or precise
                                                                                as they are exposed to more data over time.
prediction. But it does take thought, time and proper
implementation. We have seen that success and value                                           Deep learning
generated by AI projects is increased when there is a                                  Subset of machine learning in
                                                                                        which multipayered neutral
grounded understanding and expectation of what the
                                                                                           networks learn from
technology can deliver from the C-suite down.                                              vast amount of data.
Machine learning
Machine learning is enabling a machine to learn from data
                                                                                                       Hidden              Hidden         Hidden
without explicitly programming it with rules, because it can                                           layer 1             layer 2        layer 3
learn from the data it’s given. In essence, you could build                             Input layer
an AI consisting of many different rules and it would also be                                                                                       Output layer
able to be AI. But instead of programming all the rules, you
feed the algorithm data and let the algorithm adjust itself to
improve the accuracy of the algorithm. Traditional science
algorithms mainly process, whereas machine learning is
about applying an algorithm to fit a model to the data.
Examples of machine-learning algorithms that are used a
lot and that you might be familiar with are decision trees,
random forest, Bayesian networks, K-mean clustering,
neural networks, regression, artificial neural networks,
deep learning and reinforcement learning. Artificial neural
networks and deep learning have recently become more
                                                                                       Figure 2: Deep neural network with five layers.5
common machine learning algorithms.
                                                                                        Currently, the larger the neural network and the more data
Implementation examples would be predicting stock market
                                                                                        that can be added to it, the better the performance a neural
prices or predicting whether a customer will churn from
                                                                                        network can provide. DL is very powerful, but it has a couple
your company.
                                                                                        of drawbacks. It’s almost impossible to determine why the
                                                                                        system came to a certain conclusion. This is called the
Deep learning
                                                                                        “black box” problem, though there are now many available
Deep learning (DL) is a relatively new set of methods that
                                                                                        techniques that can increase insights in the inner workings
is changing machine learning in fundamental ways. DL isn’t
                                                                                        of the DL model. Also, deep learning often requires extensive
an algorithm per se, but rather a family of algorithms that
                                                                                        training times, a lot of data and specific hardware
implements deep networks (many layers). These networks
                                                                                        requirements, and it’s not easy to acquire the specific
are so deep that new methods of computation, such as
                                                                                        skills needed to develop a new DL solution to a problem.
graphics processing units (GPUs), are required to train
them, in addition to clusters of compute nodes.
                                                                                        In conclusion, there is no one algorithm that can fit or solve
                                                                                        all problems. Success really depends on the problem you
DL works very well with large amounts data, and whenever
                                                                                        need to solve and the data you have available. Sometimes
a problem is too complex to understand and engineer
                                                                                        a problem will need a hybrid approach, where you use
features (due to unstructured data, for instance). DL almost
                                                                                        multiple algorithms to solve the problem. Each problem
always outperforms the other types of algorithms when it
                                                                                        requires extensive investigation of what constitutes a
comes to image classification, natural language processing
                                                                                        best-fit type of algorithm. You should take into account
and speech recognition. An example would be recognizing
                                                                                        transparency and how much data, capabilities and time
melanoma or conducting machine translation, which was
                                                                                        you have, because some algorithms take a long time to run.
not possible using previous techniques.
                                                                                                                   IBM Services   5
How does an AI system learn?                                    In the appendix section of this paper, we have a section
To illustrate how AI system learning works, we’ll next          that addresses the different types of algorithms’ learning.
describe what a data scientist does, and what a machine         There are four main types of learning; supervised,
does in the process of developing AI solutions. Later, when     unsupervised, transferred and reinforcement learning.
we discuss the pitfalls of implementing AI, we’ll explain       While unsupervised, reinforcement and transfer learning
what types of skills you’ll need to successfully build a data   show great potential, supervised learning is currently the
science team.                                                   type that can provide the highest economic value.
—— The evolution of data: A factor contributing to the 1958 ● IBM 305, the first hard drive, 5 MB
   massive adoption of AI is the exponential growth of                                         1969 ● Backpropagation, one of the most important areas of a neural
                                                                                                          network, is proposed
   available data. With the introduction of the Internet,
   social media, proliferation of sensors and smart devices,                                   1970 ● IBM 1330, 100MB
   and the fact that data storage became cheaper, it became
                                                                                               1974 ● Intel produces second generation general purpose chips
   more accessible than ever before.                                                                      First AI winter, the belief in machine learning and AI had dropped
—— The evolution of algorithms: Algorithms have been                                    1974-1980 ● after multiple unsuccessful experiments combined with insufficient
                                                                                                          computing power, network capabilities and database capacity
   around since we could write. Recently, the development
                                                                                               1985 ● IBM 0665 hard drive, 40 MB. But much smaller than the 1330
   of more advanced algorithms has helped AI become
   more powerful and efficient.                                                                1989 ● First convolutional neural network developed
—— The evolution of computing: Another major factor in                                                    (used a lot in image recognition)
   AI’s current success is its computing power. Back when                                      1991 ● The internet is open for the public
   AI was just beginning to be developed, the computing
                                                                                               1992 ● First versions of natural language solutions set up.
   power was minimal. Computers nowadays can take much
   more data and heavier algorithms than in the 1950s.                                         1997 ● IBM’s deep blue defeats Kasparov in Chess
2014 ● There are more mobile devices than humans in the world
                                                                                               2018 ● Project debater of IBM shows ability to process very large data
                                                                                                          sets, including millions of news articles across dozens of subjects,
                                                                                                          and then turn snippets of arguments into full flowing prose—a
                                                                                                          challenging task for a computer
                                                                                        Figure 3: AI timeline.7
                                                                                                                      IBM Services   7
                                                                   Broad AI
                                                                   What we see today in self driving cars is still defined as narrow
     Narrow                       Broad            General AI
                                                                   AI. You can see it as a collection of narrow AI systems that
                           (AI for Enterprise)
                                                                   make decisions. This is what we call broad AI. Another
                                                                   example of broad AI includes a system within a bank that
                                                                   analyzes the balance sheet of corporate customers to
                                                                   recommend the best currency hedging strategy. Another
                                                                   example would be a system that supports engineers who
                                                                   work on complex maintenance tasks on a platform in the
   2010-2015                   We are here       2050 and beyond
                                                                   middle of the Atlantic Ocean. Broad AI is about integrating
                                                                   AI within a specific business process of an enterprise where
Figure 4: The three categories of AI.
                                                                   you need business- and enterprise-specific knowledge and
                                                                   data to train this type of system. These tasks are very different
                                                                   from the narrow AI used in the consumer world because the
Where are we today with AI?                                        data and knowledge available in the enterprise are much more
We can split the term AI into three categories: general,
                                                                   limited in terms of volumes, very industry specific and in most
broad and narrow. General AI encompasses all the human-
                                                                   of the cases private (for example owned by an enterprise).
like capabilities, whereas narrow AI can only do a certain
                                                                   This is what we believe is currently the most valuable type of
task—and it can do it quite well—but narrow AI can’t
                                                                   AI currently for the enterprise.
transfer its knowledge to different sorts of problems.
                                                                   General AI
Narrow AI
                                                                   General AI is far from reaching its potential. The expectations
Narrow AI is focused on addressing very focused tasks
                                                                   are that it will take at least another couple of decades. General
(such as buying a book with a voice-based device) based
                                                                   AI refers to machines that can perform any intellectual task
on “common knowledge.” That’s the reason narrow AI is
                                                                   a human can. Currently AI does not have the ability to think
scaling very quickly in the consumer world where there are
                                                                   abstractly, strategize and use previous experiences to come
a lot of common tasks and data to train these systems.
                                                                   up with new creative ideas as humans do.
Narrow or weak AI is, contrary to the naming, very powerful
at routine jobs.
                                                                   Some people think we will have general AI in a couple of
                                                                   decades others like IBM’s Rob High and Google’s Peter Norvig
                                                                   believe we don’t need broad AI at all.8
8    Beyond the hype: A guide to understanding and successfully implementing artificial intelligence within your business
After having thought this through, you’ll probably have                                Start small and scale quickly
many different use cases. At this point, it’s critical to                              —— Start with minimal valuable products (MVPs)
prioritize these cases into a transformation roadmap that                                 In this phase you want to bring in experts to help quickly
covers both a long-term vision as well as concrete feasible                               develop solutions to your business problems. This can
quick wins. Next, you should think about what data you                                    only be done once the before-mentioned steps are
have available. To solve most of the problems with AI you                                 completed and the business is ready organizationally
need to have relevant data. Without data, AI will not provide                             and technologically. This also means that the experts
any value. For many companies it’s a task in and of itself                                you bring in should be both business and technologically
to keep track of the type of data—as well as where it’s                                   savvy. A good duration for a MVP is normally between two
stored and in what way. Often the first step will be to                                   and three months. Our experience shows that starting
understand the data you currently have and the type you                                   with large-scale, complex and very long AI implementation
need to implement your AI case.                                                           projects normally lead to failure.
                                                                                       —— Set understandable key performance indicators (KPIs)
Establish AI capabilities and skills                                                      To make sure that a project will succeed, you need to
AI requires a completely new set of capabilities and skills                               define KPIs that are understandable for your business—
which may be in short supply in your organization. To build                               including employees and other stakeholders. These KPIs
the required in-house AI skills, it’s important to plan,                                  will help you evaluate whether a project is successful. In
establish and grow a dedicated Center of Competence or                                    general, we suggest taking a second look at these KPIs
leverage the IBM Garage concept to perform in partnership.                                after an appropriate duration to decide whether the
Not only this dedicated team is important, but you also                                   project is successful or if you should discontinue it. If
need to assure the right mindset and way of working in the                                your business can’t pinpoint the right KPIs to measure
rest of the organization. It’s critical that these functions                              success, the project is too complex.
occur in conjunction with developing and integrating an AI                             —— Roll-out through company (culture)
platform within your current IT architecture to implement                                 Once agreement is reached about which projects would
and scale AI.                                                                             be worth working on, it’s time to implement the MVP
                                                                                          within your company. It’s important that the way you
                                                                                          implement it is looked at from both the business and
                                                                                          the technical side.
                                                                                                                   IBM Services   11
managers support the project. Implementing AI through the                                 can scale the algorithms
organization can be a long process, and without support of                             —— Data analysts who can process the outcome
the right people there’s a higher probability that the project —— Statisticians to help ensure quality results
will fail. What you might see when this problem occurs is —— Software engineers to turn all you’ve created into
that your employees might not put enough time into problem                                something that can be used by the masses—be it your
definition and subject knowledge sharing.                                                 customers or your employees
Recommendation: Align the right people before you start                                While these roles don’t all need to be filled by individual
in a “garage” concept. The key stakeholders should be                                  employees, it’s essential to have all these skills in house.
identified and should bring input and willingness to the
table. Create buy-in and support with employees, other                                 The second option is easier and quicker to implement. In
stakeholders, management and C-suite.                                                  this case you’d make use of the capabilities of an external
                                                                                       party such as through the IBM Garage offering, which can
                                                                                       give you the ability to use these capabilities without having
                                                                                       to set up a complete internal department.	
                                                                                                                IBM Services   13
Summary                                                                                There is no one magic algorithm that can solve all your
AI has the potential to bring a lot of value to your company                           problems: Before implementing AI, you should focus on
if thought through and implemented properly. The authors                               the problems you have and how AI can help you solve
of this paper hope we’ve made it clear how you can achieve                             them. Next, you need to check whether the data you need
this goal. We’ve discussed what AI is what it can do for your                          to solve your problems is available. Often a hybrid between
organization, how it should be implemented and what pitfalls                           algorithms can be the right fit, depending on the problems
you should avoid once you’ve decided to implement AI.                                  you want to solve.
To sum up this paper’s key points:                                                     Supervised learning provides the most economic value:
                                                                                       Supervised learning is currently the most applied form of
AI not something of the future, it is real today, and it fuels                         learning and provides the most value for a wide variety of
the fourth Industrial Revolution: As you will see later in this                        applications. When beginning to implement AI in your
paper, there are many cases where AI is being successfully                             enterprise, you’ll likely be working with supervised learning.
implemented and driving competitive advantage. Companies
such as IBM, Nvidia, Twitter, Delta Airlines, Walmart, Netflix,                        In the next section of this paper, we’ll talk about how IBM
Spotify and Kreditech show that their data-driven approach                             can play a role in implementing AI within your organization.
produces extremely valuable business models. But for many                              IBM not only has many years of experience with these types
companies, failing to implement a data-driven strategy can                             of projects but has also been a pioneer in the AI arena. Based
lead to lost market share.                                                             on its experience and knowledge, IBM can help companies of
                                                                                       all sizes implement AI solutions.
Properly implementing AI requires careful evaluation and
planning: You need to evaluate how AI can help solve your
problems, where your company is right now in terms of
capabilities and what needs to be done before you can
properly implement AI to address the problems you want
to solve or the opportunities you want to take.
                                                                                                                   IBM Services    15
IBM helps you get prepared first, in a fast, pressure-cooker                           Once you are prepared for your garage, IBM uses design
environment. This helps you to kickstart your AI journey.                              thinking to identify and define use cases. All use cases are
IBM works with you to deliver an assessment of your                                    conceived based on the notion of creating business value.
current AI capabilities.                                                               In IBM’s experience, design thinking is the most effective
                                                                                       way. This so-called “demand generation” for the garage
                                                                                       should be on-going, creating the backlog and setting the
                                                                                       capacity planning for the garage.
                                                                                                                  IBM Services   17
Once the first use cases have been agreed, the garage            In the “Transform” phase, IBM collaborates with experts,
provides the playground for deep technology, accommodating       data and emerging technologies, using accelerators to
enterprise-scale ways of working—such as design thinking,        build minimum, viable products into production and realize
agile, DevOps and lean IT. Innovation starts with a Proof        business outcomes and customer adoption within weeks.
of Value (POV): evaluating the business and technical
feasibility of the use case in a time-boxed agile manner.        The third and last phase is the factory “Thrive” phase, which is
Agile development is at the core of what IBM does in the         designed to rapidly scale solutions while establishing methods
garage. The goal is to succeed or fail quickly, with speed       and new ways of working across your enterprise that can last
to scale. Flexibility and speed are key! A successful POV will   a lifetime. The thrive phase works as a managed service to
move into a pilot implementation. The IBM Garage leverages       embed and maintain data and AI solutions at scale. Using a
IBM’s extensive asset library to more efficiently validate use   factory-like approach, you put into production something you
cases and accelerate transformative change.                      want to be consistent in design and quality, and that doesn’t
                                                                 require innovation in execution. IBM focuses on successful
Organizations should be prepared to step out of their comfort    delivery of projects aligned to the roadmap and AI platform
zones and think differently. Thinking in a data-driven and AI    technology and helps ensure a clearly defined governance.
approach requires a shift in the way we see. It takes some
effort to change our ways of thinking. This is where digital     Using this method, IBM has been successful many times
change management plays a role. You also need to consider        and wants to keep improving the method with each project
the buildup of AI capabilities in your organization. IBM         delivered. IBM is one of the frontrunners of AI and AI
offers various forms of training and enablement ranging          implementation, and thrives on applying its knowledge and
from Hackathons, planned learning universities for               experience to help improve the world.
upskilling and an AI learning academy.
18   Beyond the hype: A guide to understanding and successfully implementing artificial intelligence within your business
Supervised learning is a learning method that maps an input       Reinforcement learning is a class in and of itself; it is not
to an output using human data and feedback to improve. A          given a specific goal, but rather learns from trial and error.
data set is provided with associated correct labels to the        The main concept is that instead of a specific action being
data. An example would be pictures of animals in which all        labelled, there is a sequence of actions that is associated
pictures were correctly labelled as the animal in the pictures.   with a reward. If we take a maze as an example, the
Supervised learning trains based on historical data and           algorithm will be rewarded when it comes closer to its goal
builds rules that can be applied to predict future problems.      and be penalized every time it gets stuck or moves away
The better the data set, the better the output.                   from the completion. A recent example of reinforcement
                                                                  learning is AlphaGo, where Google trained a deep
You may use this type of learning when you want to classify       reinforcement learning network with many examples of
or predict outcomes. With regression, you are predicting a        the game Go, eventually making its performance superior
continuous value (“How much will the stock price be?”).           to that of even the best human. This trick is not new, since
With classifying, you are assigning a label to an input (“Is      it was used in TD-Gammon in 1992, created by Gerald
this picture a man or a woman?”). Other examples would            Tesauro at IBM. TD-Gammon was a backgammon-playing
be using speech recognition to examine the sentiments of          program that reached the performance of the best human
people calling your customer service center, or image             players at the time.
recognition to define products in a warehouse so they
could be properly sorted.                                         Reinforcement learning is not currently widely used, but it
                                                                  does have high potential when developed more extensively.
Unsupervised learning occurs when the algorithm is not            You would need a lot of data (which is not always the case
given a specific “wrong” or “right” outcome. Instead, the         and takes time to process) to be able to make reinforcement
algorithm is given unlabeled data. Unsupervised learning          learning work.
is often used when you want to classify data but don’t know
how to do so. For example, you’d likely use unsupervised          Transfer learning is when your algorithm learns to solve
learning if you had a set of customer data and you didn’t         one problem, takes information from this problem and then
know what kind of classes they would fit in. An unsupervised      solves a new problem with that information. This currently
learning algorithm can find natural groupings of similar          happens a lot with image recognition. Pre-trained neural
customers in a database and the user can then describe            networks are used to solve new problems.
and label them.
© Copyright IBM Corporation 2018                                          	1	www.weforum.org/agenda/2016/01/the-fourth-industrial-revolution-
                                                                             what-it-means-and-how-to-respond
IBM Corporation
New Orchard Road                                                          	2	www.quora.com/What-is-artificial-intelligence-What-are-task-
Armonk, NY 10504                                                             domains-in-AI
IBM, the IBM logo, ibm.com, and IBM Watson are trademarks or              	4	www.investopedia.com/terms/a/artificial-intelligence-ai.asp
registered trademarks of International Business Machines Corp.,
registered in many jurisdictions worldwide. Other product and             	5	neuralnetworksanddeeplearning.com/chap5.html
service names might be trademarks of IBM or other companies.
                                                                          	6	datasciencedegree.wisconsin.edu/data-science/what-do-data-
A current list of IBM trademarks is available on the web at
                                                                             scientists-do
www.ibm.com/legal/copytrade.shtml.
                                                                          	7	www.interaction-design.org/literature/book/the-social-design-of-
Microsoft, Windows, Windows NT and the Windows logo are
                                                                             technical-systems-building-technologies-for-communities/the-
trademarks of Microsoft Corporation in the United States, other
                                                                             evolution-of-computing
countries, or both.
                                                                          	8	www.forbes.com/sites/blakemorgan/2017/06/13/ethics-and-artificial-
This document is current as of the initial date of publication and may
                                                                             intelligence-with-ibm-watsons-rob-high/#72b4a0e3260e
be changed by IBM at any time. Not all offerings are available in every
country in which IBM operates.                                               www.forbes.com/sites/gilpress/2016/12/21/artificial-intelligence-
                                                                             pioneers-peter-norvig-google/#6ecd8a2d38c6	
The performance data and client examples cited are presented for
illustrative purposes only. Actual performance results may vary depending 	9	www-01.ibm.com/common/ssi/cgi-bin/
on specific configurations and operating conditions. THE INFORMATION           ssialias?htmlfid=KUM12390USEN&, page 67
IN THIS DOCUMENT IS PROVIDED “AS IS” WITHOUT ANY WARRANTY,
                                                                          1
                                                                          	 0	 homes.cs.washington.edu/~marcotcr/blog/lime
EXPRESS OR IMPLIED, INCLUDING WITHOUT ANY WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND ANY
WARRANTY OR CONDITION OF NON-INFRINGEMENT. IBM products are
warranted according to the terms and conditions of the agreements under
which they are provided.                                                                Please Recycle
39019539-USEN-00