ES Evolution PDF
ES Evolution PDF
© Ovidiu S. Noran
Table of Contents.
1 Introduction....................................................................................................1
1.1 Artificial Intelligence and Expert Systems. ........................................................1
1.1.1 Artificial Intelligence. .......................................................................................... 1
1.1.2 Expert System. .................................................................................................. 1
1.2 Motivation.........................................................................................................2
1.3 The objective of this paper. ..............................................................................2
5 Conclusions. ................................................................................................23
6 References. .................................................................................................23
The Evolution of Expert Systems Introduction
1 Introduction.
1
The Evolution of Expert Systems Introduction
Expert systems are suitable for solving the ill-structured (or not entirely defined) type
problems, which are usually difficult or impossible to solve by 'conventional' methods
(e.g. procedural programming).
1.2 Motivation.
In the development of any tool, method, etc, a question will inevitably be asked: Why
develop and use it ? What is it good for ?
In order to determine the advantages of a generic expert system, we will have to
evaluate it with the only comparison term available: the human expert.
availability: On any suitable computer hardware;
cost: The per-user cost is greatly lowered;
danger: May be used in environments hazardous for humans;
permanence: will last indefinitely (depending on the hardware);
multiplicity: more expert systems may co-operate to increase the level of
expertise;
reliability: provide a second opinion to human experts, mediate opinions;
explanation: the expert system may always explain how it reached the
conclusion;
response: may provide fast or real-time response for critical applications;
emotional: in real-time or emergency situations, the expert system will
always provide an unemotional and complete response;
database: may be used to intelligently access a database (data mining);
tutoring: may be used as an intelligent tutor: explain reasoning, etc.
The most obvious shortcomings are that an expert system has to be developed, trained,
updated and that it cannot capture nor explain 'deep' knowledge (understanding of
underlying causes). Also, unless carefully designed, an expert system may entirely
reflect a single human's expertise, which would defeat most of the advantages
enumerated above. The limitations of the expert systems will be discussed later on in
this paper.
2
The Evolution of Expert Systems Introduction
3
The Evolution of Expert Systems Expert Systems Primer
4
The Evolution of Expert Systems Expert Systems Primer
• The knowledge engineer alters the knowledge base in order to reflect the
human expert's comments.
This process will iterate until the human expert finds the expert system satisfactory for
the envisaged purpose.
Some rule-based systems may display the capability to learn rules by example (rule
induction), creating rules from tables of data. Learning is a complex process, hindered
by ambiguities, duplications and inconsistencies in the human expert knowledge.
5
The Evolution of Expert Systems Expert Systems Primer
unable to scope with. By fully understanding the capabilities and limitations of the
expert system technology, we can make appropriate use of it.
6
The Evolution of Expert Systems Expert Systems Primer
7
The Evolution of Expert Systems Expert Systems Primer
• agenda: the prioritised list of rules whose patterns are satisfied by facts (or
objects) in the working memory;
• (optional) knowledge acquisition facility: a way for the user to directly enter
knowledge in the system, without the need for explicit knowledge coding.
A rule whose patterns are satisfied by the facts are said to be activated, or instantiated.
Given more rules are instantiated at the same time, the inference engine must select
one rule for firing (termed by analogy with a nerve cell - neuron). Like a neuron, once
the rule has fired, it may not fire again for a certain period of time. This behaviour is
aimed at preventing trivial loops and is called refraction.
8
The Evolution of Expert Systems Expert Systems Primer
2.3 Features.
In order to perform within the intended scope, expert systems must contain a set of
desired features. These features must ensure that the expert system is appropriately
accepted and utilised by the intended end users. An expert system that is not applied
and / or acknowledged by the users is a useless exercise even if it is a very good and
comprehensive tool.
9
The Evolution of Expert Systems Expert Systems Primer
10
The Evolution of Expert Systems Expert Systems Evolution
3.1 Timeline.
The expert systems technology is based on a wide background. Fig. 2. 1 presents the
major milestones in the development of the expert systems.
11
The Evolution of Expert Systems Expert Systems Evolution
Fig. 2. 1 Milestones in the Expert Systems history. (Adapted from Ref.1, p.11)
As one can see, the Expert Systems history starts at around 1943 with the Post
production rules. The initial trend was towards intelligent systems that relied on
powerful methods of reasoning rather than making use of domain knowledge. They
were also designed to cover any domain rather than a specialised area.
The shortcoming of this approach was that the machine had to discover everything from
first principles in every new domain. In contrast, a human expert would rely on domain
knowledge for high performance.
3.2 Milestones.
12
The Evolution of Expert Systems Expert Systems Evolution
organised chunks loosely arranged, with links to other related chunks. Sensory input
would provide stimuli to the brain and trigger appropriate rules of long term memory,
which then produce the suitable response. The long term memory is where the
knowledge is stored; the short term memory however, is used for temporary
knowledge storing during problem solving.
Human problem solving must also include a so-called cognitive processor. This is the
equivalent of the inference engine in a modern expert system. The cognitive
processor will try to find the rules to be activated by the appropriate stimuli. If more
than one rule is to be activated, then a conflict resolution must be performed in order
to decide which rule will be activated first (priority assignment).
Newell and Simon have not only managed to create a credible model of human problem
solving, but have also set the basis of the modern rule-based expert systems. The
concept of knowledge chunks brings with it the problem of granularity in the design of
expert systems. More granularity means several chunks of knowledge are blended
together in one rule; less granularity makes an isolated rule difficult to understand.
A. A change of concept.
The General Problem Solver reflected the current trend of the sixties to attempt
producing general-purpose systems that heavily relied on reasoning and not on
domain knowledge. Both approaches have their advantages and disadvantages
though. Domain knowledge may be powerful, but it is limited to a particular domain.
Applying it to another domain involves skill rather than genuine expertise.
In the early seventies it was already obvious that the only way to achieve machines
comparable in performance with human counterparts was to use domain knowledge.
In many cases, human experts rely less on reasoning and a lot more on a vast
knowledge of heuristics (practical knowledge) and experience. When a human expert
is confronted with a substantially different problem, he/she must use reasoning from
the first principles and they are no better in this respect than any other person.
Therefore, expert systems relying solely on reasoning are doomed to failure. The
factors that contributed to the creation of the modern rule-based systems are
presented in Fig. 2. 2.
In 1965, work has begun on DENDRAL, the first expert system, aimed at identifying
chemical constituents. However, DENDRAL was an unique system in which the
knowledge base was mixed together with the inference engine. This made the task of
reusing it to build a new expert system almost impossible.
1972-73 brought the development of the MYCIN system, intended to diagnose
illnesses. MYCIN was a step forward because:
13
The Evolution of Expert Systems Expert Systems Evolution
14
The Evolution of Expert Systems Expert Systems Evolution
Fig. 2. 3 The foundation of the rule based expert systems (Ref.1, p.32)
Artificial Neural Systems (ANS) have tried to solve problems by modelling the human
brain information processing. They represent a different programming paradigm, also
known as connectionism. Connectionism models solutions to problems by training
simulated neurons connected in a network. ANS may well be the ideal front end to
expert systems that require massive amounts of sensor inputs and fast response.
15
The Evolution of Expert Systems Expert Systems Evolution
Since ANSes are modelled after current brain theories, information is represented by
weights. The distributed representation of information is similar to a hologram, where
the lines of the hologram diffract a laser beam passing through in order to reconstruct
the stored image.
Advantages of ANS over conventional computers:
• fault tolerant storage. Removing portions of the net will only determine a
degradation of the quality in the stored data;
• graceful degradation of the quality of the stored image, in proportion to the amount
of net removed;
• data is stored in the form of associative memory (i.e. where partial data is sufficient
to recall the complete stored information);
• nets can extrapolate and interpolate from their stored information. A net can be
trained to seek significant features / relationships in the data. Subsequently, the net
may suggest relationships with the new data.
• nets display plasticity. After the removal of a part of the neurons in the net, the net
can be retrained to its original skill level if sufficient neurons are left;
• ANSes are simple to build and cheap to maintain, mainly because of plasticity.
However, for number intensive tasks or algorithmic solutions, ANSes are not a good
choice.
The origin of artificial neural systems dates back to 1943 (refer paragraph A). However,
a milestone in the connectionist AI approach was 1961, with Rosenblatt's definition of
16
The Evolution of Expert Systems Expert Systems Evolution
perceptron, a new type of artificial neuron. The perceptron consisted of two layers of
neurons, and a basic learning algorithm. It displayed remarkable capabilities for
learning and pattern recognition. However in his model, the weights had to be set
manually.
In 1969, a new milestone occurs with the publication of the book Perceptrons by Minksy
and Papert. The book showed the limitations of the perceptrons as general computing
machines, proving that a perceptron could not recognise the exclusive OR logic
function. The book has had a negative impact on the ANS research. Eventually, new
methods of representing symbolic AI information by frames became popular in the
1970s, but research only continued on a small scale.
In 1982 though, Hopfield's work brought a revival of ANS by introducing the two-layer
Hopfield Net and demonstrating that ANS could solve a wide variety of problems. The
back-propagation net, commonly implemented as a three-layer net, is an example of
ANS that can solve the XOR problem (refer Fig. 2. 5).
Other types of ANS emerged, such as the counter-propagation net, invented by Hecht-
Nielsen in 1986. It was also proven that a three-layer network with n inputs and 2n+1
neurons in the hidden layer could map any continuous function.
The present trend in expert systems is towards the artificial neural nets. Also, there are
masses of commercially available expert systems and shells.
New companies and existing firms have been organised to develop ANS technology
and products. For example, NestorWrite is a program by Nestor Corp., which can
recognise handwritten input and convert it to text.
ANS simulators and hardware accelerator boards (to speed up the learning process)
are also marketed by a number of companies such as Texas Instruments, Synaptics,
Neural Tech, etc.
17
The Evolution of Expert Systems Expert Systems Evolution
Artificial Intelligence itself is advancing rapidly, and with the latest progress in genetics,
the first human druids may not be that far away in time. Such developments may
create totally new social and ethical problems, which may also need to be
investigated.
18
The Evolution of Expert Systems Knowledge Representation in Expert Systems
19
The Evolution of Expert Systems Knowledge Representation in Expert Systems
4.2.1 Productions.
Production rules are commonly used in the knowledge bases of expert systems. A
formal notation for defining productions is the Backus-Naur form (BNF). BNF
represents metalanguage for defining the syntax of a language. While syntax defines
the form, the semantics refers to the meaning of the language. A grammar is a
complete set of production rules that uniquely defines a language.
There is a wide variety of languages that BNF can define, such as natural, logic,
mathematical, computer.
4.2.2.1 Concepts.
The structure of a semantic net may be shown graphically in terms of nodes (objects)
and arcs (links or edges) connecting them. Nodes represent objects, concepts or
situations. The links show the relationships between nodes. If the links are directed
arrows, then the net becomes a directed graph (refer Fig. 3. 2). Relationships give the
knowledge contained in the nodes a cohesive structure about which other knowledge
may be inferred.
20
The Evolution of Expert Systems Knowledge Representation in Expert Systems
4.2.3 Schemata.
All the knowledge in a semantic net is contained in the links and nodes. Therefore, a
semantic net is an example of shallow knowledge structure. In contrast, a deep
knowledge structure may explain why something occurs via causal knowledge. In a
human expert's case, deep knowledge is usually called upon when causal knowledge
fails to solve the problem.
Schemas are used to represent more complex knowledge structures. Conceptual
schemas are abstractions in which specific objects are classified by their general
properties. Schemas are used to focus on the general properties of an object without
being distracted by irrelevant details.
Schemas have an internal structure to their nodes, in contrast with the semantic nets. A
semantic net is like a data structure in computer science. The schema is more like a
data structure in which nodes contain records that can further contain records, data or
pointers to other nodes.
4.2.4 Frames.
Two types of schemas have been used in many AI applications: the frame and the
script (a time-ordered sequence of frames)
Frames are very suitable for representing objects typical to a particular situation
(stereotypes). Commonsense knowledge is very difficult to master by the computers,
and frames can be of great help. While semantic nets are used to represent broad
knowledge, frames are efficient at representing a narrow subject containing much
default knowledge. Special purpose language have been designed for frames, such
as FRL, SRL, KRL, etc
An analogy can be made between a frame and a record in Pascal or an atom in LISP.
The correspondent frame elements to the fields and values of the record are the slots
and slot fillers of a frame. The fillers may be values (e.g. a property) or a range of
values (a type slot). Slots may contain attached procedures which can be if-needed
(when a filler value is needed but none present), default (expectations of a situation),
or if-added (when a value is added to a slot).
Sophisticated frame systems have been used in discovering mathematical concepts
and describing mathematical understanding in linear algebra.
Frame systems that allow unrestricted alteration or cancellation of slots may however
display major problems. Most frame systems do not provide a way to define
unchangeable slots. This leads to a system where nothing is really certain and no
universal statements can be made. Building composite elements from simpler frames
is also restricted.
21
The Evolution of Expert Systems Knowledge Representation in Expert Systems
• syllogisms. Invented by Aristotle in the fourth century B.C., they have two
premises and one conclusion, which is inferred from the premises. The
premises provide the evidence from which the conclusion must follow. The
classical example is (Ref.1, p.64):
o Premise: All men are mortal
o Premise: Socrates is a man
o Conclusion: Socrates is mortal
• Venn Diagrams. In this model, a circle represents a set, and the objects
contained in the set are the elements (refer Fig. 3. 3). Socrates is an element,
therefore is represented as a dot and not a circle.
Fig. 3. 3 A Venn diagram for the example (Adapted from Ref.1, p.79)
The detailed descriptions of the knowledge representation methods previously
presented are beyond the scope of this paper. The interested reader is encouraged to
refer to Ref.1, pp. 57-91.
22
The Evolution of Expert Systems Conclusions and References
5 Conclusions.
This paper aimed to achieve two objectives:
• give a basic coverage on some important areas of expert systems, and maybe
incite the reader into further individual expert systems technology reading and
research;
• allow the author to train its own natural neural net (pun intended) with knowledge
in the field of AI and expert systems.
The history of Artificial Intelligence and expert systems started in fact very early, with
the ancient philosophers and then Renaissance scientists. The foundations of modern
expert systems are quite recent though (i.e. 1943 onwards). In terms of computer
history however, this is equivalent to hundreds of years of human history.
While Artificial Intelligence is far from being perfect and expert systems still have a long
way to go to fully model a human expert, the progress achieved in such a short period
is astonishing. And better still, the development pace is accelerating constantly, in
step with computer hardware technology.
6 References.
1. Giarratano & Riley (1998) Expert Systems: Principles and Programming, 3rd Ed.,
PWS Publishing Company, Boston, MA, ISBN 0 534 95053 1
2. Allen Newell and Herbert A. Simon, (1972) Human Problem Solving, Prentice-Hall.
3. Emil L. Post (1943), Formal Reductions of the General Combinatorial Decision
Problem, American Journal of Mathematics
4. George F. Luger, William A. Stubblefield (1993) Artificial Intelligence - Structures and
Strategies for Complex Problem Solving, Benjamin-Cummings, Albuquerque, ISBN
0-8053-4780-1
5. Stephen L. Gallant, Connectionist Expert Systems, Comm of the ACM, Feb 1998
6. Buchanan, Feigenbaum, Dendral and META-Dendral: Their Applications Dimension,
Artificial Intelligence, 1978.
7. AI Magazine, IEEE Journal, AI Expert publications, AAAI web site papers.
23