(Studies in Fuzziness and Soft Computing 336) Susanne Saminger-Platz, Radko Mesiar (
(Studies in Fuzziness and Soft Computing 336) Susanne Saminger-Platz, Radko Mesiar (
Susanne Saminger-Platz
Radko Mesiar Editors
On Logical,
Algebraic, and
Probabilistic
Aspects of Fuzzy
Set Theory
Studies in Fuzziness and Soft Computing
Volume 336
Series editor
Janusz Kacprzyk, Polish Academy of Sciences, Warsaw, Poland
e-mail: [email protected]
About this Series
Editors
On Logical, Algebraic,
and Probabilistic Aspects
of Fuzzy Set Theory
123
Editors
Susanne Saminger-Platz Radko Mesiar
Department of Knowledge-Based Department of Mathematics and Descriptive
Mathematical Systems Geometry
Johannes Kepler University Linz Slovak University of Technology
Linz Bratislava
Austria Slovakia
Copyright OÖN/Weihbold
Preface
Erich Peter Klement got interested in fuzzy set theory already in the 1970s while
being a young Assistant Professor at Johannes Kepler University in Linz, Austria.
In 1979, he stayed with Lotfi A. Zadeh at Berkeley University as a Visiting
Research Associate, several other research visits to universities in Europe and the
United States followed. It was also in 1979 when, together with Ulrich Höhle and
Robert Lowen, he first organized the “International Seminar on Fuzzy Set Theory”
in Linz. At that time, he could not know that there would be more than 35 seminars
to follow, well-established and widely known to the scientific community as the
“Linz Seminars on Fuzzy Set Theory”.
Organizing and hosting the seminars for so many years has not only been a great
service to the community but also had a big impact on the evolvement of science
dedicated to this field. The philosophy of the seminar has always been to encourage
critical discussions on mathematical aspects of fuzzy set theory by bringing together
researchers from different fields. Those, sometimes even controversial, discussions
have helped to develop a common understanding of the treatment of fuzzy sets and
fuzzy logic. And it happened more than once, in particular after the fall of the Iron
Curtain, that researchers from different countries first met in person on the occasion
of one of the Linz seminars.
It has also been during the early 1990s that Peter Klement founded the Fuzzy
Logic Laboratorium Linz Hagenberg (FLLL). Through industrial, applied and basic
research projects, he provided a working place and research perspectives for
(young) colleagues from different countries and disciplines. As the head of the
FLLL, and also the Department of Knowledge-Based Mathematical Systems at
Johannes Kepler University, he has hosted numerous international researchers
within, but also independently of, many research actions such as CEEPUS and
COST encouraging again discussions on the theory and application of fuzzy set
theory and beyond.
Besides his activities for the scientific community, Peter Klement has always
been active as a researcher himself. His early research interests have mainly been
devoted to (fuzzy) measures and integrals as several of his articles from the 1980s
vii
viii Preface
prove. Fuzzy measures have also been the background of his close collaboration
with the Dan Butnariu leading to the publication of the joint monograph entitled
“Triangular Norm-Based Measures and Games with Fuzzy Coalitions” published in
1993.
In 1992, as a result of the first visit of Radko Mesiar and Endre Pap to Linz with
an original intention to work on fuzzy measures and integrals, another long time
research cooperation, namely on the triangular norms and triangular conorms was
established resulting in a lot of journal articles, but in particular in the publication
of the joint monograph entitled “Triangular Norms” in 2000. By the intensified
work on triangular norms, Peter Klement’s attention had also been drawn to
copulas so that since the early years of the new millennium also copulas and
quasi-copulas had appeared in the titles of his articles, as well as topics related
to aggregation functions leading back to his original research interests in
(generalized) integration.
It is therefore not by chance that the current edited volume reflects and covers
several aspects of Peter Klement’s research acitivities. Among the authors one can
find former Ph.D. students, former colleagues from the Department of
Knowledge-Based Mathematical Systems, as well as colleagues, co-authors, friends
of Peter Klement with more than 30 years of experience in fuzzy set theory. Some
of the chapters included reflect personal views on traditional topics of the Linz
seminar—some of which containing even controversial aspects and fostering a
discussion on the mathematics behind. Other chapters deal with deep mathematical
theory of the algebraic and logical foundations of fuzzy set theory and fuzzy logic.
Several chapters approach topics related to Peter Klement’s personal research
interests in copulas, measures and integrals, as well as aggregation problems.
Also Thomas Vetterlein and Milan Petrík focus on the semantics of fuzzy logics
by discussing two different ways of investigating totally ordered monoids as an
interpretation of the conjunction in fuzzy logics.
Andrea Mesiarová-Zemánková’s chapter provides a characterization of the
structure of uninorms with continuous diagonal functions. Uninorms may be seen
as generalizations of t-norms and t-conorms, as they are associative and commu-
tative increasing operations on the unit interval whose neutral element can be, in
contrast to t-norms, respectively t-conorms, any interior element of the unit interval
and allow to model also bipolar behaviour in aggregation problems.
Humberto Bustince, Edurne Barrenechea, Miguel Pagola and Javier Fernandez
provide an overview on concepts of overlap and grouping functions generalizing
ideas of connectives from fuzzy set theory for the aggregation of information in
fuzzy classification systems.
Fabrizio Durante and Elisa Perrone in their chapter focus on asymmetric copulas
and their application in the design of experiments. The importance of copulas stems
from Sklar’s theorem clarifying that the dependence of a multivariate distribution
function of its univariate marginal distributions is, in case of continuity, completely
captured by a unique copula. The asymmetry of a copula therefore reflects the
non-exchangeability of the underlying random variables.
Carlo Sempi elaborates in his chapter the relationship between copulas and
stochastic processes, in particular the Brownian motion.
Anna Kolesárová and Andrea Stupňanová discuss extensions of capacities to
n-ary aggregation functions with relationships to the discrete Choquet and Sugeno
integral stressing the role of n-ary copulas when generalizing Lovász and Owen
extensions.
Ronald R. Yager approaches a more recent problem in aggregation, namely the
problem of multi-source information fusion by using measure representations. The
concepts of assurance and opportunity in the measure framework are also discussed.
Michel Grabisch focusses on bases and transforms of set functions on a finite set.
The basic duality between bases and invertible linear transforms is established,
covering, among others, the case of the Moebius transform, the Fourier transform
and interaction transforms.
Siegfried Weber in his chapter deals with conditioning for Boolean subsets,
indicator functions and fuzzy subsets. It introduces and discusses two types of
iteration.
Endre Pap discusses the integration of multivalued functions from additive to
arbitrary non-negative set functions. In particular, a set-valued Gould-type integral
of multifunctions is introduced and discussed.
Our special thanks go to our authors for their willingness to contribute to this
comprehensive volume. And we hope that the readers will enjoy reading all or part
of the chapters.
We congratulate Peter Klement for his scientific achievements and we are
thankful for the support he has given to us and to the scientific community in fuzzy
set theory throughout so many years. We are happy to witness that, although being
retired from being a university professor, he still enjoys being an active researcher.
x Preface
We wish him all the best, in particular healthiness, for pursuing his goals in the
future.
We have been supported by our universities, the Johannes Kepler University in
Linz and the Slovak University of Technology in Bratislava. We also gratefully
acknowledge the support of the grants APVV-14-0013 and the support in the
framework of the Technologie-Transfer-Förderung Wi-2014-200710/3KX/Kai
of the Upper Austrian Government, as well as the encouragement and the help
of Prof. Janusz Kacprzyk for the preparation of this edited volume.
Fuzzy Logic and the Linz Seminar: Themes and Some Personal
Reminiscences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Siegfried Gottwald
How I Saw, and How I See Fuzzy Sets. . . . . . . . . . . . . . . . . . . . . . . . . 13
Enric Trillas
Modules in the Category Sup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Ulrich Höhle
A Geometric Approach to MV-Algebras . . . . . . . . . . . . . . . . . . . . . . . 57
Daniele Mundici
On the Equational Characterization of Continuous t-Norms . . . . . . . . . 71
Francesc Esteva and Lluís Godo
The Semantics of Fuzzy Logics: Two Approaches
to Finite Tomonoids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Thomas Vetterlein and Milan Petrík
Structure of Uninorms with Continuous Diagonal Functions . . . . . . . . . 109
Andrea Mesiarová-Zemánková
The Notions of Overlap and Grouping Functions . . . . . . . . . . . . . . . . . 137
Humberto Bustince, Edurne Barrenechea, Miguel Pagola
and Javier Fernandez
Asymmetric Copulas and Their Application in Design
of Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
Fabrizio Durante and Elisa Perrone
Copulæ of Processes Related to the Brownian Motion:
A Brief Survey . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Carlo Sempi
xi
xii Contents
Siegfried Gottwald
Abstract The paper discusses concisely the main developments in the field of
mathematically oriented fuzzy logics and how they found their representation over
the years in the Linz Seminars on Fuzzy Set Theory.
1 Introduction
The Linz Seminar on Fuzzy Set Theory, first organized by Peter Klement in 1979,
soon became famous among mathematicians interested in fuzzy set topics, pure as
well as applied ones.
Personally, I first met Peter Klement in 1983 at the Polish Symposium on Interval
& Fuzzy Mathematics [1] in Poznań, and then again in 1985 at an International
Workshop on Fuzzy Sets Applications [2] in Eisenach. We soon had a quite friendly
relationship—and he told me that I’d have a standing invitation to the Linz Seminar as
soon as I was able to “tunnel” the iron curtain. This, however, proved to be impossible
till 1989. So I first attended the Linz Seminar in 1990—and ever since with only a
few exceptions. And already in 1990 Peter asked me kindly to join the Program
Committee, what I accepted with pleasure.
S. Gottwald (B)
Universität Leipzig, Abt. Logik Am Institut Für Philosophie, Beethovenstr. 15,
04107 Leipzig, Germany
e-mail: [email protected]
as) [3–5], using (only) Łukasiewicz logic—but with a strong feeling for the fact that
the whole approach would naturally work within a more general logical framework.
It was the contact and cooperation with E. Czogała from Gliwice in the beginning
1980s, and soon also with his graduate student W. Pedrycz, and particularly a stay
of W. Pedrycz at Delft University of Technology, that brought to me the information
of the t-norm framework. And it was this cooperation together with a 1983 Summer
School on fuzzy set matters in Bulgaria where I learned much about the importance
of fuzzy relation equations for fuzzy modeling, and of the solution approach by
E. Sanchez.
As a result I started to consider t-norm based logics [6] and fuzzy set theory in
this realm [7], and I generalized also solvability considerations for systems of fuzzy
relation equations to this framework [8, 9].
From the Linz circle then I learned that the Linz Seminar was at its early meetings
instrumental in establishing the fundamental role of the t-norms as suitable candidates
for generalized, interactive intersections of fuzzy sets, and hence of generalized,
non-idempotent conjunction connectives for systems of many-valued logics. Core
members of the Linz Seminar crew had contributed to the discussions around t-
norms, e.g. at the 2nd Linz Seminar of 1980 with t-norm related contributions by
D. Dubois and E.P. Klement.
Additionally, what I learned from the Linz Seminars in the early 1990s was the
central role that algebraic considerations, even category theoretic ones, should play
in the fuzzy sets context to structure the theory in a suitable way. And I realized
with pleasure the open-mindedness of most of the Linz participants regarding a large
diversity of mathematical topics.
Besides the t-norm topic, themes from formal logic have been considered only
occasionally in the early years of the Linz Seminar: U. Cerutti discussed in 1981
category theoretic aspects, L. Valverde problems of generalized connectives in 1981
and 1982. Logical operators had also been the topic of L. Kohout in 1984 and 1985.
A whole day was devoted to logic, for the first time, in 1988, with U. Höhle and
L. Kohout as speakers.
As times went on and the mathematical discussions in the fuzzy sets field became
more mature, the Linz Seminar was for the first time mainly devoted to a particular
topic in 1988: Measures and Integrals, followed by the topic of Applications of
Category Theory to Fuzzy Subsets in 1989.
With the breakdown of the iron curtain in 1989 immediately a much wider audience
was reached by and interested in the Linz Seminar, and new funding possibilities
became available which Peter Klement was able to use in a quite effective manner.
Devoted mainly to Applications of Logical and Algebraic Aspects of Fuzzy Rela-
tions the 1990 seminar considerably enlarged its number of participants and joined
researchers from the former East and West. With my personal focus on logic, let me
Fuzzy Logic and the Linz Seminar: Themes and Some Personal Reminiscences 3
mention F. Esteva, D. Mundici, and V. Novák, who for the first time attended the
Linz Seminar in 1990.
Since, logical aspects of fuzzy sets and mathematical aspects of suitable non-
classical logics have quite regularly been in the focus of the Linz Seminar. This
was e.g. the case of the Linz Seminars of 1992 on Non-Classical Logics and Their
Applications, of 1996 on Fuzzy Sets, Logics, and Artificial Intelligence, and again of
1997 on Enriched Lattice Structures for Many-Valued and Fuzzy Logics. These last
two seminars did have P. Hájek among the participants—who, in those years, was
in the main phase of his research for the logic BL of continuous t-norms presented
1998 in the seminal monograph [10].
This series of logic related Linz Seminars continued in 2000 with the topic of
Mathematical Aspects of Non-Classical Logics and Fuzzy Inference, in 2005 with
Fuzzy Logics and Related Structures, in 2010 with Lattice-Valued Logic and its
Applications, and in 2014 with Graded Logical Approaches and their Applications.
But also the Linz Seminars in the years 1993, 2004, and 2009 had logic as one of the
fields of particular interest.
Thus, from the mid-1990 the Linz Seminar well mirrored the fact of increasingly
strong relationships between fuzzy logics and mathematics.
What happened in fuzzy logic in that time, from a mathematical point of view?
As already mentioned, it was clear from the mid-1980s that t-norm based logics
offer a suitable framework for fuzzy set theory, and to identify the membership
degrees with the truth degrees of such logics. And the standard understanding of
the membership degrees of fuzzy sets also supported the choice of the truth degree
1 as the only designated one. There were, thus, two prominent particular cases for
such logics: the infinite-valued versions L∞ of the Łukasiewicz and G∞ of the Gödel
families of many-valued logics, cf. [11].
The context which was given by the t-norms as truth degree functions for non-
idempotent conjunction connectives and offered, however, some different pathways
for the introduction of further related connectives. Thus, t-conorms became natural
candidates to determine non-idempotent disjunction connectives. Furthermore the
fuzzy community agreed to have idempotent conjunction ∧ and disjunction ∨ con-
nectives with min, max, respectively, as truth degree functions, too.
There was, yet, no such standard choice neither for a negation nor for an impli-
cation connective. The matter is, however, not too difficult regarding candidates for
generalized, many-valued negation functions n. Standard properties should only be
4 S. Gottwald
a ∗ b ≤ c iff a ≤ b ⇒ c . (1)
The standard implication of the Gödel logic G∞ has such a characterization. In this
case one has ∗ = min and the R-implication coincides with the relative pseudo-
complement in the complete lattice ([0, 1], min, max, 0, 1).
Interestingly the standard implication of the Łukasiewicz logic L∞ is also an
R-implication. And because of the connections between R-implications and fuzzy
relation equations, those R-implications had also been used in the previously men-
tioned naive approaches toward t-norm based logics and related fuzzy set theoretic
developments [6, 9].
Nevertheless it seems to be a kind of competitive situation between t-norm based
logics with R-implications, and such ones with S-implications. And indeed, the latter
situation was studied in [12]. However, the mainstream development has focussed
on t-norm based logics with R-implications. It seems that the crucial point for this
decision was the fact that for R-implications the rule of detachment has a simple and
convincing argument for its correctness in the fact that the formula
1 This comes from the fact that in early phases of these considerations t-conorms also had been
discussed under the name “S-norms”.
2 The name derives from the algebraic operation of residuation.
Fuzzy Logic and the Linz Seminar: Themes and Some Personal Reminiscences 5
ϕ & (ϕ ⇒ ψ) ⇒ ψ (2)
a ∧ b = a ∗ (a ⇒ b) (3)
was known from [14]. And it was equally important that P. Hájek restricted the class
of integral commutative residuated lattice ordered monoids to those ones which
additionally satisfied the prelinearity condition
(a ⇒ b) ∨ (b ⇒ a) = 1 , (4)
called algebraic strong de Morgan law in [14]. Since then, these structures are known
as BL-algebras.
6 S. Gottwald
In the Linz Seminars of 1996 and 1997 P. Hájek presented core ideas and results
of his approach.
Another stream in the area of fuzzy sets related logics started in 1979 with J. Pavelka’s
discussion [19] of many-valued propositional logics with graded notions of conse-
quence, i.e. of logics which allowed to consider syntactic as well as semantic conse-
quence hulls of fuzzy sets of formulas. The general context was the one offered by
Goguen’s commutative lattice ordered semigroups with residuation added.
The semantic consequence hull Cn∗|= (Σ) of a fuzzy set Σ of formulas is defined
in a rather standard way with reference to models of Σ. In this context, a [0, 1]-
evaluation e is called a model of a fuzzy set Σ of formulas iff for each formula ϕ
one has
membership degree of ϕ in Σ ≤ e(ϕ) (5)
A corresponding syntax for such a graded notion of consequence has to cope with
membership degrees. Therefore the propositional language of [19] was enriched with
truth degree constants for each possible membership/truth degree, i.e. with constants
for each real out of [0, 1]. Furthermore, derivations with a fuzzy set Σ of premisses
have to treat formulas and degrees in parallel, and so have to act inference rules.
As a result, each derivation is a derivation of a formula ϕ to some degree a, the
proof degree of ϕ from Σ for that particular derivation. Because formulas may have
different derivations, they may have different proof degrees in this context. The
provability degree prΣ (ϕ) of ϕ from Σ then is the supremum of all possible proof
degrees of ϕ from Σ.
And the syntactic consequence hull Cn∗ (Σ) of a fuzzy set Σ of formulas is
defined by the condition
A general completeness theorem then is the statement Cn∗|= (Σ) = Cn∗ (Σ) and
could be proved by J. Pavelka [19] only for Łukasiewicz logic L∞ as basic system.
The reason is that the completeness proof needs the continuity of the residuation
Fuzzy Logic and the Linz Seminar: Themes and Some Personal Reminiscences 7
operation, and only the Łukasiewicz t-norm and its isomorphic versions have con-
tinuous R-implications [20].
This line of research was extended to first-order logic by V. Novák [21]. He
attended the Linz Seminar, beginning in 1990, quite often and presented various of
his theoretical achievements, e.g. about model theoretic results, but also applications
e.g. to problems in theoretical linguistics like the modeling of intermediate quantifiers
[22].
Also in this first-order case a general completeness theorem Cn∗|= (Σ) = Cn∗ (Σ)
can be proved only for Łukasiewicz logic L∞ as basic system–the proof again needs
the continuity of the R-implication.
The whole problem of graded notions of consequence can also be treated more
algebraically, having in mind that for classical logic there is a strong relationship
between consequence relations and closure operators in the class of all sets of for-
mulas.
Such a treatment needs the reference to closure operators in the class of all
fuzzy sets of formulas. They are defined in the standard way, i.e. via increasing-
ness, monotonicity, and idempotency and need only the reference to the (binary)
inclusion relation for fuzzy sets. To a large extent this approach was studied by G.
Gerla [23] who attended the Linz Seminar e.g. in 1996 and presented basic ideas of
this approach.
However, this more algebraic way toward graded notions of consequence does
not give more general results: a completeness theorem results again only for the case
of Łukasiewicz logic L∞ as basic system.
The focus of the considerations in the field of mathematical fuzzy logics toward
residuated lattice-ordered monoids forced also other investigations into the theory
of fuzzy sets with membership degrees in such structures or in similar ones.
The approaches toward lattice-valued mathematics brought together earlier inves-
tigations on fuzzy topologies and on category theoretic treatments of fuzzy set matters
which had been main topics e.g. of the 1989 Linz Seminar on Applications of Cat-
egory Theory to Fuzzy Subsets [24]. They have repeatedly been core topics for the
Linz Seminars, e.g. in 1993, 1997, 2004, and 2010, and they somehow culminated
in the 2012 Seminar on Enriched Category Theory and Related Topics documented
in [25]. Enriched categories have hom-sets which themselves are structured, e.g. as
residuated lattices. Enriched category theory can be used to understand the underly-
ing structure of fuzzy set theory as a particular monoidal closed category. From this
point of view, the grading membership functions become enriched presheaves.
The understanding of fuzzy sets as particular presheaves works, however, also
in the simpler context of usual categories. Also this point of view was quite often
discussed in the Linz Seminar and gave rise to M-valued sets as early as in [26],
and in this context to the understanding of the graded self-identity as measuring an
extent of existence [27, 28].
8 S. Gottwald
This notion of M-valued set touches a further important problem for many-valued
and fuzzy logics: graded identities. Such an M-valued set is an ordered pair A =
(|A|, δ A ) consisting of a crisp set |A| and a graded, i.e. M-valued (local) equality
relation δ A satisfying
(E1): δ A (x, y) ≤ δ A (x, x) ∧ δ A (y, y) , strictness
(E2): δ A (x, y) = δ A (y, x) , symmetry
(E3): δ A (x, y) ∗ (δ A (y, y) → δ A (y, z)) ≤ δ A (x, z) . transitivity
The degree of self-identity δ A (x, x) is understood as degree of extent for x in A, or
also as degree of existence, as was done for the Heyting algebra valued case in [29].
The problem of graded identities is a general problem in many-valued first-order
logics. It is not yet solved and has important philosophical aspects which, from this
author’s point of view, are still quite incompletely understood.
For the case of the finitely-valued Łukasiewicz logics it had already in 1958 by
H. Thiele [30] been shown that the usual Leibniz principle of substitutivity of equals
forces the equality relation to be crisp. A weakening of that principle, however, allows
for graded identities, as shown by this author in [31], cf. also [11].
Also for fuzzy sets a graded identity defined with respect to a graded membership
predicate ε by
From those identity relations, Leibniz equality is crisp and extensional equality really
graded.
To a large extent these results can also be generalized to the setting with the logic
MTL of all left-continuous t-norms as background logic, as discussed in [35].
This topic of graded identities occasionally occurred during the Linz Seminars,
but never was a core topic there. It is not only a mathematical problem.
Fuzzy Logic and the Linz Seminar: Themes and Some Personal Reminiscences 9
5 Concluding Remarks
Of course, it is nearly impossible to cover with such a series like that of the Linz
Seminars all the main developments. However, if one tries to find rather important
gaps, only proof theory comes into the focus. For long, proof theory–understood here
as the field of sequent and tableau calculi–for many-valued and fuzzy logics was not
a well developed and really active field. Besides some isolated research results often
restricted to finitely-valued logics, explained e.g. in [11], prior to the end of the 1990s
there had not been published important papers.
The situation changed as hypersequents, i.e. finite multisets of sequents as intro-
duced by A. Avron [36], became known as a useful tool for fuzzy logics. M. Baaz et
al. [37] indicate the first steps into this new field, and the monograph [38] documents
the amount of work done in the first decade of this century.
There are other areas, related to mathematical fuzzy logics but more application
oriented, which have not been discussed here, but have been present at the Linz
Seminars to at least some degree. Here I have in mind the field of approximate
reasoning, and particularly possibility theory, which was often discussed in Linz
Seminars devoted to more applied fields of mathematics.
Summing up and reconsidering all the 35 Linz Seminars on Fuzzy Set Theory from
1979 till 2014, all of them locally well organized by Peter Klement, it is remarkable
that most of the essential topics in the development of mathematically oriented fuzzy
logics have been presented and actively supported by those seminars.
References
1. Albrycht, J., Wiśniewski, H. (eds.): Proceedings of the Polish Symposium on Interval & Fuzzy
Mathematics, Poznań. Wydawn Politech Pozn (1983)
2. Bocklisch, S., Orlovski, S., Peschel, M., Nishiwaki, Y. (eds.): Fuzzy sets applications, method-
ological approaches, and results. In: Mathematische Forschung, vol. 30. Akademie-Verlag,
Berlin (1986)
3. Gottwald, S.: A cumulative system of fuzzy sets. In: Marek, W., Srebrny, M., Zarach, A. (eds.)
Set Theory Hierarchy Theory, Mem. Tribute A. Mostowski, Bierutowice (1975) Lecture Notes
In Mathematics, vol. 537, pp. 109–119. Springer, Berlin (1976)
4. Gottwald, S.: Set theory for fuzzy sets of higher level. Fuzzy Sets Syst. 2, 125–151 (1979)
5. Gottwald, S.: Fuzzy uniqueness of fuzzy mappings. Fuzzy Sets Syst. 3, 49–74 (1980)
6. Gottwald, S.: T -Normen und ϕ-Operatoren als Wahrheitswertfunktionen mehrwertiger Junk-
toren. In: Wechsung, G. (ed.) Frege Conference 1984 (Schwerin, 1984). Mathematical
Research, vol. 20, pp. 121–128. Akademie-Verlag, Berlin (1984)
7. Gottwald, S.: Fuzzy set theory with t-norms and ϕ-operators. In: Di Nola, A., Ventre, A.G.S.
(eds.) The Mathematics of Fuzzy Systems. Interdisciplinary Systems Research, vol. 88, pp.
143–195. TÜV Rheinland, Köln (Cologne) (1986)
8. Gottwald, S.: Generalized solvability criteria for fuzzy equations. Fuzzy Sets Syst. 17, 285–296
(1985)
9. Gottwald, S.: Characterizations of the solvability of fuzzy equations. Elektron. Informationsver-
arbeitung Kybernetik 22, 67–91 (1986)
10 S. Gottwald
10. Hájek, P.: Metamathematics of Fuzzy Logic. In: Trends in Logic, vol. 4. Kluwer Acad. Publ,
Dordrecht (1998)
11. Gottwald, S.: A Treatise on Many-Valued Logics. In: Studies in Logic and Computation, vol.
9. Research Studies Press, Baldock (2001)
12. Butnariu, D., Klement, E.P., Zafrany, S.: On triangular norm-based propositional fuzzy logics.
Fuzzy Sets Syst. 69, 241–255 (1995)
13. Höhle, U.: Monoidal logic. In: Kruse, R., Gebhard, J., Palm, R. (eds.) Fuzzy Systems in
Computer Science. Artificial Intelligence, pp. 233–243. Verlag Vieweg, Wiesbaden (1994)
14. Höhle, U.: Commutative, residuated l-monoids. In: Höhle, U., Klement, E.P. (eds.) Non-
Classical Logics and Their Applications to Fuzzy Subsets. Theory and Decision Library Series
B, vol. 32, pp. 53–106. Kluwer Acad. Publ., Dordrecht (1995)
15. Goguen, J.A.: The logic of inexact concepts. Synthese 19, 325–373 (1968–69)
16. Gottwald, S.: Mehrwertige Logik. Logica Nova. Akademie-Verlag, Berlin (1989)
17. Chang, C.C.: Algebraic analysis of many valued logics. Trans. Am. Math. Soc. 88, 476–490
(1958)
18. Hájek, P., Lluís, G., Francesc, E.: A complete many-valued logic with product-conjunction.
Arch. Math. Log. 35, 191–208 (1996)
19. Pavelka, J.: On fuzzy logic. I–III. Zeitschr. math. Logik Grundl. Math. 25, 45–52, 119–134,
447–464 (1979)
20. Jayaram, B.: On the continuity of residuals of triangular norms. Nonlinear Anal. 72, 1010–1018
(2010)
21. Novák, V.: On the syntactico-semantical completeness of first-order fuzzy logic. I: Syntax and
semantics. II: Main results. Kybernetika 26, 47–66, 134–154 (1990)
22. Novák, V.: A formal theory of intermediate quantifiers. Fuzzy Sets Syst. 159(10), 1229–1246
(2008)
23. Gerla, G.: Fuzzy logic. Mathematical Tools for Approximate Reasoning. In: Trends in Logic,
vol. 11. Kluwer Academic Publishers (2001)
24. Rodabaugh, S.E., Klement, E.P., Höhle, U. (eds.): Applications of Category Theory to Fuzzy
Subsets. Kluwer Acad. Publ, Dordrecht (1992)
25. Höhle, U., Klement, E.P. (eds.): Fuzzy Sets and Systems, vol. 256 (2014)
26. Höhle, U.: M-valued sets and sheaves over integral commutative C L-monoids. In: Rodabaugh,
S.E., et al. (eds.) Applications of Category Theory to Fuzzy Subsets, Theory and Decision
Library Series B, vol. 14, pp. 34–72. Kluwer Acad. Publ., Dordrecht (1992)
27. Höhle, U.: Many valued logic and sheaf theory. Sci. Math. Japon. 68(3), 417–433 (2008)
28. Höhle, U.: Many-valued equalities and their representations. In: Klement, E.P., Mesiar, R.
(eds.) Logical, Algebraic, Analytic, and Probabilistic Aspects of Triangular Norms, pp. 301–
319. Elsevier, Dordrecht (2005)
29. Scott, D.S.: Identity and existence in intuitionistic logic. In: Fourman, M.P., Mulvey, C.J.,
Scott, D.S. (eds.) Applications of Sheaves. Lecture Notes in Mathematics, vol. 753, pp. 660–
696. Springer, New York (1979)
30. Thiele, H.: Theorie der endlichwertigen Łukasiewiczschen Prädikatenkalküle der ersten Stufe.
Zeitschr. math. Logik Grundl. Math 4, 108–142 (1958)
31. Gottwald, S.: A generalized Łukasiewicz-style identity logic. In: de Alcantara, L.P. (ed.) Math-
ematical Logic and Formal Systems. Lecture Notes Pure Applied Mathematics, vol. 94, pp.
183–195. Marcel Dekker, New York (1985)
32. Gottwald, S.: Fuzzy Sets and Fuzzy Logic. Artificial Intelligence. Verlag Vieweg, Wiesbaden,
and Tecnea, Toulouse (1993)
33. Hájek, P.: On equality and natural numbers in Cantor-Łukasiewicz set theory. Log. J. IGPL
21(3), 91–100 (2013)
34. Skolem, Th.: Bemerkungen zum Komprehensionsaxiom. Zeitschr. math. Logik Grundl. Math.
3, 1–17 (1957)
35. Běhounek, L., Haniková, Z.: Set theory and arithmetic in fuzzy logic. In: Montagna, F. (ed.) Petr
Hájek on Mathematical Fuzzy Logic. Outstanding Contributions to Logic, vol. 6, pp. 63–89.
Springer, Switzerland (2015)
Fuzzy Logic and the Linz Seminar: Themes and Some Personal Reminiscences 11
36. Arnon, A.: Hypersequents, logical consequence and intermediate logics for concurrency. Ann.
Math. Log. AI 4, 225–248 (1991)
37. Baaz, M., Ciabattoni, A., Fermüller, C., Veith, H. (eds.): Proof theory of fuzzy logics: urquhart’s
C and related logics. In: Mathematical Foundations of Computer Science. Lecture Notes in
Computer Science, vol. 1450, pp. 203–212. Springer, Berlin (1998)
38. Metcalfe, G., Olivetti, N., Gabbay, D.: Proof theory for product logics. Neural Netw. World
13, 549–558 (2003)
How I Saw, and How I See Fuzzy Sets
Enric Trillas
Abstract This paper does not pretend a ‘technical’ presentation of a particular topic
with an exhausting list of references; it just would like to contain some reflections
of the author concerning how he sees, or better, he wishes, the future of current
fuzzy logic that, in his view and at the risk of stagnation, cannot lie on any kind of
‘logicism’ but on ‘scienticism’.
1 Introduction
It can be said that as it was originally introduced by Zadeh [16], fuzzy set theory
mainly deals with two important linguistic phenomena, imprecision and non-random
uncertainty, and that fuzzy sets can be applied, among others, to the study of dynam-
ical systems whose behavior can be described by sets of imprecise linguistic rules,
and to the random uncertainty associated to some linguistic statements [7, 13]. For
instance, the theory of possibility can deal with non-random uncertainty, fuzzy con-
trol with dynamical systems, and fuzzy probability with random fuzzy events.
The ground of fuzzy set theory lies in the, historically not surprising, fact that
predicates acting in a universe of discourse generate linguistic collectives in it; col-
lectives [14], except when they degenerate in just a single classical set, are cloudy
linguistic entities neither well known, nor easy to specify virtual or ‘thought’ entities
whose appearances, or states, are just membership functions, fuzzy sets allowing to
see their projections inside the fog of ordinary language. Hence, fuzzy sets can be
seen as a starting point for the currently non existing scientific study of linguistic
collectives. In sum and grossly speaking, fuzzy sets deal with ordinary language;
they are mathematical entities contextually reflecting collectives, and modeled by
their membership functions. They meant to pass from an old world of exact thinking
E. Trillas (B)
European Centre for Soft Computing, Mieres, Asturias, Spain
e-mail: [email protected]
I entered into fuzzy logic by chance. It was through an interview, in a French news-
paper, with the late professor Arnold Kaufmann in which he spoke on his then recently
appeared book ‘Ensembles flous’. The subject interested me since I was doing my
research work on the probabilistic metrics introduced by Karl Menger, and knew his
paper entitled ‘Ensembles flous’, a new concept that he translated into English by
hazy sets. I bought Kaufmann’s book, read a good part of it, and to some extent I
was actually disappointed; my first glance at fuzzy sets make me to believe that they
were just a simple generalization of sets. Nevertheless, since some of the examples
in the book called upon my attention and made me curious, I decided to read the
1965 paper where fuzzy sets were originally introduced, whose title is ‘Fuzzy Sets’
and was written by Zadeh [16]. Before reading this paper, and as a consequence of
both my mathematical formation, and the reading of Menger’s paper, I was unable
to see fuzzy sets unlinked with probability; indeed, hazy sets represent something
like the probability of an element belonging to a set.
How I Saw, and How I See Fuzzy Sets 15
But the reading of Zadeh’s paper suddenly changed my view. The subject was
towards representing the multitude of imprecise predicates of which language is full;
it was, for me, the first mathematical model for taking into account the imprecision
that, permeating language, affects ordinary reasoning with concepts that are not
definable like those managed in mathematics by ‘if and only if’ conditions, but only
describable from their use in several contexts as they appear in dictionaries. It was
for me a new land to be explored, and I was captivated as I see the possibility of
building up mathematical models of common reasoning. I decide to start with such
an exploration! At the end, in that time I was unhappy with the ‘bourbakism’ of
which mathematics was full in Spain, and I was also worried by the giving up that,
from time ago, logicians kept on ordinary reasoning.
Although the only references in the first Zadeh’s paper on fuzzy sets are the purely
mathematical books by Birkhoff, Halmos, and Kleene, and since from very young
I kept a deep interest in Bertrand Russell’s philosophical writings, I remembered
the Russell’s paper ‘On Vagueness’ and believed that there should be some links
between fuzzy sets and vagueness. This idea conducted me to the 1972 paper by
Aldo De Luca and Settimo Termini, where they established the then new concept of
a ‘fuzzy entropy’, and I thought it is nothing else than a measure of the vagueness or,
by duality of its classicality, booleanity, or crispness, the linguistic label of a fuzzy
set presents. This idea make me to think that ‘fuzziness’ is just a restriction of the
vagueness of a predicate whenever it can be represented by a fuzzy set, and this was
for me a challenging philosophical idea that, many years ahead, conducted me to see
that fuzzy sets are nothing else than measures of the meaning of predicates. My first
papers on fuzzy sets dealt, between 1976 and 1978, with trying to find functionally
expressible mathematical formulas able to represent fuzzy entropies, but different
and more general than the unique logarithmic fuzzy entropy shown by De Luca and
Termini in his paper. In addition, I also tried to relate them with the Sugeno’s fuzzy
integral since, in the meantime, I was acquainted with Michio Sugeno in Toulouse,
and with his 1974 Ph.D. Thesis.
Early after these worries, I began to be interested in the subject of fuzzy connec-
tives and fuzzy inference. For the first I was essentially motivated by the fact that
fuzzy connectives can’t show the same properties in all contexts, and that distrib-
utivity is, for instance, a very constraining and crisp property. What conducted me
in such direction were the papers by Bellman and Giertz [2], and that on negation
by Lowen [4]. My dedication to Probabilistic Metric Spaces, that bring me to know
Bert Schweizer and Able Sklar after meeting Karl Menger in Chicago, introduced me
to solve Functional Equations, and I got the idea of characterizing the (continuous)
strong negations by just solving an easy functional equation. Since I was familiar
with Schweizer and Sklar’s t-norms (a restriction by adding associativity to those
introduced by Menger [6]), I can introduce in fuzzy logic these ordered semi-groups
in the unit interval. Finally, the fact that as examples of his Compositional Rule
of Inference, Zadeh did show some that were non-preserving the classical Modus
Ponens when the input is just the antecedent of the rule, I tried to study this ‘Modus’
in fuzzy logic by formulating it as Hardegree did in Orthomodular lattices [3].
16 E. Trillas
To end this section, that corresponds with the time in which I met for the first
time professor Peter Klement and in Barcelona, let me remember again that, as
another consequence of Menger’s work, but due to the idea of mathematically
modeling the breaking of synonymous chains, I introduced in fuzzy logic the
T-indistinguishabilities, or T-equivalences, allowing to relating such problem with
that of Poincaré concerning the physical continuum. I would like to say that Menger’s
trace in fuzzy logic or, at least, in my contributions to it, is certainly of some relevance.
For a lot of time after 1965, the mathematical nature of fuzzy sets in relation with
the meaning of their linguistic label, was not clearly explained. They were simply
viewed as membership functions generalizing the characteristic function of crisp sets
and, supposedly, representing its meaning in the universe of discourse but without
counting with a meaning’s operational description [8]. If philosophers largely debated
on the meaning of ‘meaning’, they never attended the representation of meaning, and
it lacked a scientific study that today can be considered started with the work of Zadeh,
and in a form close to the Wittgenstein of the ‘Philosophical Investigations’, when
he states that almost always ‘the meaning of a word is its use in language’. How
can even if not defined, the use or management in language of a linguistic label be
mathematically described?
If P is a linguistic label, or predicate, acting in a universe of discourse X through
the elemental statements ‘x is P’, for a suitably management of P the two binary
relations in X , that empirically come from linguistic perception, from its use,
• x = P y ⇔ x shows the property named P equally than y shows it ⇔ x is equally
P than y,
• x ≤ P y ⇔ x is less P than y,
should be known [2, 14]. When both relations ≤ P and = P do coincide, it is said that
the use of P in X is precise, rigid, or crisp, = P is an equivalence, and X is partitioned
in the equivalence classes in the quotient set X/ =, [x] = {y ∈ X ; y = P x}. Instead
and when ≤ P == P that, provided it can be supposed = P =≤ P ∩ ≤−1 P , implies that it
is not ≤ P ⊆ ≤−1
P , it is said that the use of P in X is imprecise, flexible, or fuzzy. In any
case, the graph (X, ≤ P ) represents the qualitative, or primary, meaning of P in X .
In this way, the previously amorphous universe of discourse X , is softly structured
thanks to the use of P in it. The simple and usual act of ‘speaking’ on a property
recognizable in the elements of X , endows X with the arcs of this graph; an idea
corresponding with the intuitive one that rational speech tries to introduce some kind
of ‘ordering’ in the universe of discourse, also corresponding to the establishment of
some necessary link between ordering and understanding. Nevertheless, the graph
does not exhaust the ‘full meaning’ of P in X , and when it is ≤ P = ∅, it can be said
that P is metaphysically used in X , that P is metaphysical, or meaningless in X
[11]. Notice that it is thanks to the relation ≤ P that can be seen the variability of the
How I Saw, and How I See Fuzzy Sets 17
Once the graph (X, ≤ P ) is known it can be said that P is measurable in X , and once
a measure µ P is known that it is effectively measurable in X [14].
The three former properties are not sufficient, in general, to specify a measure,
but there is only a single one if the predicate is precise; to specify a measure either
more information on the use of P, or to establish a reasonable hypothesis on it, is
necessary.
In any case, each measure µ P is the membership function of a fuzzy set in X
labeled P. Fuzzy sets are defined by the measures of the extent up to which the
elements in X are P show the property named P; shortly speaking it can be said
that fuzzy sets are measures of meaning, like probabilities are measures of random
uncertainty, and fuzzy entropies are measures of fuzziness. It should be noticed that
each quantity (X, ≤ P , µ P ) represents a good enough knowledge on the meaning of
P in X for its scientific consideration; it can be said that such quantities are the
typically scientific domestication of meaning [10], and can offer a new perspective
for studying both fuzzy sets and fuzzy logic.
It should be noticed that if the use of P in X is rigid, it is x = P y ⇔ x ≤ P y &
y ≤ P x ⇒ µ P (x) = µ P (y), and hence, µ P is constant in the classes modulo P, the
only values µ P can take are 0 or 1, µ−1 P (1) is the crisp subset specified by P in X ,
µ−1P (0) its classical complement, and one of them can be empty.
In praxis, a fuzzy set is designed by means of the information on its linguistic
label that is available and that, most of the times, is not the full relation ≤ P , but a
part of it; there are cases in which obtaining ≤ P can be very difficult. Hence and very
often, neither it is always ≤ P completely known, nor it can be stated that the designed
membership function µ∗P is truly a measure, but some unknown approximation to it.
Consequently, the designer cannot work with ≤ P but only with the total order defined
by x ≤µ∗P y ⇔ µ∗P (x) ≤ µ∗P (y), called the working meaning of P in X . Provided
µ∗P were actually a measure or, at least, it can be supposed it verifies property (1),
and since then, x ≤ P y ⇒ µ∗P (x) ≤ µ∗P (y) ⇔ x ≤µ∗P y, implies ≤ P ⊆ ≤µ∗P , that is,
the working meaning extends the qualitative meaning of P. The act of measuring P,
modifies its qualitative meaning by adding more arcs to it [14].
Notice that since in most cases the relation ≤ P has not a total, or linear, character,
it cannot coincide with the linear orders ≤µ P . When there is coincidence, it is said
that the measure perfectly reflects the qualitative meaning of P. It is easy to proof
that, provided ≤ P is reflexive and transitive, then = P is an equivalence relation,
and that the mapping C : X → X/ = P , assigning to each x the equivalence class
[x] = C(x), verifies x ≤ P y ⇔ [x] ≤∗P [y] ⇔ C(x) ≤∗P C(y). Hence, not only C
18 E. Trillas
perfectly reflects the qualitative meaning of P in X , but this idea opens the door for
defining ‘qualitative measures’ of a predicate by taking, instead of the unit interval,
some non-numerical posets with which more possibilities of perfectly reflecting the
qualitative meaning could appear.
In those cases in which the measure does not perfectly reflect the qualitative meaning,
and since in science is not at all rare to manage measures with complex values, it could
be suitable to substitute the real interval [0, 1] by the complex one {a + bi; a, b ∈
[0, 1]}, the complex circle, endowed with the usual partial order a1 + b1 i ≤ a2 +
b2 i ⇔ a1 ≤ a2 & b1 ≤ b1 , and with analogous properties [14] to the former (1),
(2), and (3). This substitution cannot guarantee that a complex-valued measure will
perfectly reflect the qualitative meaning, but just that it can offer more possibilities for
it, since the working order will be not linear. This substitution that can be equivalently
seen by taking an interval-valued measure, just changing a + bi by the interval [a, b],
and corresponding to a particular type of the so-called type-2 fuzzy sets reflecting
that the value of the measure carries with the uncertainty coming from only being
sure that it is in the interval [a, b].
Analogously, and instead of the real or the complex unit intervals, it can be taken
the set [0, 1][0,1] , of the fuzzy sets in the unit interval (type-2 fuzzy sets) that contains
images isomorphic to both the unit interval and the complex unit interval, and for
those cases in which the only that can be asserted is that the value of the measure is,
for instance, either ‘around 0.7’, or ‘high’ [12]. In this form, all the types of fuzzy
sets currently considered, are integrated thanks to the quantities, either numerical or
functional, representing the meaning of its linguistic label.
The full meaning of a linguistic label P is not unique, but it is actually context-
dependent and purpose-driven. Each quantity (X, ≤ P , µ P ), real, complex or fuzzy
valued, is obtained through what the designer can know, in a given context, of the
use, action or behavior of P in X , or through some reasonable hypothesis he could
be able to make on such behavior. This last is the often considered case in most
applications, in which the real-valued measure, the membership function, is supposed
to be trapezoidal, or just triangular.
Once seen that the membership functions of fuzzy sets mean nothing else than a
‘measure of the meaning’ of its linguistic label, it can be remembered the famous
words of Lord Kelvin shortened to ‘If you cannot measure it, it is not science’.
There are, notwithstanding and at least, two aspects introducing important differences
between Lord Kelvin’s times and ours. In the first place, it is the fact that if, lets it
say, science is essentially concerned with matter and energy, fuzzy set theory is
concerned with knowledge and information, and directly related with the so called
Information Technologies. In a second place, in Lord Kelvin’s science there were
and are known systematic procedures and laboratory methods, to measure the basic
parameters of the studied things, but now and for what concerns, for instance, the
How I Saw, and How I See Fuzzy Sets 19
design of membership functions, the situation is different and more linked to some
analogy with virtual objects, than with physically real objects. It is not the same to
study the chemical composition of an organic product, or the movement of a star, than
to study the meaning of a written piece, or the control of a machine whose behavior
is known by the knowledge of the experts in their functioning and once linguistically
described. Nevertheless, this is the kind of problems currently worrying Artificial
Intelligence.
Anyway, the evolution of fuzzy logic towards Zadeh’s Computing with Words and
Perceptions, CwW for short [17], is conducting towards the mathematical represen-
tation of statements larger and more complex than the more or less simple rules
considered in fuzzy control [8]. This will mean to face with the necessity of con-
sidering different ways of expressing conditional statements, and the already known
linguistic connectives ‘and’, ‘or’, ‘not’, etc., since there is not a universal form of
expressing them in language, like it is in classical logic and set theory, but respectively
represented in fuzzy logic by residuated implications, S-implications, conjunctive
implications, t-norms, t-conorms, negation functions, etc. For all that there are a lot
of mathematical models facilitating to fuzzy logic a remarkable armamentarium for
the representation of statements, and for doing deductive inferences with them, but
what is not yet clear enough are the linguistic subjects to which such armamentarium
is applicable, and to which is not. For instance, fuzzy logic only considers function-
ally expressible connectives, but no suitable criteria are known for recognizing this
hypothesis in concrete cases, and, analogously, the use of non strong but continuous
negations is not yet spread into fuzzy logic applications to represent language. Even
more, almost always the used connectives are min, max, prod, and 1-id; there exists
a big separation between what is employed by practitioners of fuzzy logic, and what
is kept in the theoretical armamentarium generated by mathematicians.
In sum, it seems that fuzzy logic is approaching the time in which it should face a
turning point. The great subjects fuzzy logic deals with are linguistic imprecision and
non-random uncertainty, not to say anything on the very important but scientifically
almost pending subjects of ambiguity, the presence of multiple meanings, and com-
mon sense non-deductive reasoning [11] with imprecise, non-randomly uncertain,
and ambiguous words.
The only way to properly afford it is, in the author’s view, the transformation of
fuzzy logic in a kind of ‘physics’ of imprecision, non-random uncertainty and ambi-
guity. That is, in a new experimental science that, based in Natural Language, can
count with mathematical models able to give important parameters to be experimen-
tally computed at each case once their can be found in the same study of language, and
not by abstract mathematical thought considerations. What is needed is to transform
the study of language from a logic one in a scientific one.
20 E. Trillas
When fuzzy logic was initially developed in the past Century’s seventies and
eighties, almost the only back referents for its study were classical and multiple-
valued logics, but now it should be centered in Natural Language. If current fuzzy
logic already meant an important progress in the way asked by John von Neumann
of introducing mathematical analysis in the study of those subjects without a just
‘yes’ or ‘not’ hypothesis for its validity, it can be the right moment to go a step ahead
and turning towards the Artificial Intelligence’s ‘Gordian Knot’ of trying to reach
computers thinking like people usually do.
6 Conclusion
Up to some point, and although many papers of a mathematical character, even with
some of them of a true mathematical quality, are being continuously published in
the setting of ‘theoretic fuzzy logic’, its evolution seems to be actually stagnated
because of some moving away of what is the essence of fuzzy logic. By one side,
those papers remain practically unknown or, at least, not considered, for those who
devote their efforts to the applications of fuzzy logic, and by the other the motivation
of their authors is almost always purely abstract; in them, it rarely appears a ‘real
fuzzy problem’ to which either their results could be applied to, or just it can be
suggested by the paper’s content. It seems as if in current fuzzy logic it were two
streams, that of mathematicians and that of engineers, but fuzzy logic should be an
integrated study of what is ‘fuzzy’ and, in principle, that practitioners ignore the
obtained mathematical results, marks a limit in their capability of designing fuzzy
systems. There is, perhaps, some kind of isolation between both types of researchers,
the most relevant of the ones not mixed with the most relevant of the others. This and
to some extent, goes against the cross-fertilization of both groups and can contribute
to the closing of the first in their own mathematical interest. Anyway, and in the
last years, I hopefully heard on mixed groups working in some specific projects.
Notwithstanding, and as far as I know, such projects are on very specific topics not
directly related with CwW.
In the author’s view, and by looking at what is fuzzy and what is for its study, the
great challenge for the best continuation of theoretic fuzzy logic lies in the problems
that are in the back of the new Zadeh’s ‘Computing with Words and Perceptions’,
where the problems that were essential for the introduction of fuzzy sets could acquire
all their relevance as clearly dealing with Natural Language’s complex phrases, and
with the non-deductive varieties of Commonsense Reasoning. Nevertheless, the nat-
ural and dynamic characters of both language and reasoning, seems to suggest that
a new, and scientific, study of them cannot be completely afforded by only counting
with the abstract reasoning reached through mathematical theorems that only can be
successfully applied provided all what is being supposed for their proofs is actually
verified in a concrete and actual situation. Something that is, usually, very difficult
to check as, it happens, for instance, when trying to use an S-implication function
How I Saw, and How I See Fuzzy Sets 21
for linguistic rules in which the representation of the negation of their antecedents is
actually unknown.
It is in the thought of the author that the main subjects of ‘fuzzy logic’ or, by
extension CwW, are both the representation and technical management of the impre-
cision and the uncertainty pervading natural language and commonsense reasoning
in non-trivial statements. In some cases, for instance, the meaning of the components
of a large statement is only captured after having captured the full meaning of the
full statement, something different of what is done in logic where always it is done
by departing from the meaning of the components.
To afford those subjects it seems recommendable to face them as they are, natural
phenomena of which, and in addition, we have a scarce knowledge that, notwith-
standing, should be increased by the only way it can be followed for any natural
phenomena, namely, by experimenting in controlled forms as it is typical of science.
It is with the conjunction of experimentation and mathematical modeling how mea-
surable parameters can be obtained and deep conclusions attained. Science always
needs to count with suitable frames for representing what it deals with, thanks to
which some mathematical models could be established and that, at its turn, facili-
tates some numerical parameters necessary to going on with more experimentation.
À la Popper, research is always an un-ended quest.
A new experimental science dealing with linguistic imprecision and uncertainty,
both random and not random, seems to appear in the horizon and into the complex
knitting of language. It is an enterprise that jointly with, and close to, fully knowing
the brain’s functioning, could contribute to capture what is rationality by going
far from old metaphysical ideas, and by means of the single way mankind has for
acquiring safe knowledge, the scientific method.
Would young researchers in the XXI Century devote their efforts to such a chal-
lenging enterprise!
Acknowledgments This paper is partially funded by the ‘Foundation for the Advancement of Soft
Computing’, Mieres (Asturias), Spain.
References
1. Basáñez, L., Batle, N., Ferraté, G., Grané, J., Trillas, E.: A first approach to sigma-transform.
J. Math. Anal. Appl. 92(1), 224–233 (1983)
2. Bellman, R., Giertz, M.: On the analytic formalism of fuzzy sets. Inf. Sci. 5, 149–156 (1973)
3. Hardegree, G.M.: The conditional in quantum logic. Synthese 29, 63–80 (1974)
4. Lowen, R.: On fuzzy complements. Inf. Sci. 14, 107–113 (1978)
5. Mamdani, E.H., Assilian, S.: An experiment in linguistic synthesis with a fuzzy logic controller.
Int. J. Hum. Comput. Stud. 7(1), 1–13 (1975)
6. Menger, K.: Statistical metries. Proc. Nat. Acad. Sci. USA 28(12), 535–537 (1942)
7. Nguyen, H.T., Walker, E.A.: A First Course in Fuzzy Logic. Chapman & Hall, Boca Raton
(2000)
8. Trillas, E., Guadarrama, S.: Fuzzy representations need a careful design. Int. J. Gen. Sys. 39(3),
329–346 (2010)
22 E. Trillas
9. Trillas, E.: A model for ‘crisp reasoning’ with fuzzy sets. Int. J. Intell. Syst. 27, 859–872 (2012)
10. Trillas, E.: En defensa del razonamiento creativo, Universidad Pública de Navarra (2014)
11. Trillas, E.: Razonamiento; significado, incertidumbre y borrosidad; Ed. Upna, Pamplona (2015)
12. Trillas, E.: An algebraic model of reasoning to support Zadeh’s CwW (2015)
13. Trillas, E., Eciolaza, L.: Fuzzy logic: an introductory course for engineering students. Springer
(2015)
14. Trillas, E., Termini, S., Moraga, C.: A naïve way of looking at fuzzy sets. Fuzzy Sets Syst. In
press (2015)
15. Williams, C.P., Clearwater, S.H.: Explorations in Quantum Computing. Springer, NY (1988)
16. Zadeh, L.A.: Fuzzy sets. Inf. Control 8, 338–353 (1965)
17. Zadeh, L.A.: Computing with Words: Principal Concepts and Ideas. Springer, Berlin (2012)
Modules in the Category Sup
Ulrich Höhle
Abstract This chapter explains basic properties of left modules on unital quantales
with the perspective towards fuzzy set theory. Typical constructions such as the
fuzzy power set, Zadeh’s forward operator or binary operations defined according
to Zadeh’s extension principle are constructions in the symmetric monoidal closed
category of complete lattices and join preserving maps. Moreover, involutive left
modules play a significant role in the representation theory of C ∗ -algebras.
1 Introduction
L.A. Zadeh has tacitly used the unital quantale given by the canonical M V -algebra
as algebraic basis.
• Let Q be a unital quantale and X be a set. Then the Q-valued (i.e. Q-fuzzy) power
set of X is the free left Q-module generated by X . This result appears for the
first time in [12] and describes the fuzzy power set by a universal property in the
language of module theory in Sup.
U. Höhle (B)
Fachbereich C Mathematik und Naturwissenschaften, Bergische Universität,
Wuppertal, Germany
e-mail: [email protected]
• Left Q-modules and complete Q-valued (i.e. Q-fuzzy) ordered sets are equivalent
concepts—a result which goes back to I. Stubbe in a more general context given
by quantaloid enriched categories (cf. [29]).
• Binary operations constructed according to Zadeh’s extension principle are the
tensor product of the respective Minkowski multiplications with the multiplication
of the underlying unital quantale (cf. [8]).
Because of the previous observations we believe that it is meaningful to put together
various important aspects of module theory in Sup with the perspective towards
fuzzy set theory. In this sense this paper is a survey on some basic properties of left
modules on unital quantales.
We begin with the construction of the tensor product in Sup which goes back
to Z. Shmuely 1974 (cf. [27]). Since there exist various approaches to the tensor
product (cf. [3, 12]), we prefer here the understanding of the tensor product as
“function space”—i.e. as a complete lattice of join reversing maps (cf. Proposition
1 on p. 6 in [12]). Subsequently, we view unital quantales as monoids in Sup and
develop the usual module theory in Sup including complete many-valued ordered
sets which represent a typical phenomenon of left-modules in Sup. In this context
it is interesting to realize that involutive left modules play a significant role in the
representation theory of C ∗ -algebras (cf. [13, 14, 16, 24]).
Let Sup be the category of complete lattices and arbitrary join preserving maps. It
is well known that Sup is a complete and cocomplete category. In particular, regular
quotients of complete lattices L can be identified with closure operators on L—these
c
are isotone self-maps L − → L provided with the properties 1 L ≤ c and c ◦ c ≤ c.
With regard to algebraic considerations the most important property of Sup is the
existence of a tensor product transforming Sup into a symmetric monoidal closed
category (cf. [17]). The purpose of this section is to recall these fundamental proper-
ties and to explain the role of the tensor product for the construction of fuzzy power
sets.
f
Let L and M be complete lattices.
Amap L − → M is join reversing if for all
subsets A of L the relation f ( A) = f (A) holds. Obviously, join reversing
f
maps L − → M are always part of a Galois connection ( f, g) between L and M (cf.
g
[4, 7]) where the second polarity M −
→ L is determined by
g(m) = { ∈ L | m ≤ f ()} (1)
f
Therefore we denote the set of all join reserving maps L −
→ M by G(L , M) and
consider the following partial order on G(L , M):
Modules in the Category Sup 25
Obviously, (G(L , M), ≤) is a complete lattice in which meets (but in general not
joins) are computed pointwisely. In this context, the formula (1) determines an order
isomorphism between G(L , M) and G(M, L).
b
Let N be a further complete lattice. Referring to [3] a map L × M −→ N is a
bimorphism (in Sup) iff b is join preserving in each variable separately—i.e. for
all m ∈ M and ∈ L the correspondences → b(, m) and m → b(, m) are join
preserving.
It is evident that β is join preserving in its second argument. In order to verify that β is
also join preserving in its first argument, it is sufficient to consider a non empty subset
{i | i ∈ I } of L. Then f ∈ G(L , M) is an upper bound of A := {β(i , m) | i ∈ I }
iff m ≤ f (i ) holds
for
all i ∈ I .
Since f
is join reversing, f is an upper bound
of A iff m ≤ f i . Hence, β i , m is the smallest upper bound of A—i.e.
i∈I
i∈I
β(i , m) = β i , m .
i∈I i∈I
Definition 1 Let L and M be complete lattices. A pair (β, X ) is called the tensor
β
product of L and M, if X is a complete lattice and L × M −
→ X is a bimorphism
such that the following universal property holds:
b
For every bimorphism L × M −
→ N there exists a unique join preserving map
h
X−
→ N making the following diagram commutative:
26 U. Höhle
β
L×M / X
??
??
? (5)
b ??
h
N
In the special case of Sup the next theorem is a version of the Corollary of
Proposition 5 in [3].
Theorem 1 For every pair (L , M) of complete lattices L and M the tensor product
(β, X ) exists and is unique up to an isomorphism in the sense of Sup.
Proof The uniqueness of the tensor product is an immediate corollary from its uni-
versal property.
In order
to verify the existence of the tensor product, we consider
the pair β, G(L , M) constructed in Example 1.
b f
Let L × M − → N be a bimorphism. Since every join reversing map L −
→ M is the
join of {β(, f ()) | ∈ L} in G(L , M)—i.e.
f = β(, f ()), (6)
∈L
h
there exists at most a join preserving map G(L , M) −
→ N making the diagram
β
L×M / G(L , M)
JJJ
JJJ (7)
JJJ h
b
$
N
commutative.
h
In order to establish the existence of a join preserving map G(L , M) −
→ N s.t. the
h
diagram (7) commutes, we proceed as follows. We define a map G(L , M) −
→ N by
h( f ) = b(, m), A ⊆ L × M, β(, m) = f. (8)
(,m)∈A (,m)∈A
Provided that we can show that h is well defined, then formula (8) shows immediately
that h is join preserving and makes the diagram (7) commutative. Therefore we only
h is well defined. For this purpose we choose a further subset B of L × M
verify that
with β(, m) = f . Since N is complete, it is sufficient to verify that the
(,m)∈B
respective sets of upper bounds of {b(, m) | (, m) ∈ A} and {b(, m) | (, m) ∈ B}
coincide.
First we apply the property that b is join preserving in each variable separately
fz
and define a join reversing map L −→ M for all z ∈ N as follows
f z () = {m ∈ M | b(, m) ≤ z}, ∈ L .
Modules in the Category Sup 27
If z ∈ N be an upper bound of {b(, m) | (, m) ∈ A}, then m ≤ f z () holds for all
(, m) ∈ A. Since f is the join of {β(, m) | (, m) ∈ A}, we obtain f ≤ f z . Since
f is also the join of {β(, m) | (, m) ∈ B}, the relation
follows. Because of b(, f z ()) ≤ z we infer from (9) that z is also an upper
bound of {b(, m) | (, m) ∈ B}. Interchanging the role of A and B the assertion
follows.
Because of the previous theorem we fix the following notation and terminology.
The tensor product of two complete lattices L and M is denoted by L ⊗ M. By abuse
β
of notation the bimorphism L × M − → L ⊗ M is also denoted by ⊗, and instead of
β(, m) we also write ⊗ m. Elements of L ⊗ M are called tensors, and tensors of
the special type ⊗ m are called elementary tensors. It follows immediately from (6)
that every tensor is a join of an appropriate family of elementary tensors. Sometimes,
in the case of = ⊥ and m = ⊥, the following equivalence is useful:
⊗ m ≤ ⊗ m ⇐⇒ ≤ and m ≤ m .
Let L be a complete lattice. We show that the L-fuzzy power set is the tensor
product of L with the ordinary power set. For this purpose we choose a set X and
consider the partial ordering ≤ on L X defined by:
g1 ≤ g2 ⇔ ∀x ∈ X : g1 (x) ≤ g2 (x).
Then L X is a complete lattice. In the literature on fuzzy sets L X is called the L-fuzzy
or L-valued power set of X .
Theorem 2 ([8]) Let L be a complete lattice, X be a set, and let P(X ) be the ordinary
ΦL
power set of X . There exists an order isomorphism L X −−→ L ⊗ P(X ) defined by:
Φ L (g) = g(x) ⊗ {x}, g ∈ L X .
x∈X
Proof It is easily seen that every element g ∈ L X can be identified with a join revers-
Fg
ing map P(X ) −→ L
Fg (A) = g(x), A ∈ P(X ),
x∈A
ΦL
and vice versa. Hence the map L X −−→ G L , P(X ) = L ⊗ P(X ) defined by
is bijective and isotone. Since the inverse map of Φ L has the form
[Φ L−1 ( f )](x) = { ∈ L | x ∈ f ()}, f ∈ G(L , P(X )),
A := {g(x) ⊗ {x} | x ∈ X }.
An element k ∈ L ⊗ P(X ) is an upper bound of A iff {x} ⊆ k(g(x)) holds for all
x ∈ X iff {x ∈ X | ≤ g(x)} ⊆ k() holds for all ∈ L. Hence Φ L (g) (cf. (10)) is
the smallest upper bound of A.
It follows from the universal property of the tensor product that the tensor prod-
uct in Sup induces a bifunctor (also denoted by ⊗) from Sup × Sup to Sup. In
h1 h2
particular, the tensor product of join preserving maps L 1 −→ L 2 and M1 −→ M2 is
determined by the commutativity of the following diagram:
⊗
L 1 × M1 / L 1 ⊗ M1
h 1 ×h 2 h 1 ⊗h 2
L 2 × M2 / L 2 ⊗ M2
⊗
h ϕ
Further, let L −
→ M be a join preserving map. Then for any map X −
→ Y the variable-
→
(ϕ,h)
basis power set forward operator L X −−−−→ M Y has the form (cf. 3.9(6) in [25]):
[(ϕ, h)→ (g)](y) = {h ◦ g(x) | ϕ(x) = y}, y ∈ Y.
ϕ h
Corollary 1 Let X − → Y be a map and L −
→ M be a join preserving map. Then the
following diagram is commutative:
(ϕ,h)→
LX / MY
ΦL ΦM
L ⊗ P(X ) / M ⊗ P(Y )
h⊗P(ϕ)
Modules in the Category Sup 29
Because of the previous corollary the variable-basis power set forward operator
coincides with the tensor product of the traditional power set forward operator (i.e. the
operator taking direct images) with the join preserving map performing the change
of basis. In this sense the tensor product of Sup plays a fundamental role in the
mathematical foundations of fuzzy set theory.
The next important observation is the property that for every complete lattice M
the endofunctor F : Sup → Sup defined by
h
F(L) = L ⊗ M, L1 −
→ L 2 , F(h) = h ⊗ 1 M
has a right adjoint functor G. This result follows from the general constructions in
[3]. But we give here a direct proof.
h
For complete lattices M and N the set [M, N ] of all join preserving maps M −
→N
is a complete lattice w.r.t. the pointwisely defined order. Then G : Sup → Sup is
defined by:
k
G(N ) = [M, N ], N1 −
→ N2 , [G(k)](h) = k ◦ h, h ∈ [M, N1 ].
ev N
Further, the evaluation map [M, N ] × M −−→ N with ev N (h, m) = h(m) is a bimor-
phism. Because of the universal property of the tensor product there exists a unique
εN
join preserving map [M, N ] ⊗ M −→ N making the diagram
⊗
[M, N ] × M / [M, N ] ⊗ M
OOO
OOO
OOO εN
ev N OOO
'
N
h⊗1 M
L⊗M / [M, N ] ⊗ M
OOO
OOO
OOO εN
OOO
h OOO
'
N
bh h
Proof The bimorphism L × M −→ N corresponding to L ⊗ M −
→ N is given by
Since joins in [M, N ] are computed pointwisely, the map h is in fact join
preserving.
h
The previous theorem motivates the following terminology. Let L ⊗ M −
→ N be
h
a join preserving map. Then the join preserving map L −−→ [M, N ] determined by
(11) is called the monoidal adjoint map of h.
The correspondence h → h induces an order isomorphism from
[L ⊗ M, N ] onto [L , [M, N ]] (Proposition 5 in [3]). As a corollary of this fact
we obtain the associativity of the tensor product.
Proof By X op we denote the dual lattice of X . Hence the tensor product L ⊗ M has
the form L ⊗ M = [L , M op ]op .
If we choose ( ⊗ m) ⊗ n ∈ (L ⊗ M) ⊗ N = [L ⊗ M, N op ]op , then it is not
difficult to conclude from (11) that the relation ( ⊗ m) ⊗ n = ⊗ (m ⊗ n)
holds. Hence the formation of taking monoidal adjoint maps is the desired order
isomorphism from (L ⊗ M) ⊗ N = [L ⊗ M, N op ]op to [L , [M, N op ]]op =
L ⊗ (M ⊗ N ).
Comment The associativity of the tensor product appears already as Theorem 1.7
in [27], but without an explicit statement of property (12).
Lemma 1 Let L and M be complete lattices. Then there exist a unique order
cL M
isomorphism L ⊗ M −−→ M ⊗ L satisfying the following condition for all pairs
(, m) ∈ L × M:
c L M ( ⊗ m) = m ⊗ . (13)
The next lemma explains that the 2-chain is the unit object w.r.t. the tensor product
in Sup (cf. [12, p. 6, Proposition 2(ii)]).
Lemma 2 Let L and M be complete lattices and let 1l = {0, 1} be the lattice with two
lM rL
different elements. Then there exist order isomorphisms 1l ⊗ M −
→ M and L ⊗ 1l −
→
L such that the following diagrams are commutative:
(L ⊗ 1l) ⊗ M
a X 1lY
/ L ⊗ (1l ⊗ M) L ⊗ 1l
c L1l
/ 1l ⊗ L
OOO ??
OO' oo ?
r ⊗1
L M
wooo1 ⊗l
L M r L ? l L
(14)
L⊗M L
Proof Let 0 be the bottom element of two element lattice 1l = {0, 1}. For f ∈ L ⊗ 1l
and g ∈ 1l ⊗ M we define 0 ∈ L and m 0 ∈ M by
0 = { ∈ L | f () = 1} and y0 = g(1)
Since (Sup, ⊗, a, c, l, r, 1l) is bi-closed (cf. Fact and [15]) and Sup is cocomplete,
we conclude from the free algebra algorithm (cf. [1] and p. 186 in [2]) that the
algorithm converges for the endofunctor T : Sup → Sup determined by
h
T(X ) = X ⊗ X, X−
→ Y, T(h) = h ⊗ h.
Hence every complete lattice generates a free magma (i.e. free prequantale) (cf. [6]).
A prequantale (X, ∗) is a quantale (cf. [26]) if ∗ is associative—i.e. (x ∗ y) ∗ z =
x ∗ (y ∗ z). A quantale is unital, if ∗ has a unit e—i.e. x ∗ e = e ∗ x for all x ∈ X .
A homomorphism h is unital, if h preserves the respective units—i.e. h(e) = e.
Let (X, ∗, e) be a unital quantale. If we identify the multiplication with its corre-
sponding binary operation and the unit with the join preserving map
1l = {0, 1} −
→ X sending 1 to e, then (X, , e) is a monoid in the sense of the sym-
metric monoidal category (Sup, ⊗, a, c, l, r, 1l) (cf. Corollary 2, Lemmas 1 and 2).
Since (Sup, ⊗, a, c, l, r, 1l) is bi-closed and Sup is cocomplete, every complete lat-
tice generates a free monoid (i.e. a free unital quantale) (cf. Theorem 2 on p. 172
in [17]).
Example 2 Let IN0 be the set of natural numbers including 0, and let 1l be the unit
object of the tensor product in Sup. Then the free unital quantale generated by 1l
coincides with the power set P(IN0 ) of IN0 equipped with the Minkowski addition
—i.e.
A B = {a + b | a ∈ A, b ∈ B}, A, B ∈ P(IN0 ).
η1l
In fact, if 1l = {0, 1} −→ P(IN0 ) is the join preserving embedding with η1l (1) =
{1} (and η1l (0) = ∅), then for every further unital quantale (X, ∗, e) and for every
h
further join preserving map 1l −
→ X (e.g. h(1) = x) there exists a unique unital
h
homomorphism P(IN0 ) −→ X with the property h ◦ η1l = h. In particular, h is
given by:
h (A) = {x n | n ∈ A}, A ∈ P(IN0 )
Φ
(X 1 ⊗ X 2 ) ⊗ (X 1 ⊗ X 2 ) −
→ (X 1 ⊗ X 1 ) ⊗ (X 2 ⊗ X 2 )
satisfying the condition Φ (x1 ⊗ x2 ) ⊗ (y1 ⊗ y2 ) = (x1 ⊗ y1 ) ⊗ (x2 ⊗ y2 ) for all
1 2
x1 , y1 ∈ X 1 and x2 , y2 ∈ X 2 . Hence, if X 1 ⊗ X 1 −−→ X 1 and X 2 ⊗ X 2 −−→ X 2
denote the binary operations corresponding to ∗1 and ∗2 , then there exists a binary
34 U. Höhle
Example 3 Let (X, ·, e) be a monoid in Set and P(X ) be the power set of X provided
with the set inclusion as partial order and the Minkowski multiplication w.r.t. the
multiplication · on X . Then (P(X ), , {e}) is a unital quantale.
Let (Q, ∗, e) be a further unital quantale. The binary operation on the Q-valued
power set Q X defined according to Zadeh’s extension principle is given by:
(g1 g2 )(x) = g1 (x1 ) ∗ g2 (x2 ), g1 , g2 ∈ Q X , x ∈ X. (16)
x1 ·x2 =x
If we now identify the tensor product Q ⊗ P(X ) with Q X (cf. Theorem 2), then
coincides with the tensor product of ∗ with . In fact, because of (15) the relation
(g1 g2 )(x) ⊗ {x} = g1 (x1 ) ∗ g2 (x2 ) ⊗ {x1 · x2 }
x∈X x1 ,x2 ∈X
= g1 (x1 ) ∗ g(x2 ) ⊗ {x1 } {x2 }
x1 ,x2 ∈X
= g1 (x1 ) ⊗ {x1 } g(x2 ) ⊗ {x2 }
x1 ,x2 ∈X
= g1 (x) ⊗ {x} g2 (x) ⊗ {x} .
x∈X x∈X
For further details on the tensor product of unital quantales the reader is referred
to [8].
Modules in the Category Sup 35
Proof Since the transposition of the multiplication preserves the associativity law, the
triple (X , ∗op , e ) with x ∗op y = y ∗ x is again a unital quantale. Then ι is given
η X ◦ι ι
by the extension of X −−→ X to a unique unital homomorphism (X , ∗, e ) −
→
(X , ∗op , e ).
Because of the previous lemma there exist a plenty of involutive and unital quan-
tales. We finish this section with two prominent examples. (cf. [21]).
Example 4 Let A be a unital C ∗ -algebra with involution ∗ (cf. Sect. 6.3), and let
Max(A) be the set of all closed linear subspaces of A. The adjoint of a closed,
linear subspace M has the form M
= {a ∗ | a ∈ M}. If M and N are closed, linear
then the product M ∗ N is given by the closure of the set of all finite
subspaces,
sums ai · bi with ai ∈ M and bi ∈ N —i.e. M ∗ N = M N . Hence the restriction
of ∗ to closed ideals coincides with the usual ideal multiplication. In particular,
(M ∗ N )
= N
∗ M
holds for all M, N ∈ Max(A).
On Max(A) we consider the partial order determined by the set inclusion. Then
Max(A) is a complete lattice, and the multiplication ∗ of closed linear subspaces is
join preserving in each variable separately. Hence (Max(A), ∗,
) is an involutive and
unital quantale. It is well known that Max(A) characterizes A up to an ∗-isomorphism
(cf. Theorem 3.3 in [16]). Sometimes Max(A) is also called the spectrum of A.
λ1 ≤ λ2 =⇒ λ02 ≤ λ01 , λ1 , λ2 ∈ L .
Hence λ −→ λ0 is an anti-automorphism of L.
36 U. Höhle
Then ([L , L], ◦, 1 L ) is a unital quantale. Further, the order reversing involution 0 on
L induces an order preserving involution
on [L , L] by:
0
f
(λ) = f (λ0 ) , λ ∈ L
aQQM (17)
Q ⊗ (Q ⊗ M) / Q⊗M / M
1Q ⊗
1l ⊗ M / Q⊗ M
e⊗1 M
??
??
??
??
?? (18)
M ??
??
M
Modules in the Category Sup 37
Q⊗ M
h⊗1 M
/ [M, M] ⊗ M o ⊗ [M, M] × M
OOO
OOO ooo
OOO εM
ooooo
OOOO o ev M
O' woooo
M
Proof Since Sup is monoidal closed (cf. Theorem 3), the homomorphism h coincides
with the monoidal adjoint of the left-action . The fact that h preserves the algebraic
structure follows from a chase of diagrams or by a simple calculation using the
axioms (M1) and (M2) directly.
The previous proposition says that left Q-modules M can simply be characterized
by unital homomorphisms from Q to [M, M].
Before we continue, we first give a simple example of a left Q-module.
f
Example 6 Let X be a set and Q X be the set of all maps X −
→ Q provided with the
partial order defined pointwisely—i.e.
Then Q X is the Q-valued power set of X (cf. Sect. 2), and the left action on Q X
is determined by
h
Left Q-module homomorphisms are join preserving maps M − → N preserving the
respective left actions—i.e. the commutativity of the following diagram:
1Q ⊗h
Q⊗ M / Q⊗ N
M / N
h
⊗1 X (20)
$
Q⊗ X
Since in [17] the commutativity of the diagrams (17) and (18) has not been verified
in the general case of monoidal categories, we insert here this information in the
special case of Sup for the convenience of the reader.
Lemma 4 Let X be a complete lattice. Then the join preserving map
Q ⊗ (Q ⊗ X ) −
→ Q ⊗ X determined by (20) is a left action on Q ⊗ X .
Proof Since tensors in Q ⊗ X are joins of elementary tensors, we restrict our interest
to elementary tensors and choose x ∈ X and α, β, γ ∈ Q. Since is associative, we
obtain:
α (β (γ ⊗ x)) = α (β γ) ⊗ x
= (α (β γ)) ⊗ x
= ((α β) γ) ⊗ x
= (α β) (γ ⊗ x).
Corollary 3 ([12]) Let X be a set and P(X ) be the power set of X . Then the
ΦQ
order isomorphism Q X −−→ Q ⊗ P(X ) specified in Theorem 2 is a left Q-module
isomorphism.
Proof We maintain the notation from Example 6 and Lemma 4. In order to ver-
ify the relation α ΦQ (g) = ΦQ (α g) for all α ∈ Q and g ∈ Q X we refer to
Theorem 2 and obtain:
α ΦQ (g) = α g(x) ⊗ {x}
x∈X
= α (g(x) ⊗ {x})
x∈X
= (α g(x)) ⊗ {x}
x∈X
= (α ∗ g(x)) ⊗ {x}
x∈X
= ΦQ (α g).
η XM
Theorem 4 Let X be a complete lattice, (Q, ∗, e) be a unital quantale and X −−→
Q ⊗ X be the join preserving map determined by η XM (x) = e ⊗ x for all x ∈ X . Then
h
for every left Q-module N and for every join preserving map X −
→ N there exists a
h
unique left Q-module homomorphism Q ⊗ X −→ N making the following diagram
commutative:
ηM
X OO
X
/ Q⊗ X
OOO
OOO
OOO h (21)
h OOO
'
N
Proof (a) (Uniqueness). Let h be a left Q-module homomorphism making the dia-
gram (21) commutative. Then for all x ∈ X the relation h (e ⊗ x) = h(x) follows.
Because of (20) (cf. Lemma 4) we obtain for γ ∈ Q and x ∈ X :
(b) (Existence). Let be the left action on N . We define a join preserving map
h
Q ⊗ X −→ N by
h = ◦ (1Q ⊗ h). (22)
Then for every α ∈ Q and for every elementary tensor γ ⊗ x the relation
h (α (γ ⊗ x)) = h ((α γ) ⊗ x)
= [ ◦ (1Q ⊗ h)] (α γ) ⊗ x
= (α γ) h(x)
= α (γ h(x))
= α h (γ ⊗ x).
holds. Since h is join preserving and every tensor is a join of elementary tensors, h
is obviously a left Q-module homomorphism.
Finally, since e is the unit of Q, we conclude from (22) that the relation
h (e ⊗ x) = e h(x) = h(x)
F1 : Sup −
→ Mod(Q)
Corollary 5 The forgetful functor from U2 : Mod(Q) → Set has a left adjoint.
Proof Let X be a set, P(X ) be the ordinary power set of X , and let L be a complete
f
lattice. Since every map X −
→ L has a unique extension to a join preserving map
f
P(X ) −→ L with
f (A) = f (x), A ∈ P(X ),
x∈A
Modules in the Category Sup 41
the forgetful functor U0 : Sup → Set has a left adjoint functor F0 : Set → Sup
which sends a set X to its power set P(X ). Since adjoint situations compose, we
conclude from Corollary 4 that F2 = F1 ◦ F0 is left adjoint to U2 = U0 ◦ U1 .
It follows from Corollaries 3, 4 and 5 that the monad corresponding to the adjoint
situation F2 −−| U2 is the Q-valued power set monad (PQ , η Q , μQ ) (on Set) where
ϕ
PQ (X ) = Q X ,X− → Y, [PQ (ϕ)(g)](y) = {g(x) | ϕ(x) = y}, y ∈ Y,
−1 e, z = x
ηQ
X (x) = ΦQ (e ⊗ {x}) = 1{x} , 1{x} (z) = ⊥, z = x, z, x ∈ X,
[μQ G(g) ∗ g(x), x ∈ X, G ∈ QQ .
X
X (G)](x) =
g∈Q X
The Q-valued power set Q X is the free left Q-module generated by the set X .
This observation goes back to Joyal and Tierney (cf. p. 10 in [12]) and means
from a mathematical point of view that fuzzy set theory is module theory on unital
quantales. The next section is a confirmation of this insight.
At the end of this section we would like to draw the attention of the reader to the
interesting fact that submodules of Q X play a strategic role in the study of stratified
Q-valued topological spaces (cf. p. 180 in [9]).
First we recall the axioms of a many-valued preorder (cf. [5, 11, 23]). Let (Q, ∗, e)
p
be a unital quantale and X be a set. A map X × X − → Q is a Q-valued preorder
(Q-preorder for short) if p satisfies the following properties for all x, y, z ∈ X :
e ≤ p(x, x), (Reflexivity)
p(x, y) ∗ p(y, z) ≤ p(x, z). (Transitivity)
Every Q-preorder p has an underlying ordinary preorder ≤ defined by
for all x, z ∈ X . Obviously, Q-homomorphisms are always isotone w.r.t. the under-
lying preorders. The class of Q-preordered sets and the class of Q-homomorphisms
form a category denoted by Pre(Q).
Let (X, p) be a Q-preordered set. A covariant Q-presheaf on (X, p) is a Q-fuzzy
f
set X −
→ Q which is right-extensional—i.e.
Hence P(X, p), d is a Q-ordered set.
ξ
Lemma 5 Let (X, p) be a Q-preordered
set and P(X, p) −
→ X be a Q-homomor-
phism satisfying the condition ξ p(x, ) = x for all x ∈ X . Then p is antisym-
ξ
metric, and every further Q-homomorphism P(X, p) − → X with ξ p(x, ) = x
coincides with ξ—i.e. ξ = ξ.
Proof Let us choose x, y ∈ X with e ≤ p(x, y) and e ≤ p(y, x). Then we conclude
from the transitivity of p that p(x, z) = p(y, z) holds for all z ∈ X . Hence x =
ξ( p(x, )) = ξ( p(y, )) = y follows—i.e. p is antisymmetric.
Modules in the Category Sup 43
ξ
If P(X, p) −→ X is a further Q-homomorphism with ξ p(x, ) = x for all x ∈ X ,
then we infer from (26) that ξ( f ) ≤ ξ( f ) holds where ≤ is the underlying order in
(X, p). Interchanging now the role of ξ and ξ the relation ξ = ξ follows from the
antisymmetry of ≤.
Since ξ is unique, ξ is also called the formation of arbitrary meets in (X, p).
Theorem 5 Let M be a left Q-module with the left action . There exists a Q-
preorder p on M provided with the following properties:
(i) p(x, y) = {α ∈ Q | α y ≤ x}, x, y ∈ M.
ξ
(ii) The map P(X, p) −
→ X defined by
ξ( f ) = f (x) x, f ∈ P(M, p). (28)
x∈M
Proof We define p by (i) and show that p is a Q-preorder. Because of (M2) the
reflexivity of p is evident. With regard to the transitivity of p we use (M1) and
observe:
p(x, y) ∗ p(y, z) z = p(x, y) ( p(y, z) z) ≤ p(x, y) y ≤ x.
Hence p(x, y) ∗ p(y, z) ≤ p(x, z) follows. Since the underlying preorder of p coin-
cides with the dual order of M, p is even antisymmetric—i.e. p is Q-order on M.
In order to verify (ii) we proceed
Because of p(x, z) z ≤ x it follows
as follows.
immediately from (28) that ξ p(x, ) = p(x, z) z = x holds for all x ∈ X .
z∈M
44 U. Höhle
follows. Hence the assertion follows immediately from Theorem 5, (28) and (19).
h P(h)
(X, p) −
→ (Y, q), P(X, p) −−→ P(Y, q),
[P(h)(g)](y) = g(x) ∗ q(h(x), y), y ∈ Y.
x∈X
Lemma 6 Let (X, p, ξ) be a complete Q-ordered set. Then the following diagram
is commutative:
P(ξ)
P(P(X, p), d) / P(X, p)
μ(X, p) ξ (29)
P(X, p) / X
ξ
Proof Let ≤op be the dual partial order of the underlying partial order of p—i.e.
holds for all x ∈ X . Now we apply (27) and again the property that ξ is a
Q-homomorphism and obtain
On the other hand, for all f 0 ∈ P(X, p) we observe F( f 0 ) ≤ d μ(X, p) (F), f 0 .
Hence the relation
F( f 0 ) ∗ p ξ( f 0 ), z ≤ p ξ(μ(X, p) (F)), ξ( f 0 ) ∗ p ξ( f 0 ), z ≤ p ξ(μ(X, p) (F)), z
is valid—i.e. P(ξ)(F) ≤ p ξ(μ(X, p) (F)), . Now we apply (27) and (30) and
obtain:
ξ P(ξ)(F) ≤op ξ μ(X, p) (F) . (33)
46 U. Höhle
Finally, since ≤op is antisymmetric, ξ P(ξ)(F) = ξ μ(X, p) (F) follows from (32)
and (33). Thus the diagram (29) is commutative.
Theorem 6 Let (X, p, ξ) be a complete Q-ordered set and ≤op be the dual order
) is a complete (ordinary)
w.r.t. the underlying partial order of p. Then (X, ≤op
lattice, and the map (α, x) −→ α · x = ξ α ∗ p(x, ) is a bimorphism. Moreover,
the join preserving map Q ⊗ X −
→ X determined by
α x = ξ α ∗ p(x, ) , α ∈ Q, x ∈ X (34)
is a left action on X —i.e. (X, ) is a left Q-module, and the following relation holds:
ξ(g) = g(x) x, g ∈ P(X, p). (35)
x∈X
Proof Let ⊥ be the universal lower bound of P(X, p) in the sense of the dual order
of the underlying partial order of d—i.e.
where ≤ is the partial order on Q. Because of (30) the element ξ(⊥) is the universal
≤ ). If A
is a non empty subset of Xop, then it follows immediately
lower bound of (X, op
from (30) that ξ p(x, ) is the join of A w.r.t. ≤ —this means that (X, ≤op )
x∈A
is a complete lattice.
·
(a) We define a map Q × X −
→ X by
α · x = ξ α ∗ p(x, ) , α ∈ Q, x ∈ X (36)
and show that · is a bimorphism (in Sup). Because of the definition of · the relation
⊥ · x = ⊥ is evident for all x ∈ X . On the other hand, let us consider the univer-
sal lower bound ⊥ = ξ(⊥) in (X, ≤op ). Then we define a right-extensional map
F
P(X, p) −
→ Q by
Obviously, μ(X, p) (F) coincides with the universal lower bound in P(X, p)—i.e.
μ(X, p) (F) = ⊥. Further, we observe:
[P(ξ)(F)](z) = α ∗ d(⊥, f ) ∗ p(ξ( f ), z) = α ∗ p(ξ(⊥), z), z ∈ X.
f ∈P(X, p)
Obviously, μ(X, p) (F) = α ∗ g holds. Now we apply Lemma 6 and use again the fact
that ξ is join preserving:
α · xi = ξ α ∗ p(xi , )
i∈I i∈I
= ξ(α ∗ g)
= ξ(μ(X, p) (F))
= ξ(P(ξ)(F))
= ξ α ∗ p(ξ(g), )
= α · ( xi ).
i∈I
Hence we have verified that · is join preserving in each variable separately w.r.t. ≤op .
(b) Because of the universal property of the tensor product there exists a unique join
preserving map Q ⊗ X − → X making the diagram
⊗
Q⊗ X / Q⊗ X
OOO
OOO
O
· OOO
'
X
48 U. Höhle
(c) In order to verify (35) we fix g ∈ P(X, p) and use its right-extensionality—i.e.
g(z) = g(x) ∗ p(x, z), z ∈ X.
x∈X
Hence (35) follows form (34) and the property that ξ is join preserving.
From Theorems 5 and 6 it follows that left Q-modules and complete Q-ordered
sets are equivalent concepts—a result which goes back to Stubbe in the more general
context of quantaloid enriched categories (cf. [29], see also Remark 5.6 in [10]).
In this context, complete Q-ordered sets emphasizes the many-valued (i.e. enriched
categorical) aspect, while left Q-modules refer to the algebraic properties of this
theory. Since in the fuzzy community pre-singletons of the form α ∗ p(x, ) are
viewed as Q-fuzzy points, it is important to realize that the left action of α on x in
the sense of Sup means the join of α ∗ p(x, ) w.r.t. the dual order determined by
p (cf. Theorem 5(i)).
Finally, we mention the fact that complete Q-ordered sets coincide with algebras
of the monad of covariant Q-presheaves (cf. Remark 5.7 in [10]).
is a bimorphism. Referring to the universal property of the tensor product in Sup the
right implication and the order preserving involution
on Q induce a left action
on M op determined as follows
α m = α
m, α ∈ Q, m ∈ M (37)
Modules in the Category Sup 49
In fact, the following relations are an immediate corollary of (M1) and (M2):
e m = e
m = e m = m,
α (β m) = α
(β
m) = (β
∗ α
) m = (α ∗ β) m.
Let ⊗ be the tensor product and 1l be the unit object in Sup (cf. Sect. 2). A 2-form
ϕ
on a complete lattice M is a join preserving map M ⊗ M − → 1l. A 2-form on M is
symmetric if the diagram
cM M
M⊗M / M⊗M
OOO
OOO ϕ
O
ϕ OOO
'
1l
In order to give a characterization of 2-forms we recall the simple fact that for
h
every complete lattice X a join preserving map X −
→ 1l can be identified with a
unique element z ∈ X —i.e.
0, x ≤ z,
h(x) =
1, x ≤ z.
ϕ
Hence by definition of the tensor product (cf. Sect. 2) a 2-form M ⊗ M −
→ 1l can be
f
identified with a join reversing map M −
→ M. In particular, a 2-form ϕ is symmetric
iff the chain of equivalences
holds for all m, n ∈ M—this means that ϕ is symmetric iff the corresponding join
reversing map f satisfies the following condition
m ≤ f ( f (m)), m ∈ M. (38)
Further, the monoidal adjoint map ϕ of a 2-form ϕ can be characterized by its
corresponding join reversing map f as follows:
Proposition 3 Let M be a complete lattice. There exists a bijective map between the
set of all symmetric and faithful 2-forms ϕ on M and the set of all order reversing
involutions 0 on M such that the condition
ϕ(m ⊗ n) = 0 ⇔ n ≤ m0
m ⊥n ⇔ n ≤ m0.
Theorem 7 Let (M, 0 ) be a complete De Morgan algebra and [M, M] be the invo-
lutive and unital quantale of all join preserving self-maps of M (cf. Example 5).
Further, let ϕ be the faithful and symmetric 2-form on M corresponding to the order
reversing involution α → α0 . Then there exists a bijective map between the set of
all left actions on M with the property that (M, ϕ) is an involutive left Q-module
h
and the set of all involutive and unital homomorphisms Q −
→ [M, M] such that the
following diagram is commutative:
Q⊗M
h⊗1 M
/ [M, M] ⊗ M
OOO
OOO
OO εM
OOO
O'
M
(a) Let us assume that fulfills (40) and h is the monoidal adjoint of . We maintain
the notation of Example 5 and obtain the following chain of equivalences for all
α ∈ Q and m, n ∈ M:
α m = [h(α)](m), α ∈ Q, m ∈ M, (41)
we obtain:
ϕ (α m) ⊗ n = 0 ⇔ α m ≤ n 0
⇔ [h(α)](m) ≤ n 0
⇔ m ≤ [h(α) ](n 0 )
⇔ [h(α)
](n) ≤ m 0
⇔ [h(α
)](n) ≤ m 0
⇔ α
n ≤ m 0
⇔ ϕ m ⊗ (α
n) = 0.
Hence (M, ϕ) is an involutive left Q-module w.r.t. the left action defined in (41).
M⊗M
h⊗h
/ N⊗N
OOO
OOO
O
ϕ OOO
ψ
'
1l
Modules in the Category Sup 53
h
In the case of faithful and symmetric 2-forms a left Q-module homomorphism M − →
N is a morphism iff h is orthogonal—i.e. h preserves and reflects the respective
orthogonality relations—i.e. m 1 ⊥ m 2 ⇔ h(m 1 )⊥ h(m 2 ) for all m 1 , m 2 ∈ M.
π(a) ≤ a , a ∈ A.
Example 7 ([19]) Let H be a Hilbert space and L(H) be the Banach algebra of
T
all bounded and linear operators H − → H. Then L(H) is a C ∗ -algebra w.r.t. to the
involution given by the formation of adjoint operators—i.e.
In this context the faithful and symmetric 2-form ϕ corresponds to the orthogo-
nal complementation in P(H). The pair (P(H), π ) is also called the canonical
involutive left Max(A)-module associated with the representation (π, H) of A.
The next results are due to Kruml and Resende [16] showing that involutive left
Q-modules play a significant role in the theory of operator algebras.
Theorem 9 (a) Let (π, H) be a representation of a unital C ∗ -algebra A = (A, +, ·)
and (P(H), π ) be the canonical involutive left Max(A)-module associated with
(π, H).
(a) A vector x ∈ H with x = 0 is cyclic for π iff π x = where x is the
1-dimensional subspace of H generated by x.
(b) The representation (π, H) is irreducible iff every atom a of P(H) is a generator
of P(H)—i.e. P(H) = {I π a | I ∈ Max(A)}.
(c) Two representations (π1 , H1 ) and (π2 , H2 ) of A are equivalent (i.e. there exists
T
a unitary transformation H1 − → H2 s.t. π2 (a) = T ◦ π1 (a) ◦ T ∗ holds for all
a ∈ A) iff the respective canonical involutive left Max(A)-modules associated
with (π1 , H) and (π2 , H) are isomorphic.
It is worthwhile to note that the previous results make use of fundamental princi-
ples of the theory of C ∗ -algebras (see e.g. Proposition 4.5.3 in [13], Theorems 10.2.7
and 10.2.10 in [14]).
Modules in the Category Sup 55
References
1. Adámek, J.: Free algebras and automata realization in the language of categories. Comment
Math. Univ. Carol. 15, 589–602 (1974)
2. Adámek, J., Trnková, V.: Automata and Algebras in Categories. Kluwer Academic Publishers,
Dordrecht (1990)
3. Banaschewski, B., Nelson, E.: Tensor products and bimorphisms. Cand. Math. Bull. 19, 385–
402 (1976)
4. Birkhoff, G.: Lattice Theory, Colloquium Publications, vol. 25, 3rd edn., eighth printing. Amer-
ican Mathematical Society, Rhode Island (1995)
5. Denniston, J.T., Melton, A., Rodabaugh, S.E.: Enriched categories and many-valued preorders:
categorical, semantical and topological perspectives. Fuzzy Sets Syst. 256, 4–56 (2014)
6. Eklund, P., Höhle, U., Kortelainen, J.: A survey on the categorical term construction with
applications. Fuzzy Sets and Syst. doi:10.1016/j.fss.2015.07.003
7. Galatos, N., Jipsen, P., Kowalski, T., Ono, H.: Residuated Lattices: An Algebraic Glimpse at
Substructural Logics, Studies in Logic, vol. 151. Elsevier, Amsterdam (2007)
8. Gutiérrez García, J., Höhle, U., Kubiak, T.: Tensor products in Sup and their application in
constructing quantales (submitted)
9. Höhle, U.: Many Valued Topology and Its Applications. Kluwer Academic Publishers, Boston
(2001)
10. Höhle, U.: Categorical foundations of topology with applications to quantaloid enriched topo-
logical spaces. Fuzzy Sets Syst. 256, 166–210 (2014)
11. Höhle, U.: Many-valued preorders I: the basis of many-valued mathematics. In: Magdalena, L.,
et al. (eds.) Enric Trillas: A Passion for Fuzzy Sets, Studies in Fuzziness and Soft Computing,
vol. 322, pp. 125–150. Springer, Heidelberg (2015)
12. Joyal, A., Tierney, M.: An Extension of the Galois Theory of Grothendieck, Memoirs of the
American Mathematical Society, vol. 51, Number 309. American Mathematical Society (1984)
13. Kadison, R.V., Ringrose, J.R.: Fundamentals of the Theory of Operator Algebras, Volume I
Elementary Theory, Graduate Studies in Mathematics Volume 15. American Mathematical
Society (1997)
14. Kadison, R.V., Ringrose, J.R.: Fundamentals of the Theory of Operator Algebras, Volume
II Adavanced Theory, Graduate Studies in Mathematics Volume 16. American Mathematical
Society (1997)
15. Kelly, G.M.: Basic Concepts of Enriched Category Theory, London Mathematical Society
Lecture Notes Series 64. Cambridge University Press (1982)
16. Kruml, D., Resende, P.: On quantales that classify C ∗ -algebras. Cahiers Topol. Géom. Différ.
Catég. 45, 287–296 (2004)
17. Mac Lane, S.: Categories for the Working Mathematician, 2nd edn. Springer (1998)
18. Manes, E.G.: Algebraic Theories. Springer, New York (1976)
19. Meise, R., Vogt, D.: Introduction to Functional Analysis, Oxford Gruaduate Texts in Mathe-
matics. Oxford University Press (1997)
20. Mulvey, C.J., Pelletier, J.W.: A quantisation of the calculus of relations, CMS Proceedings,
vol. 13, pp. 345–360. American Mathematical Society, Providence (1992)
21. Mulvey, C.J., Pelletier, J.W.: On the quantisation of points. J. Pure Appl. Algebra 159, 231–295
(2001)
22. Pelletier, J.W., Rosický, J.: Simple involutive quantales. J. Algebra 195, 367–386 (1987)
23. Pu, Q., Zhang, D.: Preordered sets valued in a G L-monoid. Fuzzy Sets Syst. 187, 1–32 (2012)
24. Resende, P.: Sup-lattice 2-forms and quantales. J. Algebra 276, 143–167 (2004)
25. Rodabaugh, S.E.: Powerset operator foundations for poslat fuzzy theories and topologies. In:
Höhle, U., Rodabaugh, S.E. (eds.) Logic, Topology, Theory, Measure, Mathematics of Fuzzy
Sets, pp. 91–116. Kluwer Academic Publishers (1999)
26. Rosenthal, K.I.: Quantales and Their Applications, Pitman Research Notes in Mathematics,
vol. 234. Longman Scientific Technical, Longman House, Burnt Mill, Harlow (1990)
56 U. Höhle
27. Shmuely, Z.: The structure of Galois connections. Pac. J. Math. 54, 209–225 (1974)
28. Solovyov, S.A.: Powerset operator foundation for catalg fuzzy set theories. Iran. J. Fuzzy Syst.
8, 1–46 (2011)
29. Stubbe, I.: Categorical structures enriched in a quantaloid tensored and cotensored categories.
Theory Appl. Categ. 16, 283–306 (2006)
30. Zadeh, L.A.: Fuzzy sets. Inf. Control 8, 338–353 (1965)
31. Zadeh, L.A.: The concept of a linguistic variable and its application to approximate reasoning
I. Inf. Sci. 8, 119–249 (1975)
32. Zadeh, L.A.: The concept of a linguistic variable and its application to approximate reasoning
II. Inf. Sci. 8, 301–357 (1975)
33. Zadeh, L.A.: The concept of a linguistic variable and its application to approximate reasoning
III. Inf. Sci. 9, 43–80 (1975)
A Geometric Approach to MV-Algebras
Daniele Mundici
Boolean logic L2 deals with {0, 1}-observables/events. For instance, in the reduction
of the colorability problem to the boolean satisfiability problem, given a graph G
and a palette of k colors, the basic observable “the first vertex of G gets the third color”
is coded by a variable X13 and every composite observable (such as “each vertex of G
D. Mundici (B)
Department of Mathematics and Computer Science, University of Florence,
Florence, Italy
e-mail: [email protected]
gets precisely one among the k available colors”) is coded by a boolean combination
of the Xij in such a way that the k-colorability of G amounts to the satisfiability of
a suitable boolean formula φG in the basic observables Xij . The faithfulness of the
map G → φG is accounted for by saying that G is k-colorable iff φG is satisfiable.
The efficiency of this map follows from its being computable in polynomial time.
Now most observables in physics, as well as most random variables in real life,
are not {0, 1}-valued; measurements are not infinitely precise and their outcome can
only be given by specifying a real number together with an error interval. Since phys-
ical laws are formulated in terms of relations between real-valued quantities rather
than relations between intervals, errors are implicitly taken care of by assuming that
observables are continuous: continuity ensures that small errors in the measurement
of the basic observables have small effects on the evaluation of compound observ-
ables.
For any bounded observable O one may rescale the measurement unit in such
a way that the result of any measurement of O fits into the real interval [0, 1].
Once a [0, 1]-valued logic L is chosen to deal with [0, 1]-valued observables as
boolean logic L2 does for {0, 1}-observables, compound observables are formalized
in L by applying continuous connectives to the output (i.e., the truth-value) of basic
observables: the latter are the “variables” of L.
The functional completeness of boolean logic ensures that all n-variable boolean
functions are obtainable from the variables via the boolean connectives. By contrast, a
brute force counting argument shows that no [0, 1]-valued logic L can be functionally
complete, and hence one must judiciously select the most appropriate connectives
for L.
If, as is often the case, L is defined in terms of a consequence relation and the
L-consequence relation is formulated via Modus Ponens (MP), then inevitably L
must be equipped with an “implication” operation ⇒L : [0, 1]2 → [0, 1]. By the
above discussion, ⇒L must be continuous. If ⇒L is to be (minimally) reminiscent
of boolean implication then the order of premises is irrelevant, and for any two
truth-values x, y ∈ [0, 1], x ⇒L y equals 1 precisely when x ≤ y.
Elementary as they are, these three conditions characterize the implication x →Ł∞
y = min(1, 1 − x + y) of the Łukasiewicz infinite-valued calculus Ł∞ , [19]:
Lemma 1 For any map ⇒ : [0, 1]2 → [0, 1] the following conditions are equiva-
lent:
(i) ⇒ is continuous, x ⇒ (y ⇒ z) = y ⇒ (x ⇒ z), and (1 = x ⇒ y iff
x ≤ y).
(ii) There exists precisely one increasing bijection φ of [0, 1] onto [0, 1] such that
x ⇒ y = φ −1 (φ(x) →Ł∞ φ(y)) for all x, y ∈ [0, 1].
A Geometric Approach to MV-Algebras 59
Proof This is known as the Smets-Magrez Theorem [31]. Some conditions assumed
in [31] are redundant (see Fodor and Roubens [16, Theorem 1.15] and Baczyński
[1]. Also see [2]). Related results had been obtained earlier by Trillas and Valverde
in [33, Theorem 3.4].
Theorem 1 Let I = ([0, 1], 0, 1, ⇒, ¬) be the unit real interval equipped with a
binary operation ⇒ satisfying the three conditions in Lemma 1(i), and with the
derived operation ¬x = x ⇒ 0. Then the algebra I has no nontrivial congruences
and satisfies the following axioms:
(i) 1⇒x=x
(ii) (x ⇒ y) ⇒ ((y ⇒ z) ⇒ (x ⇒ z)) = 1
(iii) ((x ⇒ y) ⇒ y) = ((y ⇒ x) ⇒ x)
(iv) (¬x ⇒ ¬y) ⇒ (y ⇒ x) = 1.
Conversely, every algebra B = ([0, 1], 0, 1, ⇒, ¬) without nontrivial congruences
and satisfying axioms (i)–(iv) is isomorphic to an algebra I = ([0, 1], 0, 1, ⇒ , ¬ ),
where ⇒ satisfies the three conditions in Lemma 1(i), and ¬ x = x ⇒ 0.
Proof The map φ of Lemma 1(ii) is an isomorphism between I and the standard
Wajsberg algebra. The converse statement follows from [11, 3.5] together with the
well known fact that the standard MV-algebra [0, 1]MV = ([0, 1], 0, 1, ¬, ⊕) (where
¬x = 1 − x and x ⊕ y = min(1, x + y)) is not isomorphic to any of its proper sub-
algebras, [11, 7.2.6].
You’re browsing, let us imagine, in a music shop, and come across a box of faded pianola
rolls. One of them bears an illegible title, and you unroll the first foot or two, to see if you
can recognize the work from the pattern of holes in the paper. Are there four beats in the bar,
or only three? Does the piece begin on the tonic, or some other note? Eventually you decide
that the only way of finding out is to buy the roll, take it home, and play it on the pianola.
Within seconds your ears have told you what your eyes were quite unable to make out – that
you are now the proud possessor of a piano arrangement of “Colonel Bogey”.
This result puts an end to the time-honored program of equipping every combi-
natorial manifold with a computable set of invariants sufficient to recognize home-
omorphic objects. The program was successful for curves and surfaces but fails for
higher-dimensional manifolds. To investigate the computability of homeomorphism,
manifolds are replaced by rational polyhedra, and homeomorphisms are replaced by
rational PL-homeomorphisms. In this way—at the very least—the recognizability
A Geometric Approach to MV-Algebras 61
problem becomes recursively enumerable: some Turing machine can effectively enu-
merate all pairs of rationally PL-homeomorphic rational polyhedra.
Are rational PL-maps the only reasonable arrows for rational polyhedra?
simplexes in Δ.
The following result ensures that (i) every rational polyhedron has a regular tri-
angulation, (ii) the rational measure is independent of the chosen triangulation, and
(iii) is invariant under Z-homeomorphisms:
Theorem 3 ([26]) Let P be a rational polyhedron in Rn . We then have:
Proof (a) [28, Lemma 2.1]. The proof relies upon toric desingularization, [14, VI,
8.5], [29, pp. 23, 31].
A Geometric Approach to MV-Algebras 63
(b) [28, Theorem 2.3]. The proof follows from the Morelli-Włodarckzyk solution
of the weak Oda conjecture, [24, 36].
(c) [28, Theorem 1.1]. The proof follows from the De Concini-Procesi theorem
on elimination of points of indeterminacy in toric varieties, [29, p. 39].
Thus Z-homeomorphisms preserve not only the topological properties but also
the rational measure of rational polyhedra.
Theorem 4 (a) Let L (n) and H (n) respectively denote n-dimensional Lebesgue and
Hausdorff measure, [13, 15]. Let P (n) denote the set of all rational polyhedra in Rn .
Then for each n = 1, 2, . . . and d = 0, 1, . . . , the map λd : P (n) → R≥0 has the
following properties, for all P, Q ∈ P (n) :
(i) (Invariance) If P = γ (Q) for some map γ belonging to the n-dimensional affine
group over the integers, then λd (P) = λd (Q).
(ii) (Valuation) λd (∅) = 0, λd (P) = λd (P(d) ), and the restriction of λd to the set
of all rational polyhedra P, Q in Rn having dimension at most d is a valuation:
λd (P) + λd (Q) = λd (P ∪ Q) + λd (P ∩ Q).
(iii) (Conservativity) For any P ∈ P (n) let (P, 0) = {(x, 0) ∈ Rn+1 | x ∈ P}. Then
λd (P) = λd (P, 0).
(iv) (Pyramid) For k = 1, . . . , n, if conv(v0 , . . . , vk ) is a regular k-simplex in Rn
with v0 ∈ Zn then λk (conv(v0 , . . . , vk )) = λk−1 (conv(v1 , . . . , vk ))/k.
(v) (Normalization) Let j = 1, . . . , n. Suppose the set B = {w1 , . . . , wj } ⊆ Zn is
part of a basis of the free
abelian group Zn . Let the closed parallelepiped
PB ⊆
j
R be defined by PB = x ∈ R | x = i=1 γi wi , 0 ≤ γi ≤ 1 . Then λj (PB ) =
n n
1.
(vi) (Proportionality) Let A be an m-dimensional rational affine subspace of Rn for
some m = 0, . . . , n. Then there is a constant κA > 0, only depending on A, such
that λm (Q) = κA · H (m) (Q) for every rational m-simplex Q ⊆ A.
∗∗∗
(b) The six properties above uniquely characterize the rational measures λo , . . . , λn ,
among all maps from P (n) to R≥0 , for each n = 1, 2, . . ..
From the effectiveness of this duality we get the following equivalent reformula-
tion of Markov theorem:
Theorem 8 ([21, 27]) The category of rational polyhedra with Z-maps is dually
equivalent to finitely presented MV-algebras with their homomorphisms. The duality
sends each rational polyhedron P ⊆ [0, 1]n to the MV-algebra M (P) = {f |`P | f ∈
M ([0, 1]n )}, the symbol “ |` ” denoting restriction.
Summing up :
rational polyhedra with rational PL-maps rational polyhedra with integer PL-maps
=
finitely presented -groups finitely presented unital -groups
A Geometric Approach to MV-Algebras 65
Through a further excursion in algebraic topology [18, 34, 35], Cabrer [7] has
recently shown that conditions (i)–(iii) are also sufficient for M (P) to be isomorphic
to an n-generator projective MV-algebra.
Property (iii) is known as the “strong regularity” of P, equivalently, its “anchored-
ness”, [58]. It is equivalent to asking that the affine hull of T contains an integer point
of Rn .
A folklore general result in universal algebra is to the effect that an n-generator
MV-algebra A is projective iff it is isomorphic to a retract R of the free n-generator
MV-algebra M ([0, 1]n ) of McNaughton functions over the unit n-cube [0, 1]n . Stated
otherwise, there is a retraction (idempotent endomorphism) ρ of M ([0, 1]n ) onto
R∼= A. Let us consider the following innocent looking problem:
Note that this number is ≥1 precisely because A is projective. The answer is given
by Theorem 11 below, whose statement is surprisingly simple–although the proof
uses the rational measure of rational polyhedral in a fairly sophisticated way.
66 D. Mundici
For every finitely generated projective MV-algebra C we define the index ι(C) as
ι(C) = sup{number of retractions of M ([0, 1]n ) onto C }, where n is the smallest
number of generators of C, and C ranges over arbitrary retracts of M ([0, 1]n )
isomorphic to C.
An easy verification shows that ι(M ([0, 1]n )) = 1 for all n = 1, 2, . . . .
For every n-generator MV-algebra B, the construction introduced in [27, Corollary
4.18], yields a canonical (Yosida) homeomorphism of the maximal spectral space
μB onto a closed subset M of [0, 1]n . If M = cl(int(M)) then following Kuratowski,
[12, p. 20] we unambiguously say that μB is a closed domain in [0, 1]n .
The proof uses Theorem 10, along with the properties of the rational measure of
the maximal spectral space of A, (Theorem 3).
Last, but not least, another interesting application of the rational measure is in the
classification of orbits of affine subspaces of Rn under the action of the n-dimensional
affine group over the integers. This uses the orbit classification of [9] as a preliminary
step.
As we have seen, MV-algebra theory heavily draws from algebraic topology and
toric geometry. Conversely, the book [27] shows that MV-algebras have many appli-
cations to diverse areas of mathematics. Here is a selection of recent developments
subsequent to the publication of [27]:
Acknowledgments I am grateful to my friend Peter Klement, whose many papers and books
[20, and references therein] taught me the importance of t-norms, and whose kind hospitality at
Magdalena Bildungshaus allowed me to get in contact with a community of mathematicians—of
which he has been for decades one of the focal points—involved in all aspects of fuzzy logic.
References
1. Baczyński, M.: Residual implications revisited. Notes Smets-Magrez Theorem, Fuzzy Sets
Syst. 145, 267–277 (2004)
2. Baczyński, M., Jayaram, B.: (S, N)- and R-implications: a state-of-the-art survey. Fuzzy Sets
Syst. 159, 1836–1859 (2008)
3. Baker, K.A.: Free vector lattices. Can. J. Math. 20, 58–66 (1968)
4. Beynon, W.M.: On rational subdivisions of polyhedra with rational vertices. Can. J. Math. 29,
238–242 (1977)
5. Beynon, W.M.: Applications of duality in the theory of finitely generated lattice-ordered abelian
groups. Can. J. Math. 29, 243–254 (1977)
6. Cabrer, L.M.: Simplicial geometry of unital lattice-ordered abelian groups. Forum Math. 27,
1309–1344 (2015). doi:10.1515/forum-2011-0131
7. Cabrer, L.M.: Rational simplicial geometry and projective lattice-ordered abelian groups.
arXiv:1405.7118v1 [math.RA] 28 May 2014
8. Cabrer, L.M., Mundici, D.: Rational polyhedra and projective lattice-ordered abelian groups
with order unit. Commun. Contemp. Math. 14(3), 1250017 (20 pages) (2012). doi:10.1142/
S0219199712500174
9. Cabrer, L.M., Mundici, D.: Classifying orbits of the affine group over the integers, to appear
in Ergodic Theory Dyn. Syst. doi:10.1017/etds.2015.45
10. Caramello, O., Russo, A.C.: The Morita-equivalence between MV-algebras and lattice-ordered
abelian groups with strong unit. J. Algebra 422, 752–787 (2015)
11. Cignoli, R., D’Ottaviano, I.M.L., Mundici, D.: Algebraic Foundations of Many-Valued Rea-
soning, Trends in Logic, vol. 7. Kluwer, Dordrecht (2000)
12. Engelking, R.: General Topology, Revised and completed edition, Sigma Series in Pure Math-
ematics, vol. 6. Heldermann Verlag, Berlin (1989)
13. Evans, L.C., Gariepy, R.F.: Measure Theory and Fine Properties of Functions. CRC Press,
Boca Raton (1992)
14. Ewald, G.: Combinatorial Convexity and Algebraic Geometry. Springer, New York (1996)
15. Federer, H.: Geometric Measure Theory. Springer, New York (1969)
16. Fodor, J., Roubens, M.: Fuzzy Preference Modeling and Multicriteria Decision Support. Kluwer
Academic Publishers, Dordrecht (1994)
17. Glass, A.M.W., Madden, J.J.: The word problem versus the isomorphism problem. J. Lond.
Math. Soc. (2), 30, 53–61 (1984)
18. Hatcher, A., Algebraic Topology. Cambridge University Press (2001)
19. Łukasiewicz, J., Tarski, A.: Untersuchungen über den Aussagenkalkül, Comptes Rendus des
séances de la Société des Sciences et des Lettres de Varsovie, Classe III, 23, pp. 30–50 (1930).
English translation: Investigations into the Sentential Calculus, Chapter IV. In: A. Tarski, Logic,
Semantics, Metamathematics. Clarendon Press, Oxford (1956). Reprinted: Hackett, Indianapo-
lis (1983)
68 D. Mundici
20. Klement, E.P., Mesiar, R., Pap, E.: Triangular Norms. Kluwer, Dordrecht (2000)
21. Marra, V., Spada, L.: Duality, projectivity, and unification in Łukasiewicz logic and MV-
algebras. Ann. Pure Appl. Logic 164, 192–210 (2013)
22. Marra, V., Spada, L.: Two isomorphism criteria for directed colimits. arXiv:1312.0432v1, 2
Dec 2013
23. Menu, J., Pavelka, J.: A note on tensor products on the unit interval. Comment. Math. Univ.
Carol. 17, 71–83 (1976)
24. Morelli, R.: The birational geometry of toric varieties. J. Algebraic Geom. 5, 751–782 (1996)
25. Mundici, D.: Interpretation of AF C ∗ -algebras in Łukasiewicz sentential calculus. J. Funct.
Anal. 65, 15–63 (1986)
26. Mundici, D.: The Haar theorem for lattice-ordered abelian groups with order-unit. Discret.
Contin. Dyn. Syst. 21, 537–549 (2008)
27. Mundici, D.: Advanced Łukasiewicz calculus and MV-algebras. Trends in Logic, vol. 35.
Springer, Berlin (2011)
28. Mundici, D.: Invariant measure under the affine group over Z, combinatorics. Probab. Comput.
23, 248–268 (2014)
29. Oda, T.: Convex bodies and algebraic geometry. Convex Bodies and Algebraic Geometry.
Springer, New York (1988)
30. Shtan’ko, M.A.: Markov’s theorem and algorithmically non-recognizable combinatorial man-
ifolds, Izvestiya RAN. Ser. Math. 68, 207–224 (2004)
31. Smets, P., Magrez, P.: Implication in fuzzy logic. Int. J. Approx. Reason. 1, 327–347 (1987)
32. Stallings, J.R.: Lectures on Polyhedral Topology. Tata Institute of Fundamental Research,
Mumbay (1967)
33. Trillas, E., Valverde, L.: On implication and indistinguishability in the setting of fuzzy logic.
In: Kacprzyk, J., Yager, R.R. (eds.) Management Decision Support Systems using Fuzzy Sets
and Possibility Theory, pp. 198–212. Technical University Rhineland, Cologne (1985)
34. Whitehead, J.H.C.: On subdivisions of complexes. Math. Proc. Camb. Philos. Soc. 31, 69–75
(1935)
35. Whitehead, J.H.C.: Simplicial spaces, nuclei and m-groups. Proc. Lond. Math. Soc. 45, 243–
327 (1939)
36. Włodarczyk, J.: Decompositions of birational toric maps in blow-ups and blow-downs. Trans.
Am. Math. Soc. 349, 373–411 (1997)
37. Belluce, L.P., Di Nola, A., Ferraioli, A.R.: MV-semirings and their sheaf representations. Order
30, 165–179 (2013). doi:10.1007/s11083-011-9234-0
38. Belluce, L.P., Di Nola, A., Ferraioli, A.R.: Ideals of MV-semirings and MV-algebras. In: Litvi-
nov, G.L., Sergeev, S.N. (eds.) Tropical and Idempotent Mathematics and Applications. Con-
temporary Mathematics, vol. 616, pp. 59–76 (2014)
39. Belluce, L.P., Di Nola, A., Lenzi, G.: On generalizing the Nullstellensatz for MV-algebras. J.
Logic Comput. 25, 701–707 (2015). doi:10.1093/logcom/exu042
40. Belluce, L.P., Di Nola, A., Lenzi, G.: Algebraic geometry for MV-algebras. J. Symb. Logic
79(4), 1061–1091 (2014)
41. Busaniche, M., Mundici, D.: Bouligand-Severi tangents in MV-algebras. Revista Matemática
Iberoamericana 30(1), 191–201 (2014)
42. Cabrer, L.M.: Bouligand-Severi k-tangents and strongly semisimple MV-algebras. J. Algebra
404, 271–283 (2014)
43. Cabrer, L.M.: Exact Unification. arXiv:1410.5583v1 [math.LO] 21 Oct 2014
44. Cabrer, L.M., Mundici, D.: Interval MV-algebras and generalizations. Int. J. Approx. Reason.
55, 1623–1642 (2014)
A Geometric Approach to MV-Algebras 69
45. Cabrer, L.M., Mundici, D.M.: Severi-Bouligand tangents, Frenet frames and Riesz spaces.
Adv. Appl. Math. 64, 1–20 (2015)
46. Cignoli, R., Marra, V.: Stone duality for real-valued multisets. Forum Math. 24, 1317–1331
(2012)
47. Di Nola, A., Ferraioli, A.R., Lenzi, G.: Algebraically closed MV-algebras and their sheaf
representation. Ann. Pure Appl. Logic 164, 349–355 (2013)
48. Di Nola, A., Leustean, I.: Łukasiewicz logic and Riesz spaces. Soft Comput. 18, 2349–2363
(2014). doi:10.1007/s00500-014-1348-z
49. Di Nola, A., Russo, C.: Semiring and semimodule issues in MV-algebras. Comm. Algebra 41,
1017–1048 (2013)
50. Di Nola, A., Russo, C.: MV-semirings as a new perspective on mathematical fuzzy set theory:
a survey. arXiv:1102.1999v4, 14 Nov 2014
51. Dvurečenskij, A.: Quantum structures versus partially ordered groups. Int. J. Theor. Phys.
doi:10.1007/s10773-014-2479-9
52. Fedel, M., Keimel, K., Montagna, F., Roth, W.: Imprecise probabilities, bets and functional
analytic methods in Łukasiewicz logic. Forum Math. 25, 405–441 (2013). doi:10.1515/FORM.
2011.123
53. Flaminio, T., Godo, L., Kroupa, T.: Belief functions on MV-algebras of fuzzy sets: an overview.
In: Torra, V., Narukawa, Y., Sugeno, M. (eds.) Non-Additive Measures, Studies in Fuzziness
and Soft Computing, vol. 310, pp. 173–200. Springer (2014)
54. Flaminio, T., Godo, L., Hosni, H.: Coherence in the aggregate: a betting method for belief
functions on many-valued events. Int. J. Approx. Reason. 58, 71–86 (2015). doi:10.1016/j.ijar.
2015.01.001
55. Gavalec, M., Nemcová, Z., Sergeev, S.: Tropical linear algebra with the Łukasiewicz T-norm.
Fuzzy Sets Syst. 276, 131–148 (2015). doi:10.1016/j.fss.2014.11.008
56. Gehrke, M., van Gool, S.J., Marra, V.: Sheaf representations of MV-algebras and lattice-ordered
abelian groups via duality. J. Algebra 417, 290–332 (2014)
57. Hansoul, G., Teheux, B.: Extending Łukasiewicz logics with a modality: algebraic approach
to relational semantics. Stud. Logica 101, 505–545 (2013)
58. Jeřábek, E.E.: The complexity of admissible rules of Łukasiewicz logic. J. Logic Comput. 23,
693–705 (2013)
59. Kala, V.: Lattice-ordered abelian groups finitely generated as semirings, to appear in the J.
Commut. Algebra. arXiv:1502.01651
60. Kroupa, T.: Core of coalition games on MV-algebras. J. Logic Comput. 21, 479–492 (2011)
61. Kroupa, T.: A generalized Möbius transform of games on MV-algebras and its application to a
Cimmino-type algorithm for the core, optimization theory and related topics. Contemp. Math.
568, 139–158 (2012)
62. Kroupa, T.: States in Łukasiewicz logic correspond to probabilities of rational polyhedra. Int.
J. Approx. Reason. 53, 435–446 (2012)
63. Kroupa, T., Majer, O.: Optimal strategic reasoning with McNaughton functions. Int. J. Approx.
Reason. 55, 1458–1468 (2014)
64. Kroupa, T., Teheux, B.: Modal extension of Łukasiewicz logic for reasoning about coalitional
power. arXiv:1411.6452v1, 24 Nov 2014
65. Lawson, M.V., Scott, P.: AF inverse monoids and the structure of countable MV-algebras.
arXiv:1408.1231v2, 13 Oct 2014
66. Marchioni, E., Woolridge, M.: Łukasiewicz games, In: Huhns (eds.) Proceedings of the 13th
International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2014),
Paris, France, pp. 837–844 (2014)
67. Marra, V., Spada, L.: The dual adjunction between MV-algebras and Tychonoff spaces. Studia
Logica, special issue in memoriam Leo Esakia 100, 253–278 (2012)
68. Mundici, D.: The differential semantics of Łukasiewicz syntactic consequence, Chapter 7. In:
Montagna, F. (ed.) Petr Hájek on Mathematical Fuzzy Logic, Outstanding Contributions, vol. 6,
pp. 143–157. Springer International Publishing Switzerland (2015). doi:10.1007/978-3-319-
06233-4
70 D. Mundici
69. Mundici, D., Pedrini, A.: The Euler characteristic and valuations on MV-algebras. Math. Slo-
vaca 64, 563–570 (2014). doi:10.2478/s12175-014-0226-6
70. Mundici, D., Picardi, C.: Faulty sets of Boolean formulas and Łukasiewicz logic.J. Logic
Comput., Adv. Access published Dec 8 (2014). doi:10.1093/logcom/exu073
71. Nganou, J.B.: Profinite MV-algebras and multisets. Order, doi:10.1007/s11083-014-9345-5
72. Pedrini, A.: The Euler characteristic of a polyhedron as a valuation on its coordinate vector
lattice. arXiv:1209.3248v1, 14 Sep 2012
73. Pulmannová, S.: Representations of MV-algebras by Hilbert-space effects. Int. J. Theoret. Phys.
52, 2163–2170 (2013)
74. Pulmannová, S., Vinceková, E.: MV-pairs and state operators. Fuzzy Sets Syst. 260, 62–76
(2015)
75. Riečan, B.: Variation on a Poincaré theorem. Fuzzy Sets Syst. 232, 39–45 (2013)
76. Xie, Y., Li, Y., Yang, A.: The pasting construction for effect algebras. Math. Slovaca 64, 1051–
1074 (2014). doi:10.2478/s12175-014-0258-y
77. Shang, Y., Lu, X., Lu, R.: Computing power of turing machines in the framework of unsharp
quantum logic. Theoret. Comput. Sci. 598, 2–14 (2015). doi:10.1016/j.tcs.2014.12.015
78. Weber, H.: On topological MV-algebras and topological -groups. Topology Appl. 159, 3392–
3395 (2012)
On the Equational Characterization
of Continuous t-Norms
1 Introduction
A core constituent of fuzzy logic in narrow sense [15], from where the discipline
of Mathematical fuzzy logic has been intensively developed in the last two decades
[5, 10, 11, 14], is the family of residuated many-valued logical calculi with truth
values on the real unit interval [0, 1], and with min, max, a (left-continuous) t-norm
∗ and its residuum →∗ as basic truth functions, interpreting respectively the lattice
meet and joint connectives, a strong conjunction and its adjoint implication. These
logics are also known as t-norm based fuzzy logics.
In this framework, Hájek introduced in [11, 12] the so-called Basic Fuzzy logic,
BL for short, to capture the 1-tautologies common to all many-valued calculi in [0, 1]
defined by a continuous t-norm and its residuum, as proved in [4]. Thus, BL is in
fact a common sublogic of three well-known fuzzy logics: Łukasiewicz’s infinitely-
valued logic, Gödel’s infinitely-valued logic and Product logic, corresponding to the
three basic t-norms, i.e. Łukasiewicz, minimum and product t-norms.
2 Preliminaries
We start with some elementary and well-known definitions and results about t-norms,
just for the sake of the paper being self-contained. A t-norm is a binary operation
on [0, 1] that is commutative, associative, non-decreasing (monotone) in both vari-
ables and that have 0 as absorbent and 1 as unity. A t-norm is continuous if it is
continuous as real function of two variables. The three basic continuous t-norms are
minimum (min), product (the usual product of reals, ) and Łukasiewicz (denoted
∗ L and defined by x ∗ L y = max(0, x + y − 1)). The greatest and smallest continu-
ous t-norms are the minimum and the Łukasiewicz t-norms respectively, i.e., for all
continuous t-norm ∗ and for all x, y ∈ [0, 1], we have x ∗ L y ≤ x ∗ y ≤ min(x, y).
The following are some basic results on continuous t-norms, see e.g. [13] for
further details and results:
• Any continuous t-norm is an ordinal sum of (possibly infinitely-many) copies1 of
the minimum, product and Łukasiewicz t-norms.
• A t-norm ∗ is continuous if and only if it satisfies the divisibility condition: for all
x, y ∈ [0, 1] with x > y there exists z ∈ [0, 1] such that y = x ∗ z.
1 If
we allow for at most a countable number of degenerated components with a single idempotent
element.
On the Equational Characterization of Continuous t-Norms 73
From this definition, one can check the following facts and properties:
(i) ≤ as defined above is indeed an ordering and 1 is maximal
(ii) ∗ is associative
(iii) ∗ is monotonically increasing w.r.t. ≤: x ≤ y implies x ∗ z ≤ y ∗ z
(iv) (∗, →) is an adjoint pair: x → y ≤ z iff x ∗ y ≤ z
(v) x ∗ (x → y) ≤ y
(vi) 1→x =x
Furthermore, regarding the classes of basic, Wajsberg and cancellative hoops, the
following relationship among them hold: every Wajsberg hoop is basic and each
cancellative hoop is Wajsberg (hence basic as well). Note that hoops have an greatest
element, but they may lack a least element. A hoop A = (A, ∗, →, 1) is called
bounded if (A, ≤) has a least element. Then it turns out that cancellative hoops
coincide with unbounded Wajsberg hoops, while bounded Wajsberg hoops coincide
with MV-algebras.
Prominent examples of Wajsberg hoops are the following:
• 2, defined on a set of two elements {a, 1}, that is in fact a two-element Boolean
algebra.
• Ł = ([0, 1], ∗Ł , →Ł , 1), the (bounded) Wajsberg hoop defined over [0, 1] by the
Łukasiewicz t-norm and its residuum.
• C = ((0, 1], , → , 1), the (unbounded) cancellative hoop defined over (0, 1] by
the product t-norm and its residuum.
A similar construction to the ordinal sums for t-norms and BL-chains can be also
defined for hoops.
Definition 2 Let (I, ≤) be a totally ordered set, and for all i ∈ I let Ai = (Ai , ∗i ,
→i , 1) be a hoop such thatAi ∩ A j = {1}
for every j
= i. Then the ordinal sum of
this family is the structure i∈I Ai = ( i∈I Ai , ∗, →, 1), where the operations are
defined as follows:
On the Equational Characterization of Continuous t-Norms 75
⎧
⎪
⎨x ∗i y if x, y ∈ Ai ,
x ∗ y := x if x ∈ Ai \{1}, y ∈ A j , and i < j,
⎪
⎩
y if y ∈ Ai \{1}, x ∈ A j , and i < j.
⎧
⎪
⎨x →i y if x, y ∈ Ai ,
x → y := y if x ∈ Ai , y ∈ A j , and i > j,
⎪
⎩
1 otherwise.
Notice that in an ordinal sum of hoops, the greatest element is common to all
the hoops and to the ordinal sum as well. For instance, the product standard chain
[0, 1]Π = ([0, 1], , → , 0, 1), viewed as a bounded hoop, can be decomposed as
the ordinal sum of 2 and C, i.e. [0, 1]Π = 2 ⊕ C. Actually, in [1] the authors prove
that any BL-chain, viewed as a bounded basic hoop, can be decomposed as an ordinal
sum of linearly ordered Wajsberg hoops. Restricted to standard BL-chains, this result
amounts to say that any standard BL-chain, as a hoop, can be decomposed as an
ordinal sum of (suitably arranged) copies of the Wajsberg hoops 2, C and Ł. In this
way, besides viewing the standard product algebra as the ordinal sum of 2 plus C, we
can understand the standard Gödel chain as being isomorphic to the ordinal sum of
continuum many of copies of 2 (one for each element of a Gödel component), while
the standard Łukasiewicz chain [0, 1]Ł = ([0, 1], ∗Ł , →Ł , 0, 1) coincides with Ł as
hoop.
As already mentioned, regarding the ordinal sums of hoops just defined, one can
notice that the main difference with respect to the ordinal sum of BL-chains is that
the top elements of the components are identified with the top element of the ordinal
sum. Therefore, for instance, when considering the decomposition of a BL-chain as
an ordinal sum of Wajsberg hoops (2, C or Ł in the case of standard BL-chains),
the top of any component is the top of the ordinal sum, and given two consecutive
components, the bottom (if it exists) of the second component is not in the first
component. Notice also that the decomposition of any standard BL-chain as ordinal
sum of hoops has always a first component that is either 2 (if it is an SBL-chain2 ) or
Ł otherwise.
Finally recall that a set of equations determine a variety (or equational class) of
algebraic structures. By inspecting their definition, it is clear that the classes of hoops,
basic hoops and Wajsberg hoops are indeed varieties. The class of cancellative hoops
turns out to be a variety as well, since the condition used in Definition 2 can be shown
to be equivalent to the validity of the equation x = y → (y ∗ x).
Thus it is interesting to know how the varieties generated by the main three
prominent Wajsberg hoops, 2, C and Ł, are related to each other. To do so we consider
the following three terms:
• eŁ (x) = (x → x 2 ) ∨ ((x → x 3 ) → x 2 )
• eC (x) = (x → x 2 )
• e2 (x) = (x → x 3 ) → x 2
where x n stands for x∗ . n. . ∗x. An easy computation shows that the equation
eŁ (x) = 1 is valid in 2 and C and not in Ł, eC (x) = 1 is a valid equation in 2
and neither in C nor in Ł, and finally, the equation e2 (x) = 1 is valid in C and neither
in 2 nor in Ł.
Therefore, it is clear that C and Ł do not belong to variety of hoops V ar (2)
generated by 2, while 2 and Ł do not belong to the variety V ar (C) generated by C.
On the other hand, it is easy to check that both 2 and C belong to the variety of
hoops V ar (Ł) generated by Ł, since 2 is a subhoop of Ł and C is a subhoop of the
well-known Chang algebra, which is an MV-algebra, and thus belongs to V ar (Ł).
Summarising, we have
2, C ∈ V ar (Ł), C, Ł ∈
/ V ar (2), 2, Ł ∈
/ V ar (C),
Let us denote by [0, 1]∗ either the standard BL-chain, or its corresponding hoop when
no confusion exists, defined over [0, 1] by a continuous t-norm ∗ and its residuum →∗ .
The goal of this section is to characterize those continuous t-norms ∗ that admit an
equational characterization in the sense that the variety V ar ([0, 1]∗ ) is uniquely
generated by [0, 1]∗ , that is, for any other standard BL-chain [0, 1]◦ with ◦ being a
t-norm non isomorphic to ∗, V ar ([0, 1]∗ )
= V ar ([0, 1]◦ ). In such a case, we can
say that the set of equations defining V ar ([0, 1]∗ ) characterize ∗.
Actually, generalizing the well-known Mostert and Shields representation theo-
rem of continuous t-norms, Hájek showed in [12] that every standard BL-chain [0, 1]∗
can be isomorphically decomposed as an ordinal sum (over a bounded ordered index
set) of Gödel, Łukasiewicz and Product BL-chain components. However, as hoops,
each Gödel BL-chain is isomorphic to an ordinal sum of (possibly infinite) copies
of 2, while Łukasiewicz and Product components on a closed real interval are iso-
morphic to Ł and Π = 2 ⊕ C respectively. Then any standard BL-chain, as a hoop,
will be isomorphic to a (possibly infinite) ordinal sum of Wajsberg hoops Ł, C and 2.
The following definition and proposition are particular cases of more general
definitions and results given in [8], and therefore here we only state them without
proofs.
Definition 3 (i) We will denote by Fin the set of ordinal sums (as hoops) of
finitely-many copies of Ł, 2 and C, and whose first component is either Ł or 2.
On the Equational Characterization of Continuous t-Norms 77
As shown next, the set of Fin([0, 1]∗ ) of BL-chains univocally determines the
variety V ([0, 1]∗ ) induced by the t-norm ∗.
Proposition 1 (c.f. Theorem 3.9 of [8]) Let [0, 1]∗ , [0, 1]◦ be two standard
BL-chains. Then V ar ([0, 1]∗ ) ⊆ V ar ([0, 1]◦ ) if, and only if, Fin([0, 1]∗ ) ⊆
Fin([0, 1]◦ ). Hence, V ar ([0, 1]∗ ) = V ar ([0, 1]◦ ) if, and only if, Fin([0, 1]∗ ) =
Fin([0, 1]◦ ).
Lemma 1 If ∗ and ◦ are two continuous t-norms such that both [0, 1]∗ and [0, 1]◦
have a finite ordinal sum decomposition in terms of BL-components, then ∗ ≡ ◦ if,
and only if, they have the same decomposition,
From the above proposition and lemma, the characterization of the equationally
definable standard BL-chains follows.
Proof First we prove that for a continuous t-norm ∗ whose decomposition as ordinal
has a finite number of components, V ar ([0, 1]◦ ) = V ar ([0, 1]∗ ) if and only if ◦ ≡
∗ (the components of their decomposition as ordinal sums are the same). By the
previous proposition, this is equivalent to prove that if ◦ is a continuous t-norm such
that ◦
≡ ∗, then Fin(◦)
= Fin(∗). We prove this claim by cases, adapting a more
general proof in [8]:
78 F. Esteva and L. Godo
• If the decomposition of [0, 1]◦ has more components than the decomposition of
[0, 1]∗ then it is evident that there exist BL-chains in Fin(◦) that are not in Fin(∗).
For example let ◦ be a continuous t-norm obtained as Ł ⊕ G, and let ∗ be a contin-
uous t-norm obtained as Ł ⊕ Π ⊕ G. Then it is clear that 2 ⊕ C ∈ Fin([0, 1]∗ )
but 2 ⊕ C ∈ / Fin([0, 1]◦ ).
• An analogous reasoning proves the statement when the decomposition of [0, 1]◦
has more components than the decomposition of [0, 1]∗ .
• If the number of components of the decomposition [0, 1]∗ and [0, 1]◦ is the same,
then they need to differ in some component and thus we can find BL-chains that are
in Fin(∗) and not in Fin(◦) and viceversa. For example, let ◦ be the continuous
t-norm obtained as Ł ⊕ Ł ⊕ G and let ∗ be the continuous t-norm obtained as
Ł ⊕ Π ⊕ G. Then we have that 2 ⊕ C ∈ Fin([0, 1]∗ ) but 2 ⊕ C ∈ / Fin([0, 1]◦ ),
while Ł ⊕ Ł ∈ Fin([0, 1]◦ ) and Ł ⊕ Ł ∈ / Fin([0, 1]∗ ).
In the case the decomposition of [0, 1]∗ has infinitely many components, it is easy
to prove that there exist infinitely-many continuous t-norms ◦ such that ∗
≡ ◦ but
Fin([0, 1]∗ ) = Fin([0, 1]◦ ). We do not formally prove the statement but we give
some examples:
After identifying in the last section which t-norms are equationally definable, in this
section we show how to find an effective set of equations for each of them, again rely-
ing in results from [8]. It has to be remarked that the equations actually characterise
the variety generated by the standard algebra [0, 1]∗ for a given equationally definable
t-norm ∗, and hence the equations will involve not only the operation corresponding
to the t-norm but the operation corresponding to its residuum as well.
First we introduce an equation that will have a key role in axiomatizing the varieties
V ([0, 1]∗ ).
Definition 4 Let A be a BL-chain whose decomposition
as ordinal sum of Wajsberg
hoops has finitely many components, i.e., A = i=0,1,...,n Ai . Then we will denote
On the Equational Characterization of Continuous t-Norms 79
where eiA (x) = eŁ (x) if Ai = Ł, eiA (x) = eC (x) if Ai = C, and eiA (x) = e2 (x) if
Ai = 2.
Notation convention: for the sake of a simpler notation, from now on we will use
Fin(∗) and V ar (∗) to respectively denote Fin([0, 1]∗ ) and V ar ([0, 1]∗ ).
And from this result, we can prove the following equational characterization as a
particular case of a more general result in [8, Theorem 5.2].
Note that AX (∗) may contain an infinite number of equations. However we can
do it better. Actually, one can show that one needs only a finite subset of AX (∗) to
axiomatize V ar (∗). Indeed, it is only necessary to keep from Fin(∗⊥ ) only those BL-
chains that are minimal in the following sense. Define an ordering relation in the set
Fin as follows: for all A, B ∈ Fin, define A B if A ∈ V ar (B). And denote by
Min(∗⊥ ) the minimal elements of Fin(∗⊥ ) with respect to the order . It is then
clear that it is enough to consider the set of equations corresponding to the BL-chains
of Min(∗⊥ ), and moreover, it can be shown that Min(∗⊥ ) is always finite, and hence
that V ar (∗) can be axiomatized by a finite set of equations.
2
L
2 2 2 C 2 L
2 C 2 2 C C 2 C L
in Fin(∗) is to iteratively checking ordinal sums from Fin of increasing length (1,
2, 3, etc.). At a given step i, a given current ordinal sum B of length i is checked
whether there is another non-discarded ordinal sum B of length ≤i such that B B.
If so, the current ordinal sum is discarded for further analysis at step i + 1. Otherwise
B ∈ Min(∗⊥ ) only if B is checked to not belong to Fin(∗). At next step i + 1, only
those non-discarded ordinal sums at step i are expanded with a new component, and
the procedure starts over. This iterative procedure ends in a finite number of steps.
We exemplify this procedure with two examples.
2
L
2 2 2 C 2 L
2 C 2 2 C C 2 C L
2 C L 2 2 C L C 2 C L L
contributions to the field of fuzzy logic, but also for his incredible task of fostering
the exchange of ideas and the collaboration among researchers in our community,
mainly (but not only) through his Linz Seminars on Fuzzy Set Theory since 1979.
Congratulations Peter!
Acknowledgments The authors have been partially supported by the Spanish MINECO project
EdeTRI TIN2012-39348-C02-01.
References
1. Aglianó, P., Montagna, F.: Varieties of BL-algebras I: general properties. J. Pure Appl. Algebra
181, 105–129 (2003)
2. Aglianó, P., Ferreirim, I.M.A., Montagna, F.: Basic hoops: an algebraic study of continuous
t-norms. Stud. Logica 87(1), 73–98 (2007)
3. Blok, W.J., Ferreirim, I.M.A.: On the structure of hoops. Algebra Univers. 43, 233–257 (2000)
4. Cignoli, R., Esteva, F., Godo, L., Torrens, A.: Basic logic is the logic of continuous t-norms
and their residua. Soft Comput. 4, 106–112 (2000)
5. Cintula, P., Hájek, P., Noguera, C. (eds.): Handbook of Mathematical Fuzzy Logic (in 2 vol-
umes), Studies in Logic, Mathematical Logic and Foundations, vols. 37 and 38. College Pub-
lications, London (2011)
6. Di Nola, A., Esteva, F., Garcia, P., Godo, L., Sessa, S.: Subvarieties of BL-algebras generated
by single-component chains. Arch. Math. Logic 41, 673–685 (2002)
7. Di Nola, A., Lettieri, A.: Equational characterization of all varieties of MV-algebras. J. Algebra
221, 463–474 (1999)
8. Esteva, F., Godo, L., Montagna, F.: Equational characterization of the subvarieties of BL gen-
erated by t-norm algebras. Stud. Logica 76(2), 161–200 (2004)
82 F. Esteva and L. Godo
9. Ferreirim, I.M.A.: On varieties and quasivarieties of hoops and their reducts. Thesis, University
of Illinois at Chicago (1992)
10. Gottwald, S.: A Traitise on Multiple-valued Logics. Studies in Logic and Computation.
Research Studies Press, Baldock (2001)
11. Hájek, P.: Metamathematics of Fuzzy Logic. Trends in Logic, vol. 4. Studia Logica Library,
Kluwer, Dordercht (1998)
12. Hájek, P.: Basic logic and BL-algebras. Soft Comput. 2, 124–128 (1998)
13. Klement, P., Mesiar, R., Pap, E.: Triangular Norms. Trends in Logic, vol. 8, Studia Logica
Library. Kluwer, Dordrecht (2000)
14. Novák, V., Perfilieva, I., Močkoř, J.: Mathematical Principles of Fuzzy Logic. Kluwer, Boston
(1999)
15. Zadeh, L.A.: Preface. In: Marks-II, R.J. (ed.) Fuzzy Logic Technology and Applications. IEEE
Technical Activities Board (1994)
The Semantics of Fuzzy Logics:
Two Approaches to Finite Tomonoids
Abstract Fuzzy logic generalises classical logic; in addition to the latter’s truth
values “false” and “true”, the former allows also intermediary truth degrees. The
conjunction is, accordingly, interpreted by an operation acting on a chain, making
the set of truth degrees into a totally ordered monoid. We present in this chapter two
different ways of investigating this type of algebras. We restrict to the finite case.
1 Introduction
The idea on which fuzzy logic is built is best understood in relationship with the
canonical way in which reasoning is formalised: with classical propositional logic.
The latter is the logic of “false” and “true” and propositions are evaluated in this
two-element set. Among the connectives we find the logical “and”, “or”, and “not”,
interpreted in the well-known way. In addition to the two classical truth values,
fuzzy logic uses intermediary degrees of truth [14]. Usually, “false” and “true” are
identified with the real numbers 0 and 1, respectively; the remaining real numbers
serve as further truth degrees and may express relative tendencies.
The difficulty of this approach is that there is no straightforward way to tell how
the logical connectives should be interpreted. We rather have to make a decision, for
instance, about the interpretation of the conjunction. Different interpretations will
in general lead to different logics. As a consequence of this situation, fuzzy logic
has in fact emerged as a family of many-valued logics, each of which may bring its
own challenges. According to a common agreement, the binary operation on the real
T. Vetterlein (B)
Department of Knowledge-Based Mathematical Systems,
Johannes Kepler University, Linz, Austria
e-mail: [email protected]
M. Petrík
Department of Mathematics Faculty of Engineering, Czech University
of Life Sciences, Prague, Czech Republic
e-mail: [email protected]
© Springer International Publishing Switzerland 2016 83
S. Saminger-Platz and R. Mesiar (eds.), On Logical, Algebraic, and Probabilistic
Aspects of Fuzzy Set Theory, Studies in Fuzziness and Soft Computing 336,
DOI 10.1007/978-3-319-28808-6_6
84 T. Vetterlein and M. Petrík
unit interval taken for this purpose should be a t-norm: associative, commutative,
possessing 1 as an identity, and monotone in each argument. If the set of truth values
is not taken to be an uncountable set but for instance a finite chain, the operation
should still fulfil the same algebraic conditions. It is natural to assume that the chain
of truth degrees is a negative, commutative totally ordered monoid.
The present work is to be seen among the efforts of classifying these algebraic
structures. A considerable amount of work has been done on this topic during recent
years. In line with the given background, residuation has usually been additionally
assumed and MTL-algebras were considered [8, 22]. Our paper [29] is devoted to
MTL-algebras based on the real unit interval. For residuated lattices in general, see [3,
11]. MTL-chains fulfilling certain additional properties were considered in several
works as well. For instance, MTL-chains with the weak cancellation property are
the topic of [21] as well as [15]. Idempotent residuated chains are studied in [4].
The paper [16] deals with finite MTL-chains and their relationship to Abelian totally
ordered groups.
The present chapter is devoted to the finite case. The tomonoids considered are
assumed to be either finite, or at least to be finitely generated. We present two different
approaches based on our work [24, 28], respectively. We provide an introduction to
the main ideas; further details can be found in the indicated papers. We note that in
[27] two further approaches to the structures under consideration are offered.
We investigate in this chapter the following structures [5, 9, 12, 13, 23, 30].
Definition 1 An algebra (L; +, 0) is a monoid if (i) + is an associative binary
operation and (ii) 0 is an identity for +. A monoid (L; +, 0) is called commutative
if + is commutative.
A partial order ≤ on a monoid L is called compatible if, for any a, b, c, d ∈ L,
a ≤ b and c ≤ d imply a + c ≤ b + d. A structure (L; ≤, +, 0) such that (L; +, 0)
is a monoid and ≤ is a compatible total order on L is called a totally ordered monoid,
or tomonoid for short.
Moreover, a tomonoid (L; ≤, +, 0) is called commutative if so is its monoidal
reduct. L is called positive if 0 is the bottom element. L is called finitely generated
if L, as a monoid, is generated by finitely many elements.
For instance, let [0,1] be the real unit interval and let ⊕ : [0,1]2 → [0,1] be a
t-conorm, that is, associative, commutative, behaving neutrally w.r.t. 0, and monotone
in each argument [20]. Then ([0,1] ; ≤, ⊕, 0) is commutative, positive tomonoid.
Similarly, let L ⊂ [0,1] be a finite subset of [0,1] containing 0 and 1 and let ⊕ : L 2 →
L be a discrete t-conorm [7]. This is equivalent to say that (L; ≤, ⊕, 0) is a finite,
commutative, positive tomonoid.
We have written tomonoids in the additive way; alternatively, we may deal with
the dual structures. In this case, the order is reversed and the multiplicative notation is
The Semantics of Fuzzy Logics: Two Approaches to Finite Tomonoids 85
The first part of the present chapter is devoted to finitely generated, positive, com-
mutative tomonoids; we will write “fg.p.c. tomonoids” for short. In particular, the
finite, positive, commutative tomonoids, which correspond to the so-called discrete
t-norms [7], are included in the discussion.
We investigate a particular way of representing such tomonoids. We are guided
by the following ideas. First of all, any monoid can be identified with a congruence
on a free monoid. Similarly, we may describe tomonoids by what we call monomial
preorders. Second, the order of totally ordered Abelian groups is characterised by
their cone. We introduce for tomonoids an analogous object; the so-called direction
86 T. Vetterlein and M. Petrík
cones are certain subsets of Zn that describe tomonoids and each fg.p.c. tomonoid is
a quotient of a tomonoid arising in this way.
The results of this section originate from the paper [28], to which we refer for
further details. A continuation of this work, in which the finite case is especially
emphasised, can be found in [26].
Free commutative monoids play a central role in what follows. We identify the
free commutative monoid over n ≥ 1 elements with Nn . The addition is defined
pointwise and the identity is 0̄ = (0, . . . , 0), the n-tuple consisting of zeros only. We
also define u i = (0, . . . , 0, 1, 0, . . . , 0), “1” being at the i-th position. Clearly then,
U (Nn ) = {u 1 , . . . , u n } is a set of generators of Nn .
We endow Nn with the componentwise natural order. That is, for (a1 , . . . , an ),
(b1 , . . . , bn ) ∈ Nn , we put
Clearly, is a lattice order on (Nn ; +, 0̄) and is compatible with the addition.
Fg.p.c. tomonoids can be conveniently described on the basis of the free commu-
tative monoid Nn as follows.
We call a reflexive and transitive binary relation on a set A a preorder. We write
a ≺ b if a b but not b a. Any preorder gives rise to an equivalence relation ≈,
called its symmetrisation, where a ≈ b if a b and b a. We call the equivalence
class of some a w.r.t. ≈ a -class and we denote it by a . The preorder induces
on the quotient A a partial order, which we denote by again.
We call a preorder total if a b or b a for any pair a, b ∈ A. Moreover, we
call positive if 0 ≺ a for all a = 0. Finally, if is defined on a monoid (L; +, 0),
we call compatible if a b implies a + c b + c.
In computational mathematics, the notion “monomial ordering” refers to compat-
ible, positive, total orders on Nn ; see, e.g., [6]. Analogously, we call a preorder on
Nn monomial if is compatible, positive, and total. The significance of monomial
preorders becomes clear in the following proposition.
The positive cones of totally ordered Abelian groups give rise to typical examples
of fg.p.c. tomonoids. We will discuss these examples in some detail because they
motivate our way of representing fg.p.c. tomonoids in general.
Definition 3 Let (G; ≤, +, 0) be a totally ordered Abelian group and let G + = {g ∈
G : g ≥ 0} be its positive cone. Assume that G is generated by g1 , . . . , gn ∈ G + \{0},
where n ≥ 1. Let L be the submonoid of G generated by g1 , . . . , gn and let L be
endowed with the total order inherited from G, with the group addition, and with the
constant 0. Then we call (L; ≤, +, 0) a group cone tomonoid.
Clearly, a group cone tomonoid is a fg.p.c. tomonoid. Note that in general we do
not deal with the whole positive cone of a totally ordered Abelian group. In fact, the
latter is in general not finitely generated even if the group is.
Group cone tomonoids are characterised by the following condition. We say that a
fg.p.c. tomonoid L is cancellative if, for all a, b, c ∈ L, a + c = b + c implies a = b.
Note that in this case, for all a, b, c ∈ L, a ≤ b is equivalent to a + c ≤ b + c.
88 T. Vetterlein and M. Petrík
Proof The “only if” part follows from the construction of a group cone tomonoid.
To see the “if” part, let L be cancellative. Let G be the group consisting of the
differences of elements of L; see, e.g., [10, Chap. II.2]. Viewing L as a subset of G, we
introduce a total order on G as follows: for a, b, c, d ∈ L, we define a − b ≤ c − d
if a + d ≤ b + c in L. Then (G; ≤, +, 0) is a totally ordered Abelian group, and
(L; ≤, +, 0) is a subtomonoid of (G + ; ≤, +, 0). The assertion follows.
Recall next that the order of a partially ordered Abelian group (G; ≤, +, 0) is
uniquely determined by its positive cone G + . In fact, for any g, h ∈ G, g ≤ h if
and only if h − g ∈ G + . We may also view the positive cone of a partially ordered
group as the set of all differences of elements g and h such that g ≤ h; indeed,
G + = {h − g : g, h ∈ G such that g ≤ h}.
We may use the same object to describe group cone tomonoids. We denote by
(Zn ; +, 0̄) the free Abelian group generated by n ≥ 1 elements. Furthermore, will
be the partial order on Zn defined according to (1): for a, b ∈ Zn , we put a b if
a + c = b for some c ∈ Nn . Then (Zn ; , +, 0̄) is a lattice-ordered group.
P = {b − a ∈ Zn : a, b ∈ Nn such that a b}
A positive cone determines the preorder from which it is defined as in the case of
groups.
The positive cones of partially ordered Abelian groups possess an intrinsic charac-
terisation: they are exactly the cancellative commutative monoids such that a + b = 0
implies a = b = 0 [10]. The positive cones of cancellative monomial preorders can
be described in a similar way.
Positive cones describe cancellative fg.p.c. tomonoids. In this section we will gener-
alise this notion to cover a wider class of tomonoids. In this case we will not obtain
a strict correlation, but we will be led to a Galois correspondence.
Let be a monomial preorder on Nn . If is cancellative, then for any a, b ∈ Nn
the question of whether or not a b holds depends only on the difference z = b − a:
we have a b if and only if c d for any other pair c, d ∈ Nn such that z = d − c.
In fact, the positive cone P consists of these differences; a b if and only if
b − a ∈ P .
In general, the question of whether or not we have a b does not depend on the
difference b − a alone. For instance, it may be the case that a + c b + c holds for
some c ∈ Nn but not a b. However, let z ∈ Zn . Then the following lemma implies
that still at least one of following possibilities applies: a b for all a, b ∈ Nn such
that b − a = z, or b a for all a, b ∈ Nn such that b − a = z.
z + = z ∨ 0̄,
z − = −z ∨ 0̄.
Then we have
z = z+ − z−
and any other pair of elements of Nn whose difference is z arises from z + and z − by
adding a t ∈ Nn .
For a compatible preorder on Nn , the obvious consequence is the following.
Let z ∈ Zn . If z − z + , we conclude from Lemma 3 and the compatibility of that
a b actually holds for any pair a, b ∈ Nn such that b − a = z. Thus, intuitively,
we may view any z ∈ Zn such that z − z + as being “positively directed”; for, in
this case we have a a + z for any a ∈ Nn such that a + z ∈ Nn . Our viewpoint is
reflected in the following definition.
The Semantics of Fuzzy Logics: Two Approaches to Finite Tomonoids 91
C = {z ∈ Zn : z − z + }
(x1 + · · · + xk )− + x1 + · · · + xi 0̄ (5)
Our next aim is to show that conditions (C1)–(C3) characterise direction cones.
A preorder gives rise to a direction cone, which fulfils (C1)–(C3). Conversely, we
can assign a preorder to a set fulfilling (C1)–(C3).
We have seen that there is a mutual correspondence between monomial preorders and
direction cones. This correspondence is not one-to-one, some monomial preorders
are proper extensions of those that are induced by direction cones. However, we can
established a Galois correspondence between the two sets.
Let us fix an n ≥ 1. Let P be the set of all monomial preorders on Nn and let C
be the set of all direction cones in Zn . We partially order the two sets by means of
the set-theoretic inclusion. We then readily check that the two mappings
P → C , → C ,
C → P, C → C
are order-preserving. The mappings are not one-to-one; in fact, the former is surjec-
tive but not injective, and the latter is injective but not surjective. From Theorems
2 and 3 we conclude what happens when applying the mappings successively: any
∈ P is an extension of C ; and any C ∈ C is equal to CC . Hence there is the
following Galois connection between P and C : for any ∈ P and C ∈ C ,
C ⊆ if and only if C ⊆ C .
3.5 Example
and that the last indicated element is the top element. In accordance with Proposi-
tion 1, let ι : N2 → L be the surjective monoid homomorphism such that ι((1, 0)) = a
and ι((0, 1)) = b, and endow N2 with the preorder according to (2). Then we have
.. .. .. .. .. ..
. . . . . .
Fig. 1 The example tomonoid L. The simple arrows indicate the immediate-successor relation
w.r.t. ; the double arrows indicate -equivalence
C = {( p, q) ∈ Z2 : (− p ∨ 0, −q ∨ 0) ( p ∨ 0, q ∨ 0)}
= {( p, q) ∈ Z2 : p, q ≥ 0} ∪
{(−2, 2), (−1, 1), (−1, 2), (2, −1), (3, −2), (3, −1), (4, −2), (4, −1)} ∪
{( p, q) ∈ Z2 : p ≤ 0 and q ≥ 3} ∪
{( p, q) ∈ Z2 : p ≥ 5 and q ≤ 0}.
.. .. .. .. .. .. .. .. .. .. .. .. .. ..
. . . . . . . . . . . . . .
··· ···
··· ···
··· ···
··· ···
···
···
···
···
···
···
···
···
.. .. ..
. . .
Fig. 2 The direction cone C of the monomial preorder representing L. Each element of C is
depicted as a circle in the Z2 plane
.. .. .. .. .. ..
. . . . . .
In the second part of this chapter, we develop a much different point of view on
tomonoids. To begin with, we switch to the dual order and the multiplicative notation,
as is common in fuzzy logic.
We will again assume the property called “positive” in the previous part. In the
present context, however, positivity means that 1 is the top element; accordingly, we
will refer to this property as “negative”. Furthermore, we will restrict to the finite
case. Finally, our considerations do not rely on the commutativity of the monoidal
product and hence we will not assume this condition here.
We shall write “f.n.” for “finite, negative”. That is, a f.n. tomonoid is a structure
(L; ≤, , 1) such that (L; , 1) is a finite monoid and ≤ is a compatible total order
whose top element is 1.
Our aim is to describe the construction of f.n. tomonoids in a step-by-step manner.
The main idea is the following. Let (L; ≤, , 1) be a non-trivial f.n. tomonoid and let
0 and α be its smallest and second smallest element, respectively. Then the identifica-
tion of 0 and α is a tomonoid congruence and the quotient is by one element smaller
than L. Continuing in the same way, we get a sequence of tomonoids that ends with
the trivial one. It seems then natural to ask how to generate such a sequence in the
reversed order. That is, given an f.n. tomonoid L, how can we determine all those
f.n. tomonoids L̄ that are by one element larger and such that the identification of
their smallest two elements leads back to L? This is in fact the question that we will
answer. We will provide a practical method of determining from L systematically
all tomonoids L̄ of the indicated type.
The results of the present section are due to [24], where further details can be
found. For a more general approach to the extension of partially ordered monoids,
see, e.g., [18, 19].
Consider a negative tomonoid (L; ≤, , 1) and let q be one of its elements. Then
Iq = {a ∈ L : a ≤ q} is an ideal of L, seen as a monoid. Indeed, by the negativity of
L, a ≤ q implies a b ≤ q and b a ≤ q for any b ∈ L. Consequently, we may
form the Rees quotient of the monoid L by Iq ; see, e.g., [17]. Its elements may be
identified with the elements that are not in Iq as well as one further element, usually
denoted by 0. Obviously, this monoid congruence has only convex classes and hence
it is a tomonoid congruence; cf., e.g., [9].
For a finite chain L, we will denote by 0 the bottom element and we write
L ∗ = L \ {0}.
If L has at least two elements, we furthermore call the second smallest element of L
the atom of L. We will use in the sequel the symbol α to denote the atom.
Given a non-trivial f.n. tomonoid L, then its Rees quotient L/α by its atom α arises
from L by the identification of the smallest two elements. L is in this case a one-
element coextension of L/α. Our aim is to determine all one-element coextensions
of a f.n. tomonoid. We will then obviously be in the position to construct, starting
from the trivial tomonoid, successively all f.n. tomonoids.
Definition 9 Let (L; ≤, , 1) be a tomonoid. We define, for any (a, b), (c, d) ∈ L 2 ,
(a, b) ∼ (c, d) if a b = c d.
Based on the level equivalence of a tomonoid L, we will endow the set L 2 with a
first-order structure as follows.
Definition 10 Let ≤ be a total order on a set L and let 1 ∈ L. We denote the com-
ponentwise order on L 2 by , that is, we put
Then (L; ≤, , 1) is the unique tomonoid such that (L 2 ; , ∼, (1,1)) is its associated
tomonoid partition.
0 t u v w x y z 1
0 t u v w x y z 1 1
0 0 t u u v w x z z
0 0 0 t t u u v y y
0 0 0 t t u u v x x
0 0 0 0 0 t t u w w
0 0 0 0 0 t t u v v
0 0 0 0 0 0 0 t u u
0 0 0 0 0 0 0 0 t t
0 0 0 0 0 0 0 0 0 0
Fig. 4 A tomonoid partition associated with an eight-element negative tomonoid L. Rows and
columns of the array correspond to the elements of L, thus each square in the array corresponds
to a pair (a, b) ∈ L 2 , where a is the row index and b is the column index. In order to represent ∼,
we have indicated in each square (a, b) the product of a and b in L; two squares are ∼-equivalent
iff they contain the same symbol. For instance, the ∼-class of u comprises even elements and the
∼-class of 1 just one
e c b
0 t u v w x y z 1
0 t u v w x y z 1 1
0 0 t u u v w x z z b
0 0 0 t t u u v y y a
0 0 0 t t u u v x x
0 0 0 0 0 t t u w w
0 0 0 0 0 t t u v v d
0 0 0 0 0 0 0 t u u
0 0 0 0 0 0 0 0 t t
0 0 0 0 0 0 0 0 0 0
Fig. 5 The “Reidemeister” condition (P3). A (connected or broken) bold line between two elements
of the array indicates level equivalence. By (P3), the equivalences of the pairs connected by a solid
line imply the equivalence of the pair connected by a broken line
rectangles such that one hits the upper edge and the other one hits the right edge.
Assume that the upper left, upper right, and lower right vertices of these rectangles
are in the same blocks, respectively. By (P3), then also the remaining pair, consisting
of the lower left vertices, is in the same block. A related property is known from the
field of web geometry and called the “Reidemeister condition” [1, 2].
We conclude the subsection with a characterisation of those tomonoid partitions
in which we are actually interested: the finite, negative ones. The slightly optimised
characterisation will be useful in subsequent proofs.
Proposition 6 Let (L; ≤) be a finite and at least two-element chain with the top
element 1. Let 0 be the bottom element of L. Then (L 2 ; , ∼, (1,1)) is a tomonoid
partition if and only if (P1), (P2), and the following condition hold:
(P3’) For any a, b, c, d, e ∈ L \ {0, 1}, (a, b) ∼ d and (b, c) ∼ e imply (d, c) ∼
(a, e).
In this case, (L 2 ; , ∼, (1,1)) is finite and negative.
We have seen that tomonoids and tomonoid partitions are in a one-to-one corre-
spondence. Consequently, we can apply properties, constructions, etc. defined for
tomonoids to tomonoid partitions as well. We establish in this subsection a few of
such correspondences.
For convenience, we will apply to tomonoid partitions the same notions as to
tomonoids. For instance, a tomonoid partition will be called negative if the corre-
sponding tomonoid is negative.
Definition 11 We call a negative tomonoid Archimedean if, for any a ≤ b < 1, there
is an n ≥ 1 such that bn ≤ a.
Note that negative tomonoids with at most two elements are trivially Archimedean.
Archimedean f.n. tomonoid partitions are characterised as follows.
Proof Let (L; ≤, , 1) be the corresponding f.n. tomonoid and let 0 be the bottom
element of L. W.l.o.g., we can assume 0 = 1. We show that (i) and (ii) are equivalent.
The equivalence of (i) and (iii) is seen similarly.
Assume that (ii) holds. By the negativity of L, we have b a < a for all a = 0 and
b < 1. Let a < 1. Then, for any n ≥ 1, either a n+1 < a n or a n = 0. As L is finite, the
latter possibility applies for a sufficiently large n. It follows that L is Archimedean.
Assume that (ii) does not hold. Let a = 0 and b < 1 such that b a = a. As L is
negative, we then have a ≤ b and it follows bn ≥ bn−1 a = a > 0 for any n ≥ 2.
Hence L cannot be Archimedean.
Proof Let (L; ≤, , 1) be the corresponding negative tomonoid. Let q be the binary
operation on L q such that (L q ; ≤, q , 1) is (under the obvious identifications) the
Rees quotient of L by q. Let (L q 2 ; , ∼q , (1,1)) be the associated tomonoid partition.
Let a, b, c ∈ L such that c > q and (a, b) ∼ c. Then a, b ≥ c by Lemma 6(i) and
consequently a, b > q. We conclude that the ∼-class of each c ∈ L q is contained
in (L q )2 .
We have to show ∼q = ∼q . Let a, b, c ∈ L q such that c = 0. Then (a, b) ∼q c iff
a q b = c iff a b = c iff (a, b) ∼ c. Hence the ∼q -class of each c ∈ L q coincides
with the ∼-class of c. There is only one further ∼q -class, the ∼q -class of 0, which
consequently consists of all elements of L q 2 not belonging to the ∼-class of any
c ∈ L q .
0 u v w x y z 1 0 v w x y z 1 0 w x y z 1
0 u v w x y z 1 1 0 v w x y z 1 1 0 w x y z 1 1
0 0 u u v w x z z 0 0 0 v w x z z 0 0 0 w x z z
0 0 0 0 u u v y y 0 0 0 0 0 v y y 0 0 0 0 0 y y
0 0 0 0 u u v x x 0 0 0 0 0 v x x 0 0 0 0 0 x x
0 0 0 0 0 0 u w w 0 0 0 0 0 0 w w 0 0 0 0 0 w w
0 0 0 0 0 0 u v v 0 0 0 0 0 0 v v 0 0 0 0 0 0 0
0 0 0 0 0 0 0 u u 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0
0 x y z 1 0 y z 1 0 z 1 0 1 1
0 x y z 1 1 0 y z 1 1 0 z 1 1 0 1 1 1 1
0 0 0 x z z 0 0 0 z z 0 0 z z 0 0 0
0 0 0 0 y y 0 0 0 y y 0 0 0 0
0 0 0 0 x x 0 0 0 0 0
0 0 0 0 0 0
Fig. 6 Starting from the eight-element tomonoid shown in Fig. 4, the successive formation of Rees
quotients by the atom leads eventually to the trivial tomonoid
Let ∼
˙ be the smallest equivalence relation on L̄ 2 such that the following conditions
hold:
(E1) For any (a, b), (c, d) ∈ P such that (a, b) ∼ (c, d), we have (a, b) ∼
˙ (c, d).
(E2) For any (a, b), (b, c) ∈ P and d, e ∈ L such that (d, c), (a, e) ∈ Q, (a, b) ∼
d, and (b, c) ∼ e, we have (d, c) ∼˙ (a, e).
(E3) For any a, b, c, e ∈ L such that (a, b) ∈ Q, (b, c) ∼ e, and c < 1, we have
(a, e) ∼
˙ 0.
Moreover, for any a, b, c, d ∈ L such that (b, c) ∈ Q, (a, b) ∼ d, and a < 1,
we have (d, c) ∼
˙ 0.
(E4) We have (0, 1) ∼
˙ (1, 0) ∼
˙ (α, b) ∼˙ (b, α) for any b < 1, and (α, 1) ∼
˙ (1, α).
Moreover, for any (a, b), (c, d) ∈ Q such that (a, b) (c, d) ∼˙ 0, we have
(a, b) ∼
˙ 0.
Then we call the structure ( L̄ 2 ; , ∼,
˙ (1,1)) the ramification of (L 2 ; , ∼, (1,1)).
A few remarks might help to clarify the meaning of Definition 12. Let the tomonoid
partition (L 2 ; , ∼, (1,1)) be given. The subset P of L̄ 2 consists of all pairs (a, b) ∈
L 2 whose product in L is not the bottom element. That is, P is the union of the
∼-classes of all c ∈ L and this union lies in L 2 . We note that P is an upwards
closed subset of L̄ 2 and, consequently, its complement Q is a downward closed subset
of L̄ 2 .
The intermediate equivalence relation ∼ ˙ is determined by successive application
of conditions (E1)–(E4). We observe that ∼-equivalences
˙ involving elements of P
are required by condition (E1) only. In fact, all the ∼-classes contained in P are
∼-classes
˙ as well.
The ∼-classes
˙ contained in Q are determined by conditions (E2)–(E4). In fact,
each prescription contained in (E2) and (E3) is of the form that certain ∼-equivalences
imply that a certain pair of elements of Q is ∼-equivalent.
˙ Finally, (E4) prescribes
that the ∼-class
˙ of 0 is downward closed. We remark that Q contains the ∼-class ˙
of the bottom element 0, the ∼-class
˙ of the atom α, and possibly further ∼-classes,
˙
which contain neither (1, c) nor (c, 1) for any c ∈ L̄.
In the sequel, for two equivalence relations ∼1 and ∼2 on a set A, we say that
∼1 is coarser than ∼2 if ∼2 ⊆ ∼1 . In other words, ∼1 coarser than ∼2 if and only if
each ∼1 -class is a union of ∼2 -classes.
let ( L̄ 2 ; , ∼,
˙ (1,1)) be the ramification of L 2 . Then ∼
¯ is coarser than ∼ ˙ and the
following holds: the ∼-class
¯ of each c ∈ L coincides with the ∼-class
˙ of c, the
∼-class
¯ of 0 is downward closed, and each ∼-class
¯ contains exactly one element of
the form (1, c) for some c ∈ L̄.
Proof Let (L; ≤, , 1) and ( L̄; ≤, , ¯ 1), where L̄ = L ∪ ˙ {0, α}, be the two
tomonoids in question.
As noted above, condition (E1) requires ∼-equivalences
˙ only between elements
of P and the remaining conditions require ∼-equivalences
˙ only between elements
of Q. Furthermore, P is the union of the ∼-classes of all c ∈ L . By (E1), these
∼-classes are also ∼-classes.
˙ Moreover, by Proposition 7, each ∼-class of a c ∈ L
is a ∼-class.
¯ We conclude that the ∼-class
¯ of each c ∈ L coincides with the ∼-class
˙
of c and P is the union of these subsets.
We next check that any two elements that are ∼-equivalent
˙ according to one of the
conditions (E2)–(E4) are also ∼-equivalent.
¯ Since ∼˙ is, by assumption, the smallest
equivalence relation with the indicated properties, it will then follow that ∼ ˙ ⊆ ∼. ¯
Ad (E2): Let (a, b), (b, c) ∈ P, d, e ∈ L , (a, b) ∼ d, and (b, c) ∼ e. Then
a, b, c ∈ L , hence a ¯ b = a b = d and b ¯ c = b c = e. Consequently, d ¯
c = (a ¯ b) ¯ c=a ¯ (b ¯ c) = a ¯ e, that is (d, c) ∼ ¯ (a, e).
Ad (E3): Let a, b, c, e ∈ L , (a, b) ∈ Q, (b, c) ∼ e, and c < 1. Then a ¯ b≤
α and hence a ¯ e=a ¯ (b ¯ c) = (a ¯ b) ¯ c≤α ¯ c. As L is assumed to be
Archimedean, α is the atom of L̄, and c < 1, we conclude α ¯ c = 0. Hence (a, e) ∼ ¯
0. Similarly, we argue for the second part of (E3).
Ad (E4): As L is Archimedean, we have, for any b < 1, 0 ¯ 1=1 ¯ 0=
α ¯ b=b ¯ α = 0 by Lemma 7 and hence (0, 1) ∼ ¯ (1, 0) ∼¯ (α, b) ∼¯ (b, α). Fur-
thermore, we have (α, 1) ∼ ¯ (1, α). Finally, let (a, b), (c, d) ∈ Q and assume (a, b)
(c, d) ∼ ¯ 0. Then a ¯ b≤c ¯ d = 0 and thus (a, b) ∼ ¯ 0 as well.
It is finally clear that the ∼-class
¯ of 0 is downward closed. The last statement
holds by condition (P2) of a tomonoid partition.
the elements (a, 0) and (0, a) for any a as well as (a, α) and (α, a) for any a = 1
belong to the ∼-class
˙ of (1, 0). Again, (1, α) and (α, 1) are not concerned. Finally,
the ∼-class
˙ of (1, 0) is a downward closed set. Also this prescription has no effect on
(1, α) or (α, 1) because there is no element in Q that is larger than (1, α) or (α, 1).
We conclude that {(1, α), (α, 1)} is an own ∼-class
˙ and our claim is shown.
Let now ∼ ¯ ⊇∼ ˙ be as indicated. Note that, by what we have seen so far, at least
one such equivalence relation exists. In accordance with Proposition 6, we will verify
(P1), (P2), and (P3’).
We have shown that (1, c) ∼ ¯ (c, 1) for all c ∈ L̄. By construction, ∼
¯ fulfils (P2).
Furthermore, the ∼-class
¯ of 0 is downward closed and Q, which is the union of the
∼-classes
¯ of 0 and α, is downward closed as well. We conclude that (P1) holds for
∼.
¯
It remains to show that ∼ ¯ fulfils (P3’). Let a, b, c, d, e ∈ L \ {0, 1} such that
(a, b) ∼¯ d and (b, c) ∼ ¯ e. We distinguish the following cases.
Case 1. Let d, e ∈ L . Then (a, b) ∼ d and (b, c) ∼ e. As ∼ fulfils (P3), we have
(d, c) ∼ (a, e). In particular, it follows that (d, c) ∈ P iff (a, e) ∈ P. If (d, c) and
(a, e) are both in P, we have (d, c) ∼ ¯ (a, e) because the ∼-classes
˙ contained in P
are ∼-classes
¯ as well. If (d, c) and (a, e) are both in Q, we have (d, c) ∼ ˙ (a, e) by
(E2) and consequently also (d, c) ∼ ¯ (a, e), because ∼ ¯ extends ∼.
˙
Case 2. Let d = α and e ∈ L . Then (d, c) ∼ ˙ 0 by (E4). Furthermore, we have
a ∈ L by (E4), b, c ∈ L because (b, c) ∈ P, (a, b) ∈ Q, and (b, c) ∼ e. It follows
(a, e) ∼
˙ 0 by (E3). Consequently, (d, c) ∼ ¯ 0∼ ¯ (a, e).
Case 3. Let d ∈ L and e = α. We argue similarly to Case 2.
Case 4. Let d = e = α. Then (d, c) ∼ ˙ (a, e) ∼˙ 0 by (E4) and consequently also
(d, c) ∼¯ (a, e).
By Proposition 6, ( L̄ 2 ; , ∼,
¯ (1,1)) is a f.n. tomonoid partition, which is moreover
Archimedean by (E4) and Lemma 7. It is finally clear from Proposition 7 that the
Rees quotient of L̄ 2 by the atom α is L 2 .
The final statement follows from Lemma 8.
Let us summarise our construction and add some remarks. In order to determine the
one-element coextensions of a f.n. tomonoid L, we start from its associated tomonoid
partition (L 2 ; , ∼, (1,1)). We first determine its ramification ( L̄ 2 ; , ∼,
˙ (1,1))
according to Definition 12. This is done by means of the conditions (E1)–(E4);
note that these prescriptions are largely independent, it is not necessary to apply
them in a recursive way. To obtain, second, a coextension of the desired type, the set
Z = (1, 0) ∼¯ , i.e. the ∼-class
¯ of the bottom element, is chosen according to Theo-
rem 5. This is done as simple as follows: Z is a union of ∼-classes
˙ contained in Q
including (1, 0) ∼˙ but excluding the ∼-class
˙ {(1, α), (α, 1)}, and Z is downward
closed. Thus, to determine a specific one-element coextension, all we have to do is
to select an arbitrary set of ∼-classes
˙ different from {(α, 1), (1, α)} and Z will then
be the smallest downward closed set containing them.
Note that one possible choice is Z = Q \ {(α, 1), (1, α)}. This means that the
explained procedure always leads to a result, that is, every Archimedean, finite,
negative tomonoid has at least one Archimedean one-element coextension.
The Semantics of Fuzzy Logics: Two Approaches to Finite Tomonoids 107
Also in the general case, it is interesting that the explained procedure never requires
revisions. At no place decisions are required that lead to an impossible situation, we
may always proceed to end up with a coextension as desired.
Acknowledgments The support of the first author by the Austrian Science Fund (FWF): project
I 1923-N25 (New perspectives on residuated posets) and the support of the second author by the
Czech Science Foundation under Project 15-07724Y are gratefully acknowledged.
References
1. Aczél, J.: Quasigroups, nets and nomograms. Adv. Math. 1, 383–450 (1965)
2. Blaschke, W., Bol, G.: Geometrie der Gewebe, topologische Fragen der Differentialgeometrie
(in German). Springer, Berlin (1939)
3. Blount, K., Tsinakis, C.: The structure of residuated lattices. Int. J. Algebra Comput. 13, 437–
461 (2003)
4. Chen, W., Zhao, X.: The structure of idempotent residuated chains. Czech. Math. J. 59, 453–479
(2009)
5. Clifford, A.H., Preston, G.B.: The Algebraic Theory of Semigroups, vol. 1. American Mathe-
matical Society, Providence (1961)
6. Cox, D., Little, J., O’Shea, D.: Ideals, varieties, and algorithms. An introduction to Computa-
tional Algebraic Geometry and Commutative Algebra, 3rd edn. Springer, New York (2007)
7. De Baets, B., Mesiar, R.: Discrete triangular norms. In: Rodabaugh, S.E., et al. (Eds.), Topo-
logical and Algebraic Structures in Fuzzy Sets. A Handbook of Recent Developments in the
Mathematics of Fuzzy Sets, pp. 389–400. Kluwer Academic Publishers, Dordrecht (2003)
8. Esteva, F., Godo, L.: Monoidal t-norm based logic: towards a logic for left-continuous t-norms.
Fuzzy Sets Syst. 124, 271–288 (2001)
9. Evans, K., Konikoff, M., Madden, J.J., Mathis, R., Whipple, G.: Totally ordered commutative
monoids. Semigroup Forum 62, 249–278 (2001)
10. Fuchs, L.: Partially Ordered Algebraic Systems. Pergamon Press, Oxford (1963)
11. Galatos, N., Jipsen, P., Kowalski, T., Ono, H.: Residuated lattices. An Algebraic Glimpse at
Substructural Logics. Elsevier, Amsterdam (2007)
12. Grillet, P.A.: Semigroups. An Introduction to the Structure Theory. Marcel Dekker, New York
(1995)
13. Grillet, P.A.: Commutative Semigroups. Kluwer Academic Publishers, Dordrecht (2001)
14. Hájek, P.: Metamathematics of Fuzzy Logic. Kluwer Academic Publisher, Dordrecht (1998)
15. Horčík, R.: Structure of commutative cancellative integral residuated lattices on (0, 1]. Algebra
Univers. 57, 303–332 (2007)
16. Horčík, R.: On the structure of finite integral commutative residuated chains. J. Log. Comput.
21, 717–728 (2011)
17. Howie, J.M.: An Introduction to Semigroup Theory. Academic Press, London (1976)
18. Hulin, A.J.: Extensions of ordered semigroups. Czech. Math. J. 26, 1–12 (1976)
19. Kehayopulu, N., Tsingelis, M.: Ideal extensions of ordered semigroups. Commun. Algebra 31,
4939–4969 (2003)
20. Klement, E.P., Mesiar, R., Pap, E.: Triangular Norms. Kluwer Academic Publishers, Dordrecht
(2000)
21. Montagna, F., Noguera, C., Horčík, R.: On weakly cancellative fuzzy logics. J. Log. Comput.
16, 423–450 (2006)
22. Noguera, C., Esteva, F., Godo, L.: Generalized continuous and left-continuous t-norms arising
from algebraic semantics for fuzzy logics. Inf. Sci. 180, 1354–1372 (2010)
23. Petrich, M.: Introduction to Semigroups. Charles E. Merrill Publishing Company, Columbus
(1973)
108 T. Vetterlein and M. Petrík
24. Petrík, M., Vetterlein, Th.: Rees coextensions of finite, negative tomonoids. J. Log. Comput.
(to appear)
25. Petrík, M., Sarkoci, P.: Associativity of triangular norms characterized by the geometry of their
level sets. Fuzzy Sets Syst. 202, 100–109 (2012)
26. Vetterlein, Th.: A Representation of Finite, Positive, Commutative Tomonoids. www.flll.jku.
at/sites/default/files/u24/Direction-f-cones.pdf
27. Vetterlein, Th.: Algebraic semantics: the structure of chains. In: Cintula, P., Fermüller, C.,
Noguera, C. (eds.) Handbook of Mathematical Fuzzy Logic, vol. 3 (to appear)
28. Vetterlein, Th.: On positive commutative tomonoids. Algebra Univ. (to appear)
29. Vetterlein, Th: Totally ordered monoids based on triangular norms. Commun. Algebra 43,
2643–2679 (2015)
30. Ya, E.: Gabovich, Fully ordered semigroups and their applications. Russ. Math. Surv. 31,
147–216 (1976)
Structure of Uninorms with Continuous
Diagonal Functions
Andrea Mesiarová-Zemánková
Abstract The structure of uninorms which are continuous on some special parts
of the unit square is discussed. After a summary of partial results achieved in the
characterization of uninorms with continuous underlying t-norm and t-conorm in
the past years, a full characterization of these uninorms is described. Representation
theorems based on the set of discontinuity points of such a uninorm and the ordinal
sum construction for semigroups are presented. Further generalizations yield uni-
norms with continuous diagonal functions. Several results related to uninorms with
continuous diagonals are investigated. Further generalizations are also discussed.
1 Introduction
Uninorms (originally called uni-norms, see [49]) are functions that on the one hand
generalize both t-norms and t-conorms, and on the other hand allow bipolar behaviour
[50]. Moreover, uninorms linearly transformed to the interval [−1, 1] are just bipolar
t-conorms (see [31]). A binary function is a uninorm if it is commutative, associative,
non-decreasing in each variable and has a neutral element e ∈ [0, 1]. Therefore the
class of uninorms covers also the class of t-norms (for which e = 1) and the class
of t-conorms (for which e = 0). To distinguish uninorms which are not t-norms
or t-conorms, in later works authors assume that a uninorm has a neutral element
e ∈ ]0, 1[. Such uninorms are also called proper.
Due to the associativity the n-ary form of any uninorm is uniquely given and thus
it can be extended to an aggregation function working on n∈N [0, 1]n .
In this chapter we focus on uninorms with continuous diagonals. At first we focus
on uninorms with continuous underlying functions (for the definition of underlying
functions see Sect. 2). Afterwards we will search for conditions under which uni-
norms with continuous diagonals have continuous underlying functions. The chapter
is organized as follows. In Sect. 2 we will give a historical overview of results related
A. Mesiarová-Zemánková (B)
Mathematical Institute, Slovak Academy of Sciences, Bratislava, Slovakia
e-mail: [email protected]
2 Historical Overview
First results on the structure of uninorms are due to Yager and Rybalov [49] and
Fodor et al. [12]. As first observation we can introduce the fact that for a uninorm
with the neutral element e ∈ [0, 1] there is U (a, 0) = 0 for all a ≤ e, and U (a, 1) = 1
for all a ≥ e. This observation was then extended in [12] where it was shown that
for each uninorm U the restriction of U to [0, e]2 is a t-norm on [0, e]2 , i.e., a linear
transformation of some t-norm TU on [0, 1]2 and the restriction of U to [e, 1]2 is a
t-conorm on [e, 1]2 , i.e., a linear transformation of some t-conorm SU on [0, 1]2 . We
will call TU (SU ) an underlying t-norm (t-conorm) of U , and together TU and SU
will be called the underlying functions of U . If e = 1 (e = 0) then the underlying
t-conorm (t-norm) is not defined and the underlying t-norm (t-conorm) is the uninorm
itself. Moreover,
for all (x, y) ∈ [0, e] × [e, 1] ∪ [e, 1] × [0, e]. Thus each uninorm has a conjunctive
behaviour if all inputs are below the neutral element, a disjunctive behaviour if all
inputs are above the neutral element, and an averaging behaviour on the reminder.
For every uninorm the value a = U (0, 1) is the annihilator of U and it holds a ∈
{0, 1} (see [12]). If U (0, 1) = 0 the uninorm U is called conjunctive (andlike) and if
U (0, 1) = 1 the uninorm U is called disjunctive (orlike). Moreover, if U : [0, 1]2 −→
[0, 1] is a continuous uninorm then e = 1 or e = 0, i.e., U is either a continuous t-
norm, or a continuous t-conorm (see [22]). This means that there is no uninorm
with a neutral element e ∈ ]0, 1[ continuous on the whole unit square. In fact for
every uninorm either the function u 1 : [0, 1] −→ [0, 1] or the function u 0 : [0, 1] −→
[0, 1] is not continuous, where u 1 (x) = U (1, x) and u 0 (x) = U (0, x) for x ∈ [0, 1].
This follows from the fact shown in [12], that if u 1 and u 0 are both continuous except
at x = e then U (0, 1) = 0 implies U (x, 1) = x for all x ∈ [0, e[, and U (0, 1) = 1
implies U (x, 0) = x for all x ∈ ]e, 1] This yields the following theorem.
Theorem 1 ([12]) Suppose that U : [0, 1]2 −→ [0, 1] is a uninorm with neutral
element e ∈ ]0, 1[ and both functions x → U (x, 1) and x → U (x, 0) (x ∈ [0, 1])
are continuous except at the point x = e. Then U is given by one of the following
forms. If U (0, 1) = 0 then
Structure of Uninorms with Continuous Diagonal Functions 111
⎧
⎪
⎨e · T ( e , e ),
x y
if (x, y) ∈ [0, e]2 ,
Umin (x, y) = e + (1 − e) · S( x−e , y−e
), if (x, y) ∈ [e, 1]2 ,
⎪
⎩
1−e 1−e
min(x, y), otherwise,
and
⎧
⎪
⎨e · T ( e , e )
x y
if (x, y) ∈ [0, e]2 ,
Umax (x, y) = e + (1 − e) · S( x−e , y−e
) if (x, y) ∈ [e, 1]2 ,
⎪
⎩
1−e 1−e
max(x, y) otherwise
are uninorms. We will denote the set of all uninorms of the first type by Umin and of
the second type by Umax .
In the literature we can find several aggregation functions that generalize uni-
norms. In [26] Mas et al. introduced left and right uninorms, where the commuta-
tivity of uninorms was relaxed and a left uninorm possesses a left neutral element
and a right uninorm possesses a right neutral element. A uninorm without commu-
tativity was called a pseudo-uninorm in [48]. By removing the associativity and the
commutativity from the axioms of uninorms Liu introduced in [23] the concept of
semi-uninorms (on a complete lattice) and Su et al. [46] introduced the concept of
left and right semi-uninorms (on a complete lattice). Uninorms on a finite totally
ordered set were studied in [25]. By replacing the neutral element of a uninorm by
an n-neutral element we obtain n-uninorms that were studied in [2].
It is evident that the more we generalize the uninorm functions (on [0, 1]2 ), the
more complicated their structure will be. Thus in the clarification of the structure of
uninorms we should take an opposite approach and start from some special subclasses
of uninorms with less complicated structure.
From the observations made above we see that the structure of t-norms and
t-conorms plays a major role in the investigation of the structure of uninorms. Since
112 A. Mesiarová-Zemánková
t-norms and t-conorms are dual to each other, also their structure is similar. Let
us focus for a while on the class of t-norms. Although the class of left-continuous
t-norms was not yet fully characterized, and several peculiar examples of t-norms
belong to this class, the class of continuous t-norms (t-conorms) has a simple char-
acterization. Each continuous t-norm is an ordinal sum of continuous Archimedean
t-norms, and each continuous Archimedean t-norm is generated by a continuous
additive generator. Note that a t-norm T (t-conorm S) is called Archimedean if for
all x, y ∈ ]0, 1[ there exists an n ∈ N such that x T(n) < y, (x S(n) > y) where
n-times n-times
⎧
⎪
⎨x ∗α y if (x, y) ∈ X α × X α ,
x∗y= x if (x, y) ∈ X α × X β and α < β,
⎪
⎩
y if (x, y) ∈ X α × X β and α > β.
Each continuous t-norm has a closed set of idempotent points I (see [19]) and
thus [0, 1] \ I is equal to a union of open disjoint subintervals ]ak , bk [ of [0, 1]
which define supports for summands in the ordinal sum representation, i.e., the
corresponding semigroup is defined on [ak , bk [. If a t-norm is isomorphic to a strict
t-norm on [ak , bk ]2 we can further divide this summand into an ordinal sum of
semigroups defined on {ak } and on ]ak , bk [, however, if T is on [ak , bk ]2 isomorphic
Structure of Uninorms with Continuous Diagonal Functions 113
a a
Fig. 1 The uninorm U1 (left) and the uninorm U2 (right) from Remark 1. The bold lines denote
the points of discontinuity of U1 and U2
to a nilpotent t-norm this is not possible. In the case of t-norms the distinction between
{ak } with ]ak , bk [ and [ak , bk [ plays no role, however, as we will see, in the case of
uninorms this will be a crucial observation.
Remark 1 Assume U1 ∈ Umin and U2 ∈ Umax with respective neutral elements
e1 , e2 ∈ ]0, 1[. Then U1 and U2 are ordinal sums of semigroups in the sense of Clif-
ford. In the first case we have three semigroups G a = ([0, e1 [, TU1 ),
G b = (]e1 , 1], SU1 ), G c = ({e1 }, min), A1 = {a, b, c} and the order on the set
A1 is given by a < b < c. In the second case we have again three semigroups
G a = ([0, e2 [, TU2 ), G b = (]e2 , 1], SU2 ), G c = ({e2 }, max), A2 = {a, b, c} and the
order on the set A2 is given by b < a < c. We can see the uninorms U1 and U2 in
Fig. 1.
In [12] we can find another example of uninorms that are related to ordinal sum
of semigroups, namely pseudo-continuous uninorms. A uninorm U with neutral ele-
ment e is called pseudo-continuous on [0, 1]2 if and only if U is continuous on the
set [0, 1]2 \ {(x, y) | x = e or y = e}, i.e., U is continuous on the unit square except
the segments [(e, 0), (e, 1)] and [(0, e), (1, e)]. For a pseudo-continuous conjunc-
tive uninorm U with neutral element e ∈ ]0, 1[ we have U ∈ Umin , where TU and
SU are continuous in the open unit square and thus they are either continuous, or
obtained from an ordinal sum of a continuous t-norms (t-conorms) and a t-subnorm
(t-superconorm). More details on uninorms such that TU and SU are continuous in
the open unit square can be found in Sect. 4.2. Similarly, for a pseudo-continuous
disjunctive uninorm U with neutral element e ∈ ]0, 1[ we have U ∈ Umax and TU
and SU are continuous in the open unit square.
So far we know that each continuous t-norm is equal to an ordinal sum of
Archimedean t-norms. Furthermore, each Archimedean t-norm possesses a contin-
uous additive generator.
Proposition 1 Let t : [0, 1] −→ [0, ∞] be a continuous strictly decreasing function
such that t (1) = 0. Then the binary operation T : [0, 1]2 −→ [0, 1] given by
Extending the concept of additive generators also to the class of uninorms Fodor et
al. defined representable uninorms in [12] (which were before independently studied
as associative compensatory operators in [18]). A uninorm U : [0, 1]2 −→ [0, 1]
is called representable if there exists a continuous, strictly increasing function
h : [0, 1] −→ [−∞, ∞], h(0) = −∞, h(1) = ∞ such that
The function h is called an additive generator of the uninorm U . Note that if we relax
the strict monotonicity of h then the neutral element of the generated function will be
lost. If we relax conditions h(0) = −∞, h(1) = ∞ then if h(0) = 0, h(1) > 0 we
obtain a t-conorm, if h(1) = 0, h(0) < 0 we obtain a t-norm, if h(0) > 0, h(1) > 0
we obtain a t-subnorm (see Sect. 4.2) and if h(0) < 0, h(1) < 0 we obtain a t-
superconorm. In the case that h(0) < 0, h(1) > 0 the associativity will be lost.
Further, a uninorm U : [0, 1]2 −→ [0, 1] is representable if and only if U is con-
tinuous everywhere on the unit square except for the two points (0, 1) and (1, 0)
([1, 6, 11, 31, 42]), i.e., U is almost continuous. This implies that if U is almost
continuous then it is strictly increasing on the open unit square, i.e., TU and SU are
a strict t-norm and a strict t-conorm. On the other hand, from [14] it follows that if
T and S are a strict t-norm and a strict t-conorm then there exists a representable
uninorm U such that TU = T and SU = S.
In our summary we have already mentioned almost continuous uninorms, i.e.,
uninorms continuous on [0, 1] \ {(0, 1), (1, 0)}, pseudo-continuous uninorms, i.e.,
uninorm continuous on [0, 1]2 \ {(x, y) | x = e or y = e}, and now let us focus
on uninorms continuous on ]0, 1[2 . In [14] we can find several interesting results about
this class of uninorms. First, if U is a uninorm with the neutral element e ∈ ]0, 1[ and
there exists a u ∈ [0, e[ such that U (x, y) = x for all x ∈ ]u, e[ and y ∈ ]e, 1[, then
U is not continuous in ]0, 1[2 . Further, these uninorms have a very clear structure.
Theorem 3 ([14]) Assume that U : [0, 1]2 −→ [0, 1] is a uninorm with the neutral
element e ∈ ]0, 1[ and U is continuous on ]0, 1[2 . Then U can be represented as
(i) or (ii) :
(i) ⎧
⎪
⎪ e · TU ( xe , ey ), if x, y ∈ [0, u],
⎪
⎪
⎪
⎪ h −1 (h(x) + h(y)), if x, y ∈ ]u, 1[,
⎪
⎪
⎨x, if x ∈ [0, u], y ∈ ]u, 1[,
U (x, y) =
⎪
⎪ x, if x ∈ [0, λ[, y = 1,
⎪
⎪
⎪
⎪ if x ∈ ]λ, 1], y = 1,
⎪
⎪
1,
⎩
x or 1, if x = λ, y = 1,
(ii) ⎧
⎪
⎪e + (1 − e) · SU ( x−e , y−e
), if x, y ∈ [v, 1],
⎪
⎪
1−e 1−e
⎪
⎪ −1
r (r (x) + r (y)), if x, y ∈ ]0, v[,
⎪
⎪
⎨x, if x ∈ [v, 1], y ∈ ]0, v[,
U (x, y) =
⎪
⎪ x, if x ∈ ]ω, 1], y = 0,
⎪
⎪
⎪
⎪0, if x ∈ [0, ω[, y = 0,
⎪
⎪
⎩
x or 0, if x = ω, y = 0,
where, v ∈ ]e, 1], ω ∈ [v, 1], U (ω, ω) = ω, function r : [0, v] −→ [−∞, ∞] is con-
tinuous, strictly increasing, and r (0) = −∞, r (e) = 0, r (v) = ∞.
Remark 2 Assume a uninorm with the neutral element e ∈ ]0, 1[, continuous on
]0, 1[2 , which has the form (i) with λ > 0. This uninorm is an ordinal sum of semi-
groups in the sense of Clifford. Here we have five semigroups G a = ([0, λ[, U ),
G b = ({λ}, U ), G c = (]λ, u[, U ), G d = ([u, 1[, U ), and G f = ({1}, U ), with
A1 = {a, b, c, d, f } and the order on the set A1 is given by a < b < f < c < d
if U (1, λ) = λ and by a < f < b < c < d if U (1, λ) = 1. Similarly, if U has the
form (ii) with ω < 1 then it is an ordinal sum of semigroups in the sense of Clifford.
Here we have five semigroups G a = ({0}, U ), G b = (]0, v], U ), G c = (]v, ω[, U ),
G d = ({ω}, U ), and G f = (]ω, 1], U ), with A2 = {a, b, c, d, f } and the order on the
set A2 is given by f < d < a < c < b if U (0, ω) = ω and by f < a < d < c < b
if U (0, ω) = 0.
for any point of discontinuity s of g, let s − and s + be the corresponding lateral limits.
Then, we define the completed graph of g, denoted by Fg , as the set obtained from
G by adding the vertical segments in any discontinuity point s, from s − to s + .
116 A. Mesiarová-Zemánková
being commutative on the set of points (x, g(x)) such that x = g(g(x)).
For more details we recommend [41]. Idempotent uninorms on finite ordinal scales
were studied in [4].
A uninorm U which is internal on A(e) = [0, e] × [e, 1] ∪ [e, 1] × [0, e], i.e.,
U (x, y) ∈ {x, y} for all (x, y) ∈ A(e) is called locally internal on A(e). For uni-
norms locally internal on A(e) we have a result from [8](see also [7]) which shows
that if U : [0, 1]2 −→ [0, 1] is a uninorm locally internal on A(e) with neutral ele-
ment e ∈ ]0, 1[ then there exists a non-increasing function g : [0, 1] −→ [0, 1] with
fixed point e, such that inf{y ∈ [0, 1] | g(y) = g(x)} ≤ g(g(x)) ≤ sup{y ∈ [0, 1] |
g(y) = g(x)} for all x ∈ [0, 1], g(x) = 0 for all x > g(0), g(x) = 1 for all x < g(1),
and for all (x, y) ∈ A(e) there is
⎧
⎪
⎨min(x, y) if y < g(x), or y = g(x), x < g(g(x)),
U (x, y) = max(x, y) if y > g(x), or x = g(x), x > g(g(x)),
⎪
⎩
x or y if y = g(x) and x = g(g(x)).
Theorem 5 ([21]) Let U : [0, 1]2 −→ [0, 1] be a uninorm with the neutral element
e ∈ ]0, 1[ such that both TU and SU are strict then one of the following seven state-
ments holds:
(i) U ∈ Umin ,
(ii) ⎧
⎪
⎪ e · TU ( xe , ey ) if (x, y) ∈ [0, e]2 ,
⎪
⎨ e + (1 − e) · SU ( x−e , y−e
) if (x, y) ∈ [e, 1]2 ,
U (x, y) = 1−e 1−e
⎪1
⎪ if x = 1 or y = 1,
⎪
⎩
min(x, y) otherwise,
(iii)
⎧
⎪
⎪ e · TU ( xe , ey ) if (x, y) ∈ [0, e]2 ,
⎪
⎨e + (1 − e) · S ( x−e , y−e
U 1−e ) if (x, y) ∈ [e, 1]2 ,
U (x, y) = 1−e
⎪1
⎪ if x = 1, y > 0 or y = 1, x > 0,
⎪
⎩
min(x, y) otherwise,
(iv) U ∈ Umax ,
118 A. Mesiarová-Zemánková
(v) ⎧
⎪
⎪e · TU ( xe , ey ) if (x, y) ∈ [0, e]2 ,
⎪
⎨e + (1 − e) · S ( x−e , y−e
U 1−e ) if (x, y) ∈ [e, 1]2 ,
U (x, y) = 1−e
⎪
⎪0 if x = 0 or y = 0,
⎪
⎩
max(x, y) otherwise,
(vi)
⎧
⎪
⎪ e · TU ( xe , ey ) if (x, y) ∈ [0, e]2 ,
⎪
⎨e + (1 − e) · S ( x−e , y−e
U 1−e ) if (x,y) ∈ [e, 1]2 ,
U (x, y) = 1−e
⎪
⎪ 0 if x = 0, y < 1 or y = 0, x < 1,
⎪
⎩
max(x, y) otherwise,
(vii) U is representable.
Theorem 6 ([21]) Let U : [0, 1] −→ [0, 1]2 be a uninorm with the neutral element
e ∈ ]0, 1[ such that both TU and SU are nilpotent. Then either one of the following
two statements holds:
(i) U ∈ Umin ,
(ii) U ∈ Umax .
f (x) = v if x = e, (1)
⎪
⎩ (1−x)(d−c)
d − (1−e) otherwise.
Then f is linear on [0, e[ and on ]e, 1] and thus it is a piece-wise linear isomorphism
of [0, 1] to ([a, b[ ∪ {v} ∪ ]c, d]) and if U : [0, 1] −→ [0, 1]2 is a uninorm then a
binary function Uva,b,c,d : ([a, b[ ∪ {v} ∪ ]c, d])2 −→ ([a, b[ ∪ {v} ∪ ]c, d]) given by
is a uninorm on ([a, b[ ∪ {v} ∪ ]c, d])2 . Note that we assume that a = b (c = d) if and
only if e = 0 (e = 1). The function f is piece-wise linear, however, more generally
we can use any increasing isomorphic transformation.
If U1 and U2 are uninorms, with respective neutral elements e1 , e2 , then for 0 ≤
a < b < c < d ≤ 1 we have
(U1 )a,b,c,d
v (x, y) = (U2 )a,b,c,d
v (x, y)
if and only if U1 (x, y) = φ−1 (U2 (φ(x), φ(y))), where φ : [0, 1] −→ [0, 1] is a
strictly increasing isomorphism with φ(e1 ) = e2 which is linear on [0, e1 ] and on
[e1 , 1]. Similar result can be obtain for the case when a = b (c = d), however, then
only the corresponding parts of uninorms are isomorphic.
Now we have the following result
Proposition 2 ([32]) Assume e ∈ [0, 1]. Let K be an index set which is finite or
countably infinite and let (]ak , bk [)k∈K be a disjoint
system of open subintervals
(which can be also empty) of [0, e], such that k∈K [ak , bk ] = [0, e]. Similarly, let
(]ck , dk [)k∈K be a disjoint
system of open subintervals (which can be also empty)
of [e, 1], such that k∈K [ck , dk ] = [e, 1]. Let further these two systems be anti-
comonotone, i.e., bk ≤ ai if and only if ck ≥ di for all i, k ∈ K . Assume a family of
uninorms (Uk )k∈K on [0, 1]2 such that if both ]ak , bk [ and ]ck , dk [ are non-empty then
Uk is a proper uninorm, otherwise if ]ak , bk [ is non-empty then Uk is a t-norm and if
]ck , dk [ is non-empty then Uk is a t-conorm. If both ]ak , bk [ and ]ck , dk [ are empty then
ak = bk = ak1 = bk1 and ck = dk = ck1 = dk1 does not hold for any k1 ∈ K , k = k1 ,
and here only the value Uk (0, 1) is interesting. Denote K ∗ = {k ∈ K |]ak , bk [= ∅}
120 A. Mesiarová-Zemánková
underlying t-norm and t-conorm. From [33] we know that a uninorm U on [0, 1]2
is a complete ordinal sum of representable uninorms if and only if there exists a
continuous strictly decreasing function r : [0, 1] −→ [0, 1] with r (0) = 1, r (e) = e
and r (1) = 0 such that U is continuous on [0, 1] \ {(x, r (x)) | x ∈ [0, 1]} and U has
countably many idempotent points.
On the other hand, not every uninorm with continuous underlying t-norm and
t-conorm can be obtained as an ordinal sum of uninorms with continuous underlying
t-norm and t-conorm. Recall Theorem 5: here all cases except when U is representable
are ordinal sums in the sense of Clifford, however, only cases (i) and (iv) are ordinal
sums of uninorms. Let us see where this difference is hidden. For a detailed proof
we recommend [35].
We see that the ordinal sum of uninorms is based on the intervals of the form
[a, b[ ∪ ]c, d]. However, if the corresponding t-norm (t-conorm) that acts on [a, b]2
([c, d]2 ) is strict we can divide the semigroup acting on [a, b[ (]c, d]) into semigroups
acting on {a} and ]a, b[ (]c, d[ and {d}). By changing the order of these semigroups
in the ordinal sum construction (in the sense of Clifford) we can then obtain uni-
norms that cannot be obtained as an ordinal sum of uninorms, although they are
Archimedean. Thus in the case of Archimedean uninorms the problem is always on
the border of the unit square. In the case of t-norms (t-conorms) on the border of the
unit square we always have T = min (S = max), however, in the case of uninorms
with continuous underlying functions it is a mixture of min and max.
Recall that for a uninorm with continuous underlying t-norm and t-conorm U is
internal on {q} × [0, 1], where q ∈ [0, 1] is an idempotent point of U . After some
computations we can see that each uninorm with continuous underlying t-norm and
t-conorm is equal to an ordinal sum of uninorms with continuous underlying t-norm
and t-conorm on the set ([0, 1] \ IU )2 , where IU is the set of idempotent points of U .
Thus the difference can appear only in points (x, y) ∈ [0, 1]2 such that at least one
of x and y is idempotent.
Summarizing, we need a more general construction than the ordinal sum con-
struction that would allow semigroups defined on singletons. However, in order to
keep monotonicity, the order of these singletons cannot be changed arbitrarily, but it
depends on the other summands in the ordinal sum construction. As a first condition,
in [35] it was shown that for every idempotent point q of U the point of change
where U (x, q) = min(x, q) changes to U (x, q) = max(x, q), i.e., such a p ∈ [0, 1]
that U (x, q) = min(x, q) for all x < p and U (x, q) = max(x, q) for all x > p, is
unique and it is an idempotent point of U . Assume now that U with continuous
underlying t-norm and t-conorm is not equal to an ordinal sum of uninorms. This
means that for an ordinal sum of uninorms V such that U = V on ([0, 1] \ IU )2 there
exists an idempotent point q ∈ [0, 1] of U such that U (q, x) = V (q, x) for some
x ∈ [0, 1]. Assume that q > e (the other case is analogical). Then both restrictions
of U and V on [0, q]2 are uninorms on [0, q]2 . Let us transform these restrictions
to [0, 1]2 linearly. We obtain two uninorms U ∗ and V ∗ that are not equal on the
boundary of the unit square and V ∗ is an ordinal sum of uninorms. Now we will
recall two examples of such ordinal sums of uninorms.
122 A. Mesiarová-Zemánková
for c ∈ G ∗x denote
{[0, c]} if c = inf{ck | k ∈ G x },
Fc∗ =
{[0, c[, [0, c]} else,
and let
h : H −→ ( ([0, ak [ ∪ Jk ) ∪ Jb∗ )
x∈H k∈Hx b∈Hx∗
Example 1 Assume an ordinal sum uninorm U e : [0, 1]2 −→ [0, 1] such that U e =
(0, e, e, e, T , 0, 0, e, b, C1 , 0, 0, b, 1, C2 )e , for some b, e ∈ [0, 1], 0 < e <
b < 1 (see Fig. 2) and a t-norm T and t-conorms C1 , C2 . Assume that C1 is
c-strict and C2 is c-nilpotent. Since T is a t-norm we have U e (0, e) = 0 and thus
we have G = {0}, H = ∅ for sets G, H from the previous proposition. If we denote
the summands respectively as 1, 2, 3 for K = {1, 2, 3} we get G 0 = {2, 3}. Further,
G ∗∗ ∗
x = {e, b}, G x = ∅,
124 A. Mesiarová-Zemánková
T∗ max max
The function g is defined only in one point 0 and its range is the set
max
1
m+1
m+1
min
the previous section) we have now a detailed knowledge about structure of uninorms
with continuous underlying functions.
The major role in the decomposition of a uninorm with continuous underlying
functions to an extended ordinal sum of uninorms plays its characterizing multi-
function, which is yet another possibility how to characterize uninorms with con-
tinuous underlying t-norm and t-conorm. This characterizing multi-function in fact
covers the set of points of discontinuity of such a uninorm U . We will now show
results from [34] which characterize the set of points of discontinuity of a uninorm
with continuous underlying functions.
The first interesting result shows that a uninorm with continuous underlying func-
tions is either left-continuous or right continuous (or continuous) in each point from
[0, 1]2 . Next we need the definition of a multi-function.
Definition 1 A mapping p : X −→ P(Y ) is called a multi-function if for every
x ∈ X it assigns a subset of Y , i.e., p(x) ⊆ Y . A multi-function p is called
(i) non-increasing if for all x1 , x2 ∈ X, x1 < x2 there is p(x1 ) ≥ p(x2 ), i.e., for all
y1 ∈ p(x1 ) and all y2 ∈ p(x2 ) we have y1 ≥ y2 and thus Card ( p(x1 ) ∩ p(x2 )) ≤
1,
(ii) symmetric if y ∈ p(x) if and only if x ∈ p(y).
The graph of a multi-function p will be denoted by G( p), i.e., (x, y) ∈ G( p) if and
only if y ∈ p(x).
A symmetric multi-function p : [0, 1] −→ P([0, 1]) is surjective, i.e., for all y ∈ Y
there exists an x ∈ X such that y ∈ p(x), if and only if we have p(x) = ∅ for
all x ∈ X . The graph of a symmetric, surjective, non-increasing multi-function
126 A. Mesiarová-Zemánková
p : [0, 1] −→ P([0, 1]) is a connected line. For any uninorm with continuous under-
lying functions we denote A = inf{x | U (x, 0) > 0}, B = sup{x | U (x, 1) < 1}
and let a, d ∈ [0, 1] be such that U (x, y) = e for some y ∈ [0, 1] if and only if
x ∈ ]a, d[. Then either A = 1, B = 0, or A = 1, B = 0, or A = 1, B = 0. Note
that if A = 1, B = 0, then U is non-continuous in (B, 1), if A = 1, B = 0, then U
is non-continuous in (0, A), and if A = 1, B = 0 then U is non-continuous in (0, 1).
Further, we have 0 ≤ B ≤ a ≤ e ≤ d ≤ A ≤ 1. Now we can introduce the result
describing the set of points of discontinuity of a uninorm with continuous underly-
ing functions via a multi-function. Note that in the results on idempotent uninorms
and uninorms locally internal on A(e) introduced in the previous section we can see
examples of such a multi-function (see also Fig. 1).
0 max 0 max
Fig. 4 The uninorm U from Example 2. The bold lines denote the points of discontinuity of U
(left) and the characterizing multi-function r of U (right)
Then U ∈ Umax is a uninorm, where the underlying t-norm is the drastic product and
the underlying t-conorm is the maximum. This uninorm is non-continuous in points
from {e} × [0, e] ∪ [0, e] × {e}. Thus the corresponding multi-function is given by
(see Fig. 4) ⎧
⎪
⎪ [e, 1] if x = 0,
⎪
⎨e if x ∈ ]0, e[,
r (x) =
⎪
⎪ [0, e] if x = e,
⎪
⎩
0 otherwise.
and border points of these segments are always idempotent elements of U . Then
each horizonal and vertical segment corresponds to a non-complete summand, i.e.,
a continuous t-norm or t-conorm, in the decomposition by means of the extended
ordinal sum construction. These t-norms (t-conorms) need not be Archimedean, but
could be decomposable to further non-complete summands. Moreover, each strictly
decreasing segment corresponds to an ordinal sum of complete summands, which
are representable or idempotent uninorms (in the case of idempotent uninorms their
characterizing multi-function should be strictly decreasing in this situation). Thus
by combination of decomposition into components based on idempotent points of
U and on maximal segments of its characterizing multi-function we can couple
corresponding idempotent points and obtain a full decomposition via the extended
ordinal sum construction (see [35]).
Recall that for a binary operation O : [0, 1]2 −→ [0, 1] its corresponding diagonal
function d O : [0, 1] −→ [0, 1] is given by d O (x) = O(x, x) for all x ∈ [0, 1]. As
we have seen above, in the structure of uninorms a major role is played by t-norms
and t-conorms, and this holds especially when we discuss areas around the diagonal.
Therefore in the investigation of uninorms with continuous diagonals we have to first
focus on t-norms (t-conorms) with continuous diagonals. As t-norms and t-conorms
are dual all results on t-norms with continuous diagonals can be immediately obtained
also for t-conorms.
The first step in the study of t-norms with continuous diagonals was an open
problem from [43]: whether a t-norm with continuous diagonal function have to be
continuous. It is obvious that each continuous t-norm has a continuous diagonal and
the construction of all continuous t-norms with the given continuous diagonal was
described in [47] (see also [17, 27]). For the opposite problem a negative answer
Structure of Uninorms with Continuous Diagonal Functions 129
was given by Krause [20] (see also [19, 29, 44]). The Krause t-norm has a fractal-
like structure, it is Archimedean and continuous on the diagonal, however, it is not
right-continuous and it is not left-continuous on the border of the unit square.
Thus in general a t-norm with a continuous diagonal need not be continuous.
However, there are several results showing under which conditions such a t-norm is
continuous. We will start with Archimedean t-norms. In [37] continuous extensions
of t-norms known on [a, b]2 , for 0 ≤ a ≤ b ≤ 1, to the whole unit square were
investigated. From this paper we can obtain the following results:
(i) Let T be a t-norm continuous on [a, 1]2 , such that T (x, x) < x for all x ∈
[a, 1[. Then T is on [a, 1]2 conditionally cancellative, i.e., T (x, y) = T (x, z) >
T (a, a) implies y = z for all x, y, z ∈ [a, 1].
(ii) Let T be an Archimedean t-norm continuous and cancellative on [a, 1]2 , such
that T has a continuous diagonal. Then T is continuous.
(iii) Let T be an Archimedean t-norm continuous and conditionally cancellative,
but not cancellative, on [a, 1]2 . Then T is continuous.
Thus we see that for an Archimedean t-norm the continuity on [a, 1]2 for arbitrary
0 < a < 1 and continuity of the diagonal implies continuity on the whole unit square.
Moreover, such a t-norm is uniquely determined by its values on [a, 1]2 and its
diagonal function. Note that if we relax the Archimedean property then an ordinal
sum of the Krause t-norm on [0, a] for any 0 < a < 1 will yield a non-continuous
t-norm with continuous diagonal which is continuous on [a, 1]2 . Further, in the
following example we will see that the continuity of the diagonal is not implied by
the Archimedean property and continuity on [a, 1]2 .
From the above result we see that, for example, if for a uninorm U the TU is the
Krause t-norm and SU is the t-conorm dual to the Krause t-norm then there are no
such a, b ∈ [0, 1], 0 ≤ a < e < b ≤ 1, that U it is continuous on [a, e]2 and [e, b]2 .
130 A. Mesiarová-Zemánková
In general, we have the following result for both t-norms and t-conorms.
Corollary 2 ([36]) A t-norm T : [0, 1]2 −→ [0, 1] (t-conorm S : [0, 1]2 −→ [0, 1])
is continuous if and only if it is continuous on the diagonal and for each idempotent
point x ∈ ]0, 1] (x ∈ [0, 1[) there exists a δx > 0 such that T (S) is continuous in
(x, y) for all y ∈ [x − δx , x] (y ∈ [x, x + δx ]).
From this corollary we can conclude that a uninorm U : [0, 1]2 −→ [0, 1] has
continuous underlying functions if and only if it is continuous on the diagonal and
for each idempotent point x ∈ ]0, 1[ there exists a δx > 0 such that if x ≤ e then U
is continuous in (x, y) for all y ∈ [x − δx , x], and if y ≥ e then U is continuous in
(x, y) for all y ∈ [x, x + δx ]. For more details we recommend [36]. If we again take a
uninorm U such that TU is the Krause t-norm and SU is the t-conorm dual to the Krause
t-norm then there is no such δ > 0 that U is continuous on {e} × [e − δ, e + δ].
In this subsection we will first focus on the case when we have no information about
the continuity of the diagonal of the given uninorm. In this case let us recall a result
from [13] which shows that if a cancellative Archimedean t-norm is (left-)continuous
in the point (1, 1) then it is isomorphic with the product t-norm, i.e., it is continuous
and strict. This means that if a uninorm U is cancellative on ]0, e]2 ∪ [e, 1[2 and
continuous in (e, e) then U has continuous underlying functions and thus it has a
structure described in the previous section.
Moreover, in [36] it was shown that an Archimedean t-norm which is border-
continuous (i.e., each point from the boundary of the unit square is the point of
continuity of T ) is also continuous. This means that an Archimedean uninorm which
is left-continuous in all points from [0, e] × {e} and right-continuous in all points
from [e, 1] × {e} has continuous underlying functions.
From these two results we see that the Archimedean property can ‘spread’ the
continuity from some part of the unit square to the whole unit square. For both
results, the Archimedean property of the corresponding uninorm is crucial. In the first
case recall an example of a non-Archimedean,
left-continuous, cancellative t-norm
([28, 45]) which is given by T (x, y) = z = 1
2z i
for all (x, y) ∈ ]0, 1]2 , where
1
1 i∈N
x= 2 xi
, y= 2 yi
, and {xi }i∈N , {yi }i∈N , {z i }i∈N are increasing sequences of
i∈N i∈N
natural numbers, and z i = xi + yi − i. For the second result it is enough to take an
ordinal sum of the Krause t-norm on [0, a] for some a ∈ ]0, 1[. Then this ordinal
sum is border continuous, non-Archimedean and non-continuous.
In the final part of this chapter we will focus on uninorms that are continuous on
[0, e[2 ∪ ]e, 1]2 . Assume such a uninorm U and focus on TU (similar observations
can be obtained for SU by duality). We will now show that TU can be obtained
Structure of Uninorms with Continuous Diagonal Functions 131
from a continuous t-subnorm ([16]) by redefining its values on the border of the
unit square. First, let us recall that a binary operation M : [0, 1]2 −→ [0, 1] is a
t-subnorm if it is commutative, associative, non-decreasing in both variables and
M(x, y) ≤ min(x, y) for all (x, y) ∈ [0, 1]2 . From each t-subnorm we can obtain a
t-norm T given by T (x, y) = M(x, y) for all (x, y) ∈ [0, 1[2 , T (x, y) = min(x, y)
otherwise. This process is called lifting of a t-subnorm to a t-norm. Then T = M
if and only if M is a t-norm. Vice versa in [15] a border-continuous projection
MT : [0, 1]2 −→ [0, 1] of a t-norm was defined by
T (x, y) if (x, y) ∈ [0, 1[2 ,
MT (x, y) = − −
T (x , y ) if max(x, y) = 1.
The idea of this border-continuous projection was to obtain a reverse process to the
lifting of a t-subnorm to a t-norm. However, such a border-continuous projection
need not to be monotone.
Example 4 Let T : [0, 1]2 −→ [0, 1] be given by
⎧
⎪
⎨min(x, y) if max(x, y) = 1,
T (x, y) = 23 (x + y) − 5
if (x, y) ∈ [ 21 , 1]2 , max(x, y) < 1,
⎪
⎩
6
0 otherwise.
Then MT ( 21 , 1) = 0 and MT ( 21 , 78 ) = 1
12
.
Thus the proper definition of a border-continuous projection is rather MT (x, y) =
T (x, y) if (x, y) ∈ [0, 1[2 , MT (x, y) = T (x − , y) if x = 1, y < 1, MT (x, y) =
T (x, y − ) if y = 1, x < 1, and MT (x, y) = T (x − , y − ) if x = y = 1. In this case
a border-continuous projection of a t-norm is commutative, bounded by minimum
and non-decreasing in both variables. Note that a border-continuous projection is
not border-continuous in the sense that each point from the border of the unit square
is a point of continuity of MT , but in the sense that MT (x, 1− ) = MT (x, 1), while
the function MT (·, 1) can be non-continuous. However, if we take the t-norm from
Example 4 then MT is not associative, i.e., not a t-subnorm. Indeed, in this case
2
(x + y) − 5
if (x, y) ∈ [ 21 , 1]2 ,
MT (x, y) = 3 6
0 otherwise,
(i) for all x, y ∈ [0, 1[ either T (u 0 , x) = lim − T (u, x) for some u 0 ∈ [0, 1[, or
u−→1
T (a, y) = lim − T (v, y), where a = lim − T (u, x),
v−→a u−→1
(ii) either lim − T (u, u) = 1, or T (u 0 , v0 ) = lim − T (u, u) for some u 0 , v0 ∈ [0, 1[,
u−→1 u−→1
or for all x ∈ [0, 1[ there is T (b, x) = lim − T (v, x), where b = lim − T (u, u).
v−→b u−→1
Proposition 5 ([39]) Let U : [0, 1]2 −→ [0, 1] be a uninorm with neutral ele-
ment e ∈ ]0, 1[, such that MTU (MCU ) is a continuous cancellative t-subnorm
(t-superconorm). Then U has one of the seven forms from Theorem 5.
Acknowledgments This work was supported by grants VEGA 2/0049/14, APVV-0178-11 and
Program Fellowship of SAS.
References
1. Aczél, J.: Lectures on Functional Equations and their Applications. Academic Press, New York
(1966)
2. Akella, P.: Structure of n-uninorms. Fuzzy Sets Syst. 158, 1631–1651 (2007)
3. De Baets, B.: Idempotent uninorms. Eur. J. Oper. Res. 118, 631–642 (1998)
4. De Baets, B., Fodor, J., Ruiz, D., Torrens, J.: Idempotent uninorms on finite ordinal scales. Int.
J. Uncertain. Fuzziness, Knowl.-Based Syst. 107, 1–14 (2009)
5. Clifford, A.H.: Naturally totally ordered commutative semigroups. Am. J. Math. 76, 631–646
(1954)
6. Dombi, J.: A general class of fuzzy operators, the De Morgan class of fuzzy operators and
fuzziness measures induced by fuzzy operator. Fuzzy Sets Syst. 8, 149–163 (1982)
7. Drewniak, J., Drygaś, P.: On a class of uninorms. Int. J. Uncertain. Fuzziness, Knowl.-Based
Syst. 10, 5–10 (2002)
8. Drygaś, P.: Discussion of the structure of uninorms. Kybernetika 41, 213–226 (2005)
9. Drygaś, P.: On monotonic operations which are locally internal on some subset of their domain.
Proc. EUSFLAT Conf. 2, 185–191 (2007)
134 A. Mesiarová-Zemánková
10. Drygaś, P.: On properties of uninorms with underlying t-norm and t-conorm given as ordinal
sums. Fuzzy Sets Syst. 161, 149–157 (2010)
11. Fodor, J., De Baets, B.: A single-point characterization of representable uninorms. Fuzzy Sets
Syst. 202, 89–99 (2012)
12. Fodor, J., Yager, R.R., Rybalov, A.: Structure of uninorms. Int. J. Uncertain. Fuzziness, Knowl.-
Based Syst. 5, 411–427 (1997)
13. Hájek, P.: Observations on the monoidal t-norm logic. Fuzzy Sets Syst. 132, 107–112 (2002)
14. Hu, S., Li, Z.: The structure of continuous uninorms. Fuzzy Sets Syst. 124, 43–52 (2001)
15. Jayaram, B., Baczyński, M., Mesiar, R.: R-implications and the exchange principle: a complete
characterization. In: Galichet, S., Montero, J., Mauris, G. (eds.) Proceedings of EUSFLAT-2011
and LFA-2011, pp. 223–229. Aix-les-Bains, France (2011)
16. Jenei, S.: A note on the ordinal sum theorem and its consequence for the construction of
triangular norms. Fuzzy Sets Syst. 126, 199–205 (2002)
17. Kimberling, C.: On a class of associative functions. Publ. Math. Debrecen 20, 21–39 (1973)
18. Klement, E.P., Mesiar, R., Pap, E.: On the relationship of associative compensatory operators to
triangular norms and conorms. Int. J. Uncertainty, Fuzziness, Knowl.-Based Syst. 4, 129–144
(1996)
19. Klement, E.P., Mesiar, R., Pap, E.: Triangular Norms. Kluwer Academic Publishers, Dordrecht
(2000)
20. Krause, G.M.: The Devil’s Terraces: a Discontinuous Associative Function, personal commu-
nication (2015)
21. Li, G., Liu, H.W., Fodor, J.: Single-point characterization of uninorms with nilpotent underlying
t-norm and t-conorm. Int. J. Unc. Fuzz. Knowl. Based Syst. 22, 591–604 (2014)
22. Li, Y.M., Shi, Z.K.: Remarks on uninorm aggregation operators. Fuzzy Sets Syst. 114, 377–380
(2000)
23. Liu, H.W.: Semi-uninorms and implications on a complete lattice. Fuzzy Sets Syst. 191, 72–82
(2012)
24. Martín, J., Mayor, G., Torrens, J.: On locally internal monotonic operations. Fuzzy Sets Syst.
137, 27–42 (2003)
25. Mas, M., Mayor, G., Torrens, J.: t-operators and uninorms on a finite totally ordered set. Int. J.
Intell. Syst. 14, 909–922 (1999)
26. Mas, M., Monserrat, M., Torrens, J.: On left and right uninorms. Int. J. Uncertainty, Fuzziness
Knowl.-Based Syst. 9(4), 491–507 (2001)
27. Mesiar, R., Navara, M.: Diagonals of continuous triangular norms. Fuzzy Sets Syst. 104, 35–41
(1999)
28. Mesiar, R.: Triangular norms—an overview. In: Reusch, B., Temme, K.H. (eds.) Computational
Intelligence in Theory and Practice, pp. 35–54. Physica-Verlag, Heidelberg (2001)
29. Mesiarová, A.: Wild T-norms. J. Electr. Eng. 12/s, 36–40 (2000)
30. Mesiarová, A.: Continuous triangular subnorms. Fuzzy Sets Syst. 142, 75–83 (2004)
31. Mesiarová-Zemánková, A.: Multi-polar t-conorms and uninorms. Inf. Sci. 301, 227–240 (2015)
32. Mesiarová-Zemánková, A.: Ordinal sum of uninorms and generalized uninorms, Int. J. Approx-
imate Reasoning, under Rev. (2015)
33. Mesiarová-Zemánková, A.: Ordinal sums of representable uninorms, Fuzzy Sets Syst., under
Rev. (2015)
34. Mesiarová-Zemánková., A.: Characterization of uninorms with continuous underlying t-norm
and t-conorm by their set of discontinuity points. IEEE Trans. Fuzzy Syst., under Rev. (2015)
35. Mesiarová-Zemánková., A.: Characterization of uninorms with continuous underlying t-norm
and t-conorm by means of the ordinal sum construction. Int. J. Approximate Reasoning, under
Rev. (2015)
36. Mesiarová-Zemánková., A.: T-norms and t-conorms continuous around diagonals. Fuzzy Sets
Syst., under Rev. (2015)
37. Mesiarová-Zemánková A.: Continuous completions of triangular norms known on a subregion
of the unit interval. Fuzzy Sets Syst., under Rev. (2015)
Structure of Uninorms with Continuous Diagonal Functions 135
Abstract In this work, we make a review of the concepts of overlap and grouping
functions, mainly from a theoretical point of view. In particular, we summarize some
of the most relevant works that have been published in recent years about this topic.
1 Introduction
Rule R j : If x1 is
A j1 and . . . and xn is
A jn
(1)
then Class = C j with RW j ,
An (μ
A j1 (x p1 ), . . . , μ
A jn (x pn ))
2 Preliminaries
We start recalling some concepts and results which are well-known, in order to fix
notations.We start with the concept of automorphism of the unit interval.
For the particular case of bivariate aggregation functions we remind the following
definitions.
A particular type of continuous t-norms are Archimedean t-norms (see, e.g., [23]).
It is possible to classify t-norms using the notion of ordinal sum, that we recall now.
Theorem 2 Assume that T is a continuous t-norm. Then, one of the following three
cases is valid for T :
1. T (x, y) = min(x, y);
2. T is Archimedean;
3. there exists a family {([am , bm ], Tm )} such that T is the ordinal sum of this family
and each Tm is a continuous Archimedean t-norm.
A general study on t-norms and their properties may be found, for instance, in
[1, 15, 16, 23, 29].
3 Overlap Functions
In this section, we mainly follows the developments in [9]. As we have already said,
overlap functions are a particular instance of bivariate aggregation functions. The
formal definition reads as follows.
There are many possible examples of overlap functions. For instance, G O (x, y) =
min(x, y) or G O (x, y) = x p y p for p > 0. Note that, in the latter example the result-
ing overlap function is not associative unless p = 1, whereas, if p = 1, we recover
the product, which is also a t-norm (as it is also the case of the minimum).
Let’s denote by O the set of all overlap functions. Then:
The lattice (O, ≤O ) is not complete (no top neither bottom elements, for example).
On the other hand, it is closed with respect to appropriate aggregation functions. In
particular, it holds that:
Theorem 5 The mapping G O : [0, 1]2 → [0, 1] is an overlap function if and only if
f (x, y)
G O (x, y) =
f (x, y) + h(x, y)
The rational expressions which are provided in Theorem 5 need not be unique.
However, the lack of uniqueness allows us to recover a whole family of overlap
functions as follows.
Corollary 3 Let f and h be two functions in the setting of the previous theorem.
Then, for k1 , k2 ∈]0, ∞[, the mappings
f k1 (x, y)
G kO1 ,k2 (x, y) =
f k1 (x,y) + h k2 (x, y)
f 1 (x, y) f 2 (x, y)
G O (x, y) = =
f 1 (x, y) + h 1 (x, y) f 2 (x, y) + h 2 (x, y)
The Notions of Overlap and Grouping Functions 143
for any x, y ∈ [0, 1] and let M be a bivariate continuous aggregation function that
is homogeneous of order one. Then, if we define f (x, y) = M( f 1 (x, y), f 2 (x, y))
and h(x, y) = M(h 1 (x, y), h 2 (x, y)) it also holds that
f (x, y)
G O (x, y) = .
f (x, y) + h(x, y)
2. In the construction of the following overlap function we take the product and the
Lukasiewicz t-norms (see p. 84 of [23]). Nevertheless, in this overlap function
we do not consider any interval of the type [0, bm ].
⎧
⎨ 0.1 + 2.5(x − 0.1)(y − 0.1) if (x, y) ∈ [0.1, 0.5]2
G O (x, y) = 0.7 + max(x + y − 1.6, 0) if (x, y) ∈ [0.7, 0.9]2
⎩
min(x, y) otherwise
3. The following t-norm satisfies all the properties required to overlap func-
tions, except (G O 2). This is due to the fact that in [0, 0.25]2 we consider the
Lukasiewicz t-norm which is continuous and Archimedean but not strict.
max(x + y − 0.25, 0) if (x, y) ∈ [0, 0.25]2
T (x, y) =
min(x, y) otherwise
144 H. Bustince et al.
As we have already discussed, there exists a close relation between overlap functions
and other relevant classes of operators such as t-norms. Moreover, in the same way
as it is done for the latter, it is possible to study the use additive generators for overlap
functions [13].
We start recalling the notion of pseudo-inverse of a given function.
Definition 11 [31] Let f : [a, b] → [c, d] be an increasing or decreasing function.1
function f (−1) : [c, d] → [a, b] defined by
⎧
⎨ sup{x ∈ [a, b] | f (x) < y} if f (a) < f (b),
f (−1) (y) = sup{x ∈ [a, b] | f (x) > y} if f (a) > f (b), (2)
⎩
a if f (a) = f (b)
1 Inthis paper, an increasing (decreasing) function does not need to be strictly increasing (decreas-
ing).
The Notions of Overlap and Grouping Functions 145
is an overlap function.
is an overlap function.
(θ, ϑ) is called an additive generator pair of the overlap function Oθ,ϑ , and Oθ,ϑ
is said to be additively generated by the pair (θ, ϑ).
Example 3 Consider the functions θ : [0, 1] → [0, ∞] and ϑ : [0, ∞] → [0, 1],
defined, respectively by:
−2 ln x if x = 0
θ (x) =
∞ if x = 0
and
e−x if x = ∞
ϑ(x) =
0 if x = ∞,
which are continuous and decreasing functions, satisfying the conditions 1–4 of
Corollary 6. Then, whenever x = 0 and y = 0, one has that:
Oθ,ϑ (x, y) = x 2 y 2 ,
and so we recover ae non associative overlap function for which 1 is not a neutral
element.
Notice that whenever T is a positive continuous t-norm (that is, an overlap func-
tion) that is additively generated by a function t : [0, 1] → [0, ∞], then it is also
additively generated by a pair (θ, ϑ) in the sense of Theorem 9, where θ = t and
ϑ = t (−1) , and vice-versa.
A more detailed study on additive generators for overlap functions can be
found in [13].
Now let’s recall the definition of pseudo-automorphism.
Oϕ,T is called the overlap obtained by the distortion of the t-norm T by the
automorphism ϕ, or the overlap function obtained by a (ϕ, T )-distortion.
However, it is important to remark that not every overlap is obtained as a distortion
of a t-norm.
Proposition 3 There exists an overlap function G O : [0, 1]2 → [0, 1] which is not
a (F , T )-distortion of a positive and continuous t-norm T : [0, 1]2 → [0, 1] for
any automorphism F .
In the same way, it is also true that not every overlap function G O : [0, 1]2 →
[0, 1] can be obtained by means of a (F , T )-distortion, for some pseudo-automor-
phism F : [0, 1] → [0, 1] and some positive and continuous t-norm T : [0, 1]2 →
[0, 1].
148 H. Bustince et al.
The notion of grouping function [10] arises in the applied field when it is necessary
to measure up to what extent a given element belongs to at least one of two given
classes. The formal definition reads as follows.
There exists a close link between overlap and grouping functions. In fact, both
concepts are dual to each other.
Definition 14 [22] Let G be an overlap function (resp. a grouping function) and let
n 1 and n 2 be two continuous negation operators such that:
1. n 1 (x) = 0 (resp. n 2 (x) = 0) if and only if x = 1, and
2. n 1 (x) = 1 (resp. n 2 (x) = 1) if and only if x = 0
Then, the operator
n 1 ,n 2
G (x, y) = n 1 (G(n 2 (x), n 2 (y)))
In fact, every grouping function can be obtained as the dual through a strong
negation of an overlap function. Besides, note that this duality implies that many of
the results we have discussed for overlap functions can be translated straightforwardly
for grouping functions just making use of duality. So, in some sense, it seems that
overlap and grouping functions are related in the same way as t-norms are t-conorms
are. However, the drop of associativity in the definition of both overlap and grouping
functions gives raise to dramatic differences from the case of t-norms and t-conorms.
In particular, it does not hold that overlap functions are always smaller than or equal
to grouping functions.
√ √
Example 4 Consider the overlap function given by G O (x, y) = min( x, y) and
the grouping function given by G G (x, y) = max(x 2 , y 2 ). Then:
√ √
G O (0.5, 0.25) = min( 0.5, 0.25) > max((0.5)2 , (0.25)2 ) = G G (0.5, 0.25)
So in this case G O G G .
The Notions of Overlap and Grouping Functions 149
Let Ω denote the set of mappings satisfying the four properties given in
Proposition 4. Then
Besides, the choice of a bounding function imposes some restrictions to the size
of the corresponding overlap and grouping functions.
Proposition 8 Let f ∈ Ω, then
1. the operator G O (x, y) = f (min(x, y)) is the greatest f -bound overlap and
2. the operator G G (x, y) = f (max(x, y)) is the least f -bound grouping.
It is relevant to note that there is neither least element in O f nor greatest ele-
ment in G f .
Proposition 9 Let f ∈ Ω, then the infimum of O f is:
⎧
⎨ f (x) if y = 1
G O (x, y) = f (y) if x = 1
⎩
G O ∈O f 0 Otherwise
Corollary 10 Let f ∈ Ω. Then the set O f (resp. G f ) does not have a least element
(resp. a greatest element).
Another interesting property of O f and G f is that they are dense as posets.
Proposition 10 Let G 1 and G 2 be two overlaps in O f (resp. two groupings in
G f ) such that G 1 < G 2 . Then there exists G ∈ O f (resp. G ∈ G f ) such that G 1 <
G < G2.
The following definition establishes a restriction on negations used in the dual
construction in order to maintain the f -bound condition.
Let’s recall now the notion of f -duality.
Definition 16 Let G be an overlap (resp. grouping), f ∈ Ω an automorphism on
the unit interval (i.e., a bijective mapping), n a bijective negation2 and let us consider
the negations:
• n 1 (x) = f (n(x)) and
• n 2 (x) = f −1 (n −1 (x)),
Then the operator
n, f
G (x, y) = n 1 G(n 2 (x), n 2 (y))
From Proposition 11, we can ensure that any f-bound overlap functions is less
than or equal to any of its f-dual grouping functions.
n, f
Corollary 11 Let f ∈ Ω, G O ∈ O f and let G O be an f -dual grouping function
n, f
of G O . Then the inequality G O ≤ G O holds.
And reciprocally:
n, f
Corollary 12 Let f ∈ Ω, G G ∈ G f and let G G be an f -dual overlap function
n, f
of G G . Then the inequality G G ≤ G G holds.
In [25] this study is made deeper by considering also the so-called f -diagonal
overlap and grouping functions, that is, overlap and grouping functions whose values
at the diagonal are given by a fixed function f .
Since overlap functions are not assumed to be associative, its extensions to the n-
dimensional case with n > 2 requires of a specific definition. From now on, we
follow the developments in [18].
Note that, taking into account this definition, an object c that belongs to three
classes C1 , C2 and C3 with degrees x1 = 1, x2 = 1 and x3 = 0.3 will not have the
maximum degree of overlap since condition (3) of the previous definition is not
satisfied. Even more, if the degrees are x1 = 1, x2 = 1 and x3 = 0, from the second
condition we will conclude that the n-dimensional degree of overlapping of this
object into the classification system given by the classes C1 , C2 and C3 will be zero.
This is the reason why this first extension of the original idea of overlap proposed
in [9] has been called n-dimensional overlap. Let us observe that this definition is
closely related with the idea of intersection of n classes.
152 H. Bustince et al.
Example 5 It is easy to see that the following aggregation functions are n-dimensio-
nal overlap functions:
p
p
1. The minimum powered by p. G O (x1 , . . . , xn ) = min {xi } = min {xi } with
1≤i≤n 1≤i≤n
p > 0.
n
1
2. The geometric mean. G O (x1 , . . . , xn ) = ( xi ) n .
i=1 n
xi
3. The Einstein product aggregation operator. E P(x1 , . . . , xn ) = ni=1
1 + i=1 (1 − xi )
π
n
4. The sinus induced overlap G O (x1 , . . . , xn ) = sin ( xi ) p with p > 0.
2 i=1
Remark 1 Notice that, since any averaging aggregation function M being symmetric
and continuous satisfies the conditions of the previous Theorem, it is possible to
conclude that any continuous symmetric averaging aggregation of n-dimensional
overlap functions is also an n-dimensional overlap function.
Again, some particular continuous t-conorms (their n-ary forms) and their convex
combinations are prototypical examples of n-ary grouping functions.
7 Conclusions
In this chapter we have made a review of the main definitions and results related
to overlap and grouping functions. These functions have shown themselves very
useful in applications where associativity is not a natural requirement. Although a
detailed explanation of such applications would require a too large amount of space,
it is worth to mention the use of overlap functions in decision making to define
preference structures [10], in classification to get generalizations of fuzzy rule based
algorithms which improve the results of those based on the use t-norms [28], or
in image processing [22]. In all these cases it is worth to mention that classical
algorithms may be improved with the use of these overlap and grouping functions.
References
1. Alsina, C., Frank, M.J., Schweizer, B.: Associative Functions. Triangular Norms and Copulas.
World Scientific, Hackensack (2006)
2. Amo, A., Montero, J., Molina, E.: Representation of consistent recursive rules. Eur. J. Oper.
Res. 130, 29–53 (2001)
3. Amo, A., Montero, J., Biging, G., Cutello, V.: Fuzzy classification systems. Eur. J. Oper. Res.
156, 459–507 (2004)
4. Bedregal, B., Dimuro, G.P., Bustince, H., Barrenechea, E.: New results on overlap and grouping
functions. Inf. Sci. 249, 148–170 (2013)
5. Bustince, H., Barrenechea, E., Pagola, M., et al.: Weak fuzzy S-subsethood measures. Overlap
index. Int. J. Uncertainty Fuzziness Knowl.-Based Syst. 14(5), 537–560 (2006)
6. Bustince, H., Mohedano, V., Barrenechea, E., et al.: Definition and construction of fuzzy DI-
subsethood measures. Inf. Sci. 176(21), 3190–3231 (2006)
7. Bustince, H., Pagola, M., Barrenechea, E.: Construction of fuzzy indices from fuzzy DI-
subsethood measures: application to the global comparison of images. Inf. Sci. 177(3), 906–929
(2007)
8. Bustince, H., Montero, J., Barrenechea, E., Pagola, M.: Semiautoduality in a restricted family
of aggregation operators. Fuzzy Sets Syst. 158(12), 1360–1377 (2007)
9. Bustince, H., Fernandez, J., Mesiar, R., Montero, J., Orduna, R.: Overlap functions. Nonlin.
Anal.-Theory Meth. Appl. 72, 1488–1499 (2010)
10. Bustince, H., Pagola, M., Mesiar, R., Hullermeier, E., Herrera, F., Montero, J., Orduna, R.:
Grouping, overlap, and generalized bientropic functions for fuzzy modeling of pairwise com-
parisons. IEEE Trans. Fuzzy Syst. 20, 405–415 (2012)
11. Calvo, T., Kolesárová, A., Komorníkova, M., Mesiar, R.: Aggregation operators: properties,
classes and construction methods. In: Aggregation Operators New Trends and Applications.
Physica-Verlag, Heidelberg (2002)
12. Cutello, V., Montero, J.: Recursive connective rules. Int. J. Intell. Syst. 14, 3–20 (1999)
13. Dimuro, G.P., Bedregal, B., Bustince, H., Asiaín, M.J., Mesiar, R.: On additive generators of
overlap functions. Fuzzy Sets Syst. (in press) doi:10.1016/j.fss.2015.02.008
14. Dubois, D., Koning, J.L.: Social choice axioms for fuzzy set aggregation. Fuzzy Sets Syst. 58,
339–342 (1991)
15. Dubois, D., Ostasiewicz, W., Prade, H.: Fuzzy Sets: History and Basic Notions. In: Fundamen-
tals of Fuzzy Sets. Kluwer, Boston (2000)
16. Fodor J., Roubens M., Fuzzy preference modelling and multicriteria decision support. In:
Theory and Decision Library, Kluwer Academic Publishers (1994)
17. Gómez, D., Montero, J.: A discussion on aggregation operators. Kybernetika 40, 107–120
(2004)
18. Gómez, D., Rodríguez, J.T., Montero, J., Bustince, H., Barrenechea, E.: n-dimensional overlap
functions. Fuzzy Sets Syst. (in press). doi:10.1016/j.fss.2014.11.023
19. Hosseini, M.S., Eftekhari-Moghadam, A.M.: Fuzzy rule-based reasoning approach for event
detection and annotation of broadcast soccer video. Appl. Soft Comput. 13(2), 846–866 (2013)
20. Ishibuchi, H., Nakashima, T., Nii, M.: Classification and Modeling with Linguistic Information
Granules: Advanced Approaches to Linguistic Data Mining. Springer, Berlin (2004)
21. Ishibuchi, H., Yamamoto, T.: Rule weight specification in fuzzy rule-based classification sys-
tems. IEEE Trans. Fuzzy Syst. 13, 428–435 (2005)
22. Jurio, A., Bustince, H., Pagola, M., Pradera, A., Yager, R.: Some properties of overlap and
grouping functions and their application to image thresholding. Fuzzy Sets Syst. 229, 69–90
(2013)
23. Klement, E.P., Mesiar, R., Pap, E.: Triangular Norms, Trends in Logic, Studia Logica Library,
vol. 8. Kluwer Academic Publishers, Dordrecht (2000)
24. Klir, G.J., Folger, T.A.: Fuzzy Sets. Uncertainty and Information, Prentice Hall, Englewood
Cliffs (1988)
156 H. Bustince et al.
25. Madrid, N., Burusco, A., Bustince, H., Fernandez, J., Perfilieva, I.: Upper bounding overlaps
by groupings. Fuzzy Sets Syst. 264, 76–99 (2015)
26. Montero, J., Gomez, D., Bustince, H.: On the relevance of some families of fuzzy sets. Fuzzy
Sets Syst. 158, 2429–2442 (2007)
27. Nelsen, R.B.: An introduction to Copulas. Lecture Notes in Statistics, vol. 139. Springer, New
York (1999)
28. Sanz, J., Galar, M., Jurio, A., Brugos, A., Pagola, M., Bustince, H.: Medical diagnosis of
cardiovascular diseases using an interval-valued fuzzy rule-based classification system. Appl.
Soft Comput. J. 20, 103–111 (2014)
29. Schweizer, B., Sklar, A.: Probabilistic Metric Spaces. North-Holland, Amsterdam (1983)
30. Viceník, P.: Additive generators of non-continuous triangular norms, Topological an Algebraic
Structures in Fuzzy Sets. Kluwer (2003)
31. Viceník, P.: Additive generators of associative functions. Fuzzy Sets Syst. 153, 137–160 (2005)
32. Zadeh, L.A.: Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst. 1, 3–28 (1978)
Asymmetric Copulas and Their Application
in Design of Experiments
1 Introduction
The problem of describing relationships among random variables has attracted a lot
of attention during the years, especially for inference and prediction. Nowadays, it
can benefit of the use of copulas, which provide a convenient tool to construct and
estimate a multivariate stochastic model. In fact, following a copula approach, we
may determine the joint probability law of a random vector X = (X 1 , . . . , X d ) in two
steps: first, we fix the marginal behavior of each component X i , then we describe the
relationships among the X i ’s by means of a suitable copula, which is a multivariate
distribution function whose univariate marginals are uniformly distributed on [0, 1].
For more details about copulas, see, for instance, [28, 41, 55, 62].
In order to determine possible models for a random vector X, it is then important
to have at disposal a variety of copulas that may cover different situations that arise
in practice. To this end, several investigations in the literature considered the pro-
blem of determining families of copulas and studying their properties with particular
emphasis on their range of association (as measured by Spearman’s correlation coef-
F. Durante
Faculty of Economics and Management, Free University of Bozen–Bolzano, Bolzano, Italy
e-mail: [email protected]
E. Perrone (B)
Institute for Applied Statistics, Johannes Kepler University Linz, Linz, Austria
e-mail: [email protected]
ficient and Kendall’s tau, for instance), tail dependence and so on. For an overview
on different constructions, we refer the reader to [41, 55].
Interestingly, many of these constructions overlap with families of triangular
norms (see, for instance, [46, 62]). In fact, in dimension d = 2, copulas can be seen
as binary operations on [0, 1] that are supermodular and have neutral element 1. In
particular, when they are associative and commutative, they form an interesting class
of triangular norms, known as Archimedean copulas (see, for instance, [33, 53]).
Archimedean copulas have become quite popular in applications because of sev-
eral interesting properties that make them tractable especially for inferential pur-
poses (see, for instance, [35]). However, their main practical limitation is that they
are exchangeable, i.e. the values assumed by the copula does not change under any
permutation of its arguments.
To get more flexibility, a popular approach is to consider (multivariate) models that
exploit bivariate Archimedean (or, generally, exchangeable) copulas for determining
the marginal or conditional dependence and, hence, combining such pieces into
an high–dimension framework. Approaches of such type include vine–pair copula
constructions [7], factor copula models [20, 49, 52] or nested Archimedean copulas.
Another possibility is to find methods for providing constructions that, starting
with a given (exchangeable) copula C, produce another copula C (usually, with an
additional number of parameters) that need not be exchangeable. See, for instance,
Khoudraji’s device [31, 43] and related extensions [11, 50, 51].
It should also be stressed that some classes of (semi–)parametric bivariate copulas
can explicitly handle the non–exchangeable case. Consider, for instance, extreme–
value and Archimax copulas [5, 6, 47], Liouville copulas [54], copulas that are
invariant under (univariate) truncation [17, 18], asymmetric semilinear copulas [8]
among others.
Methods and tools for coping with non–exchangeable (asymmetric) copulas have
become quite popular in the recent years; see, for instance, the excellent overview
provided in [36]. In this contribution, we shortly review some of the recent litera-
ture on this topic by emphasizing earlier contributions provided in [45, 56], which
have suggested possible ways to quantify non–exchangeability in copula models.
Then, we consider a novel application of (asymmetric) copulas in optimal designs of
experiments and show how their use may provide a convenient tool in this framework
too.
Remark 1 In fuzzy logic (in a broad sense), instead, the adoption of a symmetric
function (e.g., triangular norm) as conjunctive operation is sometimes considered
quite strong. Therefore, several investigations have stressed the importance of intro-
ducing also asymmetric (i.e. non-commutative) conjunctions [3, 19, 30, 37, 42].
Graphically, exchangeability can be interpreted in the sense that the level sets
associated with an exchangeable copula C are symmetric with respect to the main
diagonal of the unit square (see Fig. 1). Analogously, the 3D plot of an exchangeable
copula (which is a surface) is symmetric with respect to the vertical plane passing
through the line {x = y}.
These graphical interpretations also suggest possible ways to transform an
exchangeable copula into another copula that does not share this property. Examples
are illustrated below.
1.0
0.9
0.8
0.8
0.7
0.6
0.6
0.5
v
0.4
0.4
0.3
0.2
0.2
0.1
0.0
0.0 0.2 0.4 0.6 0.8 1.0
u
measure μC coincides with μC on the Borel sets of [0, 1]2 \R, while the probability
mass is spread in a different way on R (see Fig. 2). The copula C obtained in this
way is, in general, non–exchangeable.
For instance, consider the independence copula Π2 (u, v) = uv and let a ∈ [0, 21 ].
Consider the rectangular patchwork
where a = 1 − a.
1 1
R TU
TL
0 1 0 1
Fig. 2 Graphical illustration of a rectangular (left) and diagonal (right) patchwork construction
Asymmetric Copulas and Their Application in Design of Experiments 161
where TL = {(u, v) ∈ [0, 1]2 : u ≥ v}, TU = [0, 1]2 \TL . This type of construction is
known as diagonal patchwork of two copulas (see also [25, 57]).
Remark 2 In previous examples, we start with a copula C = M2 , However, if C =
M2 it can be easily transformed into a non–exchangeable copula by means of a push–
forward of its induced measure under a suitable measure–preserving transformation.
For more details, see [27, 67].
Example 1 can be also used to check that, for every exchangeable copula C = M2 ,
a non–exchangeable copula C exists that is sufficiently close to C (in the L ∞ norm).
Instead, given a non–exchangeable copula, there is no general way to approximate
it via an exchangeable one.
At a more abstract level, the sets of exchangeable and non–exchangeable copulas
are quite different, as the following general results hold:
• Exchangeable copulas form a closed set in the class C of all copulas endowed
with the L ∞ norm, i.e. a sequence of exchangeable copulas uniformly converges
to an exchangeable copula. In particular, exchangeable copulas are not dense in
C , while non-exchangeable copulas are so.
• In some sense (namely, in the sense of Baire category), the class of exchangeable
copulas is a small set (i.e. a set of first category) in C endowed with the L ∞ norm,
while a typical copula is non-exchangeable (see [14] for more details).
These facts also provide additional motivations to consider in depth non–exchange-
able copulas.
3 Measures of Non–exchangeability
As noted in [56], “in a sense, the relationship between exchangeability and non–
exchangeability is analogous to the relationship between independence and depen-
dence” for random variables. In particular, this analogy could be exploited to
construct a measure of non–exchangeability, mimicking the axiomatic approach by
Rényi [61]. Following this viewpoint, Klement et al. [21] have introduced a set of
axioms for a measure of non–exchangeability for identically distributed and contin-
uous random variables, which turned out to depend only on the associated copula.
This definition is reported below.
162 F. Durante and E. Perrone
μ p (C p ) = K p and ∀C ∈ C μ p (C p ) ≥ μ p (C).
= 3a 2
μ+∞ (C) max |C1 (x, y) − x y| .
(x,y)∈[0,1]2
Maximum asymmetry for such a C is, hence, obtained when C1 belongs to {W, M}.
3a 2
For such a case, μ+∞ (C) = 4 .
Asymmetric Copulas and Their Application in Design of Experiments 163
1 1
2/3
1/3 1/3
Remark 3 In [4], the authors determined best possible lower and upper copulas C
under the constraint that, for a fixed t, μ∞ (C) = t. Maximally non–exchangeable
copulas in d dimensions (d ≥ 2) have been provided, instead, in [38]. An alternative
measure of non–exchangeability for copulas has been proposed in [64].
For more details, see [63]. It is easily shown that, for every C ∈ C ,
3
μ∞ (C) ≤ κ(C).
2
Moreover, if we consider other (qualitative) dependence notions like positive quad-
rant dependence, stochastic increasingness, etc., we may notice that:
• positive (resp. negative) dependence plays in favor of exchangeability;
• stronger positive (resp. negative) dependence implies smaller non–exchangeability.
Results of this type can be found, for instance, in [23, 24].
Various possible techniques for measuring asymmetry in copula models have been
also employed in developing statistical tests, as done for instance in [34, 36] (see
also [48, 60]), or to test special copula-based stochastic processes [2].
164 F. Durante and E. Perrone
N
N
x1 . . . x N
M(ξ, γ ) = wi m(xi , γ ), wi = 1 and ξ = .
w1 . . . w N
i=1 i=1
The approximate design theory is concerned with finding ξ ∗ (γ ) such that it maxi-
mizes some scalar function φ(M(ξ, γ )), i.e., the so-called design criterion. Here-
inafter, we consider only D-optimality, i.e., φ(M) = log det M, provided M is non-
singular.
The formulation of a Kiefer-Wolfowitz type equivalence relation (see [44]) is the
cornerstone of the theoretical foundation of optimal design. The following theorem
of such type is a generalized version of a result formulated in [39] and based on the
findings in [65]. A detailed proof of this result can be found in [58].
Theorem 2 For a local parameter vector (γ̄ ), the following properties are equiva-
lent:
• ξ ∗ is D-optimal;
• tr [M(ξ ∗ , γ̄ )−1 m(x, γ̄ )] ≤ (k + l), ∀x ∈ X ;
• ξ ∗ minimize max tr [M(ξ ∗ , γ̄ )−1 m(x, γ̄ )], over all ξ ∈ Ξ , where Ξ is the design
x∈X
space.
As it is now evident from previous definitions and results, the optimal designs
depend upon the trend model structure and the chosen copula. Furthermore, such
designs might also be influenced by the unknown parameter values for γ through
the induced nonlinearities (see [59]). Thence, we are resorting to localized designs
around the values γ̄ .
where π1 and π2 are the marginal probabilities of success and Cα is a given copula.
Following [39], we assume that
πi
log = βi1 + βi2 x, i = 1, 2 (4)
1 − πi
with x ∈ [0, 10]. As shown in [10], in such a model the Fisher information matrix
for a single observation can be written as:
∂p T 1 ∂p
m(x, γ ) = P −1 + eeT , (5)
∂γ 1 − p11 − p10 − p01 ∂γ
parameters related to the margins to β1 = (β11 , β12 ) with ‘localized’ initial values
β̄ 1 = [−1, 1].
We start with an exchangeable model, where the probability of success p11 is
given by
p11 = FY1 ,Y2 (π1 , π1 ; α1 ) = C(π1 , π1 ; α1 ),
with C being an exchangeable copula. The idea is to take into account a transforma-
tion of the copula C in order to obtain an asymmetric dependence for the random
vector (Y1 , Y2 ). In practice, this is obtained by bringing two additional parameters
into the model that may induce the asymmetry. Then a natural way how to inquire the
effect of the applied transformation is simply to compare the designs obtained for C
and those obtained for the asymmetric version of C, by calculating the corresponding
D-efficiencies.
The copula transformation we take into account consists of modifying a given
exchangeable copula C = Cα1 , with parameter α1 , into the copula C̃ = C̃α1 ,α2 ,α3
defined, for every (u, v) ∈ [0, 1]2 , by
(1 − α2 )(1 − α3 )
τ (C̃) ≤ =: τmax (C̃).
(1 − α2 ) + (1 − α3 ) − (1 − α2 )(1 − α3 )
It is worth to stress that, while in the symmetric model the estimated parameter vector
is (β11 , β12 , α1 ), in the asymmetric case we bring the focus on the asymmetrization
by considering as vector of the estimated parameters (β11 , β12 , α1 , α2 , α3 ).
In our example, the symmetric copula Cα1 is assumed belonging to the Clay-
ton family and the following scheme is carried on. For some values of Kendall’s τ
for the copula Cα1 of (3), we find the D-Efficiency of the optimal design ξ , which
corresponds to the exchangeable Clayton copula with the given τ , with respect to
the optimal design ξ̄ ∗ , which corresponds to the asymmetric model of Eq. (6) with
(suitable) parameters (α˜1 , α˜2 , α˜3 ) in order to ensure that τ (C) = τ (C̃). The losses
in D-efficiency (in percentage) are hence used to quantify the impact of the asym-
metrization on the design.
In this small numerical illustration we assume that τ ∈ {0.10, 0.25, 0.50}, while
(α˜2 , α˜3 ) may vary in {0, 0.2, 0.4, 0.6, 0.8} (without loss of generality, we consider
α2 > α3 ). However, it should be noted that some combinations of (α˜2 , α˜3 ) may not
provide the given values of Kendall’s measure of association.
Table 1 shows the losses in D-Efficiency for various localized parameter vectors.
Analyzing the results, one may notice that the losses are generally quite substantial.
Such high losses prove that the symmetric model and the asymmetric one provide
reasonably different optimal designs. An evidence of the difference between the two
168 F. Durante and E. Perrone
Table 1 Losses in D-efficiency (in percentage) between Clayton and asymmetric Clayton models
τ α1 (α˜2 , α˜3 ) α˜1 τmax (C̃(α˜1 ,α˜2 ,α˜3 ) ) Loss in D–efficiency (in %)
0.50 2.00 (0.2, 0.0) 3.5 0.80 43.99
0.25 0.66 (0.2, 0.0) 0.9 0.80 49.49
0.10 0.22 (0.2, 0.0) 0.3 0.80 55.70
0.50 2.00 (0.4, 0.0) 11 0.60 75.70
0.25 0.66 (0.4, 0.0) 1.5 0.60 51.74
0.10 0.22 (0.4, 0.0) 0.4 0.60 56.27
0.25 0.66 (0.6, 0.0) 3.6 0.40 63.80
0.10 0.22 (0.6, 0.0) 0.7 0.40 57.90
0.10 0.22 (0.8, 0.0) 2.2 0.20 67.46
0.25 0.66 (0.4, 0.2) 2.0 0.52 51.98
0.10 0.22 (0.4, 0.2) 0.5 0.52 56.52
0.25 0.66 (0.6, 0.2) 5.3 0.36 64.86
0.10 0.22 (0.6, 0.2) 0.9 0.36 58.07
0.10 0.22 (0.8, 0.2) 3.5 0.19 70.37
0.25 0.66 (0.6, 0.4) 9.5 0.31 65.83
0.10 0.22 (0.6, 0.4) 1.4 0.31 57.42
obtained designs can also be seen by simply comparing Fig. 4a or b with Fig. 5. In fact,
the optimal design found for the asymmetric model has three support points, while
the one related to the symmetric model just has two support points. Moreover, Table 1
also indicates that the losses depend upon the chosen transformation. As a matter
of fact, by looking at the values corresponding to the same fixed τ , a considerable
increase of loss relative to stronger asymmetries can be observed. This aspect is
also observable from Fig. 4, where the different geometry of two optimal designs
corresponding to two distinct asymmetrizations is shown.
(a) (b)
4 4
3 3
2 2
1 1
x x
Fig. 4 Sensitivity functions (continuous lines) and design weights (bars) for asymmetric Clayton
with (α˜1 , α˜2 , α˜3 ) = (11, 0.4, 0) (Fig. a) and (α˜1 , α˜2 , α˜3 ) = (3.5, 0.2, 0) (Fig. b), respectively
Asymmetric Copulas and Their Application in Design of Experiments 169
5 Conclusions
Acknowledgments This article is devoted to Prof. Erich Peter Klement on the occasion of his
retirement.
The first author has been supported by the Faculty of Economics and Management of Free University
of Bozen-Bolzano, Italy, via the project “Model Uncertainty and Dependence”.
The second author would like to thank Prof. Werner Müller, for his support and useful suggestions.
She was supported by the project ANR-2011-IS01-001-01 “DESIRE” and Austrian Science Fund
(FWF) I 883-N18.
References
1. Bacigal, T., Jágr, V., Mesiar, R.: Non-exchangeable random variables, Archimax copulas and
their fitting to real data. Kybernetika (Prague) 47(4), 519–531 (2011)
2. Beare, B.K., Seo, J.: Time irreversible copula-based Markov models. Econom. Theory 30(5),
923–960 (2014)
3. Běhounek, L., Bodenhofer, U., Cintula, P., Saminger-Platz, S., Sarkoci, P.: Graded dominance
and related graded properties of fuzzy connectives. Fuzzy Sets Syst. 262, 78–101 (2015)
4. Beliakov, G., De Baets, B., De Meyer, H., Nelsen, R.B., Úbeda-Flores, M.: Best-possible
bounds on the set of copulas with given degree of non-exchangeability. J. Math. Anal. Appl.
417(1), 451–468 (2014)
170 F. Durante and E. Perrone
5. Capéraà, P., Fougères, A.L., Genest, C.: Bivariate distributions with given extreme value attrac-
tor. J. Multivar. Anal. 72(1), 30–49 (2000)
6. Charpentier, A., Fougères, A.L., Genest, C., Nešlehová, J.G.: Multivariate Archimax copulas.
J. Multivar. Anal. 126, 118–136 (2014)
7. Czado, C.: Pair-copula constructions of multivariate copulas. In: Jaworski, P., Durante, F.,
Härdle, W., Rychlik, T. (eds.) Copula Theory and Its Applications. Lecture Notes in Statistics—
Proceedings, vol. 198, pp. 93–109. Springer, Berlin (2010)
8. De Baets, B., De Meyer, H., Mesiar, R.: Asymmetric semilinear copulas. Kybernetika (Prague)
43(2), 221–233 (2007)
9. Denman, N.G., McGree, J.M., Eccleston, J.A., Duffull, S.B.: Design of experiments for bivari-
ate binary responses modelled by Copula functions. Comput. Stat. Data Anal. 55(4), 1509–1520
(2011)
10. Dragalin, V., Fedorov, V.: Adaptive designs for dose-finding based on efficacytoxicity response.
J. Stat. Plan. Infer. 136(6), 1800–1823 (2006)
11. Durante, F.: Construction of non-exchangeable bivariate distribution functions. Stat. Pap. 50(2),
383–391 (2009)
12. Durante, F., Fernández-Sánchez, J.: On the classes of copulas and quasi-copulas with a given
diagonal section. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 19(1), 1–10 (2011)
13. Durante, F., Fernández-Sánchez, J., Sempi, C.: Multivariate patchwork copulas: a unified
approach with applications to partial comonotonicity. Insur. Math. Econ. 53, 897–905 (2013)
14. Durante, F., Fernández-Sánchez, J., Trutschnig, W.: Baire category results for exchangeable
copulas. Fuzzy Sets Syst. 284, 146–151 (2016)
15. Durante, F., Foscolo, E., Jaworski, P., Wang, H.: A spatial contagion measure for financial time
series. Expert Syst. Appl. 41(8), 4023–4034 (2014)
16. Durante, F., Jaworski, P.: Spatial contagion between financial markets: a copula-based approach.
Appl. Stoch. Models Bus. Ind. 26(5), 551–564 (2010)
17. Durante, F., Jaworski, P.: Invariant dependence structure under univariate truncation. Statistics
46(2), 263–277 (2012)
18. Durante, F., Jaworski, P., Mesiar, R.: Invariant dependence structures and Archimedean copulas.
Stat. Probab. Lett. 81(12), 1995–2003 (2011)
19. Durante, F., Klement, E., Mesiar, R., Sempi, C.: Conjunctors and their residual implicators:
characterizations and construction methods. Mediterr. J. Math. 4(3), 343–356 (2007)
20. Durante, F., Klement, E., Quesada-Molina, J., Sarkoci, P.: Remarks on two product-like con-
structions for copulas. Kybernetika (Prague) 43(2), 235–244 (2007)
21. Durante, F., Klement, E., Sempi, C., Úbeda-Flores, M.: Measures of non-exchangeability for
bivariate random vectors. Stat. Pap. 51(3), 687–699 (2010)
22. Durante, F., Kolesárová, A., Mesiar, R., Sempi, C.: Copulas with given diagonal sections: novel
constructions and applications. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 15(4), 397–410
(2007)
23. Durante, F., Papini, P.L.: Componentwise concave copulas and their asymmetry. Kybernetika
(Prague) 45(6), 1003–1011 (2009)
24. Durante, F., Papini, P.L.: Non-exchangeability of negatively dependent random variables.
Metrika 71(2), 139–149 (2010)
25. Durante, F., Rodríguez-Lallena, J.A., Úbeda-Flores, M.: New constructions of diagonal patch-
work copulas. Inf. Sci. 179(19), 3383–3391 (2009)
26. Durante, F., Saminger-Platz, S., Sarkoci, P.: Rectangular patchwork for bivariate copulas and
tail dependence. Comm. Stat. Theory Methods 38(15), 2515–2527 (2009)
27. Durante, F., Sarkoci, P., Sempi, C.: Shuffles of copulas. J. Math. Anal. Appl. 352(2), 914–921
(2009)
28. Durante, F., Sempi, C.: Principles of Copula Theory. CRC/Chapman and Hall, Boca Raton
(2015)
29. Fedorov, V.V.: The design of experiments in the multiresponse case. Theory Probab. Appl.
16(2), 323–332 (1971)
Asymmetric Copulas and Their Application in Design of Experiments 171
30. Fodor, J.C., Keresztfalvi, T.: Nonstandard conjunctions and implications in fuzzy logic. Int. J.
Approx. Reason. 12(2), 69–84 (1995)
31. Genest, C., Ghoudi, K., Rivest, L.P.: Understanding relationships using copulas, by E. Frees
and E. Valdez, January 1998. N. Am. Actuar. J. 2(3), 143–149 (1998)
32. Genest, C., Kojadinovic, I., Nešlehová, J., Yan, J.: A goodness-of-fit test for bivariate extreme-
value copulas. Bernoulli 17(1), 253–275 (2011)
33. Genest, C., MacKay, R.J.: Copules archimédiennes et familles de lois bidimensionnelles dont
les marges sont données. Can. J. Stat. 14(2), 145–159 (1986)
34. Genest, C., Nešlehová, J., Quessy, J.F.: Tests of symmetry for bivariate copulas. Ann. Inst. Stat.
Math. 64(4), 811–834 (2012)
35. Genest, C., Nešlehová, J., Ziegel, J.: Inference in multivariate Archimedean copula models.
TEST 20(2), 223–256 (2011)
36. Genest, C., Nešlehová, J.: Assessing and modeling asymmetry in bivariate continuous data. In:
Jaworski, P., Durante, F., Härdle, W. (eds.) Copulae in Mathematical and Quantitative Finance.
Lecture Notes in Statistics, pp. 91–114. Springer, Berlin (2013)
37. Hájek, P., Mesiar, R.: On copulas, quasicopulas and fuzzy logic. Soft Comput. 12(12), 123–
1243 (2008)
38. Harder, M., Stadtmüller, U.: Maximal non-exchangeability in dimension d. J. Multivar. Anal.
124, 31–41 (2014)
39. Heise, M.A., Myers, R.H.: Optimal designs for bivariate logistic regression. Biometrics 52(2),
613–624 (1996)
40. Jaworski, P., Pitera, M.: On spatial contagion and multivariate GARCH models. Appl. Stoch.
Models Bus. Ind. 30, 303–327 (2014)
41. Joe, H.: Dependence Modeling with Copulas. Chapman and Hall/CRC, London (2014)
42. Kawaguchi, M.F., Miyakoshi, M.: Composite fuzzy relational equations with noncommutative
conjunctions. Inf. Sci. 110(1–2), 113–125 (1998)
43. Khoudraji, A.: Contributions à l’étude des copules et à la modélisation des valeurs extrêmes
bivariées. Ph.D. thesis, Université de Laval, Québec (Canada) (1995)
44. Kiefer, J., Wolfowitz, J.: The equivalence of two extremum problems. Can. J. Math. 12, 363–366
(1960)
45. Klement, E.P., Mesiar, R.: How non-symmetric can a copula be? Comment. Math. Univ. Car-
olin. 47(1), 141–148 (2006)
46. Klement, E.P., Mesiar, R., Pap, E.: Triangular norms. Trends in Logic-Studia Logica Library,
vol. 8. Kluwer Academic Publishers, Dordrecht (2000)
47. Klement, E.P., Mesiar, R., Pap, E.: Archimax copulas and invariance under transformations.
C. R. Math. Acad. Sci. Paris 340(10), 755–758 (2005)
48. Kojadinovic, I., Yan, J.: A non-parametric test of exchangeability for extreme-value and left-tail
decreasing bivariate copulas. Scand. J. Stat. 39(3), 480–496 (2012)
49. Krupskii, P., Joe, H.: Factor copula models for multivariate data. J. Multivar. Anal. 120, 85–101
(2013)
50. Liebscher, E.: Construction of asymmetric multivariate copulas. J. Multivar. Anal. 99(10),
2234–2250 (2008)
51. Liebscher, E.: Erratum to Construction of asymmetric multivariate copulas. J. Multivar. Anal.
102(4), 869–870 (2011)
52. Mazo, G., Girard, S., Forbes, F.: A flexible and tractable class of one-factor copulas. Stat.
Comput., in press (2015)
53. McNeil, A.J., Nešlehová, J.: Multivariate Archimedean copulas, d-monotone functions and
1 -norm symmetric distributions. Ann. Stat. 37(5B), 3059–3097 (2009)
54. McNeil, A.J., Nešlehová, J.: From Archimedean to Liouville copulas. J. Multivar. Anal. 101(8),
1772–1790 (2010)
55. Nelsen, R.B.: An Introduction to Copulas. Springer Series in Statistics, 2nd edn. Springer, New
York (2006)
56. Nelsen, R.B.: Extremes of nonexchangeability. Stat. Pap. 48(2), 329–336 (2007)
172 F. Durante and E. Perrone
57. Nelsen, R.B., Quesada-Molina, J.J., Rodríguez-Lallena, J.A., Úbeda-Flores, M.: On the con-
struction of copulas and quasi-copulas with given diagonal sections. Insur. Math. Econom.
42(2), 473–483 (2008)
58. Perrone, E., Müller, W.G.: Optimal design for copula models. Statistics, in press (2015).
doi:10.1080/02331888.2015.1111892
59. Pronzato, L., Pázman, A.: Design of Eperiments in Nonlinear Models. Springer Lecture Notes
in Statistics, vol. 212 (2013)
60. Quessy, J.F., Bahraoui, T.: Graphical and formal statistical tools for assessing the symmetry of
bivariate copulas. Can. J. Stat. 41(4), 637–656 (2013)
61. Rényi, A.: On measures of dependence. Acta Math. Acad. Sci. Hungar. 10, 441–451 (1959)
62. Schweizer, B., Sklar, A.: North-Holland Series in Probability and Applied Mathematics. Prob-
abilistic metric spaces. North-Holland Publishing Co., New York (1983)
63. Schweizer, B., Wolff, E.F.: On nonparametric measures of dependence for random variables.
Ann. Stat. 9(4), 879–885 (1981)
64. Siburg, K.F., Stoimenov, P.A.: Symmetry of functions and exchangeability of random variables.
Stat. Pap. 52(1), 1–15 (2011)
65. Silvey, S.D.: Optimal Design (Science Paperbacks). Chapman and Hall (1980)
66. Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institut de
Statistique de Paris 8, 229–231 (1959)
67. Trutschnig, W., Fernández Sánchez, J.: Some results on shuffles of two-dimensional copulas.
J. Stat. Plan. Infer. 143(2), 251–260 (2013)
68. Wu, S.: Construction of asymmetric copulas and its application in two-dimensional reliability
modelling. Eur. J. Oper. Res. 238, 476–485 (2014)
69. Wynn, H.P.: The sequential generation of D-optimum experimental designs. Ann. Math. Stat.
41(5), 1655–1664 (1970)
Copulæ of Processes Related to the Brownian
Motion: A Brief Survey
Carlo Sempi
Abstract The copulas of a few stochastic processes related to the Brownian motion
are derived; specifically, if (Xt ) is one such process, the copula of the pair (Xs , Xt ) is
determined for s < t.
1 Introduction
C. Sempi (B)
Dipartimento di Matematica e Fisica “Ennio De Giorgi”,
Università del Salento, Lecce 73100, Italy
e-mail: [email protected]
In other words, a copula C is (the restriction to the unit square I2 of) a d.f. that
concentrates all the probability mass on I2 and that has uniform margins. The main
result is provided by Sklar’s theorem [7].
If the marginals F and G are continuous, then the copula C is uniquely defined.
Under the assumptions of Theorem 1, if F and G are continuous, then there exists
a unique copula C associated with X that is determined, for all (u, v) ∈ [0,1[2 , via
the formula
C(u, v) = H F (−1) (u), G(−1) (v) , (2)
Theorem 2 ([1]) In the probability space (Ω, F , P) let the continuous random
variables X and Y have d.f.’s F and G and copula C. Then, one has a.e.
P(X ≤ x | Y )(ω) = E 1{X≤x} | Y (ω) = ∂2 C (FX (x), FY (Y (ω))) , (3)
P(Y ≤ y | X)(ω) = E 1{Y ≤y} | X (ω) = ∂1 C(FX (X(ω)), FY (y)) . (4)
(a) If both f and g are strictly increasing, then, for all (u, v) ∈ I2 ,
(b) if both f and g are strictly decreasing, then, for all (u, v) ∈ I2 ,
We refer to [4] and to the forthcoming monograph [2] for the properties of copulas
and to [3, 5, 8] for those of stochastic processes.
With the exception of the last remark, the results of this section were established
in [1].
Given a standard Brownian motion (=BM, for short) (Bt )t≥0 on the probabilitiy
space (Ω, F , P), we wish to find the copula Cs,tB
of the pair (Bs , Bt ) with s < t.
The transition probabilities for the BM (Bt ) are given, for s ∈ ]0, t[ and x and y
in R, by
y−x
P(x, s; y, t) := P(Bt ≤ y | Bs = x) = √ ,
t−s
where denotes the distribution function of the standard normal law N(0, 1). By
Theorem 2 one has
P(x, s; y, t) = ∂1 Cs,t
B
(Fs (x), Ft (y)) ,
As is well known (see, e.g., [3] or [5]) the d.f. Ft of Bt is given, for t > 0, by
x
Ft (x) = √ .
t
B
Replacing this result into (5) yields the copula Cs,t
√ −1 √
u
t (v) − s −1 (w)
B
Cs,t (u, v) = √ dw . (6)
0 t−s
This copula has partial derivatives given, for (u, v) ∈ ]0, 1[2 by
√ −1 √
t (v) − s −1 (u)
B
=
∂1 Cs,t (u, v) √ , (7)
t−s
u √ √
1 t t −1 (v) − s −1 (w)
∂2 Cs,t
B
(u, v) = −1 ϕ √ dw ,
ϕ (v) t−s 0 t−s
where ϕ is the density of the standard normal law N(0, 1). A further derivation of
B
(7) provides the density of Cs,t
√ √
t 1 t −1 (v) − s −1 (u)
B
cs,t (u, v) = ϕ √ . (8)
t − s ϕ (v)
−1 t−s
Notice that, because of Theorem 3(a), the copula of the Brownian bridge Bt∗ =
Bt − t B1 coincides with the copula (6) of the Brownian motion (Bt ).
A geometric Brownian motion (Xt ) satisfies the stochastic integral Eq. (see, e.g., [3])
t t
Xt = X0 + μ Xs ds + ν Xs dBs , (9)
0 0
density (8).
Copulæ of Processes Related to the Brownian Motion: A Brief Survey 177
which differs from the copula of Eq. (6). When X0 < 0, the density of Cs,t
GB−
(u, v) is
given by
√ √
t 1 t −1 (1 − v) − s −1 (1 − u)
GB−
cs,t (u, v) = ϕ √ .
t − s ϕ −1 (1 − v) t−s
Thus
OU −1
Ft (u) = −1 (u) .
The joint distribution function of (Us , Ut ) is given, for s ∈ ]0, t[ and for x and y in
R by
OU
OU
Fs,t (x, y) = P(Us ≤ x, Ut ≤ y) = Cs,t
OU
Fs (x), FtOU (y)
= Cs,t
OU
((x), (y)) , (13)
178 C. Sempi
so that
−1
OU
Cs,t (u, v) = Fs,t
OU
(u), −1 (v) .
St := sup{Bs : s ≤ t} .
By recourse to the reflection principle, see, e.g., [5, Sect. 1.3], one has
y
FSt (y) = 2 √ − 1 ;
t
Let Ht denote the joint distribution function of (Bt , St ); for x ≤ y, one has, because
of the continuity of both Bt and St and as a consequence of Eq. (13.4) in [5],
Copulæ of Processes Related to the Brownian Motion: A Brief Survey 179
Then the unique copula CtBS of the random vector (Bt , St ) can be obtained by
CtBS (u, v) = Ht FB−1
t
(u), FS−1
t
(v) .
Notice that in either case the result does not depend on time: the copula of Bt and
St is independent of t and is given by
u − −1 (u) − 2 −1 v+1 , u ≤ (v + 1)/2 ,
C (u, v) =
BS 2
v, u ≥ (v + 1)/2 .
180 C. Sempi
References
1. Darsow, W.F., Nguyen, B.E., Olsen, T.: Copulas and Markov processes. Illinois J. Math. 36,
600–642 (1992)
2. Durante, F., Sempi, C.: Principles of Copula Theory. Chapman & Hall/CRC, Boca Raton (2015)
3. Karatzas, I., Shreve, S.E.: Brownian Motion and Stochastic Calculus, 2nd edn. Springer, Berlin
(1991)
4. Nelsen, R.B.: An Introduction to Copulas, 2nd edn. Springer, New York (2006)
5. Rogers, L.C.G., Williams, D.: Diffusions, Markov processes and martingales. In: Foundations,
2nd Edn. Cambridge University Press (2000)
6. Schmitz, V.: Copulas and stochastic processes. Ph.D. Dissertation, Rheinische-Westfälische
Technische Hochschule, Aachen, Germany (2003)
7. Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Publ. Inst. Statist. Univ. Paris
8, 229–231 (1959)
8. Stirzaker, D.: Stochastic processes and models. Oxford University Press (2005)
Extensions of Capacities
1 Introduction
A. Kolesárová (B)
Faculty of Chemical and Food Technology, Slovak University of Technology in Bratislava,
Radlinského 9, 812 37 Bratislava 1, Slovakia
e-mail: [email protected]
A. Stupňanová
Faculty of Civil Engineering, Slovak University of Technology in Bratislava,
Radlinského 11, 810 05 Bratislava 1, Slovakia
e-mail: [email protected]
© Springer International Publishing Switzerland 2016 181
S. Saminger-Platz and R. Mesiar (eds.), On Logical, Algebraic, and Probabilistic
Aspects of Fuzzy Set Theory, Studies in Fuzziness and Soft Computing 336,
DOI 10.1007/978-3-319-28808-6_11
182 A. Kolesárová and A. Stupňanová
In what follows, we recall the concept of a universal integral on [0, 1], which was
introduced by Klement et al. in [12], as a common framework covering the Choquet,
Shilkret and Sugeno integrals.
Definition 1 A mapping I : (Mn × [0, 1]n ) → [0, 1] is called a universal inte-
n∈N
gral on [0, 1] whenever it satisfies the following axioms:
(UI1) For each fixed n ∈ N, I |Mn × [0, 1]n is nondecreasing in both components.
(UI2) There is a semicopula ⊗ : [0, 1]2 → [0, 1] such that for any n ∈ N, m ∈ Mn ,
c ∈ [0, 1] and K ⊆ N ,
I (m, c · 1 K ) = c ⊗ m(K ).
(UI3) For any (m 1 , x1 ), (m 2 , x2 ) ∈ (Mn × [0, 1]n ) such that m 1 ({i ∈ N1 | x1,i
n∈N
≥ t}) = m 2 ({ j ∈ N2 | x2, j ≥ t}) it holds that
I (m 1 , x1 ) = I (m 2 , x2 ).
i.e., universal integrals always extend the considered capacities. The monotonicity
of I (m, ·) is guaranteed due to (UI1), i.e., I (m, ·) is an aggregation function.
Now, we recall three basic universal integrals.
• The Choquet integral was introduced in 1953 in [5] as follows
1
Ch m (x) = m({i ∈ N | xi ≥ t})dt. (1)
0
There are three other equivalent formulas for the discrete Choquet integral. The
first one is based on the Möbius transform Mm : 2 N → R of a capacity m, given by
Mm (K ) = (−1)card(K \L) m(L), (2)
L⊆K
Note that the right-hand side of this formula is also known as the Lovász extension
of m [17].
184 A. Kolesárová and A. Stupňanová
Taking into account a geometrical meaning of (1), we can write the following
equivalent formulas for the discrete Choquet integral:
n
Ch m (x) = x(i) − x(i−1) m K (i) (4)
i=1
and
n
Ch m (x) = x(i) m K (i) − m K (i+1) , (5)
i=1
where (·) : N → N is a permutation such that x(1) ≤ · · · ≤ x(n) and K (i) = {(i), . . . ,
(n)}, i = 1, . . . , n. By convention, in (4), x(0) = 0 and in (5), K (n+1) = ∅. Note that if
there are some ties between the input values, there exist more permutations satisfying
the above constraints, but both formulas (4) and (5) always give the same value of
Ch m (x).
• The Shilkret integral was introduced in 1971 in [27], originally for maxitive mea-
sures only, as follows
• The Sugeno integral was introduced in 1974 in [29] (in Japanese published in
1972) as follows
Formally, both the Shilkret and Sugeno integrals can be seen as particular exam-
ples of the smallest universal integral on [0, 1] related to a semicopula ⊗, which was
introduced in [12] by the formula
Namely, the Shilkret integral can be obtained for the semicopula ⊗ = Π , Π (x, y) =
x y, and the Sugeno integral for ⊗ = Min, Min(x, y) = min{x, y}.
Now, let us consider copulas [25] as special types of semicopulas. Recall that
C : [0, 1]2 → [0, 1] is a copula if it is a supermodular semicopula, i.e., if for all
x, y ∈ [0, 1]2 it holds that
Here PC is a probability measure on Borel subsets of the unite square [0, 1]2
generated by the probabilities of the rectangles [0, x] × [0, y], PC ([0, x] × [0, y]) =
C(x, y). Consequently, using the notation as in (4) and (5), we can write the formula
(13) in the form
n
I[C] (m, x) = C x(i) , m(K (i) ) − C x(i−1) , m(K (i) ) (14)
i=1
and also as
n
I[C] (m, x) = C x(i) , m(K (i) ) − C x(i) , m(K (i+1) ) . (15)
i=1
Definition 2 Let n ∈ N and let C : [0, 1]2 → [0, 1] be a fixed copula. The (n, C)-
universal integral on [0, 1],
IC(n) : Mk × [0, 1]k → [0, 1],
k∈N
is given by
⎧ ⎛ ⎛ ⎛⎧ ⎫⎞ ⎞
⎨ n i ⎨ i ⎬
IC(n) (m, x) = sup ⎝C ⎝ a j , m ⎝ p ∈ {1, . . . , k} | x p ≥ a j ⎠⎠
⎩ ⎩ ⎭
i=1 j=0 j=0
⎛ ⎛⎧ ⎫⎞⎞⎞⎫
i−1 ⎨ i ⎬ ⎬
− C⎝ a j , m ⎝ p ∈ {1, . . . , k} | x p ≥ a j ⎠⎠⎠ ,
⎩ ⎭ ⎭
j=0 j=0
(16)
n
where a0 = 0, a1 , . . . , an ≥ 0 and j=1 a j ≤ 1.
Remark 1 Note that if n = 1, for an arbitrary semicopula S : [0, 1]2 → [0, 1], the
functional I S(1) = I S given by (16) is the (weakest) universal integral linked to S,
compare (10) and (11). However, I S(2) does not satisfy the axiom of the monotonicity
of universal integrals, in general. To ensure this, S should be supermodular, i.e., a
copula.
where W : [0, 1]2 → [0, 1], W (x, y) = max{0, x + y − 1}, is the Fréchet-Hoeffding
lower bound (the weakest copula), satisfies the given conditions. Then
2 2 2 2
I S(2) m, , , = ,
3 3 3 3
1 2 2 1 2 2 1 2 1 1 1 1
I S(2) m, , , = S ,1 + S , −S , = + − = ,
3 3 3 3 3 3 3 3 3 3 3 3
1 1 2 1 2 1 1 1 1 1 2
I S(2) m, , , = S ,1 + S , −S , = + −0= ,
3 3 3 3 3 3 3 3 3 3 3
Extensions of Capacities 187
Definition 3 Let n ∈ N and let C : [0, 1]2 → [0, 1] be a fixed copula. The (C, n)-
universal integral on [0, 1],
C
I(n) : Mk × [0, 1]k → [0, 1],
k∈N
is given by
⎧ ⎛ ⎛ ⎛⎧ ⎫⎞⎞
⎨ n i−1 ⎨ i−1 ⎬
C
I(n) (m, x) = inf ⎝C ⎝ a j , m ⎝ p ∈ {1, . . . , k} | x p > a j ⎠⎠
⎩ ⎩ ⎭
i=1 j=0 j=0
⎛ ⎛⎧ ⎫⎞⎞⎞⎫
i−1 ⎨
i−1 ⎬ ⎬
− C⎝ a j , m ⎝ p ∈ {1, . . . , k} | x p > a j ⎠⎠⎠ ,
⎩ ⎭ ⎭
j=0 j=0
(17)
n
where a0 = 0, a1 , . . . , an ≥ 0 and max{x1 , . . . , xn } ≤ j=1 a j ≤ 1.
C
Observe that for any copula C it holds that I(1) (m, x) = (max xi , m({i ∈ N | xi >
0})).
4
IΠ (m, x) = Sh m (x) = IΠ(1) (m, x) = ,
9
5
IΠ(2) (m, x) = ,
9
6
IΠ(3) (m, x) = I(3)
Π
(m, x) = Ch m (x) = ,
9
Π 7
I(2) (m, x) = ,
9
Π 9
I(1) (m, x) = = 1.
9
For the Fréchet-Hoeffding lower bound W we get:
1
IW (m, x) = IW(1) (m, x) = ,
3
2
IW(2) (m, x) = ,
3
3
IW(3) (m, x) = I[W ] (m, x) = I(3)
W
(m, x) = I(2)
W
(m, x) = I(1)
W
(m, x) = = 1,
3
and for the Fréchet-Hoeffding upper bound Min:
(1) 2
I Min (m, x) = Su m (x) = I Min (m, x) = · · · = I(2)
Min
(m, x) = ,
3
Min
I(1) (m, x) = 1.
The formula (3) can be seen as an extension of the capacity m constructed by means
of the Möbius transform of m and the aggregation function Min, which inspired us
to propose a generalization of this approach. We briefly recall the main idea and the
results from [14].
Let x = (x1 , . . . , xn ) ∈ [0, 1]n be any input n-tuple. To each subset K ⊆ N we
assign an n-tuple x K = (u 1 , . . . , u n ), where
xi if i ∈ K ,
ui =
1 otherwise.
Extensions of Capacities 189
Note that, in general, there is a difference between formulas (3) and (18). While in (3)
the aggregation function Min, Min(x1 , . . . , xk ) = min{x1 , . . . , xk }, is considered as
a k-ary aggregation function for k = 1, . . . , n, in (18) only a fixed n-ary aggregation
function A is applied. The k-tuples corresponding to the index sets K ⊆ N with
|K | = k are always completed into n-tuples by setting to 1 the components with the
indices in N \K . Moreover, formula (3) defines an extended aggregation function
[8], i.e., it can be applied for any arity n. However, for the n-ary aggregation function
A = Min for which e = 1 is a neutral element, both formulas coincide, i.e., Fm,Min
given by (18) is the Lovász extension of m [8, 17, 20]. Similarly, for the product
aggregationfunction A = Π , Π (x1 , . . . , xn ) = x1 · x2 · . . . · xn , (18) gives the same
values as Mm (K ) xi , i.e., Fm,Π is the so-called Owen extension of m [26].
K ⊆N i∈K
In general, the function Fm,A defined by (18) is neither an extension of m nor an
aggregation function.
Let us first characterize all aggregation functions A ∈ An such that for each m ∈
Mn , Fm,A is an extension of m.
Note that each copula C is also a quasi-copula, but in general, the opposite claim is
not true.
190 A. Kolesárová and A. Stupňanová
2x y
A(x, y) = C H0 ( f (x), g(y)) = ,
x+y
2ax 2by 2x y
Fm,A (x, y) = + + (1 − a − b) ,
x +1 y+1 x+y
where the sum is taken over all vertices c = (c1 , . . . , cn ) of the n-box [a, b] (i.e.,
each ck is equal to either ak or bk ), and α(c) is the number of indices k s such that
ck = ak .
For n ≥ 2, we have the following general result.
Extensions of Capacities 191
For example, all n-copulas [25, 28] are suitable aggregation functions for our
construction. Recall that n-copulas are defined as functions C : [0, 1]n → [0, 1]
satisfying
(C1) the boundary conditions:
if 0 ∈ {x1 , . . . , xn } then C(x1 , . . . , xn ) = 0,
C(1, . . . , 1, x j , 1, . . . , 1) = x j for each j = 1, . . . , n and each x j ∈ [0, 1],
(C2) the n-increasing property:
VC ([a, b]) ≥ 0 for each n-box [a, b] in [0, 1]n .
It is easy to see that the aggregation functions described in the following proposition
also have zero annihilator and that the volumes of all n-boxes in [0, 1]n are non-
negative.
is an n-ary aggregation function such that, for all m ∈ Mn , the function Fm,A is an
aggregation function extending m.
However, there are aggregation functions A which are neither copulas nor obtained
by a distortion of some copula as in Proposition 3, and in spite of that, for all m ∈ Mn ,
Fm,A is an aggregation function extending m, see the following example.
The function A is a ternary aggregation function with zero annihilator. After a quite
tedious computations one obtains that, for each m ∈ M3 , Fm,A is an aggregation func-
tion extending m. However, A is not a copula, because, e.g., for a = (0.3, 0.3, 0.3)
and b = (0.35, 0.35, 0.35) the A-volume of the corresponding 3-box is V A ([a, b]) =
−0.0019 < 0, i.e., the 3-increasing property of A fails. This example can be gener-
alized to any n > 3.
192 A. Kolesárová and A. Stupňanová
Finally, let us mention that in [15] we also studied extension methods based on
the possibilistic Möbius transform Mm∨ of a capacity m, where Mm∨ : 2 N → [0, 1] is
given by ⎧
⎨0 if m(K ) = m(L)
Mm∨ (K ) = for some L K , (21)
⎩
m(K ) otherwise.
More details on the possibilistic Möbius transform can be found, e.g., in [19, 21].
For a semicopula ⊗ : [0, 1]2 → [0, 1], a capacity m ∈ Mn and an aggregation
∨,⊗
function A ∈ An , let Fm,A : [0, 1]n → [0, 1] be a function given by
∨,⊗
Fm,A (x) = Mm∨ (K ) ⊗ A(x K ) , (22)
K ⊆N
∨,⊗
Fm,A (x) = (m(K ) ⊗ A(x K )). (23)
K ⊆N
∨,⊗
Theorem 4 Let A ∈ An , n ≥ 2. Then the function Fm,A is for each m ∈ Mn and
each semicopula ⊗ an aggregation function extending m if and only if A is an
aggregation function with zero annihilator.
∨,∧
i.e., Fm,Min is the Sugeno integral with respect to the capacity m (compare with
the relation of Fm,Min and the Choquet integral Ch m ).
∨,Π
• In general, for any n ≥ 2, Fm,Min leads to the Shilkret integral Sh m .
∨,⊗
• Similarly, for any semicopula ⊗, the function Fm,Min coincides with the weakest
universal integral I⊗ based on ⊗ with respect to the capacity m, see (10).
Inspired by the formula (4) for the discrete Choquet integral, we proposed its fusion
function-based modification [24], substituting the product in (4) by a binary fusion
Extensions of Capacities 193
function. Note that a binary fusion function is any function F : [0, 1]2 → [0, 1],
see [2].
Definition 4 ([24]) Let F : [0, 1]2 → [0, 1] be a fusion function satisfying for
each y ∈ [0, 1] the property F(0, y) = 0. Then the functional CmF : [0, 1]n → [0, n]
given by
n
CmF (x) = F x(i) − x(i−1) , m(K (i) ) . (24)
i=1
It is clear that for F = Π , for any capacity m ∈ Mn , CmΠ is equal to the Choquet
integral, CmΠ = Ch m .
However, in general, functions CmF need not be monotone. Moreover, their range
need not be contained in [0, 1], see the following example.
Example 5 Consider the function
⎧
⎨ 0 if x = 0,
F(x, y) = y if x = 1,
⎩
1 otherwise.
Then, for a uniform capacity m ∈ Mn , m(K ) = cardn (K ) , and any x ∈]0, 1]n , such that
card({x1 , . . . , xn }) = n, we have CmF (x) = n and CmF (1 K ) = m(K ) for each K ⊆ N .
Considering, for example, n = 3 and the points (0, 0, 0), (0.2, 0.3, 0.7) and (1, 1, 1),
we get CmF (0, 0, 0) = 0, CmF (0.2, 0.3, 0.7) = 3 and CmF (1, 1, 1) = 1, which shows
that the monotonicity of CmF is violated and Ran(CmF ) [0, 1]. Note that CmF is an
extension of m for any m ∈ Mn .
A deep study of properties of functionals CmF can be found in [24]. In this paper we
only focus on the properties of fusion functions F which are necessary and sufficient
for CmF to be an aggregation function and an extension of m for any m ∈ Mn .
Proposition 4 Let F : [0, 1]2 → [0, 1] be a fusion function. Then, for any capac-
ity m ∈ Mn , CmF extends m, i.e., CmF (1 K ) = m(K ) for each K ⊆ N , if and only if
F(1, y) = y for each y ∈ [0, 1].
Now, we fix n = 2. Then any capacity m ∈ M2 is completely determined by values
a, b ∈ [0, 1], where a = m({1}) and b = m({2}). For any F ∈ F0 and m ∈ M2 it
holds that
194 A. Kolesárová and A. Stupňanová
F(x, 1) + F(y − x, b) if x ≤ y,
CmF (x, y) = (25)
F(y, 1) + F(x − y, a) otherwise.
Note that always CmF (0, 0) = 0 and CmF (1, 1) = 1 whenever F(1, 1) = 1.
Let us denote by F(2) the class of all binary fusion functions F ∈ F0 such that for
each m ∈ M2 , CmF is a binary aggregation function. The following theorem provides
a characterization of all F ∈ F(2) .
Theorem 5 Let F ∈ F0 . Then F ∈ F(2) if and only if F(x, 1) = x for all x ∈ [0, 1]
and the function F(·, y) : [0, 1] → [0, 1] is increasing and 1-Lipschitz for each y ∈
[0, 1], i.e.,
0 ≤ F(x2 , y) − F(x1 , y) ≤ x2 − x1
whenever 0 ≤ x1 ≤ x2 ≤ 1.
Note that if CmF is an aggregation function for any m ∈ M2 , then CmF is idempotent
and translation invariant [24].
Before giving an example, we recall that each quasi-copula Q satisfies the con-
straints of Theorem 5, hence Q ∈ F(2) .
Example 6 (i) Consider the greatest quasi-copula Min and a capacity m ∈ M2 ,
with m({1}) = a, m({2}) = b. Then
min{y, x + b} if x ≤ y,
CmMin (x, y) =
min{x, y + a} otherwise
= max {min{y, x + b}, min{x, y + a}} ,
see Fig. 1.
(ii) Consider the smallest quasi-copula W and m ∈ M2 , determined by a, b ∈ [0, 1]
as in (i). Then
max{x, y + b − 1} if x ≤ y,
CmW (x, y) =
max{y, x + a − 1} otherwise
= min {max{x, y + b − 1}, max{y, x + a − 1}} ,
see Fig. 2.
In Theorem 5, we have characterized the class F(2) of all binary fusion functions
F ∈ F0 such that the F-based discrete Choquet-like integral CmF defined on [0, 1]2
is a binary aggregation for any m ∈ M2 . Now, we will characterize the class F(n) ,
n > 2, of all binary fusion functions F ∈ F0 for which CmF defined on [0, 1]n is an
n-ary aggregation function for any m ∈ Mn .
Theorem 6 Let F ∈ F0 . For any n > 2, F ∈ F(n) if and only if F ∈ G , where G
is the set of all F ∈ F0 which are given by F(x, y) = x f (y), for some increasing
function f : [0, 1] → [0, 1] satisfying f (1) = 1.
Extensions of Capacities 195
Fig. 1 2D illustration of 1
CmMin from Example 6(i) for 0.9
a = 1/2, b = 1/3 0.8 x+b
0.7 y
0.6
b
0.5
0.4
0.3 x
0.2
y+a
0.1
0
0 0.2 0.4 0.6 0.8 1
a
Remark 2 In the case of Theorem 6, if F ∈ F(n) , then CmF = Ch f (m) , where Ch f (m)
is the Choquet integral with respect to the capacity f (m) ∈ Mn given by
0 if K = ∅,
f (m)(K ) =
f (m(K )) otherwise.
n
n
CmF (x) = F x(i) − x(i−1) , m K (i) = x(i) − x(i−1) f m K (i)
i=1 i=1
= Ch f (m) (x).
Summarizing, we have proved that for any n > 2, an F-based discrete Choquet-
like integral CmF is an n-ary aggregation function for any m ∈ Mn if and only if
it is the Choquet integral with respect to the capacity m distorted by a function f
generating F, i.e., CmF = Ch f (m) .
In our approach, we have generalized the formula (4) of the discrete Choquet
integral. Note that the formula (5) can be generalized in a similar way. For more
details we refer to [10].
196 A. Kolesárová and A. Stupňanová
The formula (9) for the discrete Sugeno integral Su m : [0, 1]n → [0, 1] can be writ-
ten as
n
Su m (x) = min x(i) , m K (i) .
i=1
Inspired by this formula, for any fusion function F : [0, 1]2 → [0, 1], we can define
the function Su mF : [0, 1]n → [0, 1] by the formula
n
Su mF (x) = F x(i) , m K (i) , (26)
i=1
Example 7 (i) For fusion functions F given by F(x, y) = x λ y, with λ ∈]0, ∞[,
the formula (26) gives
Su mF (x) = Sh m x1λ , . . . , xnλ .
(ii) If F(x, y) = min x λ , y , λ ∈]0, ∞[, then
Su mF (x) = Su m x1λ , . . . , xnλ .
for each (x1 , . . . , xn ) ∈ [0, 1]n and c > 0 such that (x1 + cr1 , . . . , xn + crn ) ∈ [0, 1]n .
Note that aggregation functions are pre-aggregation functions which are nondecreas-
ing in the direction of any vector r ∈ [0, 1]n \ {0}.
Proposition 5 Let F : [0, 1]2 → [0, 1] be a function increasing in the first variable
and let for each y ∈ [0, 1], F(0, y) = 0 and F(1, 1) = 1. Then SmF defined in (26) is
a pre-aggregation function for any capacity m ∈ Mn .
Example 8 Consider the function F : [0, 1]2 → [0, 1], F(x, y) = x|2y − 1|. Note
that F is a proper pre-aggregation function (not an aggregation function) which
Extensions of Capacities 197
satisfies the constraints of Proposition 5, and thus, for any m ∈ Mn , the function
n
SmF : [0, 1]n → [0, 1], SmF (x) = F x(i) , m A(i) is a pre-aggregation function
i=1
(even an aggregation function). For example, for n = 2, m({1}) = 1/3, m({2}) =
3/4, we get
x ∨ 2y i f x ≤ y,
Sm (x, y) =
F
y ∨ x3 i f x > y.
Note that any function F satisfying the constraints of Proposition 5 is, in fact,
a binary (1, 0)-nondecreasing pre-aggregation function which satisfies F(0, y) = 0
for each y ∈ [0, 1].
It is not difficult to check that Su mF , given by (26), extends an arbitrary capacity m
whenever F(1, y) = y for each y ∈ [0, 1]. Moreover, the monotonicity of F ensures
the monotonicity of Su mF , independently of m. If F is an aggregation function, so is
CmF . Summarizing these facts, we obtain the following result.
Theorem 7 Let F : [0, 1]2 → [0, 1] be an aggregation function such that F(0, y) =
0 and F(1, y) = y for each y ∈ [0, 1]. Then, for any n ≥ 2 and m ∈ Mn , the func-
tion Su mF : [0, 1]n → [0, 1] defined by (26), is an aggregation function extending the
capacity m.
References
1. Beliakov, G., Pradera, A., Calvo, T.: Aggregation Functions: A Guide for Practitioners.
Springer, Heidelberg (2007)
2. Bustince, H., Fernandez, J., Kolesárová, A., Mesiar, R.: Directional monotonicity of fusion
functions. Eur. J. Oper. Res. 244, 300–308 (2015)
3. Calvo, T., Kolesárová, A., Komorníková, M., Mesiar, R.: Aggregation operators: properties,
classes and construction methods. In: Calvo, T., Mayor, G., Mesiar, R. (eds.) Aggregation
Operators. New Trends and Applications, pp. 3–107. Physica-Verlag, Heidelberg (2002)
4. Chateauneuf, A., Jaffray, J.Y.: Some characterizations of lower probabilities and other
monotone capacities through the use of Möbius inversion. Math. Soc. Sci. 17, 263–283 (1989)
5. Choquet, G.: Theory of capacities. Ann. Inst. Fourier 5, 131–295 (1953)
6. Durante, F., Sempi, C.: Semicopulae. Kybernetika 41, 315–328 (2005)
7. Grabisch, M.: Fuzzy integral in multicriteria decision making. Fuzzy Sets Syst. 69, 279–298
(1995)
8. Grabisch, M., Marichal, J.-L., Mesiar, R., Pap, E.: Aggregation Functions. Cambridge Univer-
sity Press, Cambridge (2009)
9. Grabisch, M., Murofushi, T., Sugeno, M.: Fuzzy measures and integrals. Theory and Applica-
tions. Physica Verlag, Heidelberg (2000)
10. Horanská, Ľ., Šipošová, A.: A note on a generalization of the Choquet integral. In: Proceedings
of Uncertainty Modelling 2015, STU, Bratislava (2015)
11. Klement, E.P., Mesiar, R., Pap, E.: Measure-based aggregation operators. Fuzzy Sets Syst.
142(1), 3–14 (2004)
198 A. Kolesárová and A. Stupňanová
12. Klement, E.P., Mesiar, R., Pap, E.: A universal integral as common frame for Choquet and
Sugeno integral. IEEE Trans. Fuzzy Syst. 18, 178–187 (2010)
13. Klement, E.P., Mesiar, R., Spizzichino, F., Stupňanová, A.: Universal integrals based on cop-
ulas. Fuzzy Optim. Decis. Making 13, 273–289 (2014)
14. Kolesárová, A., Stupňanová, A., Beganová, J.: Aggregation-based extensions of fuzzy mea-
sures. Fuzzy Sets Syst. 194, 1–14 (2012)
15. Kolesárová, A., Stupňanová, A.: On some extensions methods for normed utility functions. In:
Proceedings AGOP’2011, pp. 169–174. Benevento, Italy (2011)
16. Lehrer, E.: A new integral for capacities. Econ. Theor. 39, 157–176 (2009)
17. Lovász, L.: Submodular function and convexity. In: Mathematical Programming: The State of
the Art, pp. 235–257. Springer, Berlin (1983)
18. Lucca, G., Sanz, J.A., Pereira Dimuro, G., Bedregal, B., Mesiar, R., Kolesárová, A., Bustince,
H.: Pre-aggregation functions: construction and an application. IEEE Trans. Fuzzy Syst. (2015),
accepted for publication
19. Marichal, J.-L., Mathonet, P., Tousset, E.: Mesures floues définies sur une échelle ordinale.
Working paper (1996)
20. Marichal, J.-L.: Aggregation of interacting criteria by means of the discrete Choquet integral. In:
Calvo, T., Mayor, G., Mesiar, R. (eds.) Aggregation Operators. New Trends and Applications,
pp. 224–244. Physica-Verlag, Heidelberg (2002)
21. Mesiar, R.: k-Order pan-additive discrete fuzzy measures. In: 7th IFSA World Congress, pp.
488–490. Prague (1997)
22. Mesiar, R., Li, J., Pap, E.: Superdecomposition integral. Fuzzy Sets Syst. 259, 3–11 (2015)
23. Mesiar, R., Stupňanová, A.: Decomposition integrals. Int. J. Approximate Reasoning 54, 1252–
1259 (2013)
24. Mesiar, R., Kolesárová, A., Bustince, H., Dimuro, G.P., Bedregal, B.: Fusion functions based
Choquet-like integrals. Submitted (2015)
25. Nelsen, R.B.: An Introduction to Copulas, 2nd edn. Springer, New York (2006)
26. Owen, G.: Multilinear extensions of games. In: The Shapley value. In: Roth, A.E. (ed.) Essays
in Honour of Lloyd S. Shapley, pp. 139–151. Cambridge University Press (1988)
27. Shilkret, N.: Maxitive measure and integration. Indag. Math. 33, 109–116 (1971)
28. Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Publ. Inst. Stat. Univ. Paris
8, 229–231 (1959)
29. Sugeno, M.: Theory of fuzzy integrals and its applications. PhD thesis, Tokyo Institute of
Technology (1974)
30. Wang, Z., Klir, G.J.: Fuzzy Measure Theory. Plenum Press, New York (1992)
31. Yang, Q.: The pan integral on the fuzzy measure space. Fuzzy Math. 3, 107–114 (1985). (in
Chinese)
Multi-source Information Fusion Using
Measure Representations
Ronald R. Yager
1 Introduction
R. Yager (B)
Machine Intelligence Institute, Iona College, New Rochelle, NY 10801, USA
e-mail: [email protected]
A set function we can associate with any measure is its dual. We define the dual of
μ as the set function μ̂ : X → [0, 1] defined such that μ̂(A) = 1 − μ(A) where A is
the complement of A. We can easily show that if μ is a measure on X then μ̂ is also
ˆ
a measure on X . We note that the dual of the dual is the original measure, μ̂(A) =
1 − μ̂(A) = 1 − (1 − μ(A)) = μ(A). Thus a measure and its dual are unique pairs.
We say an element x ∈ X is irrelevant with respect to μ if for all A we have
μ(A ∪ {x}) = μ(A). We observe that if x is irrelevant then μ({x}) = 0. If we define
E −x = X \ {x} and if x is irrelevant then μ(E −x ) = 1. We see this follows since
X = E −x ∪ {x} and hence μ(X ) = μ(E −x ∪ {x}) = μ(E −x ). We must emphasize
that μ({x}) = 0 does not necessarily make x irrelevant. We note that if x is irrelevant
to μ it is also irrelevant to its dual μ̂.
A fuzzy measure provides a very general structure for the representation of our
knowledge about an uncertain variable. Let V be a variable taking its value in the
space X . In using μ to express our knowledge about V we provide the following
Multi-source Information Fusion Using Measure Representations 201
interpretation. For any subset B of X we have that μ(B) indicates our anticipation
that the value of V lies in B. We see that μ(∅) = 0 reflects the fact that the value of
V will not be in the null set. The property μ(X ) = 1, called the normality condition,
indicates the fact that we are completely confident that the value of V lies in X .
Finally the monotonicity of μ reflects the fact that you cannot be more confident of
finding the value V in the set B than in a set that contains B. Here we say that the set
or event A happens if the value of V lies in A, thus μ(A) is seen as our anticipation
that the event A happens.
We shall use the expression V is μ to denote the situation where knowledge
about V is carried by the set measure μ. In the following we shall restrict our-
selves to the case where X is finite. Let us look at the representation of some types
of knowledge using this representation. Consider first the case of certainty where
we know that the value V is exactly x1 . We can express this using a fuzzy mea-
sure μ such that μ(B) = 1 if x1 ∈ B and μ(B) = 0 if x1 ∈ / B. We refer to this as
a Dirac measure focused at x1 . Consider now the case of probabilistic uncertainty
where we have Pr ob(xi ) = pi . In this case we define μ such that μ(B) = xi ∈B pi .
Here we see that μ({xi }) = pi . Thus we see that the probability measure has the
property of additivity. Thus the probability measure is a special fuzzy measure.
In this case we refer to μ(B) by the more specific name of probability n of B,
μ(B) = Pr ob(B). Since μ(X ) = Pr ob(X ) = 1 here we require i=1 pi = 1. For
this measure μ(A ∪ B) = μ(A) + μ(B) − μ(A ∩ B). We see here if A ∩ B = ∅
then μ(A ∪ B) = μ(A) + μ(B). We note that this measure requires only knowl-
edge of the pi to completely determine μ(A) for all A. A special case of this
is where all pi are the same, here pi = 1/n where n is the cardinality of X .
Another important type of uncertainty representable in this framework is possibilis-
tic uncertainty [4, 21]. We recall that usually this type of uncertainty is obtained
from a linguistic description of our knowledge. Here we associate with each xi a
value πi ∈ [0, 1] called the possibility of xi . In this case the associated measure
μ is called a possibility measure and is defined as μ(A) = maxxi ∈A πi . The con-
dition that μ(X ) = 1 requires that at least one πi = 1. We see for this measure
that μ(A ∪ B) = max(μ(A), μ(B)) for any A and B. We note that the Dirac mea-
sure is the only measure that is both a probability and possibility measure. Another
class of measures are decomposable measures. These generalize the possibility mea-
sure. Here if S is any t-conorm [7] then if μ({xi }) = ai we have μ(A) = Sxi ∈A (ai ).
A requirement here is that μ(X ) = Sxi ∈X (ai ) = 1. We note that for these mea-
sure if A ∩ B = ∅, then μ(A ∪ B) = S(μ(A), μ(B)). Since max is a t-conorm we
see that the possibility measure is an example of these decomposable measures.
Another notable type of measure is one in which μ(A ∩ B) = min(μ(A), μ(B))
these are referred to as certainty or necessity measures [2]. In this case we refer
to μ(A) as the necessity of A and use the notation N ec(A) instead of μ(A). We
note that if we let Fi = X \ {xi } then we can completely define this measure using
μ(Fi ). Here for any A, μ(A) = min xi ∈A (μ(Fi )). We shall refer to μ(Fi ) = βi . Since
μ(∅) = min x j ∈X (β j ) then we see that fact that μ(∅) = 0 requires at least one β j = 0.
The case where all β j are equal implies all β j = 0. We note that a necessity mea-
sure is the dual of a possibility measure. We also note that these necessity measures
202 R.R. Yager
are notable in that while for all measures μ(A ∩ B) ≤ min(μ(A), μ(B)), the neces-
sity measure attains this minimum, μ(A ∩ B) = min(μ(A), μ(B)). A generalization
of the necessity measure can be obtained by defining μ(A ∩ B) = T (μ(A), μ(B))
where T is a t-norm [7]. Here again all we need are n values for μ(Fi ) = βi . Another
important example of fuzzy measures are the cardinality based uncertainty measures.
Here our anticipation that the value of the variable lies in a particular subset just
depends on the number of elements in the subset. No distinction is made between
the individual outcomes elements. Let ai be a collection of parameters such that
0 = a0 ≤ a1 ≤ · · · ≤ an = 1. Let us denote |B| as the cardinality of the set B, the
number of elements in B. We now define μ(B) = a|B| . Here then for all subsets
with the same number of elements we have the same of anticipation of finding the
value in it. We can show for this measure that no elements are impossible. A useful
parameter associated with these measures is wi = ai − ai−1 which indicates the ben-
efit of adding the ith element to a set. Two important examples of cardinality based
uncertainty measures are the optimistic measure μ∗ and the pessimistic measure μ∗ .
For μ∗ we have μ∗ (A) = 1 for A = ∅. and μ∗ (∅) = 0 and for μ∗ (A) = 0 for A = X
and μ∗ (X ) = 1. For μ∗ we have a j = 1 for all j ≥ 1 and a0 = 0 and for μ∗ we have
a j = 0 for all j < n and an = 1. Another example of cardinality-based measure is
one where ai = i/n. We can use the wi to provide a measure of optimism associated
with a cardinality based measure μ on X [15]. If |X | = n then
n
(n − j)w j
O p(μ) = .
j=1
n−1
Assume μ1 is some fuzzy measure on X and F : [0, 1] → [0, 1] is such that F(0) =
0, F(1) = 1, and F(a) ≥ F(b) if a > b then it can be shown that μ(A) = F(μ1 (A))
is a fuzzy measure [19].
Another class of derived measures are those composed from other measures.
Let μ1 and μ2 be two fuzzy measures on X . We can show that the set function μ
defined such that μ(A) = μ1 (A)μ2 (A) for all A ⊆ X is also a measure. We see that
μ(∅) = μ1 (∅)μ2 (∅), μ(X ) = μ1 (X )μ2 (X ) and if A ⊇ B then μ1 (A) ≥ μ1 (B) and
μ2 (A) ≥ μ2 (B) and hence μ(A) = μ1 (A)μ2 (A) ≥ μ1 (B)μ2 (B) = μ(B). Thus μ
as defined above is a fuzzy measure.
q This result can easily be extended to the fusion
of q fuzzy measure μ(A) = i=1 μi (A).
In the following we shall provide a generalization of this result, which shall form
the basis of our approach to the fusion of multi-source information.
Multi-source Information Fusion Using Measure Representations 203
is a fuzzy measure.
Proof 1. μ(∅) = G(μ1 (∅), . . . , μq (∅)) = G(0, 0, . . . , 0) = 0,
2. μ(X ) = G(μ1 (X ), . . . , μq (X )) = G(1, . . . , 1) = 1,
3. We have μ(A) = G(μ1 (A), . . . , μq (A)) and μ(B) = G(μ1 (B), . . . , μq (B)).
With B ⊆ A, since μ j (A) ≥ μ j (B) for all j, then μ(A) ≥ μ(B).
In situations where we have a finite space and the μ j are simple measures the
calculation of the aggregate measure is relatively easily. We shall refer to these mea-
sures as quasi-simple. Let X = {x1 , . . . , xq }. Assume we have q simple measures,
μ j for j = 1 to q. For j = 1 to q and i = l to n let αi j be the parameters associated
with these measures. Thus here μk (A) just depends on αik for xi ∈ A. Let G be
some aggregation function then with μ(A) = G(μ1 (A), . . . , μq (A)) the calculation
of μ(A) just depends on the values of μ j (A) which in turn just depends on the para-
meter αi j . The key point here is that for a measure defined using the aggregation of
simple measures, the calculation of the measure of a set is not complex.
of computing with words [22, 23] can become very useful for translating these
instructions into formal operations.
The type of aggregation operators previously introduced provides a very useful
tool for implementing a wide body of expert provided instructions for fusing mul-
tiple pieces of information. One of our interests here is to look at the use of these
aggregation operators for the fusion of information expressed via fuzzy measures.
We shall be particularly concerned with probability and possibility type information
as they represent two very important classes of provided information. We note that
possibilistic information often arises from a linguistic description of the value of
some variable. An example of this is information such as the house is close. Here
close is a linguistic term that can be expressed using fuzzy sets which in turn induces
a possibility distribution on the variable “the distance of the house to the river.”
Probabilistic information often appears because it provides an effective model to
represent the accuracy of physical sensing devices. It should be emphasized that the
use of probability is not a reflection of randomness in the variable of interest but a
reflection of the lack of accuracy of the measuring device.
where T is a t-norm.
Let us look at this for some notable cases of μ1 and μ2 . Consider first the case
where V is μ1 , indicates the value of V is exactly x1 and V is μ2 , indicates the value
of V is exactly x2 . Here μ1 has μ1 (A) = 1 for all A s.t. x1 ∈ A and μ1 (A) = 0 for
all A s.t. x1 ∈
/ A. μ2 has similar information, μ2 (A) = 1 if x2 ∈ A and μ2 (A) = 0 if
x2 ∈/ A. In this case we get as our fused value μ(A) = 1 if {x1 , x2 } ⊆ A and μ(A) = 0
otherwise. Here we have an anticipation of one of finding the value of V in any set
containing both x1 and x2 . It is interesting to note that in the case where we have
two sources giving very different values for the variable we provide a fused solution
that tried to unify these two conflicting values rather then reporting a conflict. This
is useful, because for some decisions the action taken when V is either x1 or x2 is
the same so a solution need not differentiate. In a more general situation, if we have
Multi-source Information Fusion Using Measure Representations 205
q sources each saying that V has the value x j then our fused function μ would be
such that μ(A) = 1 if {x1 , . . . , x j } ⊆ A and μ(A) = 0 otherwise. Again here we see
a kind of unification of the different pieces of information.
We can now consider the case where μ1 is as above but where μ2 is a possibility
distribution with μ2 ({x j }) = π j and μ2 (A) = maxx j ∈A π j . In this case we have for
any subset A with x1 ∈ A that μ(A) = maxx j ∈A π j and μ(A) = 0 if x1 ∈ / A. In
particular we note that μ({x1 }) = π1 and μ({x j }) = 0 for j = 1. We see that μ is
not a pure possibility distribution. We note that all the preceding results hold for any
t-norm since at least one of the arguments was always with 1 or 0.
We again consider the case of two pieces of information, V is μ1 and V is μ2
where μ1 indicates that V is exactly x1 while μ2 is any arbitrary measure. Here
we get for the fusion of the conjunction of these that V is μ where μ is such that
μ(A) = 0 if x1 ∈ / A and μ(A) = μ2 (A) if x1 ∈ A.
An interesting special case of this occurs when μ2 is a probability distribution
with pi the probability of xi . In this case we see that while μ({x1 }) = p1 we have
μ({x j }) = 0 for j = 1. Furthermore for any A such that x1 ∈ / A we get μ(A) = 0
while for any A such that x1 ∈ A we have μ(A) = x j ∈A p j . We see μ is not
a probability measure if p1 < 1. As indicated earlier μ(A) is our anticipation of
finding a value V in the subset A. We observe that μ(A) = 0 or μ(A) ≥ p1 .
We now consider the conjunction in the case where μ1 is a probability distribution
and μ2 is a possibility distribution. In this case μ(A) = T (μ1 (A), μ2 (A)) where
μ1 (A) = x j ∈A p j and μ2 (A) = maxxi ∈A πi . We observe here that for any t-norm
T we get A = {x j } that μ({x j }) = T ( p j , π j ).
Let us look at this fusion of probabilistic and possibilistic information for some
notable examples of t-norms. We first consider the product t-norm. In this case
μ({x j }) = p j · π j and more generally
⎛ ⎞
μ(A) = μ1 (A) · μ2 (A) = ⎝ p j ⎠ · maxx j ∈A π j = Prob(A) · Poss(A).
x j ∈A
We further observe that the product t-norm leads to a nice formulation where
⎛ ⎞
We further observe that if A has one x j such that π j = 1 then μ( Ã) = μ(A) + p j+1 .
We see here that we have introduced a new class of measures, which we shall refer
to as P2 measures. Let us understand this measure. This measure is characterized
by two sets of parameters
n P = [ p1 , . . . , pn ] and = [π1 , . .
. , πn ] where all pi and
πi ∈ [0, 1] and i=1 pi = 1 and max j π j = 1. Here μ(A) = x j ∈A pi · maxx j ∈A π j
and in particular μ({x j }) = π j p j .
We note that this approach can be easily generalized to the case where we have
more than two sources of information some of which are possibilistic and some
probabilistic. Thus here if V is μi are a collection of q pieces of information q about V
then their fusion under the preceding “anding” imperative is μ(A) = i=1 μi (A).
Here we now describe some tools useful in the characterization of fuzzy measures
that are used to model uncertain variables. These are particularly useful in compar-
ing fuzzy measures with respect to properties such as their overall uncertainty. We
refer the reader to [16–18] more details on these. In probability theory an impor-
tant tool is the Shannon entropy. With this tool we are able to indicate the overall
uncertainty of a probability distribution. Assume we have a probability distribu-
tion on the space X = {x1 , . . . , xn } where p j is the probability of x j . Here p j is
Multi-source Information Fusion Using Measure Representations 207
capturing the distinction in our belief about each of the elements, p j is the support
for x j as the outcome.
We recall the Shannon entropy associated with this distribution
is H (P) = − j p j ln( p j ). The Shannon entropy quantifies the overall uncertainty
associated with the probability distribution. In [16] we extended this idea to fuzzy
measures. In order to extend this to a general measure μ we first introduce the idea
of the Shapley index [11]. For any x j ∈ X we define its Shapley index S j as
n−1
Sj = (rk (μ(K ∪ {x j }) − μ(K ))).
k=0 K ⊂F j
|K |=k
(n−k−1)!k!
where λk = n!
. We call Ck the kth cardinality index.
We see that Ck measures the average gain in anticipation we get in going from a
subset
of cardinality k to one of cardinality k + 1. In [18] it was shown that Ck ∈ [0, 1]
n−1
and k=0 Ck = 1.
Yager [18] used the cardinality index to define a characteristic of a fuzzy measure
called its attitudinal character. The attitudinal character of a measure m is defined as
1
n−1
AC(μ) = (n + k + 1)Ck .
n − 1 k=0
n−1
We note that since k=0 Ck = 1 then we can express the attitudinal character as
n−1
kCk
AC(μ) = 1 − k=0
.
n−1
In [18] it was shown that AC(μ) ∈ [0, 1]. Furthermore the large values of AC
indicate a more optimistic type of measure while the small values indicate a more
Multi-source Information Fusion Using Measure Representations 209
pessimistic type of measure, AC(μ) = 1 being the most optimistic and AC(μ) =
0 being the most pessimistic. Thus the attitudinal character is providing a scale
characterizing a measures degree of optimism/pessimism about its anticipation of
finding the variables value in a given subset.
In [18] we obtained the cardinality index for cardinality-based measures and
probabilistic type measures. We recall for a cardinality based measure μ we have a
set 0 ≤ a0 ≤ ai ≤ an = 1 such that μ(E) = a|E| , where |E| is the cardinality of E.
We showed that for this type of measure the cardinality index is
Ck = ak+1 − ak for k = 0 to n − 1.
A few notable examples of this are worth pointing out here. In the case of μ∗ where
μ∗ (E) = 1 for all E = ∅ and μ∗ (∅) = 0 it was shown that C0 = 1 and Ck = 0 for
all k = 0. In the case μ∗ where μ∗ (E) = 0 for E = X and μ(X ) = 1 then Cn−1 = 1
and Ck = 0 for other k. In the case where a j = nj then Ck = n1 for all k = 0 to n − 1.
Another situation studied in [18] was the additive or probabilistic uncertainty.
Here we associate with each xi a value pi ∈ [0, 1] so that they sum to one. In [18] it
was shown that independent of the value of the pi the cardinality index for all k is
always Ck = n1 . Thus the cardinality index doesn’t distinguish between probability
distributions, all probability distributions have the same cardinalityindex.
n−1
kC
Let us now consider the attitudinal character, AC(μ) = 1 − k=0 n−1
k
, of these
∗
measures. First we consider μ , here C0 = 1 and all other Ck = 0 and hence
AC(μ∗ ) = 1. This is as expected since μ∗ is the most optimistic measure. On the
other hand for μ∗ since Cn−1 = 1 and all other Ck = 0 we have AC(μ∗ ) = 0. Again
this is as expected since μ∗ is very pessimistic. Consider now the probabilistic case
where Ck = n1 for all k. Here AC(μ) = 0.5, this has a neutral value of 0.5. This is
also the case of the cardinality based measure with a j = nj .
We shall investigate the relationship between the cardinality index of a measure
μ and the cardinality index of its dual μ̂. We recall for μ,
C k = λk ( (μ(K ∪ {x}) − μ(K ))),
all K x ∈K
/
|K |=k
Since μ̂ is the dual of μ then μ̂(E) = 1 − μ(E). In this case we see that μ̂(K ) =
1 − μ(K ) and μ̂(K ∪ {x}) = 1 − μ(K ∪ {x}) and there then
(μ̂(K ∪ {x}) − μ̂(K )) = (μ(K ) − μ(K ∪ {x})).
x ∈K
/ x ∈K
/
210 R.R. Yager
Here we shall discuss the concepts of assurance and opportunity which play an
important role in the process of answering questions using information obtained
from the fusion of measures. Consider now we have a variable V with domain X .
Assume our knowledge about the value of V is expressed using a measure μ on X .
Thus for any subset A of X the value μ(A) indicates our anticipation of A occurring,
that is of finding the value of V in the set A. Does a value μ(A) = 1 assure us that
A will occur? Let us look at this in more detail. Consider the measure μ∗ (A) = 1
for all A = ∅. In the case we have μ∗ (A) = 1. However in this case we also have
μ∗ (A) = 1 so here we have just as strong an anticipation that A will not occur. In
order for us to be assured that A will occur we have to anticipate A will occur and
also anticipate that A will not occur. The degree to which our anticipation that A will
not occur can be measured by 1 − μ(A). However we note that this is the dual of
μ, μ̂(A). Using this we introduce a set function called the assurance of A which we
define as λ(A) = μ(A) ∧ μ̂(A) [20]. We easily see that λ is a measure. One thing
we note about this measure of assurance is that λ(A) ≤ μ(A). Thus the measure of
assurance of a set is never larger than its measure of anticipation. We also observe
that λ(A) ≤ μ̂(A).
Consider the two measures μ and μ̂ its dual. We have λμ (A) = μ(A) ∧ μ̂(A) and
ˆ
λμ̂ (A) = μ̂(A) ∧ μ̂(A). ˆ
However since μ̂(A) = μ(A) we have for measures that are
duals we have λμ (A) = λμ̂ (A). However it is not the case that μ̂(A) = μ(A).
Closely related to the concept of assurance is the concept of opportunity. We define
this as the set function Ψ where Ψ (A) = μ(A) ∨ μ̂(A) and we refer to Ψ (A) as the
opportunity associated with A. It essentially measures the opportunity of V lying in
A. It is clear this Ψ (A) ≥ λ(A). We can easily show that Ψ is itself a measure. We note
that Ψ (A) ≥ μ(A) and Ψ (A) ≥ μ̂(A). As is the case for the measure of assurance
we see that if μ and μ̂ are duals then Ψμ (A) = Ψμ̂ (A). We indicate that Ψ (A) is seen
as the opportunity that the value of V will lie in A. We note that Ψ (A) can be seen
as an optimistic view of the occurrence of A and λ(A) as a pessimistic view.
Multi-source Information Fusion Using Measure Representations 211
Thus in this case while μ∗ (A) = 1 for all A = ∅ we have λ∗ (A) = 0 for all A = X .
Thus this case of complete lack of information about the value of V allows for no
assurance about anything except that V lies in its domain X .
Consider now μ∗ , we see μ∗ (A) = 0 for all A = X and hence λ∗ (A) = 0 for
A = X . Thus here again in this case of complete lack of knowledge about a value
of V we get λ(A) = 0. Thus here while μ∗ (A) and μ∗ (A) have different values for
most A we have λ∗ (A) = λ∗ (A) for all A.
We further observe since Ψ is the dual of λ then in the cases of μ∗ and μ∗ we have
that Ψ ∗ (A) = Ψ∗ (A) and furthermore there are such that Ψ ∗ (A) = Ψ∗ (A) = 1 for
all A = ∅. Thus here while we have no assurance that V lies in A we have complete
opportunity for it to lie in A.
Thus these measures μ∗ and μ∗ have the same values for λ and Ψ . Since these
both correspond to lack of any information we can refer to these as λ? and Ψ? . In the
preceding we have shown Ψ? (A) = 1 for all A = ∅ and ψ? (∅) = 0 and λ? (A) = 0
for all A = X . Here we see except for the extremes of X and ∅ the values of Ψ? (A)
and λ? (A) are as far apart as they can possibly be. Here we see for each A we have
an opportunity but no assurance for finding V .
Let us now consider nμ is a probability measure. We recall her
the case where
μ({xi }) = pi , μ(A) = xi ∈A pi and i=1 pi = 1. Consider now the dual of this
measure. In this case
μ̂(A) = 1 − μ(A) = 1 − pi = pi = μ(A).
xi ∈A xi ∈A
212 R.R. Yager
Thus in the case of a probability measure the measure and its dual are the same.
We furthermore see it is also equal to the assurance and opportunity measures
Then here we Ψ (A) = λ(A) = μ(A). We note that this fact holds for any self-dual
fuzzy measure, some times called participation measures.
This is an important and very special property of the probability distribution.
There is no difference between any of the measures. The measures of opportunity,
assurance and anticipation are all the same. In the situation μ(A) is often referred to
as the probability of A.
An important special case of probability measure is one in which all p j = n1 .
In this case μ(A) = card(A)
n
. In some situations this special case has been used to
represent lack of knowledge. However we emphasize this formulation implies some
knowledge about V . In particular, it at least assumes that the measure of all the
elements μ({x j }), are the same.
Consider now the measure μ in the case where we know that V = x ∗ . Here
we recall μ(B) = 1 if x ∗ ∈ B and μ(B) = 0 for x ∗ ∈ / B. Consider the dual of this
μ̂(B) = 1 − μ(B). We see μ(B) = 1 if x ∗ ∈ / B that is if x ∗ ∈
/ B. Also we see that
μ(B) = 0 if x ∗ ∈/ B that is if x ∗ ∈ B. Thus μ̂(B) = 1 if x ∗ ∈ B. We see that in this
case μ̂ and μare the same and hence
Measures having this property are referred to as certainty measures [2]. Since μ
and μ̂ are unique pairs we refer to μ̂ as the associated certainty measure. Certainty
measures are also referred to as necessity measures
A very important and special relationship exists between a possibility measure
and its dual certainty measure. If μ is a possibility measure then for any set A we
Multi-source Information Fusion Using Measure Representations 213
have μ(A) ≥ μ̂(A). Let us see that this is true. Here we have μ(A) = maxxi ∈A (πi )
and μ̂(A) = 1 − maxxi ∈A (πi ). We note that at least one of the πi = 1. If A has one
of the xi with πi = 1 then μ(A) = 1 and μ(A) ≥ μ̂(A). If there exists no element
with πi = 1 contained in A then at least one such element must be in A and hence
maxxi ∈A (πi ) = 1 and therefore μ̂(A) = 0 and again we have μ(A) ≥ μ̂(A). An
important implication of this is that for possibility measures we always have
Thus here the measure of assurance of A is equal to the dual measure of A, the
certainty of A. Furthermore in this case of possibility measures we always have
8 Conclusion
References
1. Beliakov, G., Pradera, A., Calvo, T.: Aggregation Functions: A Guide for Practitioners.
Springer, Heidelberg (2007)
2. Dubois, D., Prade, H.: Necessity measures and the resolution principle. IEEE Trans. Syst. Man
Cybern. 17, 474–478 (1987)
3. Dubois, D., Prade, H.: Possibility Theory: An Approach to Computerized Processing of Uncer-
tainty. Plenum Press, New York (1988)
214 R.R. Yager
4. Dubois, D., Prade, H.: Formal representations of uncertainty. In: Bouyssou, D., Dubois, D.,
Pirlot, M., Prade, H. (eds.) Decision Making Process: Concepts and Methods. Wiley, Hoboken
(2010)
5. Grabisch, M., Marichal, J.-L., Mesiar, R., Pap, E.: Aggregation Functions. Cambridge Univer-
sity Press, Cambridge (2009)
6. Klement, E.P.: A theory of fuzzy measures: a survey. In: Gupta, M.M., Sanchez, E. (eds.) Fuzzy
Information and Decision Processes, pp. 59–66. North-Holland, Amsterdam (1982)
7. Klement, E.P., Mesiar, R., Pap, E.: Triangular Norms. Kluwer Academic Publishers, Dordrecht
(2000)
8. Klir, G.J.: Uncertainty and Information. Wiley, New York (2006)
9. Liu, L., Yager, R.R.: Classic works of the Dempster-Shafer theory of belief functions: an
introduction. In: Yager, R.R., Liu, L. (eds.) Classic Works of the Dempster-Shafer Theory of
Belief Functions, pp. 1–34. Springer, Heidelberg (2008)
10. Llinas, J., Nagi, R., Hall, D.L., Lavery, J.: A multi-disciplinary university research initiative in
hard and soft information fusion: Overview, research strategies and initial results. In: Proceed-
ings of the 13th International Conference on Information Fusion (Fusion 2010). Edinburgh,
UK, Unpaginated (2010)
11. Shapley, L.S.: A value for n-person games. In: Kuhn, H.W., Tucker, A.W. (eds.) Contributions
to Game Theory, pp. 307–317. Princeton University Press, Princeton (1953)
12. Sugeno, M.: Fuzzy measures and fuzzy integrals: a survey. In: Gupta, M.M., Saridis, G.N.,
Gaines, B.R. (eds.) Fuzzy Automata and Decision Process, pp. 89–102. North-Holland, Ams-
terdam (1997)
13. Walley, P., Fine, T.: Toward a frequentist theory of upper and lower probability. Ann. Stat. 10,
741–761 (1982)
14. Wang, Z., Yang, R., Leung, K.S.: Nonlinear Integrals and Their Applications in Data Mining.
World Scientific, Singapore (2010)
15. Yager, R.R.: On ordered weighted averaging aggregation operators in multi-criteria decision
making. IEEE Trans. Syst. Man Cybern. 18, 183–190 (1988)
16. Yager, R.R.: On the entropy of fuzzy measures. IEEE Trans. Fuzzy Sets Syst. 8, 453–461
(2000)
17. Yager, R.R.: Measuring the information and character of a fuzzy measure. In: Proceedings
of the Joint 9th IFSA World Congress and the 20th NAFIPS International Conference, pp.
1718–1722. Vancouver (2001)
18. Yager, R.R.: On the cardinality index and attitudinal character of fuzzy measures. Int. J. Gen.
Syst. 31, 303–329 (2002)
19. Yager, R.R.: A measure based approach to the fusion of possibilistic and probabilistic uncer-
tainty. Fuzzy Optim. Decis. Making 10, 91–113 (2011)
20. Yager, R.R.: Measures of assurance and opportunity in modeling uncertain information. Int. J.
Intell. Syst. 27, 776–796 (2012)
21. Zadeh, L.A.: Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst. 1, 3–28 (1978)
22. Zadeh, L.A.: From computing with numbers to computing with words–From manipulation of
measurements to manipulations of perceptions. IEEE Trans. Circuits Syst. 45, 105–119 (1999)
23. Zadeh, L.A.: Generalized theory of uncertainty (GTU)-principal concepts and ideas. Comput.
Stat. Data Anal. 51, 15–46 (2006)
Bases and Transforms of Set Functions
Michel Grabisch
Abstract The chapter studies the vector space of set functions on a finite set X , which
can be alternatively seen as pseudo-Boolean functions, and including as a special
cases games. We present several bases (unanimity games, Walsh and parity functions)
and make an emphasis on the Fourier transform. Then we establish the basic duality
between bases and invertible linear transform (e.g., the Möbius transform, the Fourier
transform and interaction transforms). We apply it to solve the well-known inverse
problem in cooperative game theory (find all games with same Shapley value), and
to find various equivalent expressions of the Choquet integral.
1 Introduction
Set functions on a finite set X are of fundamental usage in many areas of discrete
mathematics, e.g., cooperative game theory [20], combinatorial optimization [11],
decision making [14], computer sciences [6], and more generally operations research
[16], where in the latter domain, they are more often encountered under the form
of pseudo-Boolean functions. Specific domains focus on specific subclasses of set
functions, e.g., game theory uses set functions vanishing on the empty set (these
are characteristic functions of transferable utility games, which are simply called
“games”), decision theory needs games which are monotone with respect to inclusion
(called capacities), while combinatorial optimization often deals with submodular
games.
An interesting feature of set functions and games is that they form a vector space
of dimension 2|X | (2|X |−1 for games). Most often, this feature is ignored, although
clearly one can take advantage of the concepts and techniques of linear algebra
when dealing with set functions and games. In particular the notion of basis is of
importance. The best-known basis is perhaps the basis of unanimity games (this is the
M. Grabisch (B)
Paris School of Economics, University of Paris I, 106-112, Bd de L’Hôpital,
75013 Paris, France
e-mail: [email protected]
usual name given in game theory; they are closely related to the incidence functions in
combinatorics, see [1]), although each domain has its prefered bases. For example, in
computer sciences mostly the basis of parity functions is used, essentially because the
encoding of sets is done by −1, +1, rather than by 0, 1. The usage of a particular basis
induces a particular representation of set functions, viewed as a transform, which is
by definition linear and invertible. For example, the representation through the basis
of unanimity games is the Möbius transform (or Möbius inverse), widely used in
combinatorics and well known in decision making and game theory (known under
the name of Harsanyi dividends [17] in the latter domain), while the representation
through parity functions is the Fourier transform, which has many applications in
computer sciences.
As far as the author can see from the literature, the duality between bases and
transforms (i.e., the representation of set functions into a given basis) has never been
exploited nor even remarked. A systematic exploitation of this fact can lead to the
discovery of new bases and transforms, as well as an easy solution to the so called
inverse problem in game theory: find all games having the same Shapley value (and
similar ones). Also, it permits to get several different expressions of linear operators
on games or set functions, like the Choquet integral.
The aim of this chapter is to bring a survey on the above mentioned issues.
Section 2 gives a brief account on set functions and pseudo-Boolean functions.
Section 3 describes the best-known bases (unanimity games, Walsh functions and
parity functions). Section 4 is about the Fourier transform and its properties, while
Sect. 5 explains the fundamental duality between bases and transforms, and gives
many examples. Sections 6 and 7 are applications of this duality principle to the
solution of the above mentioned inverse problem and to the finding of equivalent
expressions of the Choquet integral.
1
ξ, ξ = ξ(S)ξ (S).
2n S⊆X
between pseudo-Boolean functions and set functions can be seen through the coding
function 1 : 2 X → {0, 1}n defined by A → 1 A , with 1 A (i) = 1 if and only if i ∈ A.
Then
ξ f = f ◦ 1, f ξ = ξ ◦ 1−1
1
f, f = f (x) f (x).
2n x∈{0,1}n
Defining the set function u ∅ (S) = 1 for every S ⊆ X , it is well known that {u S } S∈2 X
X
is a basis for R2 . It is also well known that the coordinates of ξ in this basis are the
Möbius transform coefficients:
ξ= m ξ (S)u S
S∈2 X
with
m ξ (S) = (−1)|S\T | ξ(T ).
T ⊆S
A drawback of the basis of unanimity games is that it is not orthogonal w.r.t. the
above scalar product, as it is easy to see even with n = 2:
1
u {1} , u {2} = (u {1} ({1})u {2} ({1}) + u {1} ({2})u {2} ({2})
4
1
+ u {1} ({1, 2})u {2} ({1, 2}) = = 0.
4
In the formalism
of pseudo-Boolean functions, unanimity games u S correspond
to monomials i∈S xi . Hence we have
f = m f (S) xi
S⊆[n] i∈S
where m f is the Möbius transform of ξ f (this slight abuse of notation should not be
confusing).
It can be shown that the Walsh functions {wT }T ⊆[n] form an orthornomal basis of the
pseudo-Boolean functions:
m f (S) 1 f
f (x) = w T (x) = I (T )wT (x) (2)
T ⊆[n] S⊇T
2|S| T ⊆[n]
2|T | B
Bases and Transforms of Set Functions 219
f
where IB is the Banzhaf interaction transform defined in terms of the Möbius trans-
form by
f
1 |S\T |
IB (T ) = m f (S) (T ∈ 2 X )
S⊇T
2
Another family of functions related to the Walsh functions are the parity functions.
The parity function associated to S ⊆ [n] is the function
χ S (x) = (−1)1S ·x = (−1) i∈S xi
(x ∈ {0, 1}n ). (4)
Its name comes from the fact that it takes only values −1 and +1, depending on
whether there is an odd or even number of elements of coordinates of x equal to 1
in S. It expression as a set function is
Up to a recoding by ε(1) = 0 and ε(−1) = 1, the parity functions are the Walsh
functions:
w S (z) = z i = (−1) i∈S ε(zi ) = χ S (ε(z)) (z ∈ {−1, 1}n ).
i∈S
220 M. Grabisch
In the basis of parity functions, it can be shown that any pseudo-Boolean function f
is expressed by
f =
f (S)χ S
S⊆[n]
1
f (S) = f, χ S = n (−1)1S ·x f (x) (S ⊆ [n])
2 x∈{0,1}n
1
ξ(S) = n (−1)|S∩T | ξ(T ) (S ⊆ [n]). (6)
2 T ⊆[n]
1
E[ f ] = f (x)
2n x∈{0,1}n
Var[ f ] = E ( f − E[ f ])2 = E[ f 2 ] − E2 [ f ].
1
( f ∗ g)(x) = f (x ⊕ y)g(y) (x ∈ {0, 1}n )
2n y∈{0,1}n
1 ⊕ 1 = 0 = 0 ⊕ 0, 1 ⊕ 0 = 0 ⊕ 1 = 1.
Bases and Transforms of Set Functions 221
1
( f ∗ g)(S) = f (SΔT )g(T ) (S ∈ 2 X ).
2n T ⊆[n]
The properties of the Fourier transform are gathered in the next theorem.
Theorem 1 Let f, g be two pseudo-Boolean functions. The following holds.
(i) f (0) = f (S);
S⊆[n]
(ii)
f (∅) = E[ f ];
(iii) (Parseval’s identity) f 2 = S⊆[n]
f 2 (S);
(iv)
f 2 (S) = Var[ f ];
S∈2[n] \{∅}
(vi) (
f ∗ g)(S) = g (S) for all S ∈ 2[n] .
f (S)
The name “Fourier transform” comes from the work of Fourier on the repre-
sentation of integrable functions. The Fourier transform of a function, viewed as
a function of time, gives its frequency representation. Exactly the same results as
those in Theorem 1 hold for the original Fourier transform, which explains its name
in computer sciences. However, it must be noted that all the properties, except the last
one on convolution, are direct consequences of the orthonormality of the basis. To
the opinion of the author, this transform should be rather called the Walsh transform,
since the definition of the Walsh function is an infinite version of the parity function
used here, as a comparison of (3) and (4) reveals.
The previous sections have introduced various bases on set functions and pseudo-
Boolean functions: the unanimity games, the Walsh functions and the parity func-
tions. As a by-product, two fundamental notions in combinatorics and computer
sciences have appeared, namely the Möbius transform and the Fourier transform.
The name “transform” intuitively means (in particular by reference to well-known
transforms used in mathematics for the analysis of real-valued functions: the (origi-
nal) Fourier transform and the Laplace transform, mainly) a representation in another
domain, but equivalent to the original one, i.e., the main desirable characteristic of
the transform is that it should be invertible.
222 M. Grabisch
if one consider set functions ξ as row vectors and use matrix notation. To a basis
S ) S∈2 X , we make correspond the matrix B = [b S ] of row vectors b S . Hence ξ =
(b
S∈2 X w S b S = w B is the expression of ξ in this basis. The following lemma gives
the exact equivalence between bases and transforms.
Lemma 1 [10] For every basis B, there is a (unique) transform Ψ such that for any
X
ξ ∈ R2 ,
ξ= Ψ ξ (S)b S , (7)
S∈2 X
whose inverse Ψ −1 is given by ξ → (Ψ −1 )ξ = S∈2 X ξ(S)b S = ξ B.
Conversely, to any transform Ψ corresponds a unique basis B such that (7) holds,
given by b S = (Ψ −1 )δS .
In the above lemma, δ S denotes the Dirac set function defined by
1, if S = T
δ S (T ) = (S ∈ 2 X ).
0, otherwise
with
m ξ (S) = (−1)|S\T | ξ(T ).
T ⊆S
(ii) The co-Möbius transform ([15], a.k.a. commonality function [23]) is defined
by:
m̌ ξ (S) = (−1)n−|T | ξ(T ) = (−1)|T | ξ(X \ T ) (S ∈ 2 X ).
T ⊇X \S T ⊆S
Bases and Transforms of Set Functions 223
(−1)|T | if S ∩ T = ∅
ǔ T (S) = (−1)|B| δT (B) =
0 otherwise.
B⊆X \S
where
k
k
βkl = Bl− j (k ≤ l),
j=0
j
and B0 , B1 , . . . are the Bernoulli numbers. The first values of βkl are given in
Table 1.
The associated basis {bTI }T ∈2 X is
|T |
bTI (S) = β|T ∩S| (S ∈ 2 X ).
1 1
2 −3
1 1
6 − 30
1
2 1
6 −6
1 2
15
3 0 − 30
1
4 − 30
1
224 M. Grabisch
(v) The Fourier interaction transform: as it was already explained in Sect. 4, the
Fourier transform, as given by (6), corresponds to the basis of parity functions,
given by (5).
The relations between the Fourier transform and the Möbius and Banzhaf
transforms are given as follows:
1
ξ(S) = (−1)|S| m ξ (K ). (9)
2 k
K ⊇S
−1 s
ξ(S) =
ξ
IB (S). (10)
2
(vi) The Walsh basis: This basis is defined by (1). Let us recover the correspond-
ing transform ξ → W ξ by using Lemma 1, which we already gave in (2). By
Lemma 1, the inverse transform is immediate:
(W −1 )ξ (S) = ξ(T )(−1)|T \S| .
T ⊆X
(vii) The Yokote basis: (see [27, 28]) it is a basis of the set of games, which is
defined by
1, if |S ∩ T | = 1
bTY (S) = (S ∈ 2 X \ ∅). (12)
0, otherwise
where the coordinates Y v (S) define the Yokote transform Y . We give now Y v
in terms of m v and v, as well as the inverse relations:
m v (S) = |S|(−1)|S|+1 Y v (K ) (∅ = S ⊆ X ). (14)
K ⊇S
1
Y v (S) = (−1)|S|+1 m v (K ) (∅ = S ⊆ X ). (15)
K ⊇S
|K |
(n − s − l)!(s + l − 1)!
Y v (S) = (−1)|S∩L|+1 v(L). (16)
L⊆X
n!
Hence, the interaction and Banzhaf interaction transforms can be seen as extensions
of these values.
The duality between transforms and bases permits to easily solve the so-called
“inverse problem”, well-known in game theory (see, e.g., [8, 9, 18, 28]): given a
game v on X , find all games v having the same Shapley value (or any other linear
value: Banzhaf, egalitarian, etc.), i.e., Φ Sh (v) = Φ Sh (v ).
Considering a linear value Φ and a game v, finding all games v such that
Φ(v) = Φ(v ) amounts by linearity to solving Φ(v − v ) = 0, i.e., v − v ∈ ker(Φ).
Hence the solution of the inverse problem reduces to finding the kernel of the linear
operator Φ.
The kernel is easily found if there exists a transform Ψ extending the linear value
Φ, exactly as the interaction transform extends the Shapley value (see (17)). Indeed,
the kernel is just the space spanned by the vectors f S of the corresponding basis
with |S| > 1. We illustrate this method with the Shapley value. For any game v, its
epxression in the basis induced by the interaction transform is:
v= I v (S)b SI = ΦiSh (v)b{i}
I
+ I v (S)b SI ,
S∈2 X i∈X |S|>1
which implies
v ∈ ker(Φ Sh ) ⇐⇒ v= I v (S)b SI
|S|>1
i.e.,
ker(Φ Sh ) = λ S b SI | λ S ∈ R .
|S|>1
1
v(∅) = λ∅ + (λ12 + λ13 + λ23 )
6
1 1 1 1
v(1) = λ∅ − λ12 − λ13 + λ23 + λ123
3 3 6 6
228 M. Grabisch
1 1 1 1
v(2) = λ∅ − λ12 + λ13 − λ23 + λ123
3 6 3 6
1 1 1 1
v(3) = λ∅ + λ12 − λ13 − λ23 + λ123
6 3 3 6
1 1 1 1
v(12) = λ∅ + λ12 − λ13 − λ23 − λ123
6 3 3 6
1 1 1 1
v(13) = λ∅ − λ12 + λ13 − λ23 − λ123
3 6 3 6
1 1 1 1
v(23) = λ∅ − λ12 − λ13 + λ23 − λ123
3 3 6 6
1
v(123) = λ∅ + (λ12 + λ13 + λ23 ),
6
where λ S ∈ R for every S ⊆ {1, 2, 3}.
Let us give a second illustrative example with the Banzhaf value. Consider the
following problem: Given a n-dim vector y, find all games v s.t. Φ B (v) = y. The set
of solutions is simply the set of games of the form
v = v y + w, with w ∈ ker(Φ B ),
and v y is any game s.t. Φ B (v y ) = y. Since the Banzhaf interaction transform gener-
alizes the Banzhaf index, we have
with bTIB (S) = (1/2)|T | (−1)|T \S| , and Sp denotes the space spanned by the vectors.
Now, v y can be obtained as the inverse transform of the game w defined by w({i}) =
yi for all i ∈ X , and w(S) = 0 otherwise. This yields by (8):
1
v y (S) = yi − yi .
2 i∈S i ∈S
/
In the case where there is no known transform which extends the linear value
under consideration, a general method is given by [10], consisting in the following
steps:
X
(i) Select a basis E = {e1 , . . . , ek } of the range Φ(R2 ).
X
(ii) Find set functions b1 , . . . , bk ∈ R2 such that Φ(bi ) = ei , i = 1, . . . , k.
(iii) Complete the independent set {b1 , . . . , bk } to form a basis B = {b1 , . . . , b2n }
X
of R2 .
( j) ( j)
(iv) Compute the coordinates 1 , . . . , k of Φ(b j ) in E for j = k + 1, . . . , 2n .
( j)
(v) Compute bΦj = b j − i=1 k
i bi for j = k + 1, . . . , 2n .
Φ
Finally, the basis of the kernel is {bk+1 , . . . , b2Φn }.
Bases and Transforms of Set Functions 229
∞
f dv = v({x ∈ X | f (x) ≥ α}) dα (18)
0
where σ is a permutation on [n] such that f σ(1) ≤ f σ(2) ≤ · · · ≤ f σ(n) , and f σ(0) = 0.
Remark 1 Usually, the Choquet integral is defined w.r.t. a capacity, that is, a game
which is monotone with respect to set inclusion. However, it is not possible to extend
the definition to arbitrary set functions ξ, i.e., such that ξ(∅) = 0. Indeed, if ξ(∅) =
0, it is easily seen from (18) that the integral becomes unbounded.
Expressing v in some basis, it is then possible to get the expression of the Choquet
integral w.r.t. the corresponding transform.
Let Ψ be a linear invertible transform, and {bΨA } A∈2 X be the corresponding basis of
set functions. Due to Remark 1, one has to be careful because many bases are com-
posed of set functions which are not games. Therefore, some adaptation is necessary.
From a basis {bΨA } A∈2 X , we build a basis of games {bΨ
A } A∈2 X \{∅} as follows:
bΨS (T ), if T = ∅
bΨ
S (T ) = (S ∈ 2 X \ {∅}).
0, otherwise
Then, from the linearity of the integral, for every f ∈ R X and every game v,
v
f dv = fd Ψ (A)bΨ
A = Ψ v (A) f dbΨ
A .
∅= A⊆X ∅= A⊆X
230 M. Grabisch
It is therefore sufficient to compute f dbΨ
A for every A ⊆ X , A = ∅. One obtains
the following expressions, for the main bases:
• For the Möbius transform:
f du A = fi (19)
i∈A
where
v+ I v (A), if I v (A) > 0
I (A) = ,
0, otherwise
v− I v (A), if I v (A) < 0
I (A) = (A ∈ 2 X ).
0, otherwise
References
4. Chateauneuf, A., Jaffray, J.-Y.: Some characterizations of lower probabilities and other
monotone capacities through the use of Möbius inversion. Math. Soc. Sci. 17, 263–283 (1989)
5. Choquet, G.: Theory of capacities. Annales de l’Institut Fourier 5, 131–295 (1953)
6. de Wolf, R.: A brief introduction to Fourier analysis on the Boolean cube. Theory Comput.
Lib. Graduate Surv. 1, 1–20 (2008)
7. Dempster, A.P.: Upper and lower probabilities induced by a multivalued mapping. Ann. Math.
Stat. 38, 325–339 (1967)
8. Dragan, I.: The potential basis and the weighted Shapley value. Libertas Mathematica 11,
139–150 (1991)
9. Dragan, I.: The least square values and the Shapley value for cooperative TU games. TOP 14,
61–73 (2006)
10. Faigle, U., Grabisch, M.: Bases and linear transforms of TU-games and cooperation systems.
Int. J. Game Theory, to appear
11. Fujishige, S.: Submodular functions and optimization. In: Annals of Discrete Mathematics,
vol. 58, 2nd edn. Elsevier, Amsterdam (2005)
12. Grabisch, M.: k-Order additive discrete fuzzy measures and their representation. Fuzzy Sets
Syst. 92, 167–189 (1997)
13. Grabisch, M., Labreuche, Ch.: The symmetric and asymmetric Choquet integrals on finite
spaces for decision making. Stat. Pap. 43, 37–52 (2002)
14. Grabisch, M., Labreuche, Ch.: A decade of application of the Choquet and Sugeno integrals
in multi-criteria decision aid. Ann. Oper. Res. 175, 247–286 (2010). doi:10.1007/s10479-009-
0655-8
15. Grabisch, M., Marichal, J.-L., Roubens, M.: Equivalent representations of set functions. Math.
Oper. Res. 25(2), 157–178 (2000)
16. Hammer, P.L., Rudeanu, S.: Boolean Methods in Operations Research and Related Areas.
Springer (1968)
17. Harsanyi, J.C.: A simplified bargaining model for the n-person cooperative game. Int. Econ.
Rev. 4, 194–220 (1963)
18. Kleinberg, N.L., Weiss, J.H.: Equivalent n-person games and the null space of the Shapley
value. Math. Oper. Res. 10(2), 233–243 (1985)
19. O’Donnell, R.: Analysis of Boolean functions, draft 2.0, ch. 1–3 (2007). http://www.cs.cmu.
edu/~odonnell11/boolean-analysis
20. Peleg, B., Sudhölter, P.: Introduction to the Theory of Cooperative Games. Kluwer Academic
Publisher (2003)
21. Rota, G.C.: On the foundations of combinatorial theory I. Theory of Möbius functions.
Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete. 2, 340–368 (1964)
22. Roubens, M.: Interaction between criteria and definition of weights in MCDA problems. In:
44th Meeting of the European Working Group “Multicriteria Aid for Decisions”. Brussels,
Belgium (1996)
23. Shafer, G.: A Mathematical Theory of Evidence. Princeton University Press (1976)
24. Shapley, L.S.: A value for n-person games. In: Kuhn, H.W., Tucker, A.W. (eds.) Contributions
to the Theory of Games, Vol. II, number 28 in Annals of Mathematics Studies, pp. 307–317.
Princeton University Press (1953)
25. Walley, P.: Coherent lower (and upper) probabilities. Technical Report 22, University of War-
vick, Coventry (1981)
26. Walsh, J.: A closed set of normal orthogonal functions. Am. J. Math. 45, 5–24 (1923)
27. Yokote, K.: Weak addition invariance and axiomatization of the weighted Shapley value. Int.
J. Game Theory 44, 275–293 (2015)
28. Yokote, K., Funaki, Y., Kamijo, Y.: Linear basis to the Shapley value. Technical report, Waseda
Economic Working Paper Series (2013)
Conditioning for Boolean Subsets, Indicator
Functions and Fuzzy Subsets
Siegfried Weber
Abstract This chapter deals with measure-free conditioning. It starts with the mean
value based definition of conditional fuzzy subsets which again gives a fuzzy subset.
Applying this general construction to indicator functions, it is proved that these
conditionals form an MV-algebra and that this is isomorphic to the already known
MV-algebra of the interval based conditional Boolean subsets. In the following,
the problem of iteration is completely solved with the result that there are exactly
two types of iteration, called the blurred resp. the sharper one, which remain in the
corresponding MV-algebras. Moreover, the general concept of conditional operators
plays a significant role. Finally, the problem of extending an uncertainty measure is
discussed.
1 Introduction
S. Weber (B)
Institut Für Mathematik, Universität Mainz, FB 08, Mainz, Germany
e-mail: [email protected]
Another motivation for the present paper came from several talks at the Linz Sem-
inar about “a new axiomatic approach”, where the conditional events are introduced
by three-valued “generalized indicator functions”
(CB) (A B) = [A ∩ B , B → A] for A, B ∈ B,
denoting this set by B̃. This lattice-interval approach has been treated in detail in a lot
of publications, see e.g. [9, 11, 17] and the references therein. In this approach the
“conditional Boolean event” (A B) is defined as the set of all possible conditional
candidates between the Boolean events “A and B” resp. “if B then A” as the extreme
candidates. In contrast to this approach, it is well-known that there is no reasonable
way to define a conditional as some Boolean subset (A | B) between these two extreme
Boolean subsets. On the other hand, this alternative approach really is applicable to
indicator functions. It seems that this approach has not yet treated systematically.
This will be done in the present paper, including the problem of iteration.
More concretely, we start with the general construction, introduced in [18], of
conditional f uzzy sets defined pointwisely by
based on a mean value function C on the unit interval [0, 1] which is compatible
with the complement in [0, 1]. The properties of a mean value function C are very
natural in order to choose some fuzzy subset between the extreme fuzzy subsets.
The additional property of compatibility guarantees that C generates a conditioning
operator | on F. Now, applying this construction to indicator functions, we prove that
the subset F1 of the conditional indicator functions
1
(CI) (1A | 1B ) = 1 · 1A∩B + · 1Bc + 0 · 1Ac ∩B
2
“blurred iteration” and the second type “sharper iteration”. Furthermore, we prove
that any conditioning operator | on F1 is generated by exactly one of these two
compatible mean value functions.
Moreover, it follows that i : F1 → B̃ is an MV-algebra isomorphism between
the two types (CI) resp. (CB) of conditionals. We use this result to obtain two
corresponding conditioning operators ˜| on B̃ by
The main results of the present paper were presented in the author’s talk at the
34th Linz Seminar on Fuzzy Set Theory, 2013.
The paper is organized as follows. In Sect. 2 we put together the basic prerequisites,
referring to MV-algebras, mean value functions, conditioning operators and the con-
ditional fuzzy subsets (CF) in F. Section 3 treats the conditional indicator functions
(CI) in F1 which form an MV-algebra (Theorem 1). Section 4 treats the conditional
Boolean subsets (CB) in B̃ (Theorem 2) and the MV-algebra isomorphism between
F1 and B̃ (Theorem 3). Section 5 is dedicated to the iteration processes in both F1
(Theorem 4) and B̃ (Theorem 5). Finally, in Sect. 6 we present the basic topics from
the second step of measure-free conditioning, i.e. referring to uncertainty measures:
the general Definition 5, the Theorem 6 for F1 and B̃, some examples and remarks,
including the Remark 9 for the alternative approach (∗).
The basic structure needed in the paper is that of an MV-algebra which we introduce
as follows.
Definition 1 (MV-algebras)
(i) A set L is called a residuated lattice if it is equipped with the two structures of a
bounded lattice (L, ≤, ∧, ∨) with universal upper (resp. lower) bound 1 (resp.
0) and a commutative monoid (semigroup with 1 as unit) (L, ), such that there
exist all residuals b → a given by the residuation property
cb ≤ a ⇐⇒ c ≤ b → a.
236 S. Weber
b = b → 0.
(MV) b ∨ a = (b → a) → a.
a b = (a b ) .
In the present paper we will often tacitly use the following well-known properties:
b = b.
b → a = b a = b ˙ (a ∧ b).
Conditioning for Boolean Subsets, Indicator Functions and Fuzzy Subsets 237
(iii) The lattice-meet can be expressed by the semigroup operations and the residual
by means of the divisibility property
b ∧ a = b (b → a) = b (b a).
The following well-known result is the starting point of the present paper.
a b = (a + b − 1) ∨ 0 , ¬a = 1 − a , a ⊕ b = (a + b) ∧ 1 ,
In the following we deal with the basics about mean value functions which will
play a crucial role in our presentation of conditioning.
Remark 2 (Compatible mean value functions) The additional property that a mean
value function C on an MV-algebra L is compatible implies that (C(0, 1)) = C(0, 1),
i.e. in L has to exist a self-complemented element which can serve as value C(0, 1).
if (a, b) = (0, 1)
(ii) C2 (a, b) = 1+a−b
with C2 (0, 21 ) = 0 ,
1
2
if (a, b) = (0, 1)
238 S. Weber
(iv) C4 (a, b) = b
1+b−a
with C4 (0, 21 ) = 1
3
.
In contrast to these four, the following mean value function, needed in Sect. 6, is not
compatible:
(v) C5 (a, b) = (1 − t) · a + t · b for some t ∈ [0, 1], t = 1
2
.
In the last part of this section, we present one concept of conditioning, first the
general definition introduced in [18], then the connection with the axiomatic approach
via conditioning operators introduced in [10] and, finally, the application to fuzzy
subsets. For more details see also [11], where this concept is extended to the more
general structure of Girard algebras.
C(a, b) = (a | b → a) for a ≤ b.
(iii) Vice versa, given any conditioning operator | on L, i.e. fulfilling the axioms
(C1) − (C5) from (i), then there exists a compatible mean value function C,
given by (ii) if and only if | satisfies the additional condition
Corollary 1 (Conditional fuzzy subsets) Let F = [0, 1]Ω be the MV-algebra of fuzzy
subsets of a universe Ω. Then each compatible mean value function C on [0, 1] leads
to a conditioning operator | on F, where its values are the “conditional fuzzy subsets”
given pointwise by
the subsets of F containing all indicator functions resp. all conditional indicator
functions.
Proof (i) is obtained by applying the conditionals (· | ·) in the form (CI) and the
values a b for a, b ∈ {0, 21 , 1} :
From this we obtain D directly and E = ((A1 ∩ B1 ∩ B2c ) ∪ (B1c ∩ A2 ∩ B2 ))c can
be rewritten into the given form, where D ⊆ E.
Formula (ii) follows directly:
From this we obtain F directly and G = ((Ac1 ∩ B1 ∩ B2c ) ∪ (B1c ∩ Ac2 ∩ B2 ))c can
be rewritten into the given form, where F ⊆ G.
Remark 4 (Measure-free conditioning) The form (CI) has a long history, for which
we refer to the book [9] and the references therein, where instead of 21 often the
value u but also other symbols as e.g. ? in [7] are used for this third “undefined”
or “undetermined” value. Some proposals for connecting these, sometimes named
“generalized indicator functions”, were given, very different among themselves and
from those in the preceding theorem. But all proposals correspond to “measure-free
conditioning”.
A completely different concept was proposed in [5], where the undetermined value
u is not a constant, i.e. is no longer the same number for all “conditional events”.
This concept will briefly be discussed in Remark 9.
Conditioning for Boolean Subsets, Indicator Functions and Fuzzy Subsets 241
(a b) = [ a ∧ b , b → a ] = [ a ∧ b , b ∨ a ].
The author get to know this construction during the talk [12], for details see the
book [9] and the references therein. An extension to MV-algebras was presented in
the author’s talk [16], for details see [17, 18]. In [11] this concept has been extended
to Girard algebras.
Therefore, in the following we will alternatively take the conditional events or the
intervals as elements of L̃.
Lemma 2 (The MV-algebra of conditional Boolean events) Let L̃ be the set of con-
ditional Boolean events from the preceding definition. Then the following assertions
hold:
(I1) (a 1) = [a, a] = {a}, i.e. L̃ extends L,
(I2) (a b) = (a ∧ b b).
Furthermore, L̃ is a lattice with respect to the partial ordering for intervals
where, therefore, (1 1) is the upper resp. (0 1) the lower universal bound. The
monotonicity properties follow from the lattice structure:
242 S. Weber
Proof The result was proved in [11] more generally for any MV-algebra L, but only
in terms of intervals. Rewriting these intervals into conditional events lead to the
given formulae for the MV-algebra operations as follows.
In order to obtain the first result apply, in this order, the definition of (· ·), the
definition of from Theorem 2.3 in [11], the preceding remark and known properties
in a Boolean algebra:
(a b) (c d) = [ a ∧ b , b ∨ a ] [ c ∧ d , d ∨ c ]
= [ (a ∧ b) ∧ (c ∧ d) , ((a ∧ b) ∧ (d ∨ c)) ∨ ((b ∨ a) ∧ (c ∧ d)) ]
= ( a ∧ b ∧ c ∧ d ( (a ∨ b ∨ (d ∧ c )) ∧ ((b ∧ a ) ∨ c ∨ d ) ) ∨ (a ∧ b ∧ c ∧ d) )
= ( a ∧ b ∧ c ∧ d ( (a ∧ b) ∨ (c ∧ d) ∨ (b ∧ d ) ∨ (a ∧ b ∧ c ∧ d) ).
Analogously and using also the former formulae and in the final step the property
(I2) the third formula follows:
The properties (I2) − (I5) for the interval based conditionals (a b) are com-
pletely analogous to (C2) − (C5) for the mean value based conditionals (a | b),
whereas the difference between (I1) and (C1) means simply that the original ele-
ments are embedded into the conditionals in the former construction and are special
conditionals in the latter one.
The preceding result was already proved in [9], mainly in Theorem 3 of Sect. 3,
which can be seen after rewriting the operations given there.
Conditioning for Boolean Subsets, Indicator Functions and Fuzzy Subsets 243
As already mentioned within the proof, in [11] the result was generalized to each
MV-algebra L where the set L̃ results to be a Girard algebra.
Structures L which are more general than an MV-algebra can also be extended to
the set L̃ of intervals. On the one hand, the Main Theorem 3.1 from [11] extends a
Girard algebra L to a Girard algebra L̃. On the other hand, Theorem 15 from [15]
deals with an “interval-valued residuated lattice” (IVRL) L̃, where the underlying
“base lattice” L results to be a residuated lattice. The particular case for α = 0 in
Theorem 15 from [15] corresponds to the Main Theorem 3.1 from [11]. But in both
situations, intervals cannot be rewritten into conditional events.
(CB) (A B) = [A ∩ B , B → A] = [A ∩ B , Bc ∪ A] for A, B ∈ B.
Then the result of the preceding lemma can be rewritten into the following
Proof Only the assertion referring to the orderings has not yet proved explicitely. In
order to establish the first equivalence, it follows from the left side that A1 ∩ B1 ⊆
A2 ∩ B2 (from “1 ≤ 1”) and Ac2 ∩ B2 ⊆ Ac1 ∩ B1 (from “0 ≤ 0”) and, therefore, also
B1 → A1 ⊆ B2 → A2 , i.e. the direction “⇒” is proved. The other direction in the
first equivalence can easily be obtained by checking the possible cases. The second
equivalence is precisely the definition of the ordering for intervals.
For the author it was a little surprising when he realized that the (very natural)
ordering in F1 is equivalent to the interval ordering in B̃ in such a direct way, that
this can be seen as a motivation for considering the interval ordering as the adequate
ordering.
5 Iteration of Conditioning
The conditional indicator functions are fuzzy subsets and, therefore, the general
construction (CF) from Sect. 2 can be applied and leads to the following
Lemma 3 (Iteration of conditional indicator functions) The iterated conditional
indicator functions have the form
((1A1 | 1B1 ) | (1A2 | 1B2 )) = 1 · 1A1 ∩B1 ∩A2 ∩B2 + (1 − c) · 1(A1 ∪B1c )∩B2c
1
+ · 1(B1c ∪Ac2 )∩B2 + c · 1Ac1 ∩B1 ∩B2c + 0 · 1Ac1 ∩B1 ∩A2 ∩B2
2
(1 | 1) = C(1, 1) = 1 , (1 | 21 ) = ( 21 | 21 ) = C( 21 , 1) = 1 − c ,
(1 | 0) = ( 21 | 0) = (0 | 0) = C(0, 1) = 21 , ( 21 | 1) = C( 21 , 21 ) = 21 ,
(0 | 21 ) = C(0, 21 ) = c , (0 | 1) = C(0, 0) = 0 .
these two mean value functions lead to the following two types of iteration, namely
the
“blurred iteration” for C1 : ((1A1 | 1B1 ) | (1A2 | 1B2 )) = (1A1 | 1B1 ∩A2 ∩B2 ) ,
“sharper iteration” for C2 : ((1A1 | 1B1 ) | (1A2 | 1B2 )) = (1A1 ∪B1c | 1(B1 ∩A2 )∪B2c ) .
Moreover, these two types of iterations can be obtained from any conditioning oper-
ator | on F1 , i.e. fulfilling the conditions (C1) − (C5), with the additional property
(0 | 21 ) = 21 for the blurred resp. (0 | 21 ) = 0 for the sharper iteration.
Proof The formulae follow from the preceding lemma and the property (C2).
On the one hand, the value c = 21 leads to the blurred iteration
(1A1 ∩B1 ∩A2 ∩B2 | 1B1 ∩A2 ∩B2 ) = (1A1 | 1B1 ∩A2 ∩B2 ).
(1(A1 ∩B1 ∩A2 )∪(A1 ∩B2c )∪(B1c ∩B2c ) | 1(B1 ∩A2 )∪B2c ) = (1A1 ∪B1c | 1(B1 ∩A2 )∪B2c ).
Finally, we have to prove the additional condition (cmvf ) for any conditioning oper-
ator | on F1 . For that purpose, let ϕ1 = (1A1 | 1B1 ), ϕ2 = (1A2 | 1B2 ), ψ = (1A | 1B )
with ϕ1 ≤ ϕ2 . Then
(ϕ1 ∧ ψ | ψ → ϕ1 )
= (1 · 1A1 ∩B1 ∩A∩B + 1
2
· 1(A1 ∩Bc )∪(B1c ∩A)∪(B1c ∩Bc ) + 0 · 1(Ac1 ∩B1 )∪(Ac ∩B) |
1 · 1(Ac ∩B)∪(A1 ∩B1 )∪(B1c ∩Bc ) + 1
2
· 1(Ac1 ∩B1 ∩Bc )∪(B1c ∩A∩B) + 0 · 1Ac1 ∩B1 ∩A∩B ).
For (0 | 21 ) = 1
2
this is equal to
(1A1 ∩B1 ∩A∩B | 1(A1 ∩B1 ∩B)∪(Ac ∩B) ) ≤ (1A2 ∩B2 ∩A∩B | 1(A2 ∩B2 ∩B)∪(Ac ∩B) )
= (ϕ2 ∧ ψ | ψ → ϕ2 ),
because of
(A1 ∪ B1c ) ∩ A ∩ B = (B1 → A1 ) ∩ A ∩ B ⊆ . . . and
Let us observe that for the blurred iteration only the part A1 ∩ B1 ⊆ A2 ∩ B2 of
the inequality ϕ1 ≤ ϕ2 was needed, whereas for the sharper iteration only the other
part B1 → A1 ⊆ B2 → A2 was needed.
In the preceding theorem and in the following, the second type we call “sharper
iteration” because the conditionals with the undetermined value as condition have
the “sharp values” (0 | 21 ) = 0 and (1 | 21 ) = ( 21 | 21 ) = 1, whereas the first type we
call “blurred iteration” because all these conditionals have the “blurred value” 21 .
In the following two corollaries we will specify the result of the preceding theorem
for two special situations, which will be taken up and briefly discussed at the end of
this section.
Now we will prove the result of an iteration process for conditional Boolean
subsets using essentially the isomorphism between these and the conditional indicator
functions. Although the result can also be obtained as a particular case of the more
general situation of Sect. 5 in [11], the formulation of the following theorem and
its proof are very different from [11] and show cleary the analogy of both iteration
processes and that, furthermore, also in the following setting there are exactly two
types of iteration.
Theorem 5 (Iteration within B̃ ) Let ((1A1 | 1B1 ) | (1A2 | 1B2 )) ∈ F1 be any of the
two types of iteration of conditional indicator functions from the preceding theo-
rem and let i : F1 −→ B̃ be the isomorphism from Sect. 4. Then there exist two
conditioning operators ˜| on B̃ given by
1 1 1
((A1 B1 ) ˜| (A2 B2 )) = (A1 B1 ∩ A2 ∩ B2 ) for (0 | ) = C1 (0, ) = ,
2 2 2
1 1
((A1 B1 ) ˜| (A2 B2 )) = (A1 ∪ B1c (B1 ∩ A2 ) ∪ B2c ) for (0 | ) = C2 (0, ) = 0 .
2 2
Moreover, the two conditioning operators ˜| are generated by two compatible mean
value functions C̃k on B̃ given by
which result to be
C̃1 ( [α, β] , [γ , δ] ) = [α, δ] for C1 ,
Proof The assertion that both types of ˜| are conditioning operators on B̃ generated by
some compatible mean value function follows immediately from the facts that both
types of | are conditioning operators on F1 fulfilling (cmvf ) and the isomorphism i
between these two MV-algebras. Naturally, it is needed that i is an isomorphism with
respect to the partial orderings, the lattice operations and the MV-algebra operations.
This will be illustrated in the following only for two properties, namely for (C1):
and apply the formulae for the blurred and sharper iteration from the preceding
theorem.
In [10] the formula for the blurred iteration has been proposed as one concrete
example, in [11] the formulae for both types of iteration appear as concrete examples.
But here we have shown that there do not exist more. We will compare these two
types of iteration in the following
Remark 6 (Comparison between blurred and sharper iteration in B̃) Let us write
the results of the two types of iteration from the preceding theorem as intervals:
( (A1 B1 ) ˜| (A2 B2 ) ) = [ αk , βk ] ,
where k = 1 resp. k = 2 correspond to the blurred resp. sharper iteration. Then the
following assertions can be established as an exercise. It follows that
(i) in general, α1 ⊆ α2 ⊆ β2 ⊆ β1 , i.e. [α2 , β2 ] ⊆ [α1 , β1 ],
(ii) particularly, [α2 , β2 ] = [α1 , β1 ] if and only if B2 = Ω, where [αk , βk ] = (A1
B1 ∩ A2 ),
(iii) α2 = β2 if and only if B2 ⊆ B1 ∩ A2 , where {α2 } = (B1 → A1 Ω) ⊆ (A1
B2 ) = [α1 , β1 ].
Part (i) of the preceding remark shows that the two types of iteration lead to
conditional Boolean subsets which are not comparable with respect to the partial
ordering ≤ but with respect to the set-inclusion ⊆ in B̃. Moreover, this relation ⊆
can be interpreted as “sharper than” resp. ⊇ as “more blurred than”, because a sharper
interval has less Boolean subsets than a more blurred interval. In this sense, [α2 , β2 ]
as result of the sharper iteration is “sharper than” [α1 , β1 ] as result of the blurred
iteration.
In the former papers [10, 11, 20, 21] we used the identification of an interval [α, β]
with the ordered pair (α, β) of its endpoints α, β in an MV-algebra L. Therefore, the
singletons {α} = [α, α] are identified with the pairs (α, α) and we will refer to as
elements of the “diagonal” of L̃ in both cases. This will be used in the following
of the diagonal of B̃ permits to recover the conditional Boolean subsets via the
property
(D) ({A} ˜| {B}) = (A B).
(({A1 }˜|{B1 }) ˜| ({A2 }˜|{B2 })) using the conditioning operator ˜| on B̃,
Particularly, ({A}˜|{Ω}) = {A} and (1A | 1Ω ) = 1A both are written for short as
(A | Ω) = A.
Now, using the preceding notation, we can rewrite the results of the two corollaries
from the beginning of this section not only as iterates in F1 but at the same time as
iterates in B̃.
Remark 7 (Blurred vs. sharper iterations) For the blurred iterations the following
three special iterations lead to the same result.
(i) ((A | B) | D) = (A | B ∩ D),
(ii) (A | (B | D)) = (A | B ∩ D),
(iii) ((A | D) | (B | D)) = (A | B ∩ D).
For the sharper iterations property (i) remains valid, but the other two lead to the
following different results.
(ii)* (A | (B | D)) = (A | D → B),
(iii)* ((A | D) | (B | D)) = (D → A | D → B).
These tree special iterations were treated and discussed by many authors but mostly
in an ad hoc manner.
One of the references in which the first two are systematically discussed is [7]
at the end of Sect. 2.4. There the authors justified property (i) by the “convention
? | 0 = ? = ? | 1”. This corresponds precisely to the two properties C(0, 1) = 21 =
250 S. Weber
C( 21 , 21 ) valid for any compatible mean value function C on [0, 1]. Calabrese ([2])
took (i) as definition for ((A | B) | D).
In the following the authors made out clearly the two possible meanings for
(A | (B | D)), the first justified by “1 | ? = ? = 0 | ?” which corresponds to C1 and
leads to property (ii) and the second by “1 | ? = 1 and 0 | ? = 0” which corresponds
to C2 and leads to property (ii)* taken by Calabrese as definition for (A | (B | D)).
Based on these arguments the authors of [7] proposed (but without additional jus-
tification) two possible meanings for the general iteration ((A | B) | (D | E)). As
first definition for this they proposed (A | B ∩ D ∩ E) and called it “associative
definition” due to the equality ((A | B) | D) = (A | (B | D)). This is indeed our
blurred iteration corresponding to the choice C1 . But as second definition they
proposed (A | B ∩ (E → D)) which is quite different from our sharper iteration
(B → A | E → (B ∩ D)) we derived from the choice C2 .
For the third special iteration, Copeland (see e.g. the book [13]) took property (iii)
as definition for ((A | D) | (B | D)) which corresponds to our blurred iteration. It
seems that property (iii)*, which corresponds to the sharper iteration, has not been
considered in the literature.
Finally, let us observe that the iteration process proposed in [9], Sect. 8.1, is
completely different because its result does not remain in B̃ but is an interval of
elements of B̃.
in the process of extending the additivity on a finite MV-algebra L to L̃, there appears
the compatible mean value function M from the example (iv) in Sect. 2, see also one
of the following examples.
Returning now to the MV-algebras of conditionals, we will present the corre-
sponding results in the following
which are “generated by” mean value functions M on [0, 1], resp.
1
m(1A | 1B ) = μ̃(A B) =for μ(B) = 0,
2
1
particularly m(1∅ | 1∅ ) = μ̃(∅ ∅) = .
2
(iii) An additive (probability) measure μ has a unique extension to an additive
measure μ̃ resp. m given by
1
m(1A | 1B ) = μ̃(A B) = μ(A ∩ B) + · μ(Bc ) = (1A | 1B ) dμ ,
2 Ω
where μ̃ is generated by
x+y
M(x, y) = .
2
from the example (iii) in Sect. 2.
252 S. Weber
The isotonicity condition (M2) follows from the isotonicity of μ and M and the
isomorphism i.
(ii) Also the compatibility condition is transfered from M and μ to μ̃ and m:
x 1
M(x, y) = for (x, y) = (0, 1), M(0, 1) = ,
1+x−y 2
μ(A ∩ B)
m(1A | 1B ) = μ̃(A B) = μB (A) for μ(B) > 0, where μB (A) = .
μ(B)
Particularly, it follows that μ̃(B B) = 1 for μ(B) > 0 and μ̃(B B) = 21 for
μ(B) = 0.
By part (ii) of the preceding theorem this extension is a compatible measure. By part
(iii) it is not additive, but only “levelwise additive”:
˙ 1A2 | 1B ) = μ̃(A1 ∪˙ A2 B)
m(1A1 ⊕
= μ̃(A1 B) + μ̃(A2 B) (but only) for μ(B) > 0.
μ(A ∩ B) + μ(Bc )
m(1A | 1B ) = μ̃(A B) = .
1 + μ(Bc )
But the extensions μ̃ resp. m are neither additive nor levelwise additive.
1
M(x, y) = (1 − t) · x + t · y with some t ∈ [0, 1], t = .
2
These have the following form
(∗) 1A|B = 1 · 1A∩B + t(A | B) · 1Bc + 0 · 1Ac ∩B with some t(A | B) ∈ [0, 1] ,
where we used the notation 1A|B from [5] instead of our notation (1A | 1B ). Really,
in spite of the very similar expressions (CI) and (∗), the difference between them
is essential. While (CI) is a “measure-free” definition of conditionals, the approach
based on (∗) requires some knowledge or assumptions for the undetermined values
t(A | B). For instance, if we assume that the (reasonable) property 1 − 1A|B = 1Ac |B
holds then, applying (∗) to both 1A|B and 1Ac |B , it follows that t(Ac | B) = 1 − t
(A | B).
Indeed, several authors take (∗) as definition for a “truth-value of the conditional event
A | B” and show that t(A | B) fulfills the axioms of a general conditional probability,
254 S. Weber
see the above mentioned paper [5] and the references therein. In [6], models of
“Partial Algebraic Conditional Spaces” over a Boolean algebra B of subsets of the
universe Ω are axiomatically introduced and it is shown that they are in a one-to-one
correspondence to “De Finetti-Popper conditional probabilities”.
Now, if μ is a probability (measure) on the Boolean algebra B then it follows from
(∗) that
(∗∗) m(1A|B ) = 1A|B dμ = μ(A ∩ B) + t(A, B) · μ(Bc ) ,
Ω
compare with the result of the preceding example. On the other hand, it follows from
(∗∗) for 0 < μ(B) < 1 that
μ(A ∩ B) μ(A ∩ B)
m(1A|B ) = if and only if t(A | B) = ,
μ(B) μ(B)
i.e. in this setting we have the classical conditional probability t(A | B) = μB (A) =
m(1A|B ).
7 Conclusions
All authors treating measure-free conditioning agree that a conditional element inter-
preted as “a given b” for the (unconditional) elements a, b from some structured set
L should be defined using the elements interpreted as “a and b” resp. “b implies a”.
For an MV-algebra L there are two reasonable ways:
In the interval approach, a conditional element is defined as
(a b) = [a ∧ b , b → a] ∈ L̃ for a, b ∈ L,
i.e. as the set of all elements between a ∧ b and b → a. Particularly, it follows that
(0 0) = L.
In the mean value approach, a conditional element is defined as
(a | b) = C(a ∧ b , b → a) ∈ L for a, b ∈ L,
i.e. as some element between a ∧ b and b → a. It is crucial that this approach requires
that L should have a self-complemented element u which results to be (0 | 0) = u.
Therefore, for a Boolean algebra L only the first approach is applicable. But for
a Boolean algebra L = B of subsets of a universe Ω, we can use the one-to-one
correspondence between Boolean subsets A ∈ B and its indicator functions 1A as
special fuzzy subsets and then apply the second approach. The present paper contains
three main results:
Conditioning for Boolean Subsets, Indicator Functions and Fuzzy Subsets 255
Acknowledgments I am very grateful to Peter Klement, as organizator and motor of his Linz
Seminars where I received a lot of stimulations since my first participation in 1983, and as friend
from our common time working together and in private occations.
References
19. Weber, S.: Uncertainty measures—problems concerning additivity. Fuzzy Sets Syst. 160, 371–
383 (2009)
20. Weber, S.: A complete characterization of all weakly additive measures and of all valuations
on the canonical extension of any finite MV-chain. Fuzzy Sets Syst. 161, 1350–1367 (2010)
21. Weber, S.: Measure-free conditioning and extensions of additive measures on finite MV-
algebras. Fuzzy Sets Syst. 161, 2479–2504 (2010)
Multivalued Functions Integration:
from Additive to Arbitrary Non-negative
Set Function
Endre Pap
1 Introduction
E. Pap (B)
Singidunum University, Danijelova 32, 11000 Belgrade, Serbia
e-mail: [email protected]
E. Pap
Óbuda University, Becsi út 92, Budapest 1034, Hungary
be set-valued i.e., for each quality factor there exists a multiple score or a set of
estimations.
A generalization of the classical integrals is related to their extension on non-
additive measures. For non-additive integrals see [8, 37, 39, 46, 47, 50]. Inte-
grals of multifunctions have been defined in different ways: Aumann method
[4, 7, 10], by the Rådström-Hörmander [53] embedding theorem [17], via Pettis
method [11, 19, 58, 59], using (as in Dunford [20]) sequences of multifunctions
[13, 16, 42], using finite or countable sums that generalize the Riemann sums
[5, 6, 10, 33], via Choquet or Sugeno integrals [14, 15, 41, 45, 49, 58–63], as
extension of pseudo-integral [28]. For survey on multivalued integrals of Aumann
and Debreu types see [55]. The Gould integral was defined in [27] via finite sums for
real functions relative to a finitely additive vector measure μ. Different generaliza-
tions of the Gould integral were introduced and studied in [22–25, 51, 56] for real
functions and in [52] for multifunctions relative to a finitely additive vector measure
(via Aumann method). In this paper we define and study another set-valued Gould
type integral of multifunctions (taking values in the space of all nonempty bounded
subsets of a real Banach space) based on an arbitrary non-negative set function.
We define the integral as a limit of a net of finite integral sums. Many important
properties of the integral work with an arbitrary non-negative set function μ without
supplementary conditions on μ.
This paper is organized as follows. In Sect. 2 we collect some facts about families
of subsets of Banach space, set functions and general multimeasures. Section 3 deals
with the integration of strongly measurable multifunctions and, more particularly,
with those that can be approximated by simple measurable multifunctions in the
sense of the Hausdorff distance, the Aumann integral, whose construction is based on
integrable selections and which has become the most popular set-valued integral, and
Aumann-Gould integral. In Sect. 4 is presented a multi-valued Choquet integral based
on multisubmeasure. In Sect. 5 we define a Gould type integral for multifunctions
relative to an arbitrary non-negative set function and point out some relationships
between integrability and total measurability. There are given some basic properties
of the integrable multifunctions, and some continuity properties of the multimeasure
induced by set-valued integral. Section 6 is for the conclusion.
2 Preliminaries
Let T be an abstract nonempty set, P(T ) the family of all subsets of T , A an algebra
of subsets of T, (X, · ) a real Banach space with the metric d induced by its norm,
P0 (X ) the family of all nonempty subsets of X , Pb (X ) the family of all nonempty
bounded subsets of X , Pbc (X ) the family of all nonempty bounded convex subsets
of X , Cb (X ) the family of all nonempty bounded closed subsets of X , Cbc (X ) the
Multivalued Functions Integration: from Additive … 259
family of all nonempty bounded closed convex subsets of X and Kc (X ) the family
of all nonempty compact convex subsets of X .
For every A, B ∈ P0 (X ) and every α ∈ R, let
A + B = {x + y | x ∈ A, y ∈ B},
α A = {αx | x ∈ A}.
We denote by A the closure of A with respect to the topology induced by the norm
of X . Let + be the Minkowski addition on P0 (X ), i.e.,
A + B = A + B (A, B ∈ P0 (X )).
and d(x, B) = inf d(x, y). (Cb (X ), h) and (Kc (X ), h) are complete metric spaces,
y∈B
see [32]. We denote |A| = h(A, {0}), for every A ∈ P0 (X ), where 0 is the origin of
X.
Relations of the Hausdorff metric h with respect to the operations + and +
are given by the following inequalities. If A, B, C, D, Ai , Bi ∈ P0 (X ), for every
i ∈ {1, . . . , n} and n ∈ N, then
n
n
n
h Ai , Bi h(Ai , Bi ),
i=1 i=1 i=1
π ∧ π = {Ai ∩ B j | i = 1, . . . , n; j = 1, . . . , m}.
We have by [46].
Definition 2 Let μ : A → [0, ∞] be a non-negative set function with μ(∅) = 0.
We say that μ is:
(i) monotone if μ(A) μ(B), for every A, B ∈ A , with A ⊆ B;
(ii) subadditive if μ(A ∪ B) μ(A) + μ(B), for every A, B ∈ A , with A ∩
B = ∅;
(iii) submeasure if μ is monotone and subadditive;
∞
(iv) σ -subadditive if μ(A) μ(An ), for every sequence of pairwise disjoint
n=1
∞
sets (An )n∈N ⊂ A , with A = An ∈ A ;
n=1
(v) finitely additive if μ(A ∪ B) = μ(A) + μ(B) for every disjoint A, B ∈ A ;
n
(vi) measure if lim μ(Ak ) = μ(A), for every sequence of pairwise disjoint
n→∞ k=1
∞
sets (An )n∈N ⊂ A , with A = An ∈ A ;
n=1
(vii) increasing convergent if lim μ(An ) = μ(A), for every increasing sequence
n→∞
∞
of sets (An )n∈N ⊂ A (i.e. An ⊂ An+1 , for every n ∈ N), with ∪ An = A ∈ A
n=1
(denoted by An A);
(viii) decreasing convergent if lim μ(An ) = μ(A), for every decreasing sequence
n→∞
∞
of sets (An )n∈N ⊂ A , i.e. An+1 ⊂ An , for every n ∈ N, with ∩ An = A ∈ A
n=1
(denoted by An A), and μ(A1 ) < ∞;
(ix) order-continuous (shortly, o-continuous) if lim μ(An ) = 0, for every decreas-
n→∞
ing sequence of sets (An )n∈N ⊂ A , with An ∅.;
(x) exhaustive if lim μ(An ) = 0, for every sequence of pairwise disjoint sets
n→∞
(An )n∈N ⊂ A .
We introduce two types of variations of set function based on [46].
Definition 3 We consider the following set functions associated to an arbitrary set
function μ : A → [0, ∞] with μ(∅) = 0:
(i) μ (the disjoint variation of μ) defined, for every A ∈ A , by
n
μ(A) = sup μ(Bi ) ,
i=1
where the supremum is considered over all finite partitions {Bi }i=1 n
of A,
with Bi ∈ A , for every i ∈ {1, . . . , n}. μ is said to be of finite variation if
μ(T ) < ∞.
Multivalued Functions Integration: from Additive … 261
(ii)
μ defined, for every A ⊆ T , by
μ(A) = inf{μ(B) | A ⊆ B, B ∈ A }.
The theory of set-valued measures (multimeasures) whose values are subsets of some
Banach space, under additivity condition, was investigated in [2, 12, 18, 26]. Some
of the motivations were the applications to mathematical economics and to statistics.
To extend the notion of σ -additivity, there are at least three possible definitions
depending on the series summability concept considered in the space of closed sets,
see [30].
Here we shall consider general multimeasures with some additional properties.
We have by [22].
Definition 7 Let M : A → P0 (X ) be a set multifunction, with M(∅) = {0}. M is
said to be:
(i) monotone if M(A) ⊆ M(B), for every A, B ∈ A , with A ⊆ B;
(ii) additive multimeasure if M(A ∪ B) = M(A) + M(B), ∀A, B ∈ A , A ∩ B =
∅;
(iii) absolutely continuous with respect to μ if for every ε > 0, there is δ > 0
such that for every A ∈ A with μ(A) < δ, we have |M(A)| < ε (denoted by
M μ);
262 E. Pap
where the supremum is considered over all finite partitions {Bi }i=1
n
of A, with
Bi ∈ A , ∀i ∈ {1, . . . , n};
(x) null-additive if for every A, B ∈ A with M(A) = {0} we have M(A ∪ B) =
M(B);
(xi) weakly null-additive if for every A, B ∈ A with M(A) = M(B) = {0} we
have M(A ∪ B) = {0}.
n
E(F) = p(Ai )K i (1)
i=1
When X is finite dimensional, the closure operation is not necessary. Given a sub-
space C of Cb , we denote by L 1 (C , A ) the class of strongly A -measurable
multifunctions with values in C such that E||F|| <, where E is the integral given
by (1), and by S (C , A ) the subclass of L 1 (C , A ) whose members are strongly
A -measurable simple multifunctions.
Now, we construct the integral of a strongly measurable multifunction F : T →
C , where C is a τ H -separable subspace of Ccb . Without restriction, we assume
that C is τ H -closed, and stable under the Minkowski addition and multiplication by
positive scalar.
Definition 9 For a strongly A -measurable simple multifunction F : T → C , we
define the map Φ : S (C , A ) → C by
We have by [29].
Theorem 1 The map Φ : S (C , A ) → C given by (2) is extended to a map Φ̃
from L 1 (C , A ) into C (called strong integral) with the following properties
Remark 2 The preceding approach for defining the set-valued integral is explicit, in
that it starts with simple multifunctions and uses only elementary operations such as
the Minkowski addition and the scalar multiplication. An alternative approach for
constructing the set-valued conditional expectation for a compact convex multifunc-
tion is given in [17].
The following notion of integral for multifunctions was introduced by Aumann [4].
The relation between strong and Aumann integral is given in the following theorem.
Theorem 2 Let F ∈ M (C (X )). If we assume in addition that F is integrable, then
the following property hold. If F is integrably bounded and takes on its values in a
τ H -separable subspace of Cbc (X ), then one has
Φ̃(F) = I (F)
where Φ̃(F) denotes the integral as defined in Sect. 3.1 (Theorem 1).
Multivalued Functions Integration: from Additive … 265
The Gould integral was introduced in [27] using finite sums for real functions with
respect to a finitely additive vector measure μ. If μ is a countably additive vector
measure of finite variation, then the Gould integral coincides with the Dunford inte-
gral and the Gelfand-Pettis integral. Different generalizations of the Gould integral
were introduced and studied in [22–25, 51, 56]. We consider in this section a Gould
integral of multifunction with respect to a finitely additive multimeasure.
We have by [52].
The set T f d M is called the M-Gould integral of f.
is called
the Aumann-Gould integral of the function f : T → R on A, where
(G) f dμ denotes of f with respect to the vector measure μ. We
the Gould integral
denote it by (AG) A f d M. If (AG) A f d M = ∅, we say that f is Aumann-Gould
integrable on A.
Jang et al. [34] have introduced a multi-valued Choquet integral for multifunction
taking values in the class of all closed, nonempty sets of the interval [0, ∞[ based
on a nonnegative monotone set function, for which the selectors are real Choquet
integrals.
We present in this section another multivalued Choquet integral, this time for non-
negative functions with respect to multisubmeasures taking values in Kc ([0, ∞[).
We can associate to M : A → Kc ([0, ∞[) two nonnegative set functions: μ1 (A) =
inf M(A), and μ2 (A) = sup M(A), for every A ∈ A , such that M(A) = [μ1 (A),
μ2 (A)], for every A ∈ A , and set of selectors for M is nonempty since contains the
submeasures μ1 and μ2 . Let be f : T → [0, ∞[ a measurable function. We denote
C
by S M ( f ) the set of all monotone set functions μ : A → [0, ∞[ which are selectors
of the multisubmeasure M with respect to which the function f is Choquet integrable.
Using a similar procedure as in Sect. 3.3 for Aumann-Gould integral we introduce
the following notion by [57].
We present here some recent results without proofs, which can be found in [48].
n
σ F,μ (π ) = F(ti )μ(Ai ) = μ(A1 )F(t1 ) + · · · + μ(An )F(tn )
i=1
n
(ii) Generally, if F = E i · 1 Ai , E i ∈ Cbc (X ), i ∈ {1, . . . , n}, {Ai }i=1
n
⊂ A is
i=1
a partition of T and 1 Ai is the characteristic function of Ai , for every i ∈
{1, . . . , n}, then F is μ-integrable and
Multivalued Functions Integration: from Additive … 269
n
(G) F dμ = E i μ(Ai ).
T i=1
Particularly, if f 1 = f 2 = f, then
(G) F dμ = f dμ .
T T
We have by [46].
Definition 17 If μ : A → [0, ∞[ is a non-negative set function, with μ(∅) = 0,
then a set A ∈ A is said to be an atom of μ if μ(A) > 0 and for every B ∈ A , with
B ⊆ A, we have μ(B) = 0 or μ(A\B) = 0.
In this section we list some general properties of the integral introduced in Definition
16, based on [48]. Suppose μ : A → [0, ∞[ is a non-negative set function, with
μ(∅) = 0.
For special multifunctions (interval valued) we can prove some additional properties.
6 Conclusion
Acknowledgments This research was supported by the grant MNPRS 174009 and by the project
“Mathematical models of intelligent systems and their applications” which was supported by the
Provincial Secretariat for Science and Technological Development of Vojvodina.
References
1. Apreutesei, G.: Cauchy nets and convergent nets on semilinear topological spaces. Topology
Appl. 159, 2922–2931 (2012)
2. Artstein, Z.: Set-valued measures. Trans. Amer. Math. Soc. 165 (1972)
3. Aubin, J.P., Frankowska, H.: Set-Valued Analysis. Birkhäuser, Boston (1990)
4. Aumann, R.J.: Integrals of set-valued maps. J. Math. Anal. Appl. 12, 1–12 (1965)
5. Boccuto, A., Sambucini, A.R.: A McShane integral for multifunctions. J. Concr. Appl. Math.
2(4), 307–325 (2004)
6. Bongiorno, B., Pfeffer, W.F., Thomson, B.S.: A full descriptive definition of the gage integral.
Canad. Math. Bull. 39(4), 390–401 (1996)
7. Brink, H.E., Maritz, P.: Integration of multifunctions with respect to a multimeasure. Glasnik
Math. 35, 313–334 (2000)
8. Cao, Y.: Aggregating multiple classification results using Choquet integral for financial distress
early warning. Expert Syst. Appl. 38(7), 8285–8292 (2011)
9. Caponetti, D., Di Piazza, L., Kadets, V.: Description of the limit set of Henstock-Kurzweil
integral sums of vector-valued functions. J. Math. Anal. Appl. 421, 1151–1162 (2015)
10. Cascales, B., Rodriguez, J.: Birkhoff integral for multi-valued functions. J. Math. Anal. Appl.
297, 540–560 (2004)
11. Cascales, B., Kadets, V., Rodriguez, J.: The Pettis integral for multivalued functions via single-
valued functions. J. Math. Anal. Appl. 332, 1–10 (2007)
12. Costé, A.: Sur les multimesures valeurs fermes bornes d’un espace de Banach. C.R. Acad. Sci.
Paris 280, 567–570 (1975)
13. Croitoru, A.: An integral for multifunctions with respect to a multimeasure. An. Şt. Univ. “Al.
I. Cuza” Iaşi 49, 95–106 (2003)
14. Croitoru, A.: Fuzzy integral of measurable multifunctions. Iran. J. Fuzzy Syst. 9(4), 133–140
(2012)
15. Croitoru, A.: Strong integral of multifunctions relative to a monotone measure. Fuzzy Sets
Syst. 244, 20–33 (2014)
16. Croitoru, A., Godet-Thobie, C.: Set-valued integration in seminorm. I. Annals of University of
Craiova, Mathematics and Computer Science Series, vol. 33, pp. 16–25 (2006)
17. Debreu, G.: Integration of correspondences. In: Proceedings 5th Berkely Symposium on Math-
ematical Statistics and Probability II, Part. I, pp. 351–372 (1967)
Multivalued Functions Integration: from Additive … 273
18. Debreu, G., Schmeidler, D.: The Radon-Nikodym derivative of a correspondence. In: Proceed-
ings of the Sixth Berkeley Symposium of Mathematical Statistics and Probability (1971)
19. Di Piazza, L., Musial, K.: Set-valued Kurzweil-Henstock-Pettis integral. Set-Valued Anal. 13,
167–179 (2005)
20. Dunford, N., Schwartz, J.: Linear Operators I. General Theory. Interscience, New York (1958)
21. Frankowska, H.: An open mapping principle for set-valued map. J. Math. Anal. Appl. 127,
172–180 (1987)
22. Gavriluţ, A.: On some properties of the Gould type integral with respect to a multisubmeasure.
An. Şt. Univ. “Al. I. Cuza” Iaşi 52(1), 177–194 (2006)
23. Gavriluţ, A.: Fuzzy Gould integrability on atoms. Iran. J. Fuzy Syst. 8(3), 113–124 (2011)
24. Gavriluţ, A.: The general Gould type integral with respect to a multisubmeasure. Math. Slovaca
60(3), 289–318 (2010)
25. Gavriluţ, A., Iosif, A.E., Croitoru, A.: The Gould integral in Banach lattices. Positivity 19,
65–82 (2015)
26. Godet-Thobie, C.: Some results about multimeasures and their selections. In: Measure Theory
at Oberwolfach 1979, Lecture Notes in Mathematics, vol. 794, pp. 112–116. Springer (1980)
27. Gould, G.G.: Integration over vector-valued measures. Proc. London Math. Soc. 15, 193–205
(1965)
28. Grbić, T., S̆tajner-Papuga, I., S̆trboja, M.: An approach to pseudo-integration of set-valued
functions. Inf. Sci. 181(11), 2278–2292 (2011)
29. Hess, C.: Conditional expectation and martingales of random sets. J. Pattern Recognit. 32,
1543–1567 (1999)
30. Hess, C.: Set-valued integration and set-valued probability theory: an overview. in [47],
617–673
31. Hildebrand, W.: Core and Equilibria of a Large Economy. Princeton University Press (1974)
32. Hu, S., Papageorgiou, N.S.: Handbook of Multivalued Analysis, vol. I. Kluwer Academic
Publishers, Dordrecht (1997)
33. Hukuhara, M.: Integration des applications mesurables dont la valuer est un compact convexe.
Funkcialaj Ekvacioj 10, 205–223 (1967)
34. Jang, L.C., Kwon, J.S.: On the representation of Choquet integrals of set-valued functions, and
null sets. Fuzzy Sets Syst. 112, 233–239 (2000)
35. Kendall, M.G., Moran, P.A.P.: Geometrical Probability. Charles Griffin, London (1963)
36. Klement, E.P., Mesiar, R., Pap, E.: A universal integral as common frame for Choquet and
Sugeno integral. IEEE Trans. Fuzzy Syst. 18(1), 178–187 (2000)
37. Klement, E.P., Mesiar, R., Li, J., Pap, E.: Integrals based on monotone set functions. Fuzzy
Sets Syst. 281, 88–102 (2015)
38. Kudo, H.: Dependent experiments and sufficient statistics. Natur. Sci. Rep. Ochanomizu Univ.
4, 151–163 (1954)
39. Kuncová, K., Malý, J.: Non-absolutely convergent integrals in metric spaces. J. Math. Anal.
Appl. 401, 578–600 (2013)
40. Li, J., Mesiar, R., Pap, E., Klement, E.P.: Convergence theorems for monotone measures. Fuzzy
Sets Syst. 281, 103–127 (2015)
41. Liu, W.L., Song, X.Q., Zhang, Q.Z., Zhang, S.B.: (T) Fuzzy integral of multi-dimensional
function with respect to multi-valued measure. Iran. J. Fuzzy Syst. 9(3), 111–126 (2012)
42. Martellotti, A., Sambucini, A.R.: A Radon-Nikodym theorem for multimeasures. Atti Sem.
Mat. Fis. Univ. Modena 42, 579–599 (1994)
43. Matheron, G.: Random Sets and Integral Geometry. Wiley (1975)
44. Molchanov, I.S.: Limit Theorems for Unions of Random Closed Sets. Lecture Notes in Math-
ematics, vol. 1561. Springer (1993)
45. Narukawa, Y., Torra, V.: Multidimensional generalized fuzzy integral. Fuzzy Sets Syst. 160,
802–815 (2009)
46. Pap, E.: Null-Additive Set Functions. Kluwer Academic Publishers, Dordrecht (1995)
47. Pap, E.: Handbook of Measure Theory. Elsevier (2002)
274 E. Pap
48. Pap, E., Gavriluţ, A., Croitoru, A.: Gould type integral of multifunctions relative to non-negative
set functions (submitted)
49. Park, C.K.: Set-valued Choquet-Pettis integrals. Korean J. Math. 20(4), 381–393 (2012)
50. Pham, T.D., Brandl, M., Nguyen, N.D., Nguyen, T.V.: Fuzzy measure of multiple risk factors
in the prediction of osteoporotic fractures. In: Proceedings of the 9-th WSEAS International
Conference on Fuzzy Systems (FS’08), pp. 171–177 (2008)
51. Precupanu, A., Croitoru, A.: A Gould type integral with respect to a multimeasure, I/I. An. Şt.
Univ. “Al. I. Cuza” Iaşi 48, 165–200/ 49 (2003); 183–207 (2002)
52. Precupanu, A.M., Satco, B.: The Aumann-Gould integral. Medit. J. Math. 5, 429–441 (2008)
53. Rådström, H.: An embedding theorem for spaces of convex sets. Proc. Amer. Math. Soc. 3,
151–158 (1952)
54. Robbins, H.E.: On the measure of random set I. Ann. Math. Stat. 15, 70–74 (1944)
55. Sambucini, A.R.: A survey on multivalued integration. Atti Sem. Mat. Fis. Univ. Modena 50,
53–63 (2002)
56. Satco, B.: A Vitali type theorem for the set-valued Gould integral. An. Şt. Univ. “Al. I. Cuza”
Iaşi 51, 191–200 (2005)
57. Sofian-Boca, F-N.: A multi-valued Choquet integral with respect to a multisubmeasure. An.
Şt. Univ. “Al. I. Cuza” Iaşi 61, 1, 129–152 (2015)
58. Stamate, C.: Vector fuzzy integral. Recent Advances in Neural Network. Fuzzy Systems and
Evolutionary Computing, pp. 221–224 (2010)
59. Stamate, C., Croitoru, A.: Non-linear integrals, properties and relationships. Recent Advances
in Telecommunications, Signals and Systems (Proceedings of NOLASC’13), WSEAS Press,
118–123 (2013)
60. Tversky, A., Kahneman, D.: Advances in prospect theory: commulative representation of uncer-
tainty. J. Risk Uncertain. 5, 297–323 (1992)
61. Zhang, D., Wang, Z.: On set-valued fuzzy integrals. Fuzzy Sets Syst. 56, 237–241 (1993)
62. Zhang, D., Guo, C.: Fuzzy integrals of set-valued mappings and fuzzy mappings. Fuzzy Sets
Syst. 75, 103–109 (1995)
63. Zhang, D., Guo, C., Liu, D.: Set-valued Choquet integrals revisited. Fuzzy Sets Syst. 147,
475–485 (2004)
Author Index
B M
Barrenechea, Edurne, 137 Mesiarová-Zemánková, Andrea, 109
Bustince, Humberto, 137 Mundici, Daniele, 57
D P
Durante, Fabrizio, 157 Pagola, Miguel, 137
Pap, Endre, 257
Perrone, Elisa, 157
Petrík, Milan, 83
E
Esteva, Francesc, 71
S
Sempi, Carlo, 173
F Stupňanová, Andrea, 181
Fernandez, Javier, 137
T
G Trillas, Enric, 13
Godo, Lluís, 71
Gottwald, Siegfried, 1
Grabisch, Michel, 215 V
Vetterlein, Thomas, 83
H W
Höhle, Ulrich, 23 Weber, Siegfried, 233
K Y
Kolesárová, Anna, 181 Yager, Ronald R., 199