Introduction To Discrete Bayesian Methods: Petri Nokelainen
Introduction To Discrete Bayesian Methods: Petri Nokelainen
Bayesian Methods
Petri Nokelainen
[email protected]
School of Education
University of Tampere, Finland
Outline
Overview
Introduction to Bayesian Modeling
Bayesian Classification Modeling
Bayesian Dependency Modeling
Bayesian Unsupervised Model-based Visualization
Overview
S
P
S
S
S
P
S
S
SPSS Extension
AMOS
MPlus
(Nokelainen, 2008.)
Overview
BDM = Bayesian
Dependency Modeling
BCM = Bayesian
Classification Modeling
BUMV = Bayesian
Unsupervised Modelbased Visualization
B-Course
BayMiner
(Nokelainen, Silander,
Ruohotie & Tirri, 2007.)
(Nokelainen &
Ruohotie, 2009.)
http://b-course.cs.helsinki.fi
http://b-course.cs.helsinki.fi
http://www.bayminer.com
7
Bayesian Unsupervised Model-based Visualization
Outline
Overview
Introduction to Bayesian Modeling
Bayesian Classification modeling
Bayesian Dependency modeling
Bayesian Unsupervised Model-based Visualization
Discrete
0 1 2, ..
10
11
12
13
Otherwise non-parametric
techniques should be applied.
14
15
16
17
P(E|H) P(H)
P(H|E)=
P(E|H)P(H) + P(E|~H) P(~H)
18
19
20
21
22
23
24
P(E|H) P(H)
A priori probability
Conditional probability
Posterior probability
P(H|E)=
P(E|H) P(H) + P(E|~H) P(~H)
(.9) (.01)
= .08
P(H|E)=
(.9) (.01) + (.1) (.99)
25
26
27
28
29
30
31
32
33
34
35
36
12 r
12 r
P ( r | , n)
(1 )
r
37
38
39
12!
.053 (1 .05)123
3!(12 3)!
479001600
3
12 3
.05 (1 .05)
2177280
.017
40
41
42
43
n!
12!
r (1 ) n r
.05 3 (1 .05)123 .017
r!(n r )!
3!(12 3)!
44
45
46
47
48
P ( H 0 | r 3)
P (r 3 | H 0 ) P( H 0 )
P (r 3 | H 0 ) P ( H 0 ) P (r 3 | H 1 ) P( H 1 ) P(r 3 | H 2 ) P( H 2 ) P(r 3 | H 3 ) P ( H 3 )
1
P (r 3 | .017) P( )
2
1
1
1
1
P (r 3 | .017) P( ) P (r 3 | .027) P( ) P(r 3 | .039) P ( ) P (r 3 | .053) P ( )
2
6
6
6
0.30
49
50
51
52
Outline
Overview
Introduction to Bayesian Modeling
Bayesian Classification Modeling
Bayesian Dependency Modeling
Bayesian Unsupervised Model-based Visualization
53
BCM = Bayesian
Classification Modeling
BDM = Bayesian
Dependency Modeling
BUMV = Bayesian
Unsupervised Modelbased Visualization
B-Course
54
55
56
Population size.
Confidence interval
Confidence level
57
58
59
60
61
62
For an example of
practical use of BCM, see
Nokelainen, Tirri,
Campbell and Walberg
(2007).
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
Outline
Research Overview
Introduction to Bayesian Modeling
Investigating Non-linearities with Bayesian Networks
Bayesian Classification Modeling
Bayesian Dependency Modeling
Bayesian Unsupervised Model-based Visualization
83
BCM = Bayesian
Classification Modeling
BDM = Bayesian
Dependency Modeling
BUMV = Bayesian
Unsupervised Modelbased Visualization
B-Course
84
Var 1
Var 2
Var 3
85
86
87
qi
P ( D | M )
i 1
j 1
( N )
'
ij
( N N ij
'
ij
ri
)
k 1
( N
'
ijk
N ijk )
( N ijk' )
88
qi
P ( D | M )
i 1
j 1
( N )
'
ij
( N N ij
'
ij
ri
)
k 1
( N
'
ijk
N ijk )
( N )
'
ijk
89
'
N'
N
(
N ijk' 1 ) (
N ijk' 2 )
r q
r q
P( D x1 | M 1 )
N'
N'
N'
(
N ij )
(
)
(
)
qi
r q
r q
(1.00 5) (0.50)
(0.50)
0.008 6.563 0.500
0.027
x1
1
1
2
1
1
x2
1
1
2
2
1
90
N'
( )
qi
'
N'
N
(
N ijk' 1 ) (
N ijk' 2 )
r q
r q
| M1)
N'
N'
N'
(
N ij )
(
)
(
)
qi
r q
r q
(1.00 5) (0.50)
(0.50)
x1
1
1
2
1
1
x2
1
1
2
2
1
91
92
'
N'
N
(
N ijk' 1 ) (
N ijk' 2 )
r q
r q
P ( D x1 | M 2 )
N'
N'
N'
(
N ij )
(
)
(
)
qi
r q
r q
(1.00 5) (0.50)
(0.50)
93
(0.50 4) (0.25)
(0.25)
94
(0.50 1) (0.25)
(0.25)
95
96
P( D)
P( M 1 | D)
0.000324
0.88
P( M 2 | D) P( D | M 2 ) P( M 2 )
0.000365
P
(
D
)
97
x1
1
1
2
1
1
x2
1
1
2
2
1
98
n*( n 1) / 2
99
For an example of
practical use of BDM,
see Nokelainen and Tirri
(2010).
100
Our hypothesis regarding the first research question was that intrinsic goal
orientation (INT) is positively related to moral judgment (Batson & Thompson,
2001; Kunda & Schwartz, 1983).
It was also hypothesized, based on Blasis (1999) argumentation that emotions
cannot be predictors of moral action, that fear of failure (affective motivational
section) is not related to moral judgment.
Research evidence showed support for both hypotheses: firstly, only intrinsic
motivation was directly (positively) related to moral judgment, and secondly,
affective motivational section was not present in the predictive model.
101
Conditioning the three levels of moral judgment showed that there is a positive
statistical relationship between moral judgment and intrinsic goal orientation. The
probability of belonging to the highest intrinsically motivated group three (M = 3.7
5.0) increases from 15 per cent to 90 per cent alongside with the moral judgment
abilities. There is also similar but less steep increase in extrinsic goal orientation
(from 5% to 12%), but we believe that it is mostly tied to increase in extrinsic goal
orientation.
102
For an example of
practical use of BDM see
Nokelainen and Tirri
(2007).
103
104
In conflict situations, my superior is able
to draw out all parties and understand
the differing perspectives.
My superior sees other people in positive
rather than in negative light.
My superior has an optimistic "glass half
full" outlook.
105
2% vs. 90%
EL_iv_17_49 In conflict situations, my superior is able to draw out all parties and
understand the differing perspectives.
EL_ii_09_26 My superior sees other people in positive rather than in negative light.
EL_ii_09_25 My superior has an optimistic "glass half full" outlook.
(Nokelainen & Tirri, 2007.)
106
69%
66%
107
95%
85%
108
Outline
Overview
Introduction to Bayesian Modeling
Bayesian Classification Modeling
Bayesian Dependency Modeling
Bayesian Unsupervised Model-based Visualization
109
BCM = Bayesian
Classification Modeling
BDM = Bayesian
Dependency Modeling
BUMV = Bayesian
Unsupervised Modelbased Visualization
BayMiner
110
VISUALIZATION TECH.
NON-REDUC.
UNSUPERVISED
CLUSTER ANALYSIS
EFA
REDUCING
PROJECTION TECH.
LINEAR
PCA
PROJ.PUR.
NON-LINEAR
MDS
NEUR.N.
SOM
PRIN.C.
ICA
BUMV
111
112
VISUALIZATION TECH.
UNSUPERVISED
CLUSTER ANALYSIS
EFA
113
114
115
116
117
VISUALIZATION TECH.
NON-REDUC.
UNSUPERVISED
CLUSTER ANALYSIS
REDUCING
PROJECTION TECH.
EFA
118
119
VISUALIZATION TECH.
NON-REDUC.
CLUSTER ANALYSIS
REDUCING
PROJECTION TECH.
LINEAR
UNSUPERVISED
NON-LINEAR
EFA
120
121
VISUALIZATION TECH.
NON-REDUC.
CLUSTER ANALYSIS
REDUCING
PROJECTION TECH.
LINEAR
PCA
PROJ.PUR.
UNSUPERVISED
NON-LINEAR
EFA
122
123
VISUALIZATION TECH.
NON-REDUC.
UNSUPERVISED
CLUSTER ANALYSIS
EFA
REDUCING
PROJECTION TECH.
LINEAR
PCA
PROJ.PUR.
NON-LINEAR
MDS
NEUR.N.
SOM
PRIN.C.
ICA
BUMV
124
125
126
127
128
129
130
131
132
133
(Jensen, 1995.)
134
A weakness of neural networks tis hat you are unable to utilize the
knowledge you might have in advance.
Probabilities, on the other hand, can be assessed using a
combination of theoretical insight, empiric studies independent of
the constructed system, training, and various more or less
subjective estimates.
(Jensen, 1995.)
135
(Jensen, 1995.)
136
For an example of
practical use of BUMV,
see Nokelainen and
Ruohotie (2009).
137
Results showed that managers and teachers had higher growth motivation and
level of commitment to work than other personnel, including job titles such as
cleaner, caretaker, accountant and computer support.
Employees across all job titles in the organization, who have temporary or parttime contracts, had higher self-reported growth motivation and commitment to
work and organization than their established colleagues.
138
139
Links
B-Course
BayMiner
http://b-course.cs.helsinki.fi
http://www.bayminer.com
140
References
141
References
142
References
Kaski, S. (1997). Data exploration using self-organizing maps. Doctoral
dissertation. Acta Polytechnica Scandinavica, Mathematics, Computing and
Management in Engineering Series No. 82. Espoo: Finnish Academy of Technology.
Kim, S., Kwon, S., & Cook, D. (2000). Interactive Visualization of Hierarchical
Clusters Using MDS and MST. Metrika, 51(1), 3951.
Kohonen, T. (1995). Self-Organizing Maps. Berlin: Springer.
Kontkanen, P., Lahtinen, J., Myllymki, P., Silander, T., & Tirri, H. (2000).
Supervised Model-based Visualization of High-dimensional Data. Intelligent Data
Analysis, 4, 213-227.
Kontkanen, P., Lahtinen, J., Myllymki, P., & Tirri, H. (2000). Unsupervised
Bayesian Visualization of High-Dimensional Data. In R. Ramakrishnan, S. Stolfo,
R. Bayardo, & I. Parsa (Eds.), Proceedings of the Sixth International Conference on
Knowledge Discovery and Data Mining (pp. 325-329). New York, NY: The
Association for Computing Machinery.
143
References
144
References
145
References
Nokelainen, P., Ruohotie, P., & Tirri, H. (1999). Professional Growth
Determinants-Comparing Bayesian and Linear Approaches to Classification. In P.
Ruohotie, H. Tirri, P. Nokelainen, & T. Silander (Eds.), Modern Modeling of
Professional Growth, vol. 1 (pp. 85-120). Hmeenlinna: RCVE.
Nokelainen, P., & Tirri, K. (2010). Role of Motivation in the Moral and Religious
Judgment of Mathematically Gifted Adolescents. High Ability Studies, 21(2), 101116.
Nokelainen, P., Tirri, K., Campbell, J. R., & Walberg, H. (2004). Cross-cultural
Factors that Account for Adult Productivity. In J. R. Campbell, K. Tirri, P.
Ruohotie, & H. Walberg (Eds.), Cross-cultural Research: Basic Issues, Dilemmas,
and Strategies (pp. 119-139). Hmeenlinna: RCVE.
Nokelainen, P., Tirri, K., & Merenti-Vlimki, H.-L. (2007). Investigating the
Influence of Attribution Styles on the Development of Mathematical Talent. Gifted
Child Quarterly, 51(1), 64-81.
Pylvs, L., Nokelainen, P., & Roisko, H. (in press). Modeling of Vocational
Excellence in Air Traffic Control. Submitted for review.
146
References