A Hybrid Approach to Mining Conditions

A Hybrid Approach to
Mining Conditions
Fernando O. Gallego,
and Rafael Corchuelo

Opinion mining
Attribute Polarity
“lens” Positive
2
Attribute Polarity
“resolution” Neutral
“Flash” Negative
I think that the lens is beyond
excellent for amateurs.
The resolution of this camera is
13Mp. Flash is tacky when
using outdoors.

But wait!
3
The opinion is only
true in a certain
situation

Opinion mining (with conditions)
Attribute Polarity
“lens” Positive (for
amateurs)
4
Attribute Polarity
“resolution” Neutral
“Flash” Negative
(when using
outdoors)
using outdoors.

Roadmap
Introduction
Our proposal
Experimental results
Conclusions

Condition mining
7
using outdoors.
for amateurs
when using
outdoors

Current approaches
Machine-learningHandcrafted patterns

Handcrafted patterns
• Mausam et al. (2012)
– OpenIE extraction
– Dependency tree
– Adverbial clauses
• Chikersal et al. (2015)
– Opinion mining
– Basic connectives
– “then”/comma
9

Variability of conditions
11
0/1st/2nd/3rd
conditionals
If you do sth
Even if sby fell down
If sth had passed
Should you help me
When sth happens
May it be accepted
For sby
To sby
During my event
While doing sth After/before sth
If it occurs

Machine learning
• Nakayama et al (2015):
– SVM/CRF Model
– 3k Japanese sentences
– Several lexicons used
12

Our solution
Computational linguistics
+
Deep learning

Inputs
Sentence Conditions
I think that the lens is beyond excellent for
amateurs.
[“for amateurs”]
The resolution of this camera is 13Mp. []
Flash is tacky when using outdoors. [“when using outdoors”]
… …
16

Train (1/4)
• Create a subset of
training examples for
each sentence
19
S1
S1
S1
S1
ts
ts
ts
ts

Train (2/4)
• Generate condition
candidates for a given
sentence
20
outdoors
tacky
Flash is
usingcop
advcl
nsubj
advmod
when
c1:
c2:
advmod
Flash is tacky when using outdoors
when using outdoors

Train (3/4)
• Score each candidate
21
c1:
c2:
when using outdoors
0.8560
1.0000

Train (4/4)
• Train a deep regressor
from training set
22

Regressor’s alternatives
23
MLP
CNN-BiGRUBiGRU
GRU
CNN

Apply (1/5)
• Generate condition
candidates
25
outdoors
tacky
Flash is
usingcop
advcl
nsubj
advmod
when
c1:
c2:
advmod
when using outdoors

Apply (2/5)
• For each condition
candidate it checks
whether it must be
considered or not
26

Apply (3/5)
• The regressor scores
the candidate
27
c1:
c2:
when using outdoors
0.8560
1.0000

Apply (4/5)
• If score is equal to or
greater than a given
threshold, it is
considered
28

Apply (5/5)
• It keeps the best non-
overlapped candidates
29

Hardware & software configuration
• Intel Xeon E5-2690
• 4 threads at 2.60 Ghz
• 2 GiB of RAM
• Nvidia Tesla K10 GPU
• CentOS Linux 7.3
• Snowball 1.2.1
• Stanford Core NLP 3.8.0
• Python 3.5.4
• Gensim 2.3.0
• Keras 2.0.8 & Theano 1.0
31

Dataset
https://www.kaggle.com/fogallego/reviews-with-conditions

Baselines
Machine-learningHandcrafted patterns

Well done!
• It overcomes the
problems found in the
literature
• Comprehensive
experimental analysis
• It achieves good results
36

Thanks
Fernando O. Gallego
fogallego@us.es

Condition mining’s main applications
38
Inf. Extraction Opinion mining Recommenders

Detailed example (1/3)
39
who cake
if you be lik- ’s
try
nsubj dobj
someone then john
advmod
xcompadvcl
mark acl:relcl
copnsubj
case
If you are someone who likes cakes then try John’s

Detailed example (2/3)
40
’s
case
john
xcomp
try
-
then
advmod
cake
dobj
lik-
acl:relcl
who
nsubj
someone
advcl
be
cop
you
nsubj
if
mark
c1 :
c2:
c3:
c4:
cake
dobj
lik-
acl:relcl
who
nsubj
someone
advcl
be
cop
you
nsubj
if
mark
cake
dobj
lik-
acl:relcl
who
nsubj
’s
case
john
xcomp
who cake
if you be lik- ’s
try
nsubj dobj
someone then john
advmod
xcompadvcl
mark acl:relcl
copnsubj
case

Our Neural Networks
CNN:
Convolution
Input ld
Output .9l1.2d
Activation relu
Kernel 3
Drop-out 0.2000
Convolution
Input .9l1.2d
Output .6l.3d
Activation relu
Kernel 17
Drop-out 0.2000
Pooling
Input .6l.3d
Output .6l1
Functor max
Pool global
Dense
Input .6l1
Output .3l1
Activation linear
Drop-out 0.2000
Dense
Input .3l1
Output 11
Activation tanh
Drop-out 0.0000
MLP: GRU:
Dense
Input ld
Output l.5d
Activation tanh
Drop-out 0.2000
Dense
Input l.5d
Output 11
Activation linear
Drop-out 0.0000
GRU
Input ld
Output l1
Activation tanh
Drop-out 0.1500
Dense
Input l1
Output .3l1
Activation linear
Drop-out 0.2000
Dense
Input .3l1
Output 11
Activation tanh
Drop-out 0.0000
BiGRU:
BiGRU
Input ld
Output 2l1
Activation tanh
Drop-out 0.1500
Dense
Input 2l1
Output .3l1
Activation linear
Drop-out 0.2000
CNN-BiGRU:
Convolution
Input ld
Output .9l.3d
Activation relu
Kernel 3
Drop-out 0.0000
Pooling
Input .9l.3d
Output .9l.3d
Functor max
Pool 2
BiGRU
Input .9l.3d
Output 4l1
Activation tanh
Drop-out 0.1500
Dense
Input .4l1
Output .3l1
Activation linear
Drop-out 0.2000
Dense
Input 3l1
Output 11
Activation tanh
Drop-out 0.0000
Dense
Input .3l1
Output 11
Activation tanh
Drop-out 0.0000

Detailed results
Lang Proposal
q = 0.2500 q = 0.5000 q = 0.7500
P R F1 P R F1 P R F1
en
MB 0.6270 0.6144 0.6206 0.6270 0.6144 0.6206 0.6270 0.6144 0.6206
CB 0.7979 0.4642 0.5870 0.7979 0.4642 0.5870 0.7979 0.4642 0.5870
Averages 0.7125 0.5393 0.6038 0.7125 0.5393 0.6038 0.7125 0.5393 0.6038
MLP 0.4741 0.7799 0.5897 0.5612 0.5271 0.5436 0.5739 0.4582 0.5096
GRU 0.9999 0.4421 0.6131 0.9999 0.4421 0.6131 0.9999 0.4421 0.6131
BiGRU 0.5448 0.5262 0.5353 0.8999 0.4421 0.5929 0.9999 0.4421 0.6131
CNN 0.5908 0.7546 0.6628 0.6211 0.6278 0.6244 0.6571 0.5432 0.5948
CNN-BiGRU 0.5586 0.8052 0.6596 0.6318 0.6529 0.6422 0.7327 0.4914 0.5883
Averages 0.6336 0.6616 0.6121 0.7428 0.5384 0.6033 0.7927 0.4754 0.5838
es
MB 0.6699 0.5285 0.5909 0.6699 0.5285 0.5909 0.6699 0.5285 0.5909
CB 0.7953 0.4399 0.5665 0.7953 0.4399 0.5665 0.7953 0.4399 0.5665
Averages 0.7326 0.4842 0.5787 0.7326 0.4842 0.5787 0.7326 0.4842 0.5787
MLP 0.4232 0.8295 0.5604 0.5382 0.5678 0.5526 0.5771 0.4465 0.5034
GRU 0.5246 0.7483 0.6168 0.7089 0.4304 0.5356 0.9999 0.4153 0.5869
BiGRU 0.5321 0.7451 0.6209 0.6335 0.4692 0.5391 0.9999 0.4153 0.5869
CNN 0.5997 0.7519 0.6672 0.6606 0.6521 0.6563 0.7065 0.5467 0.6164
CNN-BiGRU 0.5227 0.8221 0.6390 0.6195 0.6968 0.6559 0.6843 0.5369 0.6017
Averages 0.5205 0.7794 0.6209 0.6321 0.5633 0.5879 0.7935 0.4721 0.5790
43

Statistical analysis
q = 0.2500 q = 0.5000
Proposal Ranking Comparison z p-value Proposal Ranking Comparison z p-value
CNN 1.0000 CNN x CNN - - CNN-BiGRU 1.4000 CNN-BiGRU x CNN-BiGRU - -
CNN-BiGRU 2.0000 CNN x CNN-BiGRU 1.4142 0.1573 CNN 1.6000 CNN-BiGRU x CNN 0.2828 0.7773
BiGRU 3.5000 CNN x BiGRU 3.5355 0.0008 MLP 3.1000 CNN-BiGRU x MLP 2.4042 0.0324
MLP 4.1000 CNN x MLP 4.3841 0.0000 BiGRU 4.2000 CNN-BiGRU x BiGRU 3.9598 0.0002
GRU 4.4000 CNN x GRU 4.8083 0.0000 GRU 4.7000 CNN-BiGRU x GRU 4.6669 0.0000
(a) (b)
q = 0.7500
Proposal Ranking Comparison z p-value Proposal Ranking Comparison z p-value
CNN 1.3000 CNN x CNN - - CNN0.25 1.4000 CNN0.25 x CNN0.25 - -
CNN-BiGRU 1.7000 CNN x CNN-BiGRU 0.5657 0.5716 CNN-BiGRU0.50 1.8000 CNN0.25 x CNN-BiGRU0.50 0.5657 0.5716
MLP 3.0000 CNN x MLP 2.4042 0.0324 MB 3.4000 CNN0.25 xMB 2.8284 0.0094
GRU 4.5000 CNN x GRU 4.5255 0.0000 CNN0.75 3.7000 CNN0.25 xCNN0.75 3.2527 0.0034
BiGRU 4.5000 CNN x BiGRU 4.5255 0.0000 CB 4.7000 CNN0.25 x CB 4.6669 0.0000
(c) (d)
44

A Hybrid Approach to Mining Conditions

More Related Content

Similar to A Hybrid Approach to Mining Conditions (20)

Recently uploaded (20)

A Hybrid Approach to Mining Conditions

Editor's Notes