Incremental Association Rule Mining Using Promising Frequent Itemset Algorithm
Incremental Association Rule Mining Using Promising Frequent Itemset Algorithm
Abstract Association rule discovery is an important area of rules and some existing association rules would become
data mining. In dynamic databases, new transactions are invalid. As a brute force approach, apriori may be reapplied to
mining the whole dynamic database when the database has
appended as time advances. This may introduce new association been changed. However, this approach is very costly even if
rules and some existing association rules would become invalid. small amount of new transactions is inserted into a database.
Thus, the association rule mining for a dynamic database is an
Thus, the maintenance of association rules for dynamic databases important problem. Several research works [7, 8, 9, 10, 11]
is an important problem. In this paper, promising frequent have proposed several incremental algorithms to deal with this
problem. Review of related works will be introduced in section
itemset algorithm, which is an incremental algorithm, is proposed
2.
to deal with this problem. The proposed algorithm uses
In this paper, a new incremental algorithm, called
maximum support count of 1-itemsets obtained from previous promising frequent itemset algorithm, is introduced. The goal
mining to estimate infrequent itemsets, called promising itemsets, of this work is to solve the efficient updating problem of
association rules after a nontrivial number of new records have
of an original database that will capable of being frequent been added to a database. Our approach introduces a promising
itemsets when new transactions are inserted into the original frequent itemset for an infrequent itemset that has capable of
being a frequent itemset after a number of new records have
database. Thus, the algorithm can reduce a number of times to
been added to a database. This can reduce a number of times to
scan the original database. As a result, the algorithm has scan an original database. As a result, the algorithm has
execution time faster than that of previous methods. This paper execution time faster than that of previous methods.
also conducts simulation experiments to show the performance of The remaining of this paper is organized as follows. We
brief review of related works in Section 2. The Promising large
the proposed algorithm. The simulation results show that the itemset algorithm is described in Section 3. We evaluate the
proposed algorithm has a good performance. performance in Section 4. Finally, we conclude the work of this
paper in section 5.
Keywordsassociation rule, maintain association rule, II. RELATED WORK
incremental associatin rule
I. INTRODUCTION
An influential algorithm for association rule mining is
Data mining is one of the processes of Knowledge Apriori [2]. Apriori computes frequent itemsets in the large
Discovery in Database (KDD) that is used for extracting database through several iterations based on a prior knowledge.
information or pattern from large database. One major Each iteration has 2 steps. For each iteration with 2 steps,
application area of data mining is association rule mining [1] processes are join and prune step. For an frequent itemset, its
that discovers hidden knowledge in database. The association support must be higher than a user-specified minimum support
rule mining problem is to find out all the rules in the form of X threshold. The association rule can be discoverd based on
=> Y, where X and Y I are sets of items, called itemsets. The frequent itemsets that must be higher than user-specified
association rule discovery algorithm is usually decomposed minimum confidence.
into 2 major steps. The first step is find out all large itemsets
that have support value exceed a minimum support threshold For dynamic databases, several incremental updating
and the second steps is find out all the association rules that techniques have been developed for mining association rules.
have value exceed a minimum confidence threshold. One of the previous work for incremental association rule
mining is FUP algorithm that was presented by Cheung et al
However, a database is dynamic when new transactions are [3]. FUP algorithm is the first incremental updating technique
inserted into the database. This may introduce new association
Execution time
10000
Output :
(1) L (DBdb) : frequent k-itemset in updated database, 8000 Promising 6%