0% found this document useful (0 votes)

9 views23 pages

Lecture 2.3.1 2.3.2

The document outlines the course objectives and outcomes for a Data Mining and Warehousing course, focusing on mining single-dimensional Boolean association rules using the Apriori algorithm. It covers key concepts, methods for analyzing data, and the application of various data mining techniques. Additionally, it includes a detailed syllabus, examples of association rule mining, and references for further reading.

Uploaded by

24zosia.unnoticed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views23 pages

Lecture 2.3.1 2.3.2

Uploaded by

24zosia.unnoticed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

APEX INSTITUTE OF TECHNOLOGY

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

Data Mining and Warehousing (22CSH-380)

Faculty: Dr. Preeti Khera (E16576)

Lecture – 2.3.1 & 2.3.2

Mining Single-Dimensional Boolean Association DISCOVER . LEARN . EMPOWER
rules from Transactional Databases – Apriori
Algorithm

June 4, 2025 1
Data Mining and Warehousing : Course Objectives

COURSE OBJECTIVES
The Course aims to:

1. Develop understanding key concepts of data mining and obtain knowledge about
how to extract useful characteristics from data using data pre-processing techniques.
2. Demonstrate methods to apply and analyze relevant attributes, perform statistical
measure to look for meaningful variation in data, and mine association rules for
transactional datasets.
3. Teach use and application of data mining techniques such as classification, decision
tree, neural networks, back propagation and many more, in various applications.

June 4, 2025 2
COURSE OUTCOMES
On completion of this course, the students shall be able to:-

Understand the concept of Data mining and usage of various tools for
CO1
data warehousing and data mining.

Demonstrate the strengths and weaknesses of different methods of

CO2
meaningful data mining.

Apply association rule, classification, and clustering algorithms

CO3
for large data sets.

Evaluate and employ correct data mining techniques depending on

CO4
characteristics of the dataset.
Verify and formulate the performance of various data mining
CO5
techniques according to the dataset.

June 4, 2025 3
Unit-2 Syllabus

Unit-2
Concept Description: Definition, Data Generalization, Analytical Characterization,
Analysis of attribute relevance, Mining Class comparisons, Statistical measures in large
Databases. Measuring Central Tendency, Measuring Dispersion of Data, Graph Displays
of Basic Statistical class Description, Mining Association Rules in Large Databases,
Association rule mining, mining Single-Dimensional Boolean Association rules from
Transactional Databases – Apriori Algorithm, Mining Multilevel Association rules from
Transaction Databases and Mining Multi- Dimensional Association rules from Relational
Databases.

June 4, 2025 4
Table of Content
• Mining Association Rules in Large Databases
• Association rule mining
• Mining Single-Dimensional Boolean Association rules from Transactional
Databases
• Apriori Algorithm

June 4, 2025 5
Association Mining
• Association rule mining:
• Finding frequent patterns, associations, correlations, or
causal structures among sets of items or objects in
transaction databases, relational databases, and other
information repositories.
• Applications:
• Basket data analysis, cross-marketing, catalog design, loss-
leader analysis, clustering, classification, etc.
• Examples.
• Rule form: “Body ® Head [support, confidence]”.
• buys(x, “diapers”) ® buys(x, “beers”) [0.5%, 60%]
• major(x, “CS”) ^ takes(x, “DB”) ® grade(x, “A”) [1%, 75%]
Association Rules: Basic Concepts
• Given: (1) database of transactions, (2) each transaction
is a list of items (purchased by a customer in a visit)
• Find: all rules that correlate the presence of one set of
items with that of another set of items
• E.g., 98% of people who purchase tires and auto accessories
also get automotive services done
• Applications
• *  Maintenance Agreement (What the store should do to
boost Maintenance Agreement sales)
• Home Electronics  * (What other products should the store
stocks up?)
• Attached mailing in direct marketing
• Detecting “ping-pong”ing of patients, faulty “collisions”
Interestingness Measures: Support and Confidence

Customer
Customer • Find all the rules X & Y  Z with
buys both
buys diaper minimum confidence and support
• support, s, probability that a
transaction contains {X  Y  Z}
• confidence, c, conditional
probability that a transaction having
Customer
buys beer
{X  Y} also contains Z

Transaction ID Items Bought Let minimum support 50%,

2000 A,B,C and minimum confidence
1000 A,C 50%, we have
4000 A,D – A  C (50%, 66.6%)
5000 B,E,F
– C  A (50%, 100%)
Association Rule Mining: A Road Map
• Boolean vs. quantitative associations (Based on the types of values
handled)
• buys(x, “SQLServer”) ^ buys(x, “DMBook”) ® buys(x, “DBMiner”) [0.2%,
60%]
• age(x, “30..39”) ^ income(x, “42..48K”) ® buys(x, “PC”) [1%, 75%]
• Single dimension vs. multiple dimensional associations (each
distinct predicate of a rule is a dimension)
• Single level vs. multiple-level analysis (consider multiple levels of
abstraction)
• What brands of beers are associated with what brands of diapers?
• Extensions
• Correlation, causality analysis
Association does not necessarily imply correlation or causality
•
• Maxpatterns (a frequent pattern s.t. any proper subpattern is not frequent) and
closed itemsets (if there exist no proper superset c’ of c s.t. any transaction
containing c also contains c’)
Mining Association Rules-An Example

Transaction ID Items Bought Min. support 50%

2000 A,B,C Min. confidence 50%
1000 A,C
4000 A,D Frequent Itemset Support
{A} 75%
5000 B,E,F
{B} 50%
{C} 50%
For rule A  C: {A,C} 50%
support = support({A C}) = 50%
confidence = support({A C})/support({A}) = 66.6%
The Apriori principle:
Any subset of a frequent itemset must be frequent
Mining Frequent Itemsets

• Find the frequent itemsets: the sets of items that

have minimum support
• A subset of a frequent itemset must also be a frequent
itemset
• i.e., if {AB} is a frequent itemset, both {A} and {B} should be a
frequent itemset
• Iteratively find frequent itemsets with cardinality from 1 to
k (k-itemset)
• Use the frequent itemsets to generate association
rules.
The Apriori Algorithm: Basic idea
• Join Step: C is generated by joining L with itself
k k-1

• Prune Step: Any (k-1)-itemset that is not frequent cannot

be a subset of a frequent k-itemset
• Pseudo-code:
Ck: Candidate itemset of size k
Lk : frequent itemset of size k

L1 = {frequent items};
for (k = 1; Lk !=; k++) do begin
Ck+1 = candidates generated from Lk;
for each transaction t in database do
increment the count of all candidates in Ck+1 that
are contained in t
Lk+1 = candidates in Ck+1 with min_support
end
return k Lk;
The Apriori Algorithm — Example
Database D itemset sup.
L1 itemset sup.
TID Items C1 {1} 2 {1} 2
100 134 {2} 3 {2} 3
200 235 Scan D {3} 3 {3} 3
300 1235 {4} 1 {5} 3
400 25 {5} 3
C2
itemset sup C2 itemset
L2 itemset sup {1 2} 1 Scan D {1 2}
{1 3} 2 {1 3} 2 {1 3}
{2 3} 2 {1 5} 1 {1 5}
{2 3} 2 {2 3}
{2 5} 3
{2 5} 3 {2 5}
{3 5} 2
{3 5} 2 {3 5}
C3 itemset Scan D L3 itemset sup
{2 3 5} {2 3 5} 2
Candidate Generation
• Suppose the items in Lk-1 are listed in an order
• Step 1: self-joining Lk-1
insert into Ck
select p.item1, p.item2, …, p.itemk-1, q.itemk-1
from Lk-1 p, Lk-1 q
where p.item1=q.item1, …, p.itemk-2=q.itemk-2, p.itemk-1 < q.itemk-1

• Step 2: pruning
forall itemsets c in Ck do
forall (k-1)-subsets s of c do
if (s is not in Lk-1) then delete c from Ck
To Count Supports of Candidates

• Counting supports of candidates a problem:

• The total number of candidates can be huge
• Each transaction may contain many candidates
• Method:
• Candidate itemsets are stored in a hash-tree
• Leaf node of hash-tree contains a list of itemsets and
counts
• Interior node contains a hash table
• Subset function: finds all the candidates contained in a
transaction
Example of Generating Candidates

• L3={abc, abd, acd, ace, bcd}

• Self-joining: L3*L3
• abcd from abc and abd
• acde from acd and ace

• Pruning:
• acde is removed because ade is not in L3

• C4={abcd}
Improving Apriori’s Efficiency
• Hash-based itemset counting: A k-itemset whose corresponding hashing
bucket count is below the threshold cannot be frequent

• Transaction reduction: A transaction that does not contain any frequent k-

itemset is useless in subsequent scans

• Partitioning: Any itemset that is potentially frequent in DB must be frequent

in at least one of the partitions of DB

• Sampling: mining on a subset of given data, need a lower support threshold

+ a method to determine the completeness

• Dynamic itemset counting: add new candidate itemsets immediately

(unlike Apriori) when all of their subsets are estimated to be frequent
Is Apriori Fast Enough - Performance
Bottlenecks
• The core of the Apriori algorithm:
• Use frequent (k – 1)-itemsets to generate candidate frequent k-itemsets
• Use database scan and pattern matching to collect counts for the
candidate itemsets
• The bottleneck of Apriori: candidate generation
• Huge candidate sets:
• 104 frequent 1-itemset will generate 107 candidate 2-itemsets
• To discover a frequent pattern of size 100, e.g., {a1, a2, …, a100}, one
needs to generate 2100  1030 candidates.
• Multiple scans of database:
• Needs (n +1 ) scans, n is the length of the longest pattern
Summary
• Association mining
• Association rules
• Mining frequent itemsets
• Apriori Algorithm
• Improving Apriori’s Efficiency

19
Assignment
• Discuss the concept of frequent item sets.
• Discuss the methods for improving Apriori’s Efficiency.
• Illustrate the steps of apriori algorithm with example.

20
References
TEXT BOOKS
T1: Tan, Steinbach and Vipin Kumar. Introduction to Data Mining, Pearson Education, 2016.
T2: Zaki MJ, Meira Jr W, Meira W. Data mining and machine learning: Fundamental concepts and algorithms.
Cambridge University Press; 2020 Jan 30.
T3: King RS. Cluster analysis and data mining: An introduction. Mercury Learning and Information; 2015 May
12.

REFERENCE BOOKS
R1: Pei, Han and Kamber. Data Mining: Concepts and Techniques, Elsevier, 2011.
R2: Halgamuge SK, Wang L, editors. Classification and clustering for knowledge discovery. Springer Science
& Business Media; 2005 Sep 2.
R3: Bhatia P. Data mining and data warehousing: principles and practical techniques. Cambridge University
Press; 2019 Jun 27.

JOURNALS
• https://www.igi-global.com/journal/international-journal-data-warehousing-mining/1085
• https://www.springer.com/journal/41060 21
• https://link.springer.com/journal/10618
References
RESEARCH PAPER
 Alasadi SA, Bhaya WS. Review of data preprocessing techniques in data mining. Journal of Engineering and Applied
Sciences. 2017 Sep;12(16):4102-7.
 Freitas AA. A survey of evolutionary algorithms for data mining and knowledge discovery. InAdvances in evolutionary
computing: theory and applications 2003 Jan 1 (pp. 819-845). Berlin, Heidelberg: Springer Berlin Heidelberg.
 Kumbhare TA, Chobe SV. An overview of association rule mining algorithms. International Journal of Computer
Science and Information Technologies. 2014 Feb;5(1):927-30.
 Srivastava S. Weka: a tool for data preprocessing, classification, ensemble, clustering and association rule mining.
International Journal of Computer Applications. 2014 Jan 1;88(10).
 Dol SM, Jawandhiya PM. Classification technique and its combination with clustering and association rule mining in
educational data mining—A survey. Engineering Applications of Artificial Intelligence. 2023 Jun 1; 122:106071.

• WEB LINK
http://www.dataminingzone.weebly.com/uploads/6/5/9/4/6594749/ch14_min_assoc_rules.pdf

• VIDEO LINK
https://youtu.be/m5c27rQtD2E 22
THANK YOU

For queries
Email: [email protected]

DBBL Prepaid Card Application Form
No ratings yet
DBBL Prepaid Card Application Form
3 pages
Rmo 35-90
100% (1)
Rmo 35-90
4 pages
Building Community John Turner PDF
100% (1)
Building Community John Turner PDF
167 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
15 pages
Data Mining:: Association Rules Techniques
No ratings yet
Data Mining:: Association Rules Techniques
14 pages
Association Rule Mining
No ratings yet
Association Rule Mining
11 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 6
82 pages
BRM Presentation - by AbhishekB - YogeshB - AbhishekC
No ratings yet
BRM Presentation - by AbhishekB - YogeshB - AbhishekC
13 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
19 pages
Mining Association Rules in Large Databases
No ratings yet
Mining Association Rules in Large Databases
40 pages
[2025-05-27]-FPM_LECTURE 9-
No ratings yet
[2025-05-27]-FPM_LECTURE 9-
35 pages
04 AssociationRules PDF
No ratings yet
04 AssociationRules PDF
15 pages
GD32F3x0 User Manual en v2.1-1
No ratings yet
GD32F3x0 User Manual en v2.1-1
686 pages
Assoc 1
No ratings yet
Assoc 1
26 pages
Health & Safety at Work Act 2015 PDF
No ratings yet
Health & Safety at Work Act 2015 PDF
14 pages
Assessment of Land Use and Land Cover Changes PPT 21
100% (2)
Assessment of Land Use and Land Cover Changes PPT 21
13 pages
Full Download Illustrated Microsoft Office 365 and Access 2016 Introductory 1st Edition Friedrichsen Test Bank
100% (53)
Full Download Illustrated Microsoft Office 365 and Access 2016 Introductory 1st Edition Friedrichsen Test Bank
36 pages
4 Association
No ratings yet
4 Association
66 pages
CSE 634 Data Mining Techniques: Mining Association Rules in Large Databases
No ratings yet
CSE 634 Data Mining Techniques: Mining Association Rules in Large Databases
41 pages
Association Rule Mining
No ratings yet
Association Rule Mining
54 pages
CIS664-Knowledge Discovery and Data Mining
No ratings yet
CIS664-Knowledge Discovery and Data Mining
74 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
27 pages
Designs Luz Del Futuro Community School
No ratings yet
Designs Luz Del Futuro Community School
40 pages
Beyond The Bottom Line: The Challenges and Opportunities of A Living Wage
No ratings yet
Beyond The Bottom Line: The Challenges and Opportunities of A Living Wage
77 pages
The Discovery and History of The Dalgaranga Meteorite Crater, Western Australia
No ratings yet
The Discovery and History of The Dalgaranga Meteorite Crater, Western Australia
11 pages
ATC - Lecture - Notes - Data Mining Techniques - 2021
No ratings yet
ATC - Lecture - Notes - Data Mining Techniques - 2021
77 pages
Unit-5: Concept Description and Association Rule Mining
No ratings yet
Unit-5: Concept Description and Association Rule Mining
39 pages
Final FDS Code
No ratings yet
Final FDS Code
28 pages
Data Mining: Association
No ratings yet
Data Mining: Association
41 pages
Data Mining Association Rules
No ratings yet
Data Mining Association Rules
54 pages
3final CH 5 Concept
No ratings yet
3final CH 5 Concept
101 pages
APznzaYKXa5YwGceeu2-5Hb2cWsN90NIV1g8I9DxBLLoKwuE7P4qjOfEGWd6pCzfmwSqKnWNBm5euXlo07JZKRKi-UcpBSTEjg7UTMzxCaVnPn0Jb2VsTE_sqVGq7R0pvAGyLrtvL4jK7B1dY1fgM9rEecJTtpRn5WSkJB__vFz_Re2xK6z3uN9DfvIaFgXRVYH8z-mJcY-z6Q8hhRFSOd
No ratings yet
APznzaYKXa5YwGceeu2-5Hb2cWsN90NIV1g8I9DxBLLoKwuE7P4qjOfEGWd6pCzfmwSqKnWNBm5euXlo07JZKRKi-UcpBSTEjg7UTMzxCaVnPn0Jb2VsTE_sqVGq7R0pvAGyLrtvL4jK7B1dY1fgM9rEecJTtpRn5WSkJB__vFz_Re2xK6z3uN9DfvIaFgXRVYH8z-mJcY-z6Q8hhRFSOd
174 pages
Association Rule Mining
No ratings yet
Association Rule Mining
50 pages
Lecture 5
No ratings yet
Lecture 5
43 pages
Association Rules
No ratings yet
Association Rules
14 pages
Apriori and FP-Growth Algorithm
No ratings yet
Apriori and FP-Growth Algorithm
48 pages
BIS 541 Ch05 20-21 S
No ratings yet
BIS 541 Ch05 20-21 S
91 pages
04 AssociationRules
No ratings yet
04 AssociationRules
15 pages
Unit 5
No ratings yet
Unit 5
40 pages
Association Rule-A Tool For Data Mining: Praveen Ranjan Srivastava
No ratings yet
Association Rule-A Tool For Data Mining: Praveen Ranjan Srivastava
6 pages
New Association Rule
No ratings yet
New Association Rule
37 pages
CIS664-Knowledge Discovery and Data Mining
No ratings yet
CIS664-Knowledge Discovery and Data Mining
74 pages
ADB Slides 5
No ratings yet
ADB Slides 5
52 pages
ICT Competency Level of Teacher in The MUST: September 2017
No ratings yet
ICT Competency Level of Teacher in The MUST: September 2017
6 pages
Mining: Association Rules
No ratings yet
Mining: Association Rules
54 pages
British Reaction To Amritsar Massacre
No ratings yet
British Reaction To Amritsar Massacre
36 pages
CH - 5
No ratings yet
CH - 5
43 pages
Set 10 ESOL (QCF) Guidance For Assessors - Writing L1
No ratings yet
Set 10 ESOL (QCF) Guidance For Assessors - Writing L1
2 pages
CH 03 Frequent Pattern Mining 2021
No ratings yet
CH 03 Frequent Pattern Mining 2021
62 pages
DM Lect7
No ratings yet
DM Lect7
26 pages
CS0-003 Practice
100% (1)
CS0-003 Practice
16 pages
DWDM - Unit - IV
No ratings yet
DWDM - Unit - IV
67 pages
DMDW Chapter 4
No ratings yet
DMDW Chapter 4
29 pages
Data Mining: Magister Teknologi Informasi Universitas Indonesia
No ratings yet
Data Mining: Magister Teknologi Informasi Universitas Indonesia
72 pages
Taking Stock of The Pandemics Impact On Global Aviation
No ratings yet
Taking Stock of The Pandemics Impact On Global Aviation
9 pages
Cessna 172 Weight and Balance Sheets
No ratings yet
Cessna 172 Weight and Balance Sheets
4 pages
P-3 1 5-Association
No ratings yet
P-3 1 5-Association
46 pages
Contents
No ratings yet
Contents
59 pages
Global Demographic Trends and Patterns
No ratings yet
Global Demographic Trends and Patterns
5 pages
DMDW Chapter 4
No ratings yet
DMDW Chapter 4
28 pages
Frequent Patterns and Association Rule Mining: Outline
No ratings yet
Frequent Patterns and Association Rule Mining: Outline
26 pages
Association Rule Mining
No ratings yet
Association Rule Mining
72 pages
Association-Analysis
No ratings yet
Association-Analysis
72 pages
BCS-078 BCS Interface
No ratings yet
BCS-078 BCS Interface
9 pages
Unit - III
No ratings yet
Unit - III
27 pages
Session 8-Association Rules Mining
No ratings yet
Session 8-Association Rules Mining
75 pages
Structural Design Practice - II: Tutorial-1 Orientation Exercise
No ratings yet
Structural Design Practice - II: Tutorial-1 Orientation Exercise
2 pages
16-Efficient and scalable frequent item set mining methods_ Apriori algorithm-05-02-2025
No ratings yet
16-Efficient and scalable frequent item set mining methods_ Apriori algorithm-05-02-2025
37 pages
Es 312 Bosh
No ratings yet
Es 312 Bosh
1 page
Association-Rules
No ratings yet
Association-Rules
33 pages
Data Mining: Concepts and Techniques: Mining Association Rules in Large Databases
No ratings yet
Data Mining: Concepts and Techniques: Mining Association Rules in Large Databases
81 pages
Mining Association Rules in Large Databases
No ratings yet
Mining Association Rules in Large Databases
77 pages
UUCMS - ಸಮಗ್ರ ವಿಶ್ವವಿದ್ಯಾಲಯ ಮತ್ತು ಕಾಲೇಜು ನಿರ್ವಹಣಾ ವ್ಯವಸ್ಥೆ
No ratings yet
UUCMS - ಸಮಗ್ರ ವಿಶ್ವವಿದ್ಯಾಲಯ ಮತ್ತು ಕಾಲೇಜು ನಿರ್ವಹಣಾ ವ್ಯವಸ್ಥೆ
2 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 6
82 pages
FALLSEM2022-23 SWE2009 ETH VL2022230101117 Reference Material I 25-08-2022 Frequent Pattern Mining
No ratings yet
FALLSEM2022-23 SWE2009 ETH VL2022230101117 Reference Material I 25-08-2022 Frequent Pattern Mining
42 pages
L9
No ratings yet
L9
24 pages
DWM
No ratings yet
DWM
66 pages
2 Job Rotation
No ratings yet
2 Job Rotation
4 pages
DMDW Chapter 4(Updated)
No ratings yet
DMDW Chapter 4(Updated)
28 pages
Lecture 2.1.3 2.1.4
No ratings yet
Lecture 2.1.3 2.1.4
34 pages
Lecture 2.3.3 2.3.4
No ratings yet
Lecture 2.3.3 2.3.4
29 pages
Lecture 2.3.5 2.3.6
No ratings yet
Lecture 2.3.5 2.3.6
19 pages
BCS4-031 june 18
No ratings yet
BCS4-031 june 18
3 pages
Data Mining Unit-III
No ratings yet
Data Mining Unit-III
24 pages
Lecture 2.3.7
No ratings yet
Lecture 2.3.7
17 pages
Action Research 2023
No ratings yet
Action Research 2023
19 pages
Exploring ITIL4 Practices Monitor Support Fulfil
100% (1)
Exploring ITIL4 Practices Monitor Support Fulfil
43 pages
Lecture 2.1.1 2.1.2
No ratings yet
Lecture 2.1.1 2.1.2
23 pages
Jh09an6111 Permit Upto 20-12-2024
No ratings yet
Jh09an6111 Permit Upto 20-12-2024
1 page
The Bicycle Wheel
No ratings yet
The Bicycle Wheel
147 pages
Model Question Paper ME010 304 Metallurgy and Material Science
No ratings yet
Model Question Paper ME010 304 Metallurgy and Material Science
2 pages
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Hashicorp Certified Vault Associate Certification Concept Based Practice Questions - Latest Edition
From Everand
Hashicorp Certified Vault Associate Certification Concept Based Practice Questions - Latest Edition
Exam OG
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet

Lecture 2.3.1 2.3.2

Uploaded by

Lecture 2.3.1 2.3.2

Uploaded by

APEX INSTITUTE OF TECHNOLOGY

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

Data Mining and Warehousing (22CSH-380)

Lecture – 2.3.1 & 2.3.2

Demonstrate the strengths and weaknesses of different methods of

Apply association rule, classification, and clustering algorithms

Evaluate and employ correct data mining techniques depending on

Transaction ID Items Bought Let minimum support 50%,

Transaction ID Items Bought Min. support 50%

• Find the frequent itemsets: the sets of items that

• Prune Step: Any (k-1)-itemset that is not frequent cannot

• Counting supports of candidates a problem:

• L3={abc, abd, acd, ace, bcd}

• Transaction reduction: A transaction that does not contain any frequent k-

• Partitioning: Any itemset that is potentially frequent in DB must be frequent

• Sampling: mining on a subset of given data, need a lower support threshold

• Dynamic itemset counting: add new candidate itemsets immediately

You might also like