0% found this document useful (0 votes)
127 views

Data Mining

This document provides an overview of data mining concepts across 5 units. It introduces key data mining topics like association rule mining, classification, clustering, outlier detection, and preprocessing. It also discusses related areas such as machine learning algorithms, data warehousing, online analytical processing, and web mining. The document aims to equip readers with fundamental knowledge of data mining techniques and applications.

Uploaded by

Anthony Luna
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
127 views

Data Mining

This document provides an overview of data mining concepts across 5 units. It introduces key data mining topics like association rule mining, classification, clustering, outlier detection, and preprocessing. It also discusses related areas such as machine learning algorithms, data warehousing, online analytical processing, and web mining. The document aims to equip readers with fundamental knowledge of data mining techniques and applications.

Uploaded by

Anthony Luna
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Mining

Unit 1:
Introduction: Why Data Mining? What Is Data Mining? - What Kinds Of Data Can Be
Mined?-What Kinds Of Patterns Can Be Mined?-Which Technologies Are Used?-Which Kinds
Of Applications Are Targeted? - Major Issues in Data Mining. Data Mining: KDD Vs Data
Mining-DBMS Vs DM. Data Warehousing: Introduction-What Is Data Warehousing? Mining
Frequent Patterns Associations And Correlations: Basic Concepts And Methods-Basic
Concepts-Frequent Itemset Mining Methods-Which Patterns Are Interesting?-Patterns
Evaluation Methods. Other Techniques: Introduction- What Is Neural Network? - Learning in
NN-Unsupervised Learning-Data Mining Using NN: A Case Study- Genetic Algorithm.
Algorithms For Classification An Regression: Group Method of Data Handling
(GMDH).Classification: Alternative Techniques- Rule Base Classifier-Nearest-Neighbor
Classifiers-Bayesian Classifiers.

Unit 2:
Association Rules: Introduction- What Is An Association Rule?- Methods To Discover
Association Rules- Apriori Algorithm- Partition Algorithm- Pincer Search Algorithm- Dynamic
Itemset Counting Algorithm- FP-Tree Growth Algorithm -clat And Declat -Rapid Association
Rule Mining- Discussion On Different Algorithms- Incremental Algorithm- Border AlgorithmGeneralized Association Rule-Association Rules With Item Constraints. Data Preprocessing:
An Overview-Data Cleaning-Data Integration-Data Reduction-Data Transformation and Data
Discretization. Preprocessing And Postprocessing In Data Mining: Introduction- Steps In
Preprocessing- Discretization Feature Extraction ,Selection And Construction- Missing Data And
Methodological Techniques For Dealing It- Example Of Dealing Missing Data In Decision Tree
Induction- Postprocessing.Classification:Advanced Methods- Bayesian Belief NetworksClassification By Back propagation- Support Vector Machines- Classification Using Frequent
Patterns- Lazy Learners- Other Classification Methods-Additional Topics Regarding
Classification.

Unit 3:
Cluster Analysis: What Is Cluster Analysis? -Desired Features Of Cluster Analysis- Types Of
Data- Computing Distance- Types Of Cluster Analysis Methods- Partitional MethodsHierarchical Methods Density Based Methods- Dealing With Large Databases -Quality And
Validity Of Cluster Analysis Methods- Cluster Analysis Software. Advanced Cluster Analysis:
Probabilistic Model Based Clustering Clustering High Dimensional Data- Clustering Graph
And Network Data- Clustering With Constraints. Outlier Detection: Outliers And Outlier
Analysis-Outlier Detection Methods- Statistical Approaches- Proximity-Based Approaches-

Clustering-Based Approaches- Classification-Based Approaches- Mining Contextual And


Collective Outliers-Outlier Detection In High-Dimensional Data. Cluster Analysis:
Introduction- Partitional Clustering -K-Medoids- Moderns Clustering Methods-Birch-DBSCANOptics-Clustering Based On Graph Partitioning-CHAMELEON: A Two Phase Clustering
Algorithm-The COBWEB Conceptual Clustering Algorithm-GCLUTO. Cluster Analysis:
Based Concepts and Algorithms: DBSCAN- Cluster Evaluation. Cluster Analysis: Additional
Issues And Algorithms: Characteristics Of Data, Clusters, and Clustering AlgorithmsPrototypes Based Clustering- Density Based Clustering- Graph Based Clustering- Scalable
Clustering Algorithms.

Unit 4:
Datasets: Introduction- Contact Lenses- Iris Plants Database- Breast Cancer Database- Wage
Data- Credit Database- Housing Database- 1985 Auto Imports Database- Badge Problem.
Machine Learning With Open Resource and Commercial Software: Machine Learning With
Weka XLminerTM.
Data Warehousing: Introduction- Operational Data Stores- ETL-Data
Warehouses- Data Warehouses Design- Guidelines For Data Warehouse Implementation- Data
Warehouse Metadata Software For ODS, ZLE, ETL And Data Warehousing. Online
Analytical Processing(OLAP): Introduction- OLAP-Characteristics Of OLAP SystemsMotivation For Using OLAP-Multidimensional View And Data Cube- Data Cube
Implementations- Data Cube Operations- Guidelines For OLAP Implementation- OLAP
Software. Data Transformation: Attribute Selection- Discretizing Numeric AttributesProjection- Sampling Cleansing- Transforming Multiple Classes To Binary Ones- Calibrating
Class Probabilities. Ensemble Learning: Combining Multiple Models BuggingRandomization- Boosting- Additive Regression- Interpretable Ensembles -Stacking. Moving
On: Applications And Beyond: Applying Data mining- Learning From Massive DatasetsDataStream Learning -Incorporating Domain Knowledge- Text Mining -Web Mining
-Adversarial Situations- Ubiquitous Data mining. Data Warehousing And Online Analytical
Processing: Data Warehouse: Basic Concepts-Data Warehouse Modeling: Data Cube And
OLAP- Data Warehouse Design And Usage- Data Warehouse Implementation-Data
Generalization By Attribute-Oriented Inuction.OLAP: Introduction-OLAP-Characteristic Of
OLAP System Motivation For Using OLAP-Multidimensional View And Data Cube-Data Cube
Implementation-Data Cube Operation-Guidelines For OLAP-Implementation .Visualizing An
Exploring Data: Introduction-Summarizing Data: Some Simple Examples-Tools For Displaying
More Than Two Variables- Principal Components: And Lyses Multidimensional Scaling.
Linear Algebra: Matrices. Probability Statistics: Probability- Statistics- Hypothesis Testing.
Optimization: Unconstrained Optimization- Constrained Optimization. Embedded Machine
Learning: A Simple Data Mining Application.

Unit 5:
Data Mining Trends And Research Frontiers: Mining Complex Data Types- Other
Methodologies Of Data Mining-Data Mining Applications- Data Mining And Society. Rough
Set Theory: Introduction-Definition- Example- Reduct- Propositional Reasoning And Piap To
Compute Reducts- Types Of Reducts- Rule Extraction- Decision Tree- Rough Sets And Fuzzy
Sets- Granular Computing. Other Techniques: Introduction- What Is Neural Network?Learning In NN- Unsupervised Learning- Data Mining Using NN:A Case Study- Genetic
Algorithm-Support Vector Machine. Web Mining: Introduction- Web Mining- Web Content
Mining- Web structure Mining- Web Usage Mining- Text Mining-Unstructured Text- Episode
Rule Discovery Of Texts- Hierarchy Of Categories- Text Clustering. Temporal And Spatial
Data Mining: Introduction- What Is Temporal Data Mining?- Temporal Association RulesSequences Mining- The GSP Algorithm- SPADE SPIRIT WUM- Episode Discovery- Event
Prediction Problem- Time Series Analysis- Spatial Mining- Spatial Mining Task- Spatial
Clustering Spatial Trends. Search Engines: Introduction- Characteristics Of Search EngineSearch Engine Functionality- Search Engine Architecture- Ranking Of Web Pages- The Search
Engine Industry- Enterprise Search- Enterprise Search Engine Software. Web Data Mining:
Introduction Web Terminology and Characteristics-Locality and Hierarchy in the Web- Web
Content Mining- Web Usage Mining- Web Mining Software- Web Structure Mining. Search
And Optimization Methods: Introduction- Searching For Models And Patterns: Background On
Search- The State-Space Formulation For Search In Data Mining- A Simple Greedy Search
Algorithm- System Search And Search Heuristics- Branch And Bound. Parameter
Optimization: Parameter Optimization: Background Closed Form And Linear Algebra Method
Gradient- Based Methods For Optimizing Smooth Functions- Univariate Parameter
Optimization- Multivariate Parameter Optimization-Constrained Optimization- Optimization
With Missing Data: The EM Algorithm- Online And Single Scan Algorithm Stochastic Search
And Optimization Techniques.

You might also like