Web Mining: By-Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar
Web Mining: By-Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar
By-
Pawan Singh
Piyush Arora
Pooja Mansharamani
Pramod Singh
Praveen Kumar
1
Outline
Introduction
Web Mining
Web Content Mining
Web Structure Mining
Web Usage Mining
Conclusion & Exam Questions
2
Four Problems
3
Personalizing the information
Catering to personal preference in content and presentation(associated
with the type and presentation of the information )
4
Other Approaches
5
Direct vs. Indirect Web Mining
6
The Research
7
Web Mining: Definition
8
Web Mining: Subtasks
Resource finding
Retrieving intended documents
Information selection/pre-processing
Select and pre-process specific information from
retrieved web resources.
Generalization
Discover general patterns within and across web sites
Analysis
Validation and/or interpretation of mined patterns
9
Web Mining: Not IR
10
Web Mining: Not IE
11
IE - IR
12
Types of IE
13
Web Mining and Machine
Learning
Machine learning is concerned with the
development of algorithms and techniques that
allow computers to "learn".
Web mining is NOT learning from the Web.
Some applications of machine learning on the web
are NOT Web Mining
Methods used for Web Mining are NOT limited to
machine learning
There is a close relationship between web mining
and machine learning
14
Web Mining and Machine Learning
15
Web Mining Categories
17
Web Structure Mining
18
Web Usage Mining
Tries to predict user behavior from
interaction with the Web
Wide range of data (logs)
Web client data
Proxy server data
Web server data
Two common approaches
Map usage data into relational tables before using
adapted data mining techniques
Use log data directly by utilizing special pre-processing
techniques
19
Thank you!
20