addition of unit 1 3 4 5
This commit is contained in:
20
unit 5/02_Web_Mining.md
Normal file
20
unit 5/02_Web_Mining.md
Normal file
@@ -0,0 +1,20 @@
|
||||
# Web Mining
|
||||
|
||||
**Web Mining** is using data mining techniques to discover useful information from the World Wide Web.
|
||||
|
||||
## Types of Web Mining
|
||||
|
||||
### 1. Web Content Mining
|
||||
- **What**: Mining the **actual content** of web pages.
|
||||
- **Data**: Text, images, audio, video.
|
||||
- **Example**: Analyzing reviews on Amazon to see if people like a product (Sentiment Analysis).
|
||||
|
||||
### 2. Web Structure Mining
|
||||
- **What**: Mining the **links** (hyperlinks) between pages.
|
||||
- **Goal**: To find important pages (Authorities) and pages that link to many others (Hubs).
|
||||
- **Example**: Google's **PageRank** algorithm uses this to rank search results.
|
||||
|
||||
### 3. Web Usage Mining
|
||||
- **What**: Mining **user activity** logs.
|
||||
- **Data**: Server logs, browser history, clicks.
|
||||
- **Example**: Analyzing which pages users visit most often and where they leave the site.
|
||||
Reference in New Issue
Block a user