DATABASE 4
DATABASE 4
Data warehouses collaborate data from several sources and ensure data
accuracy, quality, and consistency. System execution is boosted by
differentiating the process of analytics from traditional databases. In a data
warehouse, data is sorted into a formatted pattern by type and as needed. The
data is examined by query tools using several patterns.
Data warehouses store historical data and handle requests faster, helping in
online analytical processing, whereas a database is used to store current
transactions in a business process that is called online transaction
processing.
• Integrated:
Different heterogeneous sources are put together to build a data warehouse,
such as level documents or social databases.
• Time-Variant:
The data collected in a data warehouse is identified with a specific period.
• Nonvolatile:
This means the earlier data is not deleted when new data is added to the data
warehouse. The operational database and data warehouse are kept separate
and thus continuous changes in the operational database are not shown in
the data warehouse.
• Consumer goods
• Banking services
• Financial services
• Manufacturing
• Retail sectors
Advantages of Data Warehousing
• The data warehouse’s job is to make any form of corporate data easier
to understand. The majority of the user’s job will consist of inputting
raw data.
• The capacity to update continuously and frequently is the key benefit
of this technology. As a result, data warehouses are perfect for
organizations and entrepreneurs who want to stay current with their
target audience and customers.
• It makes data more accessible to businesses and organizations.
• A data warehouse holds a large volume of historical data that users
can use to evaluate different periods and trends in order to create
predictions for the future.
Disadvantages of Data Warehousing
• There is a great risk of accumulating irrelevant and useless data. Data
loss and erasure are other potential issues.
• Data is gathered from various sources in a data warehouse. Cleansing
and transformation of the data are required. This could be a difficult
task.
What is Data Mining?
data is extracted and analyzed to fetch useful information. In data mining
hidden patterns are researched from the dataset to predict future behavior.
Data mining is used to indicate and discover relationships through the data.
It is the process of finding patterns and correlations within large data sets to
identify relationships between data. Data mining tools allow a business
organization to predict customer behavior. Data mining tools are used to build
risk models and detect fraud. Data mining is used in market
analysisandmanagement, fraud detection, corporate analysis, and risk
management
A data warehouse is a
database system that is
Data mining is the process of
Definition designed for analytical
analyzing data patterns.
analysis instead of
transactional work.
Data is stored
Process Data is analyzed regularly.
periodically.
Subject-oriented,
AI, statistics, databases,
integrated, time-varying
and machine learning systems are
Functionality and non-volatile
all used in data mining
constitute data
technologies.
warehouses.
### 5. **NewSQL**
Internet database
2. **Access**: Users or applications connect to the database through the internet using
websites, apps, or APIs (Application Programming Interfaces).
- **Google Search**: The data about billions of websites is stored in an internet database.
- **Online Shopping**: Platforms like Amazon use databases to manage product catalogs,
user accounts, and orders.
- **Social Media**: Facebook and Instagram store data like posts, messages, and user
profiles in internet databases.
- **Streaming Services**: Netflix and Spotify use databases to manage their vast libraries
of movies, TV shows, and music.
### **Key Features**
1. **Remote Access**: Data can be accessed from anywhere in the world using the
internet.
2. **Real-Time Updates**: Changes to the database are instantly reflected for all users.
3. **Multi-User Access**: Many users can access and use the database at the same time.
An internet database is essential for running modern online services and applications. It
allows:
In simple terms, an internet database is the backbone of most online services, ensuring
data is always available and up-to-date for users around the globe.
1. **Storing Data**:
2. **Organizing Data**:
- The database organizes resources into
categories or collections for easy navigation
(e.g., by topic, author, or publication date).
3. **Searching**:
4. **Retrieving Data**:
1. **Metadata**:
- Descriptive information about resources,
such as title, author, publication year, file
type, and subject.
2. **Indexing**:
3. **Search Engine**:
4. **Storage System**:
1. **Storage**:
2. **Organization**:
4. **Access**:
- **Advantages**
**Challenges**
1. **Large Files**: Multimedia files take up a
lot of storage space.
MULTIMEDIA DATABASE
- 2. **Key Features**
- **Lightweight:** Mobile databases are designed to use
minimal resources, making them suitable for devices with
constrained hardware.
- **Synchronization:** They support synchronization with a
central database to ensure data consistency across devices.
- **Offline Access:** They allow users to access and update
data without requiring an active internet connection.
- **Cross-platform Support:** Many mobile databases are
designed to work across various operating systems like
Android, iOS, and Windows.
2. **Realm**
- Designed specifically for mobile devices.
- High performance and offline-first approach.
- Supports complex queries and real-time updates.
4. **Couchbase Lite**
- Embedded NoSQL database with synchronization
capabilities.
- Ideal for apps needing strong offline functionality.
- 7. **Challenges**
- **Resource Limitations:** Mobile devices have limited
storage and processing capabilities.
- **Security:** Ensuring data is encrypted and secure during
storage and transmission.
- **Synchronization Issues:** Managing conflicts when
multiple devices update the same data.
Spatial database
A **spatial database** is a database designed to store,
query, and manage spatial data, which is information related
to the position, shape, and size of objects in space. Spatial
data is used to represent geographic objects, such as roads,
rivers, buildings, or even entire regions.