Module-1
Module-1
✓ What is DBMS
✓ Course Outcomes
10-09-2020
Department of Information Science and Engg
1
Transform Here
Module-1
Introduction to Databases
10-09-2020
Department of Information Science and Engg
2
Transform Here
Topics of Module-1
Introduction to Databases: Introduction, Characteristics of database approach,
Advantages of using the DBMS approach, History of database applications.
Overview of Database Languages and Architectures: Data Models, Schemas, and
Instances. Three schema architecture and data independence, database languages,
and interfaces, The Database System environment. Conceptual Data Modelling
using Entities and Relationships: Entity types, Entity sets, attributes, roles, and
structural constraints, Weak entity types, ER diagrams, examples, Specialization
and Generalization. 10 Hours
Textbook 1:Ch 1.1 to 1.8, 2.1 to 2.6, 3.1 to 3.10
Text Books:
1. Fundamentals of Database Systems, Ramez Elmasri and Shamkant B. Navathe,
7th Edition, 2017, Pearson.
2. Database management systems, Ramakrishnan, and Gehrke, 3rd Edition, 2014,
McGraw Hill
10-09-2020
Department of Information Science and Engg
3
Transform Here
Learning Objectives
10-09-2020
Department of Information Science and Engg
4
Transform Here
Quote for the Day
With data collection, ‘the sooner the better’ is always the best answer.”
- Marissa Mayer
American businesswoman and investor
“You can have data without information, but you cannot have information
without data.”
- Daniel Keys Moran
American computer programmer and science fiction writer
10-09-2020
Department of Information Science and Engg
5
Transform Here
Basic Definitions
10-09-2020
Department of Information Science and Engg
6
Transform Here
Basic Definitions (Cont…)
Database System
DBMS Software
3. Data Abstraction:
• A data model is used to hide storage details and present the users with a
conceptual view of the database.
• Programs refer to the data model constructs rather than data storage details
10-09-2020
Department of Information Science and Engg
9
Transform Here
Main Characteristics of the Database Approach ( Cont…)
10-09-2020
Department of Information Science and Engg
10
Transform Here
Advantages of using the DBMS Approach (Cont…)
• Controlling redundancy in data storage and in development and
maintenance efforts.
• Sharing of data among multiple users.
• Restricting unauthorized access to data.
• Providing persistent storage for program Objects
• Providing Storage Structures (e.g. indexes) for efficient Query Processing.
• Providing backup and recovery services.
• Providing multiple interfaces to different classes of users.
• Representing complex relationships among data.
• Enforcing integrity constraints on the database.
• Drawing inferences and actions from the stored data using deductive
and active rules
10-09-2020
Department of Information Science and Engg
11
Transform Here
Historical Development of Database Technology
10-09-2020
Department of Information Science and Engg
12
Transform Here
Historical Development of Database Technology (Cont…)
Object-oriented and emerging applications:
• Object-Oriented Database Management Systems (OODBMSs) were introduced
in late 1980s and early 1990s to cater to the need of complex data
processing in CAD and other applications.
• Many relational DBMSs have incorporated object database concepts, leading
to a new category called object-relational DBMSs (ORDBMSs)
• Extended relational systems add further capabilities (e.g. for multimedia data,
XML, and other data types)
10-09-2020
Department of Information Science and Engg
13
Transform Here
Extending Database Capabilities
• Scientific Applications
• XML (eXtensible Markup Language)
• Image Storage and Management
• Audio and Video Data Management
• Data Warehousing and Data Mining
• Spatial Data Management
• Time Series and Historical Data Management
The above gives rise to new research and development in incorporating new data
types, complex data structures, new operations and storage and indexing schemes
in database systems.
10-09-2020
Department of Information Science and Engg
14
Transform Here
Overview of Database Languages and Architectures ( Cont…)
10-09-2020
Department of Information Science and Engg
15
Transform Here
Overview of Database Languages and Architectures ( Cont…)
10-09-2020
Department of Information Science and Engg
16
Transform Here
Categories of Data Models
Conceptual (high-level, semantic) data models:
Provide concepts that are close to the way many users perceive
data. Also called entity-based or object-based data models.
Physical (low-level, internal) data models:
10-09-2020
Department of Information Science and Engg
17
Transform Here
Schemas versus Instances
Database Schema:
Schema Diagram:
Schema Construct:
10-09-2020
Department of Information Science and Engg
18
Transform Here
Schemas versus Instances (Cont…)
Database State:
This includes the collection of all the data in the database.
Also called database instance (or occurrence or snapshot).
The term instance is also applied to individual database components,
Ex: record instance, table instance, entity instance
The actual data stored in a database at a particular moment in time.
10-09-2020
Department of Information Science and Engg
19
Transform Here
Database Schema vs. Database State
Database State: Refers to the content of a database at a moment in time.
Initial Database State: Refers to the database state when it is initially loaded
into the system.
Valid State: A state that satisfies the structure and constraints of the
database.
Distinction:
• The database schema changes very infrequently.
10-09-2020
Department of Information Science and Engg
20
Transform Here
Schema Diagram for the database
• Student and Course Information Database
10-09-2020
Department of Information Science and Engg
21
Transform Here
Three-Schema Architecture
10-09-2020
Department of Information Science and Engg
22
Transform Here
The Three-schema architecture
10-09-2020
Department of Information Science and Engg
23
Transform Here
The Three-schema Architecture (Cont..)
Mappings among schema levels are needed to transform requests and data.
Programs refer to an external schema, and are mapped by the DBMS to the
internal schema for execution.
Data extracted from the internal DBMS level is reformatted to match the
user’s external view (e.g. formatting the results of an SQL query for display
in a Web page)
10-09-2020
Department of Information Science and Engg
24
Transform Here
Data Independence
• Logical Data Independence:
• The capacity to change the conceptual schema
without having to change the external schemas
and their associated application programs.
• Physical Data Independence:
• The capacity to change the internal schema
without having to change the conceptual schema.
• For example, the internal schema may be
changed when certain file structures are
reorganized or new indexes are created to
improve database performance
10-09-2020
Department of Information Science and Engg
25
Transform Here
Data Independence
(continued)
• When a schema at a lower level is changed, only the mappings
between this schema and higher-level schemas need to be changed
in a DBMS that fully supports data independence.
• The higher-level schemas themselves are unchanged.
• Hence, the application programs need not be
changed since they refer to the external schemas.
10-09-2020
Department of Information Science and Engg
26
Transform Here
DBMS Languages
10-09-2020
Department of Information Science and Engg
27
Transform Here
• Data Definition Language (DDL):
• Used by the DBA and database designers to
specify the conceptual schema of a database.
• In many DBMSs, the DDL is also used to define
internal and external schemas (views).
• In some DBMSs, separate storage definition
language (SDL) and view definition language
(VDL) are used to define internal and external
schemas.
• SDL is typically realized via DBMS commands provided to the DBA and database
designers
10-09-2020
Department of Information Science and Engg
28
Transform Here
• Data Manipulation Language (DML):
• Used to specify database retrievals and updates
• DML commands (data sublanguage) can be
embedded in a general-purpose programming
language (host language), such as COBOL, C,
C++, or Java.
• A library of functions can also be provided to access the DBMS from a programming
language
10-09-2020
Department of Information Science and Engg
29
Transform Here
Types of DML
• High Level or Non-procedural Language:
• For example, the SQL relational language
• Are “set”-oriented and specify what data to
retrieve rather than how to retrieve it.
• Also called declarative languages.
• Low Level or Procedural Language:
• Retrieve data one record-at-a-time;
• Constructs such as looping are needed to retrieve
multiple records, along with positioning pointers.
10-09-2020
Department of Information Science and Engg
30
Transform Here
DBMS Interfaces
10-09-2020
Department of Information Science and Engg
31
Transform Here
DBMS Programming Language Interfaces
• Programmer interfaces for embedding DML in a programming
languages:
• Embedded Approach: e.g embedded SQL (for C,
C++, etc.), SQLJ (for Java)
• Procedure Call Approach: e.g. JDBC for Java,
ODBC for other programming languages
• Database Programming Language Approach: e.g.
ORACLE has PL/SQL, a programming language
based on SQL; language incorporates SQL and its
data types as integral components
10-09-2020
Department of Information Science and Engg
32
Transform Here
User-Friendly DBMS Interfaces
10-09-2020
Department of Information Science and Engg
33
Transform Here
Other DBMS Interfaces
10-09-2020
Department of Information Science and Engg
34
Transform Here
The Database System environment.
10-09-2020
Department of Information Science and Engg
35
Transform Here
Overview of Database Design Process
10-09-2020
Department of Information Science and Engg
36
Transform Here
Example COMPANY Database
10-09-2020
Department of Information Science and Engg
37
Transform Here
• We store each EMPLOYEE’s social security number,
address, salary, sex, and birthdate.
• Each employee works for one department but may work on several projects.
• We keep track of the number of hours per week that an employee currently works on each
project.
• We also keep track of the direct supervisor of each employee.
10-09-2020
Department of Information Science and Engg
38
Transform Here
ER Model Concepts
10-09-2020
Department of Information Science and Engg
39
Transform Here
Types of Attributes (1)
• Simple
• Each entity has a single atomic value for the attribute. For example,
SSN or Sex.
• Composite
• The attribute may be composed of several components. For example:
• Address(Apt#, House#, Street, City, State, ZipCode, Country), or
• Name(FirstName, MiddleName, LastName).
• Composition may form a hierarchy where some components are
themselves composite.
• Multi-valued
• An entity may have multiple values for that attribute. For example,
Color of a CAR or PreviousDegrees of a STUDENT.
• Denoted as {Color} or {PreviousDegrees}.
10-09-2020
Department of Information Science and Engg
40
Transform Here
Types of Attributes (2)
• In general, composite and multi-valued attributes may be nested
arbitrarily to any number of levels, although this is rare.
• For example, PreviousDegrees of a STUDENT is a
composite multi-valued attribute denoted by
{PreviousDegrees (College, Year, Degree, Field)}
• Multiple PreviousDegrees values can exist
• Each has four subcomponent attributes:
• College, Year, Degree, Field
10-09-2020
Department of Information Science and Engg
41
Transform Here
Example of a composite attribute
10-09-2020
Department of Information Science and Engg
42
Transform Here
Entity Types and Key Attributes
10-09-2020
Department of Information Science and Engg
43
Transform Here
Entity Types and Key Attributes (Cont..)
10-09-2020
Department of Information Science and Engg
44
Transform Here
Displaying an Entity type
10-09-2020
Department of Information Science and Engg
45
Transform Here
Entity Type CAR with two keys and a corresponding Entity
Set
10-09-2020
Department of Information Science and Engg
46
Transform Here
Entity Set
10-09-2020
Department of Information Science and Engg
47
Transform Here
Initial Design of Entity Types for the COMPANY
Database Schema
10-09-2020
Department of Information Science and Engg
48
Transform Here
Initial Design of Entity Types:
EMPLOYEE, DEPARTMENT, PROJECT, DEPENDENT
10-09-2020
Department of Information Science and Engg
49
Transform Here
Refining the initial design by introducing relationships
10-09-2020
Department of Information Science and Engg
50
Transform Here
Relationships and Relationship Types
10-09-2020
Department of Information Science and Engg
51
Transform Here
Relationship instances of the WORKS_FOR N:1 relationship between
EMPLOYEE and DEPARTMENT
10-09-2020
Department of Information Science and Engg
52
Transform Here
Relationship instances of the M:N WORKS_ON relationship between
EMPLOYEE and PROJECT
10-09-2020
Department of Information Science and Engg
53
Transform Here
Relationship type vs. relationship set
• Relationship Type:
• Is the schema description of a relationship
• Identifies the relationship name and the
participating entity types
• Also identifies certain relationship constraints
• Relationship Set:
• The current set of relationship instances
represented in the database
• The current state of a relationship type
10-09-2020
Department of Information Science and Engg
54
Transform Here
Relationship type vs. relationship set
10-09-2020
Department of Information Science and Engg
55
Transform Here
Refining the COMPANY database schema
by introducing relationships
10-09-2020
Department of Information Science and Engg
56
Transform Here
ER DIAGRAM – Relationship Types are:
WORKS_FOR, MANAGES, WORKS_ON, CONTROLS, SUPERVISION, DEPENDENTS_OF
10-09-2020
Department of Information Science and Engg
57
Transform Here
Recursive Relationship Type
10-09-2020
Department of Information Science and Engg
58
Transform Here
Weak Entity Types
10-09-2020
Department of Information Science and Engg
59
Transform Here
Constraints on Relationships
10-09-2020
Department of Information Science and Engg
60
Transform Here
Many-to-one (N:1) Relationship
10-09-2020
Department of Information Science and Engg
61
Transform Here
Many-to-many (M:N) Relationship
10-09-2020
Department of Information Science and Engg
62
Transform Here
A Recursive Relationship Supervision`
10-09-2020
Department of Information Science and Engg
63
Transform Here
Recursive Relationship Type is: SUPERVISION
(participation role names are shown)
10-09-2020
Department of Information Science and Engg
64
Transform Here
Notation for Constraints on Relationships
10-09-2020
Department of Information Science and Engg
65
Transform Here
Alternative (min, max) notation for relationship structural
constraints:
10-09-2020
Department of Information Science and Engg
66
Transform Here
The (min,max) notation for relationship constraints
10-09-2020
Department of Information Science and Engg
67
Transform Here
COMPANY ER Schema Diagram using (min, max) notation
10-09-2020
Department of Information Science and Engg
68
Transform Here
Summary of notation for ER diagrams
10-09-2020
Department of Information Science and Engg
69
Transform Here
Conclusions
We have understood the below concepts:
1. Why Data Analytics, What Data Analytics
2. Who is Data Analyst, Data Analyst skill set
3. Statistics and Types
4. Data Cleaning and Data Manipulation
5. Data Visualization
6. Bonus: Machine Learning
7. Analytics Life Cycle
8. Roles of Data Analyst and Salary Ranges
10-09-2020
Department of Information Science and Engg
70
Transform Here
Important Questions asked in VTU Semester End Exam ( SEE)
10-09-2020
Department of Information Science and Engg
71
Transform Here
Application Oriented Questions
10-09-2020
Department of Information Science and Engg
72
Transform Here
GATE Questions
10-09-2020
Department of Information Science and Engg
73
Transform Here
Market Share of different Databases
10-09-2020
Department of Information Science and Engg
74
Transform Here
Contact Details:
Dr.Manjunath T N
Professor and Dean – ER
Department of Information Science and Engg
BMS Institute of Technology and Management
Mobile: +91-9900130748
E-Mail: [email protected] / [email protected]
10-09-2020
Department of Information Science and Engg
75
Transform Here