TM351 Module Specifications
TM351 Module Specifications
1. 1. Factual information
2. Rationale for the module and its links with other modules
Data management and analysis (TM351) – an overview of the concepts, techniques, and tools
of modern data management and analysis. The requirements of data management continually
evolve. Recently those requirements have surpassed the capabilities of traditional data
management. So, in order to better prepare our graduates for the new demands of the job
market, it is necessary to address both the traditional concepts of data management as well as
the increasingly important area of data analytics.
Page 1 of 6
3. Aims of the module
The module will be driven by Richard Hamming's famous quote: The purpose of computing is
insight, not numbers.
Some of the key ideas are:
Introducing data analysis. Starting with a text based data file such as comma separated
variable (CSV) document, this unit will provide a brief introduction to some basic
operations on simple data files. This will give an opportunity to provide an outline of the
key ideas in the module, to ensure that the students have installed the module software
correctly, and to begin to familiarise themselves with that software.
Concepts in data management. The module will look at three key areas in data
management: data architectures and data access (CRUD), data integrity, and
transaction management (ACID). Each of these will be illustrated using a relational
database, and one non-relational alternative, and the advantages and limitations of each
model discussed.
Legal and ethical issues. The module will consider the legal and ethical issues involved
in managing data collections. Students will be required to obtain and read (parts of) the
Data Protection Act and the Freedom of Information Act, and demonstrate how these
apply to issues in data management. They will also consider privacy, ownership,
intellectual property and licensing issues in data collection, management, retrieval and
reuse.
Concepts in data analytics. These sections will focus on using data to answer a real
question; the focus will be on exploratory techniques (such as visualisation) and
formulating a question into a form which can realistically be answered using the data
that is available. Issues in processing techniques for large and real-time streamed data
collections will also be addressed along with techniques and technologies (such as
mapreduce) for handling them. This part will use a statistical package such as the
python scientific libraries and/or ggplot to visualise the data and carry out appropriate
analyses. It is not anticipated that students will need to understand statistical methods in
depth.
Page 2 of 6
5. Intended learning outcomes
A. Knowledge and understanding Learning and teaching strategy
Upon completing this course, students will be able to: Lectures
A1.Discuss and describe the similarities and differences between at face-to-face teaching
least two different database models, and how they are used to Take-home assignments
manage data collections. Final project
Page 3 of 6
C. Practical and professional skills Learning and teaching strategy
C2.Use a statistical package to explore a data set Final project
6. Indicative content.
Introducing data management and analysis
Acquiring and representing data
Data preparation
Data analysis
Presentation: telling the story
With data comes responsibility
When spreadsheets fail
Databases
Relational databases I
Normalisation
Relational databases II
Data integrity
Page 4 of 6
6. Indicative content.
Alternatives to relational databases
Document databases
Complex queries and analysis
Scaling out: replication and sharding
Distributed transactions
Sample data investigation and report
Data warehousing
Data mining I: classification tasks
Data mining II: clustering tasks
Data mining III: semi-supervised techniques for investigating a data warehouse
With data comes responsibility II: keeping data safe
Linked Data on the Semantic Web
Querying and manipulating linked data
Applications of Semantic Web data
End-of-module project: data investigation and report
Finish and submit project
Page 5 of 6
8. Mapping of assessment tasks to learning outcomes
Learning Outcomes
Assessment tasks A1 A2 A3 B1 B2 B3 C1 C2 C3 D1 D2 D3
Name and contact details
TBA
Page 6 of 6