Course Handout MBBA 6004 - 2018-19 - Sem IV
Course Handout MBBA 6004 - 2018-19 - Sem IV
Galgotias University
Gautam Buddha Nagar, Greater Noida, Uttar Pradesh, India
Title Page
LTPC: 3003
Semester: IV
Program: MBA
Classroom: A-015
Designation: Professor
Open Hours
Wednesday: 15:00-17:00 (2:00 hrs)
8:30- 9:20- 10:10- 11:00- 11:50- Lunch(12:40- 1:30- 2:20- 3:10- 04:00- 4:00-
9:20 10:10 11:00 11:50 12:40 1:30) 2:20 3:10 4:00 04:50 4:50
Sunday I1 E1 F1 TU1 1E ES1 E2 F2 TU4 2E TU16
Monday TU13 F1 E1 TU2 TU3 ES2 F2 E2 TU5 TU6 I2
Tuesday I1 A1 B1 C1 D11 ES3 A2 B2 C2 D21 I2
Wednesday TU14 D12 A1 B1 C1 ES4 D22 A2 B2 C2 TU17
Thursday J1 A1 B1 C1 D13 ES5 A2 B2 C2 D23 J2
Course Content
Version 1.01 3 0 0 3
Co-requisites None
Use of information has become central for the survival and development of the human race. Today we
experience a true deluge of data which record and shape our lives, ranging from large global issues
such as climate change to the smallest local problem such as controlling a thermostat. The critical
screening and processing of Big Data has become a world-wide effort, requiring academic attention
from diverse disciplines. The challenge is to develop theoretical and innovative scientific and
technological solutions to cater to the needs of the industry, the society and the environment. Given
the wide gap between demand and supply of scientists, technologists and key experts in the domain of
Data Analytics today, the course has been initiated to prepare the interested young minds for the
academic analysis of such Big Data and its applications in the society today, from business concerns
to social practices and cultural change.
The course has been designed to impart an in-depth knowledge of Big Data processing using Hadoop
and Spark. The course provides with an in-depth understanding of the Hadoop framework including
HDFS, YARN, and MapReduce. Students will learn to use Pig, Hive to process and analyze large
datasets stored in the HDFS. This course provides an overview of the field of big data analytics so that
you can make informed business decisions in distributed environment.
For you to get the most out of this subject, and for it to be a rewarding and fun learning experience for
all, I expect you to:
i. Attend the class sessions and come prepared – that is, having read the assigned readings.
ii. Have a positive attitude and be willing to engage in non-traditional learning formats.
iv. Challenge the ideas presented in your readings as well as those of the instructor and other
students – demonstrate your ability to think critically and to offer constructive alternatives.
Fulfil the requirements of this subject to the best of your ability. The more time and effort you put
into this subject, the more you’ll get out of it.
Step 0: At the end of the course, the student will be able to:
2 K4 50 20 15 10
3 K4 50 20 20 20
4 K4 50 20 20 30
5 K4 20 30 30
It is to see that efforts are to be taken to achieve the following level of knowledge i.e. K3,K4 through
this course. (K1-Remembering, K2-Understanding, K3-Applying, K4-Analyzing, K5-Evaluating,
K6-Creating)
Course Handout MBBA 6004 Big Data Analytics
CO/PO Mapping
(S/M/W indicates strength of correlation) S-Strong, M-Medium, L-Low
Cos Programme Outcomes(POs)
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8
CO1 M
CO2 M
CO3 M
CO4 S
CO5 S
Big Data, Big Analytics: Emerging Business Michael Minelli, Michele Chambers, and Ambiga Dhiraj
Intelligence and Analytics
Seema Acharya, Subhashini Chhellappan, Willey
Big Data and Analytics
SUPPLEMENTARY READINGS
Jay Liebowitz, CRC Press
Big Data and Business Analytics
Anil Maheshwari, McGH
Data Analytics
Holden Karau
Online Resources
www.solver.com/xlminer-data-mining
https://rapidminer.com/
https://sourceforge.net/projects/weka
Step 4: Quizzes/Assignment/Project:
Components of evaluation are very crucial pertaining to assessing the learning goals and objectives of
the course. The following components of evaluation have been designed to assess the learning goals
and objectives. CAT-I, CAT-II and Semester End Examination will assess the learning goals 1-5 as
follows
Assignments ( 20 marks) √ √ √ √ √
Assignments
This component of evaluation is to assess the performance of students after the completion of 15/30
lectures. This is to monitor students’ performance continuously and make them aware about their
mistakes and wrong understanding of the concepts.
End Term Examination is to assess students individually by keeping the overall learning goals and
objectives in mind. The questions are mostly contextual, numerical, analytical and situational.
Course Handout MBBA 6004 Big Data Analytics
This module introduces the concept of big data and Big Data Analytics and emphasizing on applications
of big data in industry.
7 Concept of Virtualization
16-17 Installing Hadoop, making Single node/multimode Clusters- Ref Book Ch 1(Vignesh),
Handouts
Fast data analysis is essential while looking at the enormous data. The module provides explains the data
analysis with Spark
25-28 Introduction to Data Analysis with Spark Ref Book Karau Ch1-4
Module V : NoSQL
2. Course Handout
Course Description:
This course provides a comprehensive introduction to the concepts, techniques and applications of
business intelligence (BI). The class will equip students with a managerial overview of business
intelligence, a basic understanding of statistics and economics foundations in BI, a general exposure
to real world BI applications and trends, and hands-on practices of BI software..
Text Books:
Big Data, Big Analytics: Emerging Business Michael Minelli, Michele Chambers, and Ambiga Dhiraj
Intelligence and Analytics
Course Handout MBBA 6004 Big Data Analytics
Online Resources
www.solver.com/xlminer-data-mining
https://rapidminer.com/
https://sourceforge.net/projects/weka
Evaluation Scheme:
Teaching Pedagogy:
Course Handout MBBA 6004 Big Data Analytics
The pedagogy will be a combination of classroom discussions and lab sessions ( Free hours of
students) on Big Data (concepts and solving problems).
Notices: All notices concerning this course will be communicated through email / whats app
Understanding
HiveQL
Understanding
HBase
Understanding Ref Book Karau CO4 K4
Data analytics Ch1-4
project Life Cycle
Introduction to
Feb 14, Feb 21, 10
4 Data Analysis
2019 2019
with Spark
Downloading
Spark and Getting
Started
Understanding Handouts, Ref CO5 K4
NoSQL- Boo Joe Celko
advantages of Ch 1, Text Book
NoSQL Ch 4
SQL vs NoSQL
Feb 28, Feb 28, 04 Use of NoSQL in
5
2019 2019 Industry
Revision and
Project
Presentations
Semester End
term Examination
4. Lecture Material
..\..\..\BIG DATA\bigdata-challenges-opportunities.pdf
..\..\Big Data\(Wiley CIO) Michael Minelli, Michele Chambers, Ambiga Dhiraj-Big Data, Big
Analytics_ Emerging Business Intelligence and Analytic Trends for Today's Businesses-Wiley
(2013).pdf
Suggested Projects
Tweeter Data Management
Anomaly Detection
Stream Mining for Tweets
Course Handout MBBA 6004 Big Data Analytics
Text Mining
Sentiment Analysis