Data Science Masters Program Brochure
Data Science Masters Program Brochure
edureka!
Discover Learning
About Edureka
Edureka is one of the world’s largest and most effective online education platform for
technology professionals. In a span of 10 years, 100,000+ students from over 176 countries
have upskilled themselves with the help of our online courses. Since our inception, we have
been dedicated to helping technology professionals from all corners of the world learn
Programming, Data Science, Big Data, Cloud Computing, DevOps, Business Analytic, Java &
Mobile Technologies, Software Testing, Web Development, System Engineering, Project
Management, Digital Marketing, Business Intelligence, Cybersecurity, RPA and more.
We have an easy and affordable learning solution that is accessible to millions of learners. With
our learners spread across countries like the US, India, UK, Canada, Singapore, Australia, Middle
East, Brazil, and many others, we have built a community of over 1 million learners across the
globe.
Index
1 Python Statistics for Data Science Course
2 Data Science with Python Certification Course
3 PySpark Certification Training Course
4 Advanced Artificial Intelligence Course
5 Tableau Certification Training Course
*Depending on industry requirements, Edureka may make changes to the course curriculum
edureka!
Discover Learning
Python
Index Statistics for Data Science
Course (Self-paced)
Course Curriculum
Course Outline
Topics:
Topics:
• Uses of probability
• Need of probability
• Bayesian Inference
• Density Concepts
• Normal Distribution Curve
Topics:
• Point Estimation
• Confidence Margin
• Hypothesis Testing
• Levels of Hypothesis Testing
Topics:
Topics:
• Parametric Test
• Parametric Test Types
• Non- Parametric Test
• Experimental Designing
• A/B testing
Topics:
edureka!
Discover Learning
Course Outline
Topics:
• Overview of Python
• The Companies using Python.
• Different Applications where Python is Used
• Discuss Python Scripts on UNIX/Windows
• Values, Types, Variables
• Operands and Expressions
• Conditional Statements
• Loops
• Command Line Arguments
• Writing to the Screen
Topics:
Topics:
• Functions
• Function Parameters
• Global Variables
• Variable Scope and Returning Values
• Lambda Functions
• Object Oriented Concepts
• Standard Libraries
• Modules Used in Python
• The Import Statements
• Module Search Path
• Package Installation Ways
• Errors and Exception Handling
• Handling Multiple Exceptions
Topics:
• Data Analysis
• NumPy - arrays
• Operations on arrays
• Indexing slicing and iterating
Topics:
Topics:
Topics:
Topics:
• Introduction to Dimensionality
• Why Dimensionality Reduction
• PCA
• Factor Analysis
• Scaling dimensional model
• LDA
Topics:
Topics:
Topics:
Topics:
Topics:
Topics:
• Adaptive Boosting
Topics:
Topics:
• Data Visualization
• Business Intelligence tools
• VizQL Technology
• Connect to data from File
• Connect to data from Database
• Basic Charts
• Chart Operations
• Combining Data
• Calculations
Topics:
• Trend lines
• Reference lines
• Forecasting
• Clustering
• Geographic Maps
• Using charts effectively
• Dashboards
• Story Points
• Visual best practices
• Publish to Tableau Online
edureka!
Discover Learning
Course Curriculum
Course Outline
Topics:
• Spark at eBay
• Spark’s Place in Hadoop Ecosystem
Topics:
• Overview of Python
• Different Applications where Python is Used
• Values, Types, Variables
• Operands and Expressions
• Conditional Statements
• Loops
• Command Line Arguments
• Writing to the Screen
• Python files I/O Functions
• Numbers
• Strings and related operations
• Tuples and related operations
• Lists and related operations
• Dictionaries and related operations
• Sets and related operations
Topics:
• Spark Web UI
Topics:
• RDD Lineage
• RDD Persistence
Topics:
• Schema RDDs
• Spark-Hive Integration
Topics:
• Introduction to MLlib
Topics:
• Supervised Learning: Linear Regression, Logistic Regression, Decision Tree, Random
Forest
• Unsupervised Learning: K-Means Clustering & How It Works with MLlib
• Analysis of US Election Data using MLlib (K-Means)
Topics:
• Need for Kafka
• What is Kafka
• Core Concepts of Kafka
• Kafka Architecture
• Where is Kafka Used
• Understanding the Components of Kafka Cluster
• Configuring Kafka Cluster
• Kafka Producer and Consumer Java API
• Need of Apache Flume
• What is Apache Flume
• Basic Flume Architecture
• Flume Sources
• Flume Sinks
• Flume Channels
• Flume Configuration
• Integrating Apache Flume and Apache Kafka
Topics:
• Drawbacks in Existing Computing Methods
• Why Streaming is Necessary
• What is Spark Streaming
• Spark Streaming Features
• Spark Streaming Workflow
• How Uber Uses Streaming Data
• Streaming Context & DStreams
• Transformations on DStreams
• Describe Windowed Operators and Why it is Useful
• Important Windowed Operators
• Slice, Window and ReduceByWindow Operators
• Stateful Operators
Topics:
• Apache Spark Streaming: Data Sources
• Streaming Data Source Overview
• Apache Flume and Apache Kafka Data Sources
• Example: Using a Kafka Direct Data Source
Topics:
• Introduction to Spark GraphX
• Information about a Graph
• GraphX Basic APIs and Operations
edureka!
Discover Learning
Course Curriculum
Course Outline
Topics:
Topics:
• Tokenization
• Frequency Distribution
• Different Types of Tokenizers
• Bigrams, Trigrams & Ngrams
• Stemming
• Lemmatization
• Stopwords
• POS Tagging
• Named Entity Recognition
Topics:
• Syntax Trees
• Chunking
• Chinking
• Context Free Grammars (CFG)
• Automating Text Paraphrasing
• Bag of Words
• Count Vectorizer
Topics:
Topics:
Topics:
• Regional-CNN
• Selective Search Algorithm
• Bounding Box Regression
• SVM in RCNN
• Pre-trained Model
• Model Accuracy
• Model Inference Time
• Model Size Comparison
• Transfer Learning
• Object Detection – Evaluation
• mAP
• IoU
• RCNN – Speed Bottleneck
• Fast R-CNN
• RoI Pooling
• Fast R-CNN – Speed Bottleneck
• Faster R-CNN
• Feature Pyramid Network (FPN)
• Regional Proposal Network (RPN)
• Mask R-CNN
Topics:
Topics:
Topics:
Topics:
• Components of GRU
• Update gate
• Reset gate
• Current memory content
• Final memory at current time step
Topics:
• What is LSTM?
• Structure of LSTM
• Forget Gate
• Input Gate
• Output Gate
• LSTM architecture
• Types of Sequence-Based Model
• Sequence Prediction
• Sequence Classification
• Sequence Generation
• Types of LSTM
• Vanilla LSTM
• Stacked LSTM
• CNN LSTM
• Bidirectional LSTM
• How to increase the efficiency of the model?
• Backpropagation through time
• Workflow of BPTT
Topics:
Module 15: Developing a Criminal Identification and Detection Application Using OpenCV
(Self-paced)
Topics:
Topics:
Topics:
Topics:
edureka !
Discover Learning
Course Outline
• Data Visualization
• Introduction to Tableau
• Tableau Architecture
• VizQL
• Types of Connections
• Data Blending
• Visual Analytics
• Hierarchies
• Data Granularity
• Highlighting
• Sorting
• Filtering
• Grouping
• Sets
• Types of Calculations
• Table Calculations
• Parameters
• Tool tips
• Trend lines
• Reference lines
• Forecasting
• Clustering
• Types of Maps
• Spatial Files
• Custom Geocoding
• Polygon Maps
• Background Images
• Bullet Chart
• Gantt Chart
• Waterfall Chart
• Pareto Chart
• Control Chart
• Funnel Chart
• Bump Chart
• Word Cloud
• Donut Chart
• Introduction to Dashboards
• Dashboard Objects
• Building a Dashboard
• Story Points
• Format Style
• Understand Scheduling