0% found this document useful (0 votes)
14 views

Silabus CDSS

Uploaded by

akhyarul.rijal.b
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

Silabus CDSS

Uploaded by

akhyarul.rijal.b
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Certified Data Science Specialist

Duration: 5 Days Instructor-led Course


ITCDSS

Course Overview Prerequisites


Our lives are flooded by large amounts of information, but not all of them are
All participants should have basic understanding of data, relations, and basic
useful data. Therefore it is essential for us to learn how to apply data science to
knowledge ofmathematics.
every aspect of our daily life from personal finances, reading and lifestyle habits,
to making informed business decisions. In this course you will learn how to
leverage on data to ease life, or unlock new economic value for a business. Who Should Attend
This workshop is intended for individuals who are interested in learning data
This course is a hands-on guided course for you to learn the concepts, tools,
science, or who want to begin their career as a data scientist.
and techniques that you need to begin learning data science. We will cover
the key topics from data science to big data, and the processes of gathering,
cleaning and handling data. This course has a good balance of theory and
practical applications, and key concepts are taught using case study references. Exam Format
The CDSS Certification Exam duration is 2 hours, consisting of 50 Multiple
Upon completion, participants will be able to perform basic data handling tasks, Choice Questions, with a Passing Score of 70%. You will receive a professional
collect and analyze data, and present them using industrystandard tools. CDSS Certification upon passing the exam.

Learning Outcomes
Upon completion of this course, you will be able to:
Identify the appropriate model for different data types Differentiate key data ETL process, from cleaning, processing to visualization.
Create your own data process and analysis workflow Implement algorithms to extract information from dataset.
Define and explain the key concepts and models relevant to data science. Apply best practices in data science, and become familiar with standard tools.
Course Outline
Day 1 Day 2
Introduction to Data Science Data Science Workflow Data Science Prerequisites Beginning Databases

• What is Data? • Data Gathering • Probability and Statistics • Types of Databases


• Types of Data • Data Preparation & Cleansing • Linear Algebra • Relational Databases
• What is Data Science? • Data Analysis - Descriptive, • Calculus • NoSQL
• Knowledge Check Predictive, and Prescriptive • Combinatorics • Hybrid database
• Lab Activity • Data Visualization and Model • Knowledge Check Lab activity
Deployment
• Knowledge Check

Life of a data scientist Data Gathering Structured Query Language (SQL) Introduction to Python

• What is a Data Scientist? • Obtain data from online • Performing CRUD • Basics of Python language
• Data Scientist Roles repositories (Create, Retrieve,Update, Delete) • Functions and packages
• What does a Data Scientist Look Like? • Import data from local • Designing a Real world database • Python lists
• T-Shaped Skillset file formats (json, xml) • Normalizing a table • Functional programming
• Data Scientist Roadmap • Import data using Web API • Knowledge Check Lab Activity in Python
• Data Scientist Education Framework • Scrape website for data • Numpy and Scipy
• Thinking like a Data Scientist • Knowledge check • iPython
• Knowns and Unknowns • Knowledge check
• Demand and Opportunity • Lab Activity
• Labor Market • Lab: Exploring data using
• Applications of Data Science Python
• Data Science Principles
• Data-Driven Organization
• Developing Data Products
• Knowledge Check
Day 3 Day 5

Data Preparation and Cleansing Introduction to R Data Visualization Big Data Landscape

• Extract, Transform and Load (ETL) • Packages for data import, • Choosing the right visualization • What is small data?
- Pentaho, Talend, etc wrangling, and visualization • Plotting data using Python libraries • What is big data?
• Data Cleansing with OpenRefine • Conditionals and Control Flow • Plotting data using R • Big data analytics vs Data Science
• Aggregation, Filtering, Sorting, Joining • Loops and Functions • Using Jupyter Notebook • Key elements in Big Data (3Vs)
• Knowledge Check Lab Activity • Knowledge check to validate scripts • Extracting values from big data
• Lab activity • Knowledge check • Challenges in Big data
• Lab: Exploring data using R • Lab activity

Exploratory Data Analysis (Descriptive) Data Quality Data Analysis Presentation Big data Tools and Applications
• What is EDA? • Raw vs Tidy Data • Using Markdown language • Introducing Hadoop Ecosystem
• Goals of EDA • Key Features of Data Quality • Convert your data into slides • Cloudera vs Hortonworks
• The role of graphics • Maintenance of Data Quality • Data presentation techniques • Real world big data applications
• Handling outliers • Data Profiling • The pitfall of data analysis • Knowledge check
• Dimension reduction • Data Completeness and • Knowledge check • Group discussion
Consistency • Lab activity
• Group presentation Lab:
Day 4 Mini Project

Machine Learning (Predictive) Introduction to Text Mining What’s Next?

• Bayes Theorem • What is Text Mining? • Preview of Data Science Specialist


• Information Theory • Natural Language Processing • Showing advanced data analysis techniques
• NLP • Pre-processing text data • Demo: Interactive visualizations
• Statistical Algorithms • Extracting features from
• Stochastic Algorithms documents
• Using BeautifulSoup
• Measuring document similarity
• Knowledge check Lab activity

Supervised, Unsupervised, and Semi-supervised Learning

• What is prediction? • Constructing a decision tree


• Sampling, training set, testing set. • Knowledge check Lab Activity

You might also like