Introduction to Data Science
Week 1
www.swaraadyasolutions.co.in
Agenda
• Defining Data Science
• What Does a Data Science Professional Do?
• Data Science in Business
• Use Cases for Data Science
• Installation of R and R studio
www.swaraadyasolutions.co.in
www.swaraadyasolutions.co.in
Defining Data Science
• Data Science deals with the science and algorithms
related to data.
• Data generated from various sort of sources.
• Report says, “Every day, approximately 2 quintillion bytes
of data is generated. If it grows at this pace, then by the
next 3 years, it is expected that 2MB of data will be
created every second for every individual on this planet.”
• Last 2 years witnessing the creation of 90% of data over
the globe.
www.swaraadyasolutions.co.in
• Data has two sources:
• Structured
• Unstructured
• Structured sources include information that is compatible
with the relational database.
• E.g. ATM transactions, Flight Tickets which enable SQL to
make changes in them.
• Unstructured data is generated from tweets and comments
on social media, audio and video files which the SQL cannot
process.
www.swaraadyasolutions.co.in
Definition
“ Data Science is a broad field which is an assembly of scientific techniques,
methods, processes used to clean the data and then extract some useful
patterns and insights in form of visualizations.”
• Visualizations are crucial to make important business decisions and come up
with strategies that are instrumental for organization’s well-being.
www.swaraadyasolutions.co.in
History
In 1997, when C. F. Jeff at University of Michigan, stated that below concepts
should be studied under phrase Data Science.
• Data Collection
• Data Modeling
• DataAnalysis
www.swaraadyasolutions.co.in
Role of Data Science on Statistics
• Statistics
• Mathematics
• Computer Science
• DataAnalysis
• CriticalThinking
• Problem Solving
• Machine Learning
• DataVisualization
www.swaraadyasolutions.co.in
Data Science??
In 2012, it was titled as the “The sexiest job of the
21st Century” by Harvard Business School.
www.swaraadyasolutions.co.in
www.swaraadyasolutions.co.in
Statistics
• Statistics is the branch of mathematics that deals with data collection,
categorization, interpretation and presentation.
• These techniques helped with the processing and analyzing of the data at a
large scale.
www.swaraadyasolutions.co.in
StatisticsTechniquesTo Deal with Data
• Data Collection
– Collecting relevant data/information
– Primary data includes surveys, observations and experiments.
– Secondary data has internal records and government published data.
• Data Categorization and Classification
– Organized to get some insights
For example, we have data of heights of 10 people
160cm, 165cm, 155cm, 190cm, 177cm, 181cm, 179cm, 185cm, 159cm, 173cm
This data in an ordered array will look like
155cm, 159cm, 160cm ,165cm, 173cm, 177cm, 179cm, 181cm, 185cm, 190cm
The above data tells us that 155cm is the shortest height while 190cm is the tallest.
www.swaraadyasolutions.co.in
StatisticsTechniquesTo Deal with Data
• Data Classification
– Assembly of relevant facts/data into different categories/groups as per features.
– Factors are:
• Geographical
• Chronological (basis of time)
• Qualitative
• Quantitative
• Data Presentation
– Includes frequency distribution using histograms.
– For example, assume you are looking for prospective clients for your new
product which is an electric bike.
www.swaraadyasolutions.co.in
Applications
• Data Science has tons of applications in real-world implementation.
• Recommender Systems
– Content based – keeps track of users watching habits.
– Collaborative based – recognizes users with similar tastes.
• Voice and Image Recognition
• Spam and Fraud Detection
• Many more…….
www.swaraadyasolutions.co.in
Data Scientists andTheir Role
• Data Scientist is a Rockstar!!!
• A Data Scientist is an individual who has the power and freedom to
experiment with tons of different kinds of data.
• Based on knowledge of:
– Mathematics
– Problem solving
– Critical thinking
– Careful analysis
www.swaraadyasolutions.co.in
• For anyone who is willing to carry this “tag” along should be well-versed with a lot
of concepts.
Some of them are
• Mathematics
• Statistics
• Problem-solving
• Data wrangling or data munging
• Coding prowess in both R and Python
• SQL
• Hadoop
• Machine learning and AI
• Data visualization
• Communication skills
www.swaraadyasolutions.co.in
Data Analyst v/s Data Scientist
• Data Analyst has a lot to do with converting the data into a structured
format in order to process it further.
• Focus more on Data Mining and Data Auditing
• Data mining involves retrieving information from large databases with the help of SQL to
extract new data/information.
• Data auditing involves checking the essence of data and trying to figure out if the data is
capable enough for gaining useful insights or not.
www.swaraadyasolutions.co.in
Data Analyst v/s Data Scientist
• Data Scientist take the clean data and trying to gain some meaningful
insights.
• An algorithm either from classification or regression is implemented in
order to create a model and make it sustainable enough to gain some
business insights with the help of visualization tools.
www.swaraadyasolutions.co.in
www.swaraadyasolutions.co.in
Are There Enough Skilled Data Scientists In The Industry?
• According to a survey conducted by IBM, the demand for data
scientists will soar by 28% by 2020.
• That includes all jobs which require machine learning, big data,
visualization likeTableau and PowerBI expertise and knowledge of
data analysis.
• This is divided among the industries looking for such professionals in
finance, insurance, professional services, and IT sectors.
www.swaraadyasolutions.co.in
A candidate who is always thirsty for new challenges and loves problem-solving
of any kind is capable to become a skilled data scientist.
He likes observing and defining a problem from different angles and
perspectives.
Coding is his daily hustle and loves doing it, not because the problem demands
him to do, but he knows how interesting it becomes to come up with new findings
and insights and then make a cute little story out of it!
www.swaraadyasolutions.co.in
Data Science Effects
How Can Data Science Help A Business/CompanyGrow?
• Data Science was breathing in the IT industry for a long time.
• The sudden increase in the amount of data hinted the companies to make it a norm slowly and steadily.
• There are numerous ways in which this emerging discipline can help an organization grow and achieve
new heights
• Business logistics, including supply chain optimization
• Finance
• Health and wellness
• Education and electronic teaching
• Climate and energy
www.swaraadyasolutions.co.in
Popular Data ProcessingTOOLS in Data Science
• Jupyter – open source tool to create and distribute documents
• R Studio – open source tool for R programming.
• SAS – analytics tool.
• Apache Spark – open source shared software specializes in cluster computing.
• Microsoft Excel – spreadsheet.
• SQL – programming language.
• Tableau – data visualization tool used for representing data in terms of charts.
• PowerBI – business intelligence tool developed by Microsoft.
www.swaraadyasolutions.co.in
What does Data Science Professional Do?
www.swaraadyasolutions.co.in
www.swaraadyasolutions.co.in
www.swaraadyasolutions.co.in
Installation of R and R Studio
www.swaraadyasolutions.co.in
Conclusion/Endnotes
• Data Science is turning out to be one of the fastest growing fields in the US and India.
• Today, it has its foot in weather forecasting, sales prediction, fraud and spam detection, pattern recognition, taxi fare
prediction, sentiment analysis, and neural networks.
• The future of data science is going to be dominated byArtificial Intelligence and Automation.
• These two big-heads have the capability of changing the current market scenario into something that data scientists describe
as the “age of revolution”.
• Machines are enriching themselves with new concepts and technology every counting second which is making them smarter
and sharper than humans.
• Looking at the current scenario of the market, data science is slowly and gradually making its
way into businesses and enterprises.
www.swaraadyasolutions.co.in
www.swaraadyasolutions.co.in

More Related Content

PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
PPTX
Data science
PPTX
Data science
PDF
Data science presentation
PPTX
Introduction to data science
PPTX
Data Science
PPTX
Introduction to Data Science
PDF
Introduction to data science
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Data science
Data science
Data science presentation
Introduction to data science
Data Science
Introduction to Data Science
Introduction to data science

What's hot (20)

PPTX
Introduction to data science.pptx
PPTX
Introduction of Data Science
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PDF
Introduction on Data Science
PDF
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
PPTX
Data science & data scientist
PPTX
Introduction to Data Science
PDF
Data science
PDF
Introduction To Data Science
PDF
Introduction to Data Science
PDF
Introduction to Data Science
PDF
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
PDF
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
PDF
Introduction to Data Science
PDF
Data Science Introduction
PPTX
Introduction to data science club
PPTX
Data science
PPTX
Ppt on data science
PPTX
Data mining , Knowledge Discovery Process, Classification
PPTX
Data Science
Introduction to data science.pptx
Introduction of Data Science
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Introduction on Data Science
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data science & data scientist
Introduction to Data Science
Data science
Introduction To Data Science
Introduction to Data Science
Introduction to Data Science
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Introduction to Data Science
Data Science Introduction
Introduction to data science club
Data science
Ppt on data science
Data mining , Knowledge Discovery Process, Classification
Data Science
Ad

Similar to introduction to data science (20)

PPTX
Data science in business Administration Nagarajan.pptx
PPTX
Big Data Courses In Mumbai
PDF
Introduction to Data Science.pdf
PPTX
intro to data science Clustering and visualization of data science subfields ...
PPTX
Ch1IntroductiontoDataScience.pptx
PPTX
BADS-MBA-Unit 1 that what data science and Interpretation
PPTX
The Power of Data Science by DICS INNOVATIVE.pptx
PPTX
Data science Nagarajan and madhav.pptx
PDF
Untitled document.pdf
PPTX
Unit 1-FDS. .pptx
PPTX
Data scientist What is inside it?
PDF
data scientists and their role
PDF
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
PPTX
Data science | demand of data science with AI
PPTX
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
PDF
Introduction to Data Science and Data Analysis
PPTX
introductiontodatascience-230122140841-b90a0856 (1).pptx
PDF
Data Science Training and Placement
PPTX
Data Science Roadmap
PPTX
Careers in Data Science and Analytics
Data science in business Administration Nagarajan.pptx
Big Data Courses In Mumbai
Introduction to Data Science.pdf
intro to data science Clustering and visualization of data science subfields ...
Ch1IntroductiontoDataScience.pptx
BADS-MBA-Unit 1 that what data science and Interpretation
The Power of Data Science by DICS INNOVATIVE.pptx
Data science Nagarajan and madhav.pptx
Untitled document.pdf
Unit 1-FDS. .pptx
Data scientist What is inside it?
data scientists and their role
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data science | demand of data science with AI
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
Introduction to Data Science and Data Analysis
introductiontodatascience-230122140841-b90a0856 (1).pptx
Data Science Training and Placement
Data Science Roadmap
Careers in Data Science and Analytics
Ad

More from bhavesh lande (20)

PDF
The Annual G20 Scorecard – Research Performance 2019
PDF
information control and Security system
PDF
information technology and infrastructures choices
PDF
ethical issues,social issues
PDF
managing inforamation system
PDF
• E-commerce, e-business ,e-governance
PDF
IT and innovations
PDF
organisations and information systems
PDF
IT stratergy and digital goods
PDF
Implement Mapreduce with suitable example using MongoDB.
PDF
aggregation and indexing with suitable example using MongoDB.
PDF
Unnamed PL/SQL code block: Use of Control structure and Exception handling i...
PDF
database application using SQL DML statements: all types of Join, Sub-Query ...
PDF
database application using SQL DML statements: Insert, Select, Update, Delet...
PDF
Design and Develop SQL DDL statements which demonstrate the use of SQL objec...
PDF
working with python
PDF
applications and advantages of python
PDF
introduction of python in data science
PDF
PDF
applications
The Annual G20 Scorecard – Research Performance 2019
information control and Security system
information technology and infrastructures choices
ethical issues,social issues
managing inforamation system
• E-commerce, e-business ,e-governance
IT and innovations
organisations and information systems
IT stratergy and digital goods
Implement Mapreduce with suitable example using MongoDB.
aggregation and indexing with suitable example using MongoDB.
Unnamed PL/SQL code block: Use of Control structure and Exception handling i...
database application using SQL DML statements: all types of Join, Sub-Query ...
database application using SQL DML statements: Insert, Select, Update, Delet...
Design and Develop SQL DDL statements which demonstrate the use of SQL objec...
working with python
applications and advantages of python
introduction of python in data science
applications

Recently uploaded (20)

PDF
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
PDF
REPORT CARD OF GRADE 2 2025-2026 MATATAG
PDF
technical specifications solar ear 2025.
PPTX
Introduction to Fundamentals of Data Security
PPTX
Hushh Hackathon for IIT Bombay: Create your very own Agents
PPTX
DATA ANALYTICS COURSE IN PITAMPURA.pptx
PPTX
langchainpptforbeginners_easy_explanation.pptx
PPTX
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
PDF
Hikvision-IR-PPT---EN.pdfSADASDASSAAAAAAAAAAAAAAA
PPTX
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
PPTX
ifsm.pptx, institutional food service management
PDF
The Role of Pathology AI in Translational Cancer Research and Education
PPTX
Stats annual compiled ipd opd ot br 2024
PDF
Grey Minimalist Professional Project Presentation (1).pdf
PPT
Classification methods in data analytics.ppt
PPTX
PPT for Diseases (1)-2, types of diseases.pptx
PPTX
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PPTX
machinelearningoverview-250809184828-927201d2.pptx
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
REPORT CARD OF GRADE 2 2025-2026 MATATAG
technical specifications solar ear 2025.
Introduction to Fundamentals of Data Security
Hushh Hackathon for IIT Bombay: Create your very own Agents
DATA ANALYTICS COURSE IN PITAMPURA.pptx
langchainpptforbeginners_easy_explanation.pptx
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
Hikvision-IR-PPT---EN.pdfSADASDASSAAAAAAAAAAAAAAA
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
ifsm.pptx, institutional food service management
The Role of Pathology AI in Translational Cancer Research and Education
Stats annual compiled ipd opd ot br 2024
Grey Minimalist Professional Project Presentation (1).pdf
Classification methods in data analytics.ppt
PPT for Diseases (1)-2, types of diseases.pptx
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
machinelearningoverview-250809184828-927201d2.pptx
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf

introduction to data science

  • 1. Introduction to Data Science Week 1 www.swaraadyasolutions.co.in
  • 2. Agenda • Defining Data Science • What Does a Data Science Professional Do? • Data Science in Business • Use Cases for Data Science • Installation of R and R studio www.swaraadyasolutions.co.in
  • 4. Defining Data Science • Data Science deals with the science and algorithms related to data. • Data generated from various sort of sources. • Report says, “Every day, approximately 2 quintillion bytes of data is generated. If it grows at this pace, then by the next 3 years, it is expected that 2MB of data will be created every second for every individual on this planet.” • Last 2 years witnessing the creation of 90% of data over the globe. www.swaraadyasolutions.co.in
  • 5. • Data has two sources: • Structured • Unstructured • Structured sources include information that is compatible with the relational database. • E.g. ATM transactions, Flight Tickets which enable SQL to make changes in them. • Unstructured data is generated from tweets and comments on social media, audio and video files which the SQL cannot process. www.swaraadyasolutions.co.in
  • 6. Definition “ Data Science is a broad field which is an assembly of scientific techniques, methods, processes used to clean the data and then extract some useful patterns and insights in form of visualizations.” • Visualizations are crucial to make important business decisions and come up with strategies that are instrumental for organization’s well-being. www.swaraadyasolutions.co.in
  • 7. History In 1997, when C. F. Jeff at University of Michigan, stated that below concepts should be studied under phrase Data Science. • Data Collection • Data Modeling • DataAnalysis www.swaraadyasolutions.co.in
  • 8. Role of Data Science on Statistics • Statistics • Mathematics • Computer Science • DataAnalysis • CriticalThinking • Problem Solving • Machine Learning • DataVisualization www.swaraadyasolutions.co.in
  • 9. Data Science?? In 2012, it was titled as the “The sexiest job of the 21st Century” by Harvard Business School. www.swaraadyasolutions.co.in
  • 11. Statistics • Statistics is the branch of mathematics that deals with data collection, categorization, interpretation and presentation. • These techniques helped with the processing and analyzing of the data at a large scale. www.swaraadyasolutions.co.in
  • 12. StatisticsTechniquesTo Deal with Data • Data Collection – Collecting relevant data/information – Primary data includes surveys, observations and experiments. – Secondary data has internal records and government published data. • Data Categorization and Classification – Organized to get some insights For example, we have data of heights of 10 people 160cm, 165cm, 155cm, 190cm, 177cm, 181cm, 179cm, 185cm, 159cm, 173cm This data in an ordered array will look like 155cm, 159cm, 160cm ,165cm, 173cm, 177cm, 179cm, 181cm, 185cm, 190cm The above data tells us that 155cm is the shortest height while 190cm is the tallest. www.swaraadyasolutions.co.in
  • 13. StatisticsTechniquesTo Deal with Data • Data Classification – Assembly of relevant facts/data into different categories/groups as per features. – Factors are: • Geographical • Chronological (basis of time) • Qualitative • Quantitative • Data Presentation – Includes frequency distribution using histograms. – For example, assume you are looking for prospective clients for your new product which is an electric bike. www.swaraadyasolutions.co.in
  • 14. Applications • Data Science has tons of applications in real-world implementation. • Recommender Systems – Content based – keeps track of users watching habits. – Collaborative based – recognizes users with similar tastes. • Voice and Image Recognition • Spam and Fraud Detection • Many more……. www.swaraadyasolutions.co.in
  • 15. Data Scientists andTheir Role • Data Scientist is a Rockstar!!! • A Data Scientist is an individual who has the power and freedom to experiment with tons of different kinds of data. • Based on knowledge of: – Mathematics – Problem solving – Critical thinking – Careful analysis www.swaraadyasolutions.co.in
  • 16. • For anyone who is willing to carry this “tag” along should be well-versed with a lot of concepts. Some of them are • Mathematics • Statistics • Problem-solving • Data wrangling or data munging • Coding prowess in both R and Python • SQL • Hadoop • Machine learning and AI • Data visualization • Communication skills www.swaraadyasolutions.co.in
  • 17. Data Analyst v/s Data Scientist • Data Analyst has a lot to do with converting the data into a structured format in order to process it further. • Focus more on Data Mining and Data Auditing • Data mining involves retrieving information from large databases with the help of SQL to extract new data/information. • Data auditing involves checking the essence of data and trying to figure out if the data is capable enough for gaining useful insights or not. www.swaraadyasolutions.co.in
  • 18. Data Analyst v/s Data Scientist • Data Scientist take the clean data and trying to gain some meaningful insights. • An algorithm either from classification or regression is implemented in order to create a model and make it sustainable enough to gain some business insights with the help of visualization tools. www.swaraadyasolutions.co.in
  • 20. Are There Enough Skilled Data Scientists In The Industry? • According to a survey conducted by IBM, the demand for data scientists will soar by 28% by 2020. • That includes all jobs which require machine learning, big data, visualization likeTableau and PowerBI expertise and knowledge of data analysis. • This is divided among the industries looking for such professionals in finance, insurance, professional services, and IT sectors. www.swaraadyasolutions.co.in
  • 21. A candidate who is always thirsty for new challenges and loves problem-solving of any kind is capable to become a skilled data scientist. He likes observing and defining a problem from different angles and perspectives. Coding is his daily hustle and loves doing it, not because the problem demands him to do, but he knows how interesting it becomes to come up with new findings and insights and then make a cute little story out of it! www.swaraadyasolutions.co.in
  • 22. Data Science Effects How Can Data Science Help A Business/CompanyGrow? • Data Science was breathing in the IT industry for a long time. • The sudden increase in the amount of data hinted the companies to make it a norm slowly and steadily. • There are numerous ways in which this emerging discipline can help an organization grow and achieve new heights • Business logistics, including supply chain optimization • Finance • Health and wellness • Education and electronic teaching • Climate and energy www.swaraadyasolutions.co.in
  • 23. Popular Data ProcessingTOOLS in Data Science • Jupyter – open source tool to create and distribute documents • R Studio – open source tool for R programming. • SAS – analytics tool. • Apache Spark – open source shared software specializes in cluster computing. • Microsoft Excel – spreadsheet. • SQL – programming language. • Tableau – data visualization tool used for representing data in terms of charts. • PowerBI – business intelligence tool developed by Microsoft. www.swaraadyasolutions.co.in
  • 24. What does Data Science Professional Do? www.swaraadyasolutions.co.in
  • 27. Installation of R and R Studio www.swaraadyasolutions.co.in
  • 28. Conclusion/Endnotes • Data Science is turning out to be one of the fastest growing fields in the US and India. • Today, it has its foot in weather forecasting, sales prediction, fraud and spam detection, pattern recognition, taxi fare prediction, sentiment analysis, and neural networks. • The future of data science is going to be dominated byArtificial Intelligence and Automation. • These two big-heads have the capability of changing the current market scenario into something that data scientists describe as the “age of revolution”. • Machines are enriching themselves with new concepts and technology every counting second which is making them smarter and sharper than humans. • Looking at the current scenario of the market, data science is slowly and gradually making its way into businesses and enterprises. www.swaraadyasolutions.co.in