Fundamentals of Data Engineering by Joe Reis and Matt Housley 84

Uploaded by

alt.b7-az1xd48

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

Fundamentals of Data Engineering by Joe Reis and Matt Housley 84

Uploaded by

alt.b7-az1xd48

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Chapter 10.

Security and
Privacy

Security is vital to the practice of data engineering. This should be

blindingly obvious, but we’re constantly amazed at how often data
engineers view security as an afterthought. We believe that security is the
first thing a data engineer needs to think about in every aspect of their job
and every stage of the data engineering lifecycle. You deal with sensitive
data, information, and access daily. Your organization, customers, and
business partners expect these valuable assets to be handled with the utmost
care and concern. One security breach or a data leak can leave your
business dead in the water; your career and reputation are ruined if it’s your
fault.
Security is a key ingredient for privacy. Privacy has long been critical to
trust in the corporate information technology space; engineers directly or
indirectly handle data related to people’s private lives. This includes
financial information, data on private communications (emails, texts, phone
calls), medical history, educational records, and job history. A company that
leaked this information or misused it could find itself a pariah when the
breach came to light.
Increasingly, privacy is a matter of significant legal importance. For
example, the Family Educational Rights and Privacy Act (FERPA) went
into effect in the US in the 1970s; the Health Insurance Portability and
Accountability Act (HIPAA) followed in the 1990s; GDPR was passed in
Europe in the mid-2010s. Several US-based privacy bills have passed or
will soon. This is just a tiny sampling of privacy-related statutes (and we
believe just the beginning). Still, the penalties for violation of any of these
laws can be significant, even devastating, to a business. And because data
systems are woven into the fabric of education, health care, and business,
data engineers handle sensitive data related to each of these laws.
A data engineer’s exact security and privacy responsibilities will vary
significantly between organizations. At a small startup, a data engineer may
do double duty as a data security engineer. A large tech company will have
armies of security engineers and security researchers. Even in this situation,
data engineers will often be able to identify security practices and
technology vulnerabilities within their own teams and systems that they can
report and mitigate in collaboration with dedicated security personnel.
Because security and privacy are critical to data engineering (security being
an undercurrent), we want to spend some more time covering security and
privacy. In this chapter, we lay out some things data engineers should
consider around security, particularly in people, processes, and technology
(in that order). This isn’t a complete list, but lays out the major things we’d
wish would improve based on our experience.

People
The weakest link in security and privacy is you. Security is often
compromised at the human level, so conduct yourself as if you’re always a
target. A bot or human actor is trying to infiltrate your sensitive credentials
and information at any given time. This is our reality, and it’s not going
away. Take a defensive posture with everything you do online and offline.
Exercise the power of negative thinking and always be paranoid.

The Power of Negative Thinking

In a world obsessed with positive thinking, negative thinking is distasteful.
However, American surgeon Atul Gawande wrote a 2007 op-ed in the New
York Times on precisely this subject. His central thesis is that positive
thinking can blind us to the possibility of terrorist attacks or medical
emergencies and deter preparation. Negative thinking allows us to consider
disastrous scenarios and act to prevent them.
Data engineers should actively think through the scenarios for data
utilization and collect sensitive data only if there is an actual need
downstream. The best way to protect private and sensitive data is to avoid
ingesting this data in the first place.
Data engineers should think about the attack and leak scenarios with any
data pipeline or storage system they utilize. When deciding on security
strategies, ensure that your approach delivers proper security and not just
the illusion of safety.

Always Be Paranoid
Always exercise caution when someone asks you for your credentials.
When in doubt—and you should always be in extreme doubt when asked
for credentials—hold off and get second opinions from your coworkers and
friends. Confirm with other people that the request is indeed legitimate. A
quick chat or phone call is cheaper than a ransomware attack triggered
through an email click. Trust nobody at face value when asked for
credentials, sensitive data, or confidential information, including from your
coworkers.
You are also the first line of defense in respecting privacy and ethics. Are
you uncomfortable with sensitive data you’ve been tasked to collect? Do
you have ethical questions about the way data is being handled in a project?
Raise your concerns with colleagues and leadership. Ensure that your work
is both legally compliant and ethical.

Processes
When people follow regular security processes, security becomes part of the
job. Make security a habit, regularly practice real security, exercise the
principle of least privilege, and understand the shared responsibility model
in the cloud.

Security Theater Versus Security Habit

With our corporate clients, we see a pervasive focus on compliance (with
internal rules, laws, recommendations from standards bodies), but not
enough attention to potentially bad scenarios. Unfortunately, this creates an
illusion of security but often leaves gaping holes that would be evident with
a few minutes of reflection.
Security needs to be simple and effective enough to become habitual
throughout an organization. We’re amazed at the number of companies with
security policies in the hundreds of pages that nobody reads, the annual
security policy review that people immediately forget, all in checking a box
for a security audit. This is security theater, where security is done in the
letter of compliance (SOC-2, ISO 27001, and related) without real
commitment.
Instead, pursue the spirit of genuine and habitual security; bake a security
mindset into your culture. Security doesn’t need to be complicated. For
example, at our company, we run security training and policy review at least
once a month to ingrain this into our team’s DNA and update each other on
security practices we can improve. Security must not be an afterthought for
your data team. Everyone is responsible and has a role to play. It must be
the priority for you and everyone else you work with.

Active Security
Returning to the idea of negative thinking, active security entails thinking
about and researching security threats in a dynamic and changing world.
Rather than simply deploying scheduled simulated phishing attacks, you
can take an active security posture by researching successful phishing
attacks and thinking through your organizational security vulnerabilities.
Rather than simply adopting a standard compliance checklist, you can think
about internal vulnerabilities specific to your organization and incentives
employees might have to leak or misuse private information.
We have more to say about active security in “Technology”.

The Principle of Least Privilege

The principle of least privilege means that a person or system should be
given only the privileges and data they need to complete the task at hand
and nothing more. Often, we see an antipattern in the cloud: a regular user
is given administrative access to everything, when that person may need
just a handful of IAM roles to do their work. Giving someone carte blanche
administrative access is a huge mistake and should never happen under the
principle of least privilege.
Instead, provide the user (or group they belong to) the IAM roles they need
when they need them. When these roles are no longer needed, take them
away. The same rule applies to service accounts. Treat humans and
machines the same way: give them only the privileges and data they need to
do their jobs, and only for the timespan when needed.
Of course, the principle of least privilege is also critical to privacy. Your
users and customers expect that people will look at their sensitive data only
when necessary. Make sure that this is the case. Implement column, row,
and cell-level access controls around sensitive data; consider masking PII
and other sensitive data and create views that contain only the information
the viewer needs to access. Some data must be retained, but should be
accessed only in an emergency. Put this data behind a broken glass process:
users can access it only after going through an emergency approval process
to fix a problem, query critical historical information, etc. Access is
revoked immediately once the work is done.

Shared Responsibility in the Cloud

Security is a shared responsibility in the cloud. The cloud vendor is
responsible for ensuring the physical security of its data center and
hardware. At the same time, you are responsible for the security of the
applications and systems you build and maintain in the cloud. Most cloud
security breaches continue to be caused by end users, not the cloud.
Breaches occur because of unintended misconfigurations, mistakes,
oversights, and sloppiness.

Always Back Up Your Data

Data disappears. Sometimes it’s a dead hard drive or server; in other cases,
someone might accidentally delete a database or an object storage bucket. A
bad actor can also lock away data. Ransomware attacks are widespread
these days. Some insurance companies are reducing payouts in the event of
an attack, leaving you on the hook both to recover your data and pay the
bad actor who’s holding it hostage. You need to back up your data regularly,
both for disaster recovery and continuity of business operations, if a version
of your data is compromised in a ransomware attack. Additionally, test the
restoration of your data backups on a regular basis.
Data backup doesn’t strictly fit under security and privacy practices; it goes
under the larger heading of disaster prevention, but it’s adjacent to security,
especially in the era of ransomware attacks.

An Example Security Policy

This section presents a sample security policy regarding credentials,
devices, and sensitive information. Notice that we don’t overcomplicate
things; instead, we give people a short list of practical actions they can take
immediately.

Fundamentals of Data Engineering by Joe Reis and Matt Housley 85
No ratings yet
Fundamentals of Data Engineering by Joe Reis and Matt Housley 85
6 pages
User Domain Policies
No ratings yet
User Domain Policies
9 pages
Need For Information Security
No ratings yet
Need For Information Security
44 pages
Cyber - Security - Practices
No ratings yet
Cyber - Security - Practices
27 pages
Data Security PPT - Version 1
No ratings yet
Data Security PPT - Version 1
22 pages
Securing Our Cyber Realm Deck
No ratings yet
Securing Our Cyber Realm Deck
32 pages
Remote Work Cybersecurity
No ratings yet
Remote Work Cybersecurity
27 pages
Ethical Hacking Assignment
No ratings yet
Ethical Hacking Assignment
9 pages
Organizational Need For Information Security
No ratings yet
Organizational Need For Information Security
3 pages
Lecture Note On Data Security Threats
No ratings yet
Lecture Note On Data Security Threats
19 pages
The Little Book of Network Security and
No ratings yet
The Little Book of Network Security and
52 pages
Cyber Security and Privacy: Balancing Convenience and Security in The Era of Big Data
No ratings yet
Cyber Security and Privacy: Balancing Convenience and Security in The Era of Big Data
10 pages
What Is Information Security
No ratings yet
What Is Information Security
12 pages
Application Security and Data Protection
No ratings yet
Application Security and Data Protection
5 pages
Cyber Security and Data Privacy Awarness Training
No ratings yet
Cyber Security and Data Privacy Awarness Training
14 pages
DBM Tutorial 5
No ratings yet
DBM Tutorial 5
6 pages
Unit 1a
No ratings yet
Unit 1a
34 pages
Unit3 Cloud Computing
No ratings yet
Unit3 Cloud Computing
17 pages
Week-09-10-11-12 Fundamentals of Cybersecurity
No ratings yet
Week-09-10-11-12 Fundamentals of Cybersecurity
67 pages
Unit 7
No ratings yet
Unit 7
11 pages
What Is Information Security
No ratings yet
What Is Information Security
36 pages
Chapter 4 Security Privacy
No ratings yet
Chapter 4 Security Privacy
15 pages
Securing Windows & Linux Systems
No ratings yet
Securing Windows & Linux Systems
21 pages
Sec
No ratings yet
Sec
27 pages
It Security Practical Guide
No ratings yet
It Security Practical Guide
20 pages
Subtitle
No ratings yet
Subtitle
3 pages
Saved You A Seat Ep 28 - Business Continuity
No ratings yet
Saved You A Seat Ep 28 - Business Continuity
18 pages
Chapter 2: The Cybersecurity Cube
No ratings yet
Chapter 2: The Cybersecurity Cube
25 pages
Cybersecurity Strategies for IT Pros
No ratings yet
Cybersecurity Strategies for IT Pros
20 pages
Cyber Security Notres Module 1
No ratings yet
Cyber Security Notres Module 1
24 pages
Secure Your IT Environment Guide
No ratings yet
Secure Your IT Environment Guide
30 pages
The Need For Information Security
No ratings yet
The Need For Information Security
34 pages
Weakness To Infosec
No ratings yet
Weakness To Infosec
32 pages
1.how Do You See Information Security Within A Business or Organization? Explain by Citing Examples
No ratings yet
1.how Do You See Information Security Within A Business or Organization? Explain by Citing Examples
77 pages
Safety & Security in Tech Age
No ratings yet
Safety & Security in Tech Age
8 pages
NPDIVGoc 9 B
No ratings yet
NPDIVGoc 9 B
8 pages
Lecture 1 - Security Concepts
100% (3)
Lecture 1 - Security Concepts
55 pages
Core Principles
No ratings yet
Core Principles
17 pages
DBST Unit I
No ratings yet
DBST Unit I
8 pages
(Document Title) Notice: Version No. Date Type of Changes Owner/Author Date of Review / Expiry
No ratings yet
(Document Title) Notice: Version No. Date Type of Changes Owner/Author Date of Review / Expiry
18 pages
CH 13 - InfoSys Security
No ratings yet
CH 13 - InfoSys Security
30 pages
Data Security Issues, Threats, CIA Triad
No ratings yet
Data Security Issues, Threats, CIA Triad
10 pages
Secure Architecture Design Methodologies
No ratings yet
Secure Architecture Design Methodologies
22 pages
Data Security and Contro1
No ratings yet
Data Security and Contro1
4 pages
Ebook Cybersecurity Tips For Employees
No ratings yet
Ebook Cybersecurity Tips For Employees
12 pages
The-Gorilla-Guide-To-Enterprise-Security-Fundamentals 2
No ratings yet
The-Gorilla-Guide-To-Enterprise-Security-Fundamentals 2
54 pages
Security Policy Unit 4
No ratings yet
Security Policy Unit 4
12 pages
CH 11
No ratings yet
CH 11
36 pages
Vhuiplnnv 88
No ratings yet
Vhuiplnnv 88
3 pages
Ids
No ratings yet
Ids
14 pages
060 Security Management
No ratings yet
060 Security Management
259 pages
Slidesgo Fortifying The Future Strategies For Enhanced Security in A Digital World 20241106042556u6uw
No ratings yet
Slidesgo Fortifying The Future Strategies For Enhanced Security in A Digital World 20241106042556u6uw
18 pages
Ebook - CISSP - Domain - 03 - Security Architecture and Engineering
100% (1)
Ebook - CISSP - Domain - 03 - Security Architecture and Engineering
279 pages
Introduction To Enterprise Data Management
No ratings yet
Introduction To Enterprise Data Management
5 pages
CYB238 - Lecture 2
No ratings yet
CYB238 - Lecture 2
35 pages
Cyber Security and The Strategic Business Lead
No ratings yet
Cyber Security and The Strategic Business Lead
6 pages
Chapter 7 Data Security - DONE DONE DONE
No ratings yet
Chapter 7 Data Security - DONE DONE DONE
41 pages
اسئلة التنافسي قسم الاجهزة الطبية 2018 2019
No ratings yet
اسئلة التنافسي قسم الاجهزة الطبية 2018 2019
5 pages
다음 글의 내용과 일치하지 않는 것은? (수능특강 Light 1강 4번) 다음 글의 내용과 일치하는 것은? (수특 라이트 1 강 gateway)
No ratings yet
다음 글의 내용과 일치하지 않는 것은? (수능특강 Light 1강 4번) 다음 글의 내용과 일치하는 것은? (수특 라이트 1 강 gateway)
36 pages
Design & Modification On Automatic and Pneumatic Jack System
No ratings yet
Design & Modification On Automatic and Pneumatic Jack System
4 pages
All in One Science Class 10
No ratings yet
All in One Science Class 10
25 pages
Carrier Central User Manual Guide
No ratings yet
Carrier Central User Manual Guide
20 pages
The Effectiveness of Isometric Contractions Compared With Isotonic Contractions in Reducing Pain For In-Season Athletes With Patellar Tendinopathy
No ratings yet
The Effectiveness of Isometric Contractions Compared With Isotonic Contractions in Reducing Pain For In-Season Athletes With Patellar Tendinopathy
4 pages
State Bank of India
No ratings yet
State Bank of India
13 pages
Blockchain Tech Seminar Report
No ratings yet
Blockchain Tech Seminar Report
27 pages
Encyclopedia of Philosophy 2nd Ed Volume 10 Appendix Additional Articles Thematic Outline Bibliographies Index 2nd Ed Edition Donald M. Borchert Instant Download
100% (5)
Encyclopedia of Philosophy 2nd Ed Volume 10 Appendix Additional Articles Thematic Outline Bibliographies Index 2nd Ed Edition Donald M. Borchert Instant Download
76 pages
Science Lesson Plan 5
No ratings yet
Science Lesson Plan 5
2 pages
Tank Flush Simulation Tutorial
No ratings yet
Tank Flush Simulation Tutorial
23 pages
Microspectrofluorimetry of Fluorescent Dyes and Brighteners On Single Textile Fibres
No ratings yet
Microspectrofluorimetry of Fluorescent Dyes and Brighteners On Single Textile Fibres
18 pages
Class 12 Geography: Planning & Sustainable Development
No ratings yet
Class 12 Geography: Planning & Sustainable Development
40 pages
2024 Assessment Handbook
No ratings yet
2024 Assessment Handbook
20 pages
Arabic Greetings for Beginners
No ratings yet
Arabic Greetings for Beginners
4 pages
TSS HD Suspension
No ratings yet
TSS HD Suspension
2 pages
Oil & Gas Construction Services
No ratings yet
Oil & Gas Construction Services
22 pages
Aclara kV2c Data Sheet
No ratings yet
Aclara kV2c Data Sheet
2 pages
CLASS 8th Soc SC BRIDGE COURSE Bridge Course Primary 2024 25
No ratings yet
CLASS 8th Soc SC BRIDGE COURSE Bridge Course Primary 2024 25
42 pages
GU Student Manual 2 Schemas
No ratings yet
GU Student Manual 2 Schemas
11 pages
BA Assignment Front Page
No ratings yet
BA Assignment Front Page
6 pages
Excel Tutorial PDF
No ratings yet
Excel Tutorial PDF
13 pages
LG Oem Lgit Plde-P017a SCH
No ratings yet
LG Oem Lgit Plde-P017a SCH
2 pages
Error Peskin
No ratings yet
Error Peskin
4 pages
Packing Machine Operation Instruction
No ratings yet
Packing Machine Operation Instruction
18 pages
Vitocal Heat Pumps Brochure
No ratings yet
Vitocal Heat Pumps Brochure
52 pages
KFR 2
No ratings yet
KFR 2
126 pages
Wynns 2
No ratings yet
Wynns 2
8 pages
P35 Portable Dewpoint Meter Datasheet 1898 Iss7
No ratings yet
P35 Portable Dewpoint Meter Datasheet 1898 Iss7
3 pages
4pm1 02r Rms 20230302
No ratings yet
4pm1 02r Rms 20230302
29 pages