0% found this document useful (0 votes)

1K views

Lesson 2 Quiz - Coursera

This document contains the results of a Spark lesson quiz. The quiz contained 12 multiple choice questions about Spark concepts like jobs, tasks, stages, executors, broadcast variables, and accumulator variables. The student scored 100% by correctly answering all 12 questions.

Uploaded by

Rupesh Kumar Sah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views

Lesson 2 Quiz - Coursera

Uploaded by

Rupesh Kumar Sah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Lesson 2 Quiz

LATEST SUBMISSION GRADE

100%

1. What is a job? 1 / 1 point

An activity you get paid for.

A pipelineable part of the computation.

A unit of work performed by the executor.

That is how Spark calls my application.

An activity spawned in the response to a Spark action.

A dependency graph for the RDDs.

Correct

Exactly!

2. What is a task? 1 / 1 point

A pipelineable part of the computation.

That is how Spark calls my application.

An activity spawned in the response to a Spark action.

An activity you get paid for.

A unit of work performed by the executor.

A dependency graph for the RDDs.

Correct

Exactly!

3. What is a job stage? 1 / 1 point

A place where a job is performed.

A pipelineable part of the computation.

A subset of the dependency graph.

A particular shuffle operation within the job.

An activity spawned in the response to a Spark action.

A single step of the job.

Correct

Correct.

4. How does your application find out the executors to work with? 1 / 1 point

The SparkContext object queries a discovery service to find them out.

You statically define them in the configuration file.

The SparkContext object allocates the executors by communicating with the cluster manager.

Correct

Exactly!

5. Mark all the statements that are true. 1 / 1 point

You can ask Spark to make several copies of your persistent dataset.

Correct

Yes, you can tune the replication factor.

Data can be cached both on the disk and in the memory.

Correct

Yes, you can tune persistence level to use both the disk & the memory.

Spark keeps all the intermediate data in the memory until the end of the computation, that is why it is a 'lighting-fast
computing'!

Spark can be hinted to keep particular datasets in the memory.

Correct

Yes!

It is advisable to cache every RDD in your computation for optimal performance.

Every partition is stored in Spark in 3 replicas to achieve fault-tolerance.

While executing a job, Spark loads data from HDFS only once.

6. Imagine that you need to deliver three floating-point parameters for a machine learning 1 / 1 point
algorithm used in your tasks. What is the best way to do it?

Make a broadcast variable and put these parameters there.

Capture them into the closure to be sent during the task scheduling.

Hardcode them into the algorithm and redeploy the application.

Correct

Yes, that is correct. Three floating-point numbers add a negligible overhead.

7. Imagine that you need to somehow print corrupted records from the log file to the screen. 1 / 1 point
How can you do that?

Use an accumulator variable to collect all the records and pass them back to the driver.

Use a broadcast variable to broadcast the corrupted records and listen for these events in the driver.

Use an action to collect filtered records in the driver.

Correct

There is no way to trick you!

8. How broadcast variables are distributed among the executors? 1 / 1 point

The executors distribute the content with a peer-to-peer, torrent-like protocol, and the driver seeds the content.

The driver sends the content one-by-one to every executor.

The executors are organized in a tree-like hierarchy, and the distribution follows the tree structure.
The driver sends the content in parallel to every executor.

Correct

Correct.

9. What will happen if you use a non-associative, non-commutative operator in the accumulator 1 / 1 point
variables?

Operation semantics are ill-defined in this case.

The cluster will crash.

I have tried that -- everything works just fine.

Spark will not allow me to do that.

Correct

Yes. As the order of the updates is unknown in advance, we must be able to apply them in any order. Thus,
commutativity and associativity.

10. Mark all the operators that are both associative and commutative. 1 / 1 point

first(x, y) = x

prod(x, y) = x * y

Correct

Correct.

avg(x, y) = (x + y) / 2

min(x, y) = if x > y then y else x end

Correct

Correct.

max(x, y) = if x > y then x else y end

Correct
Correct.

concat(x, y) = str(x) + str(y)

last(x, y) = y

sum(x, y) = x + y

Correct

Correct.

11. Does Spark guarantee that accumulator updates originating from actions are applied only 1 / 1 point
once?

Yes.

No.

Correct

Correct.

12. Does Spark guarantee that accumulator updates originating from transformations are applied 1 / 1 point
at least once?

No.

Yes.

Correct

Correct.

Pyspark Dumps
No ratings yet
Pyspark Dumps
10 pages
Databricks Certified Developer For Apache Spark 3.0 Practice Tests 540 Questions
0% (1)
Databricks Certified Developer For Apache Spark 3.0 Practice Tests 540 Questions
290 pages
Formulas TPM
No ratings yet
Formulas TPM
9 pages
Lead Time Reduction and Process Enhancement For A Low Volume Product
No ratings yet
Lead Time Reduction and Process Enhancement For A Low Volume Product
7 pages
AoB Dead Frontier
No ratings yet
AoB Dead Frontier
2 pages
BASIC 3+4 - Final Exam 24
No ratings yet
BASIC 3+4 - Final Exam 24
14 pages
Pc2 Ing Metodos Pc2 Resuelta de Ingenieria de Metodos Upc
No ratings yet
Pc2 Ing Metodos Pc2 Resuelta de Ingenieria de Metodos Upc
4 pages
IN316 ExamenFinal MOO
No ratings yet
IN316 ExamenFinal MOO
7 pages
Exploring E-Kanban Application in The Inventory Management Process ESPAÑOL
No ratings yet
Exploring E-Kanban Application in The Inventory Management Process ESPAÑOL
5 pages
Comandos y Funciones de Promodel
50% (2)
Comandos y Funciones de Promodel
142 pages
Evaluación - (ACV-S08) Evaluación Continua Virtual Semana 8 - ECV PDF
No ratings yet
Evaluación - (ACV-S08) Evaluación Continua Virtual Semana 8 - ECV PDF
4 pages
GM Supplier Training CONDOR SPANISH 011521
No ratings yet
GM Supplier Training CONDOR SPANISH 011521
82 pages
PySpark RDD Assignment
No ratings yet
PySpark RDD Assignment
1 page
Tarea 8
0% (2)
Tarea 8
13 pages
Apache Spark Interview Questions
No ratings yet
Apache Spark Interview Questions
12 pages
Simulado Databricks
No ratings yet
Simulado Databricks
25 pages
Top Questions for Data Engineering Interviews 1742072752
No ratings yet
Top Questions for Data Engineering Interviews 1742072752
72 pages
TFWoljND9k
No ratings yet
TFWoljND9k
25 pages
PracticeExam DCADAS3 Scala 1
No ratings yet
PracticeExam DCADAS3 Scala 1
27 pages
Super 25 Unit 5 Notes
No ratings yet
Super 25 Unit 5 Notes
11 pages
Bigdata Interview Q&A
No ratings yet
Bigdata Interview Q&A
71 pages
Interview - Questions
No ratings yet
Interview - Questions
8 pages
Architecture and Components of Spark
No ratings yet
Architecture and Components of Spark
6 pages
Msbte Super 25 Unit 5 Notes
No ratings yet
Msbte Super 25 Unit 5 Notes
17 pages
SPARK Interview Questions
No ratings yet
SPARK Interview Questions
12 pages
Spark Questions Imp
No ratings yet
Spark Questions Imp
33 pages
Spark Vs Hadoop Features Spark
No ratings yet
Spark Vs Hadoop Features Spark
9 pages
BDA-Lec9
No ratings yet
BDA-Lec9
25 pages
Spark - RDD CS DESIGN
No ratings yet
Spark - RDD CS DESIGN
1 page
Spark Interview Questions: Click Here
No ratings yet
Spark Interview Questions: Click Here
35 pages
Spark Interview Questions and Answers
100% (3)
Spark Interview Questions and Answers
31 pages
2022 Assignment Answers
No ratings yet
2022 Assignment Answers
37 pages
Extended Spark Interview QA
No ratings yet
Extended Spark Interview QA
3 pages
PySpark Comprehensive Notes⚡
No ratings yet
PySpark Comprehensive Notes⚡
59 pages
Top Answers To Spark Interview Questions
No ratings yet
Top Answers To Spark Interview Questions
32 pages
Top Answers To Spark Interview Questions
No ratings yet
Top Answers To Spark Interview Questions
32 pages
Spark Interview Questions 04
No ratings yet
Spark Interview Questions 04
4 pages
ABD Exame PDF
No ratings yet
ABD Exame PDF
17 pages
Spark Interview Questions
No ratings yet
Spark Interview Questions
61 pages
Top Answers To Spark Interview Questions
No ratings yet
Top Answers To Spark Interview Questions
4 pages
Spark Interview 4
No ratings yet
Spark Interview 4
10 pages
Top 75 Apache Spark Interview Questions
No ratings yet
Top 75 Apache Spark Interview Questions
18 pages
Apache Spark IQ
No ratings yet
Apache Spark IQ
15 pages
Spark Interview Questions 1713805760
No ratings yet
Spark Interview Questions 1713805760
40 pages
Spark Intro
No ratings yet
Spark Intro
24 pages
SPARK Question answers
No ratings yet
SPARK Question answers
19 pages
Spark Intreview FAQ
100% (1)
Spark Intreview FAQ
21 pages
Apache Spark - Practices 2nd
No ratings yet
Apache Spark - Practices 2nd
26 pages
Week 8-2
No ratings yet
Week 8-2
9 pages
Apache Spark
No ratings yet
Apache Spark
31 pages
Recap_Spark
No ratings yet
Recap_Spark
21 pages
Assignment 03 BigData Computing Noc23-Cs112
No ratings yet
Assignment 03 BigData Computing Noc23-Cs112
6 pages
Spark Interview Questions
No ratings yet
Spark Interview Questions
19 pages
Introduction To Spark
No ratings yet
Introduction To Spark
4 pages
Spark Interview QUestions
No ratings yet
Spark Interview QUestions
200 pages
bd1718-10-spark
No ratings yet
bd1718-10-spark
55 pages
Data Bricks Interview
No ratings yet
Data Bricks Interview
18 pages
Bigdata MCQ QA Part2
No ratings yet
Bigdata MCQ QA Part2
9 pages
Introduction to Spark
No ratings yet
Introduction to Spark
30 pages
C5-SPARK Technology
No ratings yet
C5-SPARK Technology
39 pages
AP0110 Weather Station Example PDF
No ratings yet
AP0110 Weather Station Example PDF
5 pages
Manual
No ratings yet
Manual
5 pages
Phần 1: Câu Hỏi Mở Đầu Family 1. How many people are there in your family? Who are they?
No ratings yet
Phần 1: Câu Hỏi Mở Đầu Family 1. How many people are there in your family? Who are they?
40 pages
Oprating Personal Computer Lab Test
0% (1)
Oprating Personal Computer Lab Test
4 pages
E Universitydesktop Support Interview Questions
No ratings yet
E Universitydesktop Support Interview Questions
12 pages
The Slow Query Log - MySQL 8 Query Performance Tuning - A Systematic Method For Improving Execution Speeds
No ratings yet
The Slow Query Log - MySQL 8 Query Performance Tuning - A Systematic Method For Improving Execution Speeds
6 pages
THIRD SEMESTER PROGRAMMING AND DATA STRUCTURES-2 NOTES FOR 5 UNITS REGULATION 2013Cs6301 Notes
No ratings yet
THIRD SEMESTER PROGRAMMING AND DATA STRUCTURES-2 NOTES FOR 5 UNITS REGULATION 2013Cs6301 Notes
207 pages
Angular Intern Assignment-1
No ratings yet
Angular Intern Assignment-1
2 pages
Project Report 1
No ratings yet
Project Report 1
26 pages
Overview of RTOS and VxWorks
No ratings yet
Overview of RTOS and VxWorks
48 pages
SAP VIM 7.5 SP4 Is Now Available: New Features
No ratings yet
SAP VIM 7.5 SP4 Is Now Available: New Features
2 pages
My Ecodial L 3.4 Keygen Crack, 19
No ratings yet
My Ecodial L 3.4 Keygen Crack, 19
3 pages
Smart Cities
No ratings yet
Smart Cities
31 pages
System Detail of Noida Office
No ratings yet
System Detail of Noida Office
61 pages
Exchange Server Architecture - Microsoft Docs
No ratings yet
Exchange Server Architecture - Microsoft Docs
7 pages
Sample Question Paper - Object Oriented Programming-12063
50% (2)
Sample Question Paper - Object Oriented Programming-12063
4 pages
Dell OptiPlex 5050 MT
No ratings yet
Dell OptiPlex 5050 MT
4 pages
Exception Handling
No ratings yet
Exception Handling
7 pages
Firmware Manager Manual
No ratings yet
Firmware Manager Manual
9 pages
6 Authoring Tools
No ratings yet
6 Authoring Tools
31 pages
MXC-6400 Series Datasheet-En 20180706
No ratings yet
MXC-6400 Series Datasheet-En 20180706
2 pages
Local Administrator Password Management Detailed Technical Specification
No ratings yet
Local Administrator Password Management Detailed Technical Specification
20 pages
Tf022 Review Paper
No ratings yet
Tf022 Review Paper
14 pages
Course Number Course Title Credit/s Semester/Term/School Year Schedule College or Department
No ratings yet
Course Number Course Title Credit/s Semester/Term/School Year Schedule College or Department
7 pages
20741B - 02-Implementing DHCP
No ratings yet
20741B - 02-Implementing DHCP
32 pages
Mainboard MCU
No ratings yet
Mainboard MCU
11 pages
VALMONTE
No ratings yet
VALMONTE
3 pages
How To Change Welcome Screen in XP CD and Your System
No ratings yet
How To Change Welcome Screen in XP CD and Your System
2 pages
lastException_63870411518
No ratings yet
lastException_63870411518
10 pages
Flowchart Symbols
100% (1)
Flowchart Symbols
5 pages

Lesson 2 Quiz - Coursera

Uploaded by

Lesson 2 Quiz - Coursera

Uploaded by

Lesson 2 Quiz

LATEST SUBMISSION GRADE

1. What is a job? 1 / 1 point

An activity you get paid for.

A pipelineable part of the computation.

A unit of work performed by the executor.

That is how Spark calls my application.

An activity spawned in the response to a Spark action.

A dependency graph for the RDDs.

2. What is a task? 1 / 1 point

A pipelineable part of the computation.

That is how Spark calls my application.

An activity spawned in the response to a Spark action.

An activity you get paid for.

A unit of work performed by the executor.

A dependency graph for the RDDs.

3. What is a job stage? 1 / 1 point

A place where a job is performed.

A subset of the dependency graph.

A particular shuffle operation within the job.

An activity spawned in the response to a Spark action.

A single step of the job.

The SparkContext object queries a discovery service to find them out.

You statically define them in the configuration file.

5. Mark all the statements that are true. 1 / 1 point

Yes, you can tune the replication factor.

Data can be cached both on the disk and in the memory.

Spark can be hinted to keep particular datasets in the memory.

It is advisable to cache every RDD in your computation for optimal performance.

Every partition is stored in Spark in 3 replicas to achieve fault-tolerance.

Make a broadcast variable and put these parameters there.

Hardcode them into the algorithm and redeploy the application.

Yes, that is correct. Three floating-point numbers add a negligible overhead.

Use an action to collect filtered records in the driver.

There is no way to trick you!

8. How broadcast variables are distributed among the executors? 1 / 1 point

The driver sends the content one-by-one to every executor.

Operation semantics are ill-defined in this case.

The cluster will crash.

I have tried that -- everything works just fine.

Spark will not allow me to do that.

min(x, y) = if x > y then y else x end

max(x, y) = if x > y then x else y end

concat(x, y) = str(x) + str(y)

You might also like