0% found this document useful (0 votes)

12 views7 pages

MCQ Big

Uploaded by

malakshaaban613

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

MCQ Big

Uploaded by

malakshaaban613

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

MCQ:

1. ………………… data files are a compact, efficient binary format that provides interoperability with
applications written in other programming languages
a. Avro b. sequence files c.dictionaries d.JSON
2. …………………are a binary format that store individual records in custom record-specific data types
a. Avro b. sequence files c.dictionaries d.JSON
3. ………………… is an incredibly rich and flexible data representation format
a. XML b. sequence files c.dictionaries d.JSON
4. ……………….. is a columnar storage format available to any project in the Hadoop ecosystem, built from
the ground up with complex nested data structures in min and support compression and encoding schemes.
a. Praquest b. sequence files c.dictionaries d.Dictionary
5. ………………….. is a plain-text object serialization format that can represent quite complex data in a way
that can be transferred between a user and a program or one program to another program.
a. Praquest b. JSON c.directiories d.Dictionary
6. …………………. was specifically introduced to handle the rise in data types, data access, and data
availability needs brought on the dot.com boom.
a. NoSql b. JSON c.dictionaries d.Dictionary
7. ………………. All reading and writing of data in one region is done by the assigned Region Server
a. Atomicity b.Durability 3.Scalabaility 4.Consistency
8. ................. mode on a single machine without requirement for HDFS [Local ]
9. …………………….mode: execution on an HDFS cluster, with the Pig scrip converted to a MapReduce job
[Map Reduce].
10. …………………. A system for managing and querying structured data built on top of Hadoop [Hive].
11. ………….. is a component of Hive. It is a table and storage management layer for Hadoop that enables
users with different data processing tools – including Pig and MapReduc.[ HCatalog]
12. ……………..provides a service that you can use to run Hadoop MapReduce (or YARN), Pig, Hive jobs or
perform Hive metadata operations using an HTTP [WebHCat]
13. …………………database is a columnar storage database [HBase].
14. …………………….. is designed to work with the Spark via SQL and HiveQL (a Hive variant of SQL).
15. …………… provides processing of live streams of data.[Spark Streaming]
16. …………………. is the machine learning library that provides multiple types of machine learning
algorithms.[MLib]
17. …………………… is a graph processing library with APIs to manipulate graphs and performing graph-
parallel computations.[GraphX].
18. ………………… Fault-tolerant collection of elements that can be operated on in parallel[RDD]
19. map(func) Return a new dataset formed by passing each element of the source through a function func.
20. filter(func) Return a new dataset formed by selecting those elements of the source on which func returns
true.
21. flatMap(func) Similar to map, but each input item can be mapped to 0 or more output items. So func should
return a Seq rather than a single item.
22. The …………….function combines two sets of key value pairs and return a set of keys to a pair of values
from the two initial set.[ join]
23. The ……………….. function aggregates on each key by using the given reduce function. This is something
you would use in a WordCount to sum up the values for each word to count its occurrences.[ reduceByKey]
24. The …………….is the main entry point for Spark functionality; it represents the connection to a Spark
cluster [SparkContext]
1. The MapReduce algorithm contains two important tasks, namely __________.
A. mapped, reduce
B. mapping, Reduction
C. Map, Reduction
D. Map, Reduce
2. _____ takes a set of data and converts it into another set of data, where individual elements are broken down
into tuples (key/value pairs).
A. Map
B. Reduce
C. Both A and B
D. Node
Explanation: Map takes a set of data and converts it into another set of data, where individual elements are
broken down into tuples (key/value pairs).
3. ______ task, which takes the output from a map as an input and combines those data tuples into a smaller set
of tuples.
A. Map
B. Reduce
C. Node
D. Both A and B
Explanation: Reduce task, which takes the output from a map as an input and combines those data tuples into a
smaller set of tuples.
4. In how many stages the MapReduce program executes?
A. 2
B. 3
C. 4
D. 5
Explanation: MapReduce program executes in three stages, namely map stage, shuffle stage, and reduce stage.
5. Which of the following is used to schedules jobs and tracks the assign jobs to Task tracker?
A. SlaveNode
B. MasterNode
C. JobTracker
D. Task Tracker
Explanation: JobTracker : Schedules jobs and tracks the assign jobs to Task tracker.
6. Which of the following is used for an execution of a Mapper or a Reducer on a slice of data?
A. Task
B. Job
C. Mapper
D. PayLoad
Explanation: Task : An execution of a Mapper or a Reducer on a slice of data.
8. Point out the correct statement.

A. MapReduce tries to place the data and the compute as close as possible
B. Map Task in MapReduce is performed using the Mapper() function
C. Reduce Task in MapReduce is performed using the Map() function
D. None of the above
9. Although the Hadoop framework is implemented in Java, MapReduce applications need not be written in
____________

A. C
B. C#
C. Java
D. None of the above
10. The number of maps is usually driven by the total size of ____________
A. Inputs
B. Output
C. Task
D. None of the above
1. Data in ___________ bytes size is called Big Data.

A. Tera
B. Giga
C. Peta
D. Meta
2. How many V's of Big Data
A. 2
B. 3
C. 4
D. 5
3. Transaction data of the bank is?
A. structured data
B. unstructured datat
C. Both A and B
D. None of the above

4. In how many forms BigData could be found?

A. 2
B. 3
C. 4
D. 5
Explanation: BigData could be found in three forms: Structured, Unstructured and Semi-structured.

10. What are the main components of Big Data?

A. MapReduce
B. HDFS
C. YARN
D. All of the above
Explanation: All of the above are the main components of Big Data.

1. A ________ serves as the master and there is only one NameNode per cluster.
a) Data Node
b) NameNode
c) Data block
d) Replication

HDFS, and Replication, etc. are stored and maintained on the NameNode.
2. Point out the correct statement.
a) DataNode is the slave/worker node and holds the user data in the form of Data Blocks
b) Each incoming file is broken into 32 MB by default
c) Data blocks are replicated across different nodes in the cluster to ensure a low degree of fault tolerance
d) None of the mentioned

3. HDFS works in a __________ fashion.

a) master-worker
b) master-slave
c) worker/slave
d) all of the mentioned

4. ________ NameNode is used when the Primary NameNode goes down.

a) Rack
b) Data
c) Secondary
d) None of the mentioned

6. Which of the following scenario may not be a good fit for HDFS?
a) HDFS is not suitable for scenarios requiring multiple/simultaneous writes to the same file
b) HDFS is suitable for storing data related to applications requiring low latency data access
c) HDFS is suitable for storing data related to applications requiring low latency data access
d) None of the mentioned

8. ________ is the slave/worker node and holds the user data in the form of Data Blocks.
a) DataNode
b) NameNode
c) Data block
d) Replication

10. HDFS is implemented in _____________ programming language.

a) C++
b) Java
c) Scala
d) None of the mentioned

11. For YARN, the ___________ Manager UI provides host and port information.
a) Data Node
b) NameNode
c) Resource
d) Replication

13. For ________ the HBase Master UI provides information about the HBase Master uptime.
a) HBase
b) Oozie
c) Kafka
d) All of the mentioned

14. During start up, the ___________ loads the file system state from the fsimage and the edits log file.
a) DataNode
b) NameNode
c) ActionNode
d) None of the mentioned

1. A ________ node acts as the Slave and is responsible for executing a Task assigned to it by the JobTracker.
a) MapReduce
b) Mapper
c) TaskTracker
d) JobTracker

3. ___________ part of the MapReduce is responsible for processing one or more chunks of data and producing
the output results.
a) Maptask
b) Mapper
c) Task execution
d) All of the mentioned

4. _________ function is responsible for consolidating the results produced by each of the Map()
functions/tasks.
a) Reduce
b) Map
c) Reducer
d) All of the mentioned

7. ________ is a utility which allows users to create and run jobs with any executables as the mapper and/or the
reducer.
a) Hadoop Strdata
b) Hadoop Streaming
c) Hadoop Stream
d) None of the mentioned
Answer: b

8. __________ maps input key/value pairs to a set of intermediate key/value pairs.

a) Mapper
b) Reducer
c) Both Mapper and Reducer
d) None of the mentioned

11. Running a ___________ program involves running mapping tasks on many or all of the nodes in our cluster.
a) MapReduce
b) Map
c) Reducer
d) All of the mentioned

1. ________ is the architectural center of Hadoop that allows multiple data processing engines.
a) YARN
b) Hive
c) Incubator
d) Chuckwa

3. YARN’s dynamic allocation of cluster resources improves utilization over more static _______ rules used in
early versions of Hadoop.
a) Hive
b) MapReduce
c) Imphala
d) All of the mentioned

…………………has the responsibility of negotiating appropriate resource containers from the Scheduler,
tracking their status and monitoring for progress
a) NodeManager
b) ResourceManager
c) ApplicationMaster
d) All of the mentioned

4. The __________ is a framework-specific entity that negotiates resources from the ResourceManager.
a) NodeManager
b) ResourceManager
c) ApplicationMaster
d) All of the mentioned

6. Apache Hadoop YARN stands for _________

a) Yet Another Reserve Negotiator
b) Yet Another Resource Network
c) Yet Another Resource Negotiator
d) All of the mentioned

8. The ____________ is the ultimate authority that arbitrates resources among all the applications in the system.
a) NodeManager
b) ResourceManager
c) ApplicationMaster
d) All of the mentioned

9. The __________ is responsible for allocating resources to the various running applications subject to familiar
constraints of capacities, queues etc.
a) Manager
b) Master
c) Scheduler
d) None of the mentioned

2. YARN is the one who helps to manage the resources across the ________.
A. clusters
B. Negotiator
C. Jobs
D. Hadoop System
3. How many major component Yarn has?
A. 2
B. 3
C. 4
D. 5
4. Which of the following is the component of YARN?
A. Resource Manager
B. Nodes Manager
C. Application Manager
D. All of the above
Explanation: Yarn consists of three major components i.e. Resource Manager, Nodes Manager, Application
Manager.
5. Which managers work on the allocation of resources?
A. Nodes Manager
B. Resource Manager
C. Application Manager
D. All of the above
Explanation: Node managers work on the allocation of resources such as CPU, memory, bandwidth per
machine and later on acknowledges the resource manager.
7.…………………. is responsible for accepting job-submissions, negotiating the first container for executing
the application specific ApplicationMaster and provides the service for restarting the ApplicationMaster
container on failure.
A. NodeManager
B. ApplicationManager
C. ApplicationMaster
D. All of the above.

…………………………

Nptel Big Data Full Assignment Solution 2021
89% (9)
Nptel Big Data Full Assignment Solution 2021
36 pages
Subject Name:: Knowledge Institute of Technology & Engineering-135
No ratings yet
Subject Name:: Knowledge Institute of Technology & Engineering-135
22 pages
Is The World's Most Complete, Tested, and Popular Distribution of Apache Hadoop and Related Projects. A. MDH B. CDH C. ADH
No ratings yet
Is The World's Most Complete, Tested, and Popular Distribution of Apache Hadoop and Related Projects. A. MDH B. CDH C. ADH
21 pages
454U8-Big Data Analytics
No ratings yet
454U8-Big Data Analytics
22 pages
Bits
No ratings yet
Bits
2 pages
Week 2
No ratings yet
Week 2
7 pages
2022 Assignment Answers
No ratings yet
2022 Assignment Answers
37 pages
Bda Bits - Mid I-Qp (2024-25)
No ratings yet
Bda Bits - Mid I-Qp (2024-25)
2 pages
Assignment1 BigData Computing Noc23-Cs112
No ratings yet
Assignment1 BigData Computing Noc23-Cs112
8 pages
2023 Assignment Answers
No ratings yet
2023 Assignment Answers
52 pages
Big Data and Hadoop - Semester Exam - 6th Sem-Set 01
No ratings yet
Big Data and Hadoop - Semester Exam - 6th Sem-Set 01
3 pages
Big Data MCQ
No ratings yet
Big Data MCQ
47 pages
Big Data & NoSQL Exam Prep
No ratings yet
Big Data & NoSQL Exam Prep
5 pages
Bigdata MCQ QA Part2
No ratings yet
Bigdata MCQ QA Part2
9 pages
Bda Guess Paper Solution
No ratings yet
Bda Guess Paper Solution
130 pages
BDA IMPORTANT QUESTION (5marks)
No ratings yet
BDA IMPORTANT QUESTION (5marks)
7 pages
MCQ Da
No ratings yet
MCQ Da
28 pages
Devoir Surveillé: Please Answer The Following Multiple-Choice Questions
No ratings yet
Devoir Surveillé: Please Answer The Following Multiple-Choice Questions
8 pages
Hadoop MCQs
75% (8)
Hadoop MCQs
21 pages
DS BigDATA 2ièmeN2TR UVT 2022 2023
No ratings yet
DS BigDATA 2ièmeN2TR UVT 2022 2023
4 pages
Short Answers
No ratings yet
Short Answers
4 pages
Bda A1
No ratings yet
Bda A1
15 pages
Bda MCQ
100% (1)
Bda MCQ
44 pages
Bda r16 Csdlo7032 QP
No ratings yet
Bda r16 Csdlo7032 QP
4 pages
Hadoop Big Data Concepts Guide
100% (1)
Hadoop Big Data Concepts Guide
7 pages
Bda MCQ
No ratings yet
Bda MCQ
9 pages
DS QCM BigData 2021
No ratings yet
DS QCM BigData 2021
6 pages
Big-Data Final
No ratings yet
Big-Data Final
7 pages
No SQL Quiz Questions
No ratings yet
No SQL Quiz Questions
7 pages
Nptel Assignment 1
No ratings yet
Nptel Assignment 1
4 pages
Bda QB Sample Unit
No ratings yet
Bda QB Sample Unit
12 pages
4 5969937999511686081
No ratings yet
4 5969937999511686081
6 pages
Big Data Analytics Exam 2020
100% (1)
Big Data Analytics Exam 2020
10 pages
Assignment BDHHHH
No ratings yet
Assignment BDHHHH
15 pages
Big Data Analytics Exam Questions
No ratings yet
Big Data Analytics Exam Questions
11 pages
BigData Questions
No ratings yet
BigData Questions
17 pages
IAT-IV Question Paper With Solution of 18CS72 Big Data Analytics Feb-2022-Poonam Vijay Tijare
No ratings yet
IAT-IV Question Paper With Solution of 18CS72 Big Data Analytics Feb-2022-Poonam Vijay Tijare
9 pages
Chapter 1
No ratings yet
Chapter 1
16 pages
Big Data 2020
No ratings yet
Big Data 2020
13 pages
Mid Term Sample Questions
No ratings yet
Mid Term Sample Questions
8 pages
Big Data & Hadoop Essentials
No ratings yet
Big Data & Hadoop Essentials
43 pages
Hadoop Quiz and Exam Answers
No ratings yet
Hadoop Quiz and Exam Answers
10 pages
2023 BD All Assignment
No ratings yet
2023 BD All Assignment
63 pages
Hadoopsdsdgs
No ratings yet
Hadoopsdsdgs
29 pages
Questions For Second
No ratings yet
Questions For Second
59 pages
500+ Interview Questions-1
No ratings yet
500+ Interview Questions-1
126 pages
BDA Question Bank
No ratings yet
BDA Question Bank
33 pages
Date of Exam:25/09/2020: "T3 Examination, Sep 2020."
No ratings yet
Date of Exam:25/09/2020: "T3 Examination, Sep 2020."
6 pages
Big Data and Hadoop Quiz Guide
No ratings yet
Big Data and Hadoop Quiz Guide
21 pages
BIG DATA ANALYTICS MCQs
No ratings yet
BIG DATA ANALYTICS MCQs
8 pages
Midterm Solution
0% (1)
Midterm Solution
7 pages
Bda Ut1 Que Ans
No ratings yet
Bda Ut1 Que Ans
13 pages
Data Egineer Interview Questions
No ratings yet
Data Egineer Interview Questions
126 pages
Assignment 6
No ratings yet
Assignment 6
12 pages
Big Data Visualization
No ratings yet
Big Data Visualization
55 pages
Big Data Analysis Unit 1-5 Extended
No ratings yet
Big Data Analysis Unit 1-5 Extended
35 pages
Hadoop Interview Materials
No ratings yet
Hadoop Interview Materials
28 pages
University of Mumbai Sample MCQ Question Bank Course Code and Name: BDA ITC801 /R16 Class: BE Semester:8 Options A B C D
No ratings yet
University of Mumbai Sample MCQ Question Bank Course Code and Name: BDA ITC801 /R16 Class: BE Semester:8 Options A B C D
6 pages
Ite06 Big Data Analytics-Qbank
No ratings yet
Ite06 Big Data Analytics-Qbank
18 pages
SQL PDF 9
No ratings yet
SQL PDF 9
51 pages
BERT - PLI-Modeling Paragraph-Level Interactions For Legal Case Retrieval
No ratings yet
BERT - PLI-Modeling Paragraph-Level Interactions For Legal Case Retrieval
7 pages
Requirements
No ratings yet
Requirements
17 pages
Data STR and Algorithim CSE 205 PDF
No ratings yet
Data STR and Algorithim CSE 205 PDF
11 pages
INFORMATION TECHNOLOGY Practical File and Project
No ratings yet
INFORMATION TECHNOLOGY Practical File and Project
1 page
Amazon Mws Access: General Role: Listorders Listordersbynexttoken Getorder
No ratings yet
Amazon Mws Access: General Role: Listorders Listordersbynexttoken Getorder
2 pages
Advanced Database Management Course
No ratings yet
Advanced Database Management Course
3 pages
Top 10 Open Source Data Mining Tools: A Brief Look at Mining Tasks
No ratings yet
Top 10 Open Source Data Mining Tools: A Brief Look at Mining Tasks
2 pages
Data Warehouse Design Guide
No ratings yet
Data Warehouse Design Guide
105 pages
B SRV Admin Ref Windows PDF
No ratings yet
B SRV Admin Ref Windows PDF
1,686 pages
Appian Developer Associate Part 4
No ratings yet
Appian Developer Associate Part 4
95 pages
Benohead Sybase Ase Cookbook
No ratings yet
Benohead Sybase Ase Cookbook
82 pages
Data Mining Report
100% (1)
Data Mining Report
15 pages
Advanced Excel Syllabus
No ratings yet
Advanced Excel Syllabus
4 pages
Citation and Use Geoboundaries 3 0 0
No ratings yet
Citation and Use Geoboundaries 3 0 0
2 pages
Steps For Creating Spring Boot Application
No ratings yet
Steps For Creating Spring Boot Application
3 pages
OLTP & OLAP in Auto Industry
No ratings yet
OLTP & OLAP in Auto Industry
6 pages
Database Development Process Comprises A Series of Phases
No ratings yet
Database Development Process Comprises A Series of Phases
3 pages
SQL Joins Explained for IT Students
No ratings yet
SQL Joins Explained for IT Students
3 pages
Data Mining M2
No ratings yet
Data Mining M2
18 pages
ETL Customization
No ratings yet
ETL Customization
1 page
KOHA E-Collections Setup Guide
No ratings yet
KOHA E-Collections Setup Guide
4 pages
Books Zone: Presented By: Akanksha Nanen Toppo MBA/10043/18
No ratings yet
Books Zone: Presented By: Akanksha Nanen Toppo MBA/10043/18
28 pages
Introduction To Hadoop & Spark
No ratings yet
Introduction To Hadoop & Spark
28 pages
Python Material Chapter-5-2024
No ratings yet
Python Material Chapter-5-2024
28 pages
DataStage Guide for IT Professionals
100% (24)
DataStage Guide for IT Professionals
210 pages
VSAM Interview Questions Guide
No ratings yet
VSAM Interview Questions Guide
15 pages
Success Like Pro With Verify 1Z0-184-25 Exam Dumps (2025) Oracle AI Vector Search Professional
No ratings yet
Success Like Pro With Verify 1Z0-184-25 Exam Dumps (2025) Oracle AI Vector Search Professional
7 pages
Engineering & IT Leadership
No ratings yet
Engineering & IT Leadership
56 pages
12th IP 10-2-2024 Set-B
No ratings yet
12th IP 10-2-2024 Set-B
1 page

MCQ Big

Uploaded by

MCQ Big

Uploaded by

MCQ:

4. In how many forms BigData could be found?

10. What are the main components of Big Data?

3. HDFS works in a __________ fashion.

4. ________ NameNode is used when the Primary NameNode goes down.

10. HDFS is implemented in _____________ programming language.

8. __________ maps input key/value pairs to a set of intermediate key/value pairs.

6. Apache Hadoop YARN stands for _________

You might also like