0% found this document useful (0 votes)
29 views4 pages

Big Data

The document discusses big data technologies and concepts. It contains questions for an exam on topics like distributed computing, MapReduce, Hadoop, HDFS, Google File System, NoSQL databases, and data analytics. The questions test understanding of architectures, algorithms, use cases and challenges related to big data systems.

Uploaded by

Ease Gunn
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views4 pages

Big Data

The document discusses big data technologies and concepts. It contains questions for an exam on topics like distributed computing, MapReduce, Hadoop, HDFS, Google File System, NoSQL databases, and data analytics. The questions test understanding of architectures, algorithms, use cases and challenges related to big data systems.

Uploaded by

Ease Gunn
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

I

25 F TRIBHUVAN UNIVERSITY
INSTITUTE OF ENGINEERING
Examination Control Division Programme
2075 Bhadra ry/II

Sabiecr:-BigDatql!9gh"q_l_9gl9lgk:ty!-!!!cIL{10.17)
./ Candidates are required to give their answers in their own words as far as practicable.
,/ Attempt All questions.
{ The fiSares in the margin indicate FullMarks.
./ Assume suitable data if necessary.

1. Why distributed computing is necessary for big data? t5l


Z. Define DFS. How client writes data in HDFS? Explain with the help of suitable block
diagram. [10]
3. The data in big data warehouse is called hybrid data. Explain with suitable examples. t1 0l

4. How GFS differ from other File Systems? List out five distinct differences. tsl
5. What is the main role of GFS Master during read and write processes? How data and
control messages flow in GFS architecture. Explain with suitable flow diagram. [10]
6. Map Reduce is the heart of Hadoop eco-system? Define work flow of Map reduce with
suitable examples. U 0I

7. Clock synchronizationin DFS may be the big challenge. How this clock synchronization
problem can be solved? [10]
8. Hbase, Cassandra and MongoDB are called column-oriented NoSQL database? How
row-oriented database differ from column-oriented database? Explain with suitable
examples. [10]
9. Write short notes on: [5x21
a) Scoop and fiume .
b) Zookeeper
c) Oozie
d) Pig and Hive
e) Client-Server and Master-Siave architecture
,1.**
*
35F TRIBHUVAN UNIVERSITY
INSTITUTE OF ENGINE ERING
Examination Control Division
207 4 Bhadra

Sub-j9c4-Bis-DajaTe9*hnotoE}!'|@^.!9
,/ Candidates are required to give their answers in their own words as far as practicable.
{ Attempt All questions,
,/ The figures in the margin indicate Fu!!_Ws.
,/ Assume suitable data if necessary.

l. a) Explain with example about the distributed system in Big Data. t8l
b) What is the role of Data Scientist? t4l
2, a) Explain the architecture of Google File System (GFS). t8l
b) What is availability and fault tolerance in Google File System? tsl
3. a) Explain in brief Data Flow technique of Map-Reduce Framework. t8l
b) What is Optimization and Data Locality in Map Reduce? t4l
4. Differentiate between structured and unstructured data and discuss the Taxonomy of
NoSQL. t8l
5. Explain the components of Indexing and searching. t8l
6. a) Explain in brief five daemons of Hadoop. t8l
b) What is the role of Hadoop Distributed File System in Hadoop? t4l
7 . Write short notes on: [5 x3]

i) Elastic Search
ii) Hbase Architecture
iii) Functional Programming
,'*

35 F TRIBHWAN UNIVERSITY
INSTITUTE OF ENGINEERING
Examination Control Division
2073 Magh tV/[ iTime

Subjecf - Big Data Technologies (Elective ID Gr76507)


{ Candidates are required to give their answers in their own words as far as practicable.
{ Attempt 4U questions.
/ The figures in the margin indicate Full Marks.
r' Assume suitable dqta if necessary.

l. Why do we need data analytics process? Explain the role of Distributed computing in Big
data-
[s+s]
2. Why do we have large and fixed sized Chunks in GFS? What can be the demerits of that
design?
[10]
3. How is MapReduce library designed to tolerate different machines (map/reduce nodes)
failure wtrile executing MapReduce job? ll 0l
4. For following dablist the input toloutput from both the map and reduce functions for
getting marimum marks oof each co [10]
Student Name College Name Final Marks inYo
Ram ABC 70
Sita ABC 80
Hari ABC 60
Gita XYZ 90
Rita XYZ 80
Shyam PQN 90
Laxmi PQR 70
Gopal PQR 60
OR
What is the combiner function in mapreduce? Explain its purpose with suitable example. tlg]
5. Explain the term NO-SQL. Explain CAP theorem with suitable block diagram. t3+71
6. Describe the typical components involved in search application. tlg]
7. What are different daemons in HADOOP cluster? Explain each in details. [3+Z]
8. Write short notes on any two of following. [2x5]
a) Shadou' Master and Cluak services
b) Analyzers available in Lucene *..
c) Vertical and Horizontal Scalabiliby
*:f.*
*
35 F TRIBHUVAN I.JNIVERSITY i Exam.
INSTITUTE OF ENGINEERING
Examination Control Division
2073 Bhadra

S_1tbi9cti : Pls Dap TgchnoJo_ei_": Pp""rlly" !!l(I7.!-!a7) -- -


r/ Candidates are required to give their answers in their or,vn words as far as practicable.
{ Attempt /!! questions.
{ The Jigures in the margin indicate Fall Marks.
{ Assume suitable data if necessary.

l. What are the current trends in big data analytics? Wtrat are the technical c'hallenges and
characteristics of big data? lt 0l
2. Explain the GFS Architecture. Why single master is n,ct a bottleneck in GF S c luster. [s+5]
3. How does MAP-REDUCE work? Explain each step vrdth suitable example. Is+s]
4. Discuss the architecture of Hbase in short. Explair.r eventual consistency an d tunable
consistency in c.ontext of Cassandra. ll0l
5. Explain LUCRNE architecture and its data indexing approach. tl0l
6. What are the components of Hadoop? Explain each irr bricf. I l0]

7. How do you find max and min occurrence of ttre woi'ds in a given text do,cument.
Explain. tlo]
8. Write short notes on: (any two) [2x5]
a) CAP theorem
b) Role of Data Scientist in Big data
c) Amazon cloucl
,l. r$ t

You might also like