Distributed File System Overview

Distributed file systems (DFS) allow users to access and store files across multiple servers and locations in a transparent manner. DFS achieve local transparency by hiding the physical locations of files from users, and achieve redundancy through file replication across servers. Key features of DFS include transparency of structure, naming, access, and replication; scalability to add more nodes; ensuring data integrity and high availability even during partial failures. DFS implement these features through protocols like SMB and NFS, and cluster storage nodes with replicated data sets to provide redundancy and high availability.

Uploaded by

Lakshya Ruhela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views12 pages

Distributed File System Overview

Uploaded by

Lakshya Ruhela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

UNIT 3

Distributed File system

Introduction
• A distributed file system (DFS) is a file system that is distributed on
various file servers and locations.
• It permits programs to access and store isolated data in the same method as
in the local files.
• It also permits the user to access files from any system. It allows network
users to share information and files in a regulated and permitted manner.
• Although, the servers have complete control over the data and provide users
access control.
• DFS's primary goal is to enable users of physically distributed systems to
share resources and information through the Common File System (CFS).
It is a file system that runs as a part of the operating systems. Its
configuration is a set of workstations and mainframes that a LAN connects.
DFS has two components in its services, and these are as follows:
1. Local Transparency
2. Redundancy
1. Local Transparency
• It is achieved via the namespace component i.e. users can access files
without knowing where they are physically stored on the network.
2.Redundancy
• It is achieved via a file replication component.
• In the case of failure or heavy load, these components work together to
increase data availability by allowing data from multiple places to be
logically combined under a single folder known as the "DFS root".
It is not required to use both DFS components simultaneously; the namespace
component can be used without the file replication component, and the file
replication component can be used between servers without the namespace
component.
Features of DFS
1. Transparency
There are mainly four types of transparency. These are as follows:
• 1. Structure Transparency
The client does not need to be aware of the number or location of file servers and
storage devices. In structure transparency, multiple file servers must be given to
adaptability, dependability, and performance.
• 2. Naming Transparency
There should be no hint of the file's location in the file's name. When the file is
transferred form one node to other, the file name should not be changed.
• 3. Access Transparency
Local and remote files must be accessible in the same method. The file system must
automatically locate the accessed file and deliver it to the client.
• 4. Replication Transparency
When a file is copied across various nodes, the copies files and their locations must
be hidden from one node to the next.
Contd….
2. Scalability
The distributed system will certainly increase over time when more
machines are added to the network, or two networks are linked together.
A good DFS must be designed to scale rapidly as the system's number
of nodes and users increases.
3. Data Integrity
Many users usually share a file system. The file system needs to secure
the integrity of data saved in a transferred file. A concurrency control
method must correctly synchronize concurrent access requests from
several users who are competing for access to the same file.
3.High Reliability
• The risk of data loss must be limited as much as feasible in an
effective DFS. Users must not feel compelled to make backups of their
files due to the system's unreliability. Instead, a file system should
back up key files so that they may be restored if the originals are lost.
4.High Availability
• A DFS should be able to function in the case of a partial failure, like a
node failure, a storage device crash, and a link failure.
DFS Architecture
• A DFS clusters together multiple storage
nodes and logically distributes data sets
across multiple nodes that each have their
own computing power and storage. The data
on a DFS can reside on various types of
storage devices, such as solid-state drives
and hard disk drives.
• Data sets are replicated onto multiple
servers, which enables redundancy to keep
data highly available.
• The DFS is located on a collection of servers,
mainframes or a cloud environment over a
local area network (LAN) so multiple users
can access and store unstructured data.
• If organizations need to scale up their
infrastructure, they can add more storage
nodes to the DFS.
• Clients access data on a DFS using namespaces.
Organizations can group shared folders into logical
namespaces.
• A namespace is the shared group of networked storage on a
DFS root. These present files to users as one shared folder with
multiple subfolders. When a user requests a file, the DFS brings
up the first available copy of the file.
• There are two types of namespaces:
1. Standalone DFS namespaces.
2. Domain-based DFS namespaces.
1. Standalone DFS namespaces. A standalone or independent
DFS namespace has just one host server.
Standalone namespaces do not use Active Directory (AD). In a
standalone namespace, the configuration data for the DFS is
stored on the host server's registry. A standalone namespace is
often used in environments that only need one server.
2. Domain-based DFS namespaces. Domain-based DFS
namespaces integrate and store the DFS configuration in AD.
Domain-based namespaces have multiple host servers, and the
DFS topology data is stored in AD. Domain-based namespaces
are commonly used in environments that require higher
availability.
Implementations of a DFS
• A DFS uses file sharing protocols. Protocols enable users to access
file servers over the DFS as if it was local storage.
The Protocols that DFS use include the following:
• Server Message Block (SMB). SMB is a file sharing protocol
designed to allow read and write operations on files over a LAN. It is
used primarily in Windows environments.
• Network File System (NFS). NFS is a client-server protocol for
distributed file sharing commonly used for network-attached storage
systems. It is also more commonly used with Linux and Unix
operating systems.
• Hadoop Distributed File System (HDFS). HDFS helps deploy a
DFS designed for Hadoop applications.
Advantages and disadvantages of a DFS

• A DFS provides organizations with a scalable system to

manage unstructured data remotely. It can enable organizations
to use legacy storage to save costs of storage devices and
hardware. A DFS also improves availability of data through
replication.
• However, security measures need to be in place to protect
storage nodes. In addition, there is a risk for data loss when
data is replicated across storage nodes. It can also be
complicated to reconfigure a DFS should an organization
replace storage hardware on any of the DFS nodes.
THANK YOU

What Is DFS
No ratings yet
What Is DFS
37 pages
Distributed File System
No ratings yet
Distributed File System
5 pages
Rev. Lecture 1 PPT2
No ratings yet
Rev. Lecture 1 PPT2
24 pages
Unit 3 Dic
No ratings yet
Unit 3 Dic
22 pages
A Distributed File System: By, Prof Ankita Mandore
No ratings yet
A Distributed File System: By, Prof Ankita Mandore
37 pages
(DFS) Distributed File System-1
No ratings yet
(DFS) Distributed File System-1
12 pages
2distributed File System Dfs
No ratings yet
2distributed File System Dfs
21 pages
Module III Hadoop Framework
No ratings yet
Module III Hadoop Framework
21 pages
Chapter 8
No ratings yet
Chapter 8
22 pages
DC - Unit 3 Uhh Ybhg The G Hai H G BT
No ratings yet
DC - Unit 3 Uhh Ybhg The G Hai H G BT
32 pages
Distributed File System
No ratings yet
Distributed File System
7 pages
Chapter 2 (II) Distributed System
No ratings yet
Chapter 2 (II) Distributed System
80 pages
Lecture24 DFS PartI 25nov 2014
No ratings yet
Lecture24 DFS PartI 25nov 2014
46 pages
DC Mod 6
No ratings yet
DC Mod 6
9 pages
Distributed File System Overview
No ratings yet
Distributed File System Overview
16 pages
Distributed File Systems-2
No ratings yet
Distributed File Systems-2
4 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
Shradha
No ratings yet
Shradha
13 pages
Lec 11 - Distributed Files - Distributed File System
No ratings yet
Lec 11 - Distributed Files - Distributed File System
33 pages
Cloud Spanning: Multiple Environments
No ratings yet
Cloud Spanning: Multiple Environments
6 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
Title: Distributed File Systems
No ratings yet
Title: Distributed File Systems
9 pages
Title: Distributed File Systems
No ratings yet
Title: Distributed File Systems
9 pages
Navigating The Landscape of Distributed File Systems: Architectures, Implementations, and Considerations
No ratings yet
Navigating The Landscape of Distributed File Systems: Architectures, Implementations, and Considerations
10 pages
DC Chapter 6
No ratings yet
DC Chapter 6
15 pages
22is403 LM16
No ratings yet
22is403 LM16
10 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
DFS Simplified Presentation
No ratings yet
DFS Simplified Presentation
8 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
Distributed System DS Unit5
No ratings yet
Distributed System DS Unit5
61 pages
Discrete Computing
No ratings yet
Discrete Computing
25 pages
Unit 2. What Is A Distributed File System (DFS)
No ratings yet
Unit 2. What Is A Distributed File System (DFS)
1 page
Hadoop & Distributed File Systems
No ratings yet
Hadoop & Distributed File Systems
120 pages
DC - PPT A Case Study On Distributed File Systems
No ratings yet
DC - PPT A Case Study On Distributed File Systems
17 pages
AStudyOnDistributedFileSystems MahmutUNVER
No ratings yet
AStudyOnDistributedFileSystems MahmutUNVER
6 pages
Notes - 3 Unit Neha
No ratings yet
Notes - 3 Unit Neha
25 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
Chap 6
No ratings yet
Chap 6
54 pages
Microsoft Windows Server 2012 R2 - Administration: File Services and Encryption
No ratings yet
Microsoft Windows Server 2012 R2 - Administration: File Services and Encryption
69 pages
Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and Minio
No ratings yet
Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and Minio
9 pages
DFS, PPT
No ratings yet
DFS, PPT
18 pages
7 A Taxonomy and Survey On Distributed File Systems
No ratings yet
7 A Taxonomy and Survey On Distributed File Systems
6 pages
Unit-4 BDA As On 25-11-2024
No ratings yet
Unit-4 BDA As On 25-11-2024
258 pages
Lecture13 15319 MHH 27feb 2012
No ratings yet
Lecture13 15319 MHH 27feb 2012
30 pages
2.5 DFS
No ratings yet
2.5 DFS
14 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Distributed File Systems Concepts and e 61384
No ratings yet
Distributed File Systems Concepts and e 61384
54 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
9.2 Desirable Features of Good Distributed File System
No ratings yet
9.2 Desirable Features of Good Distributed File System
20 pages
Unit 3 Part 2 DFS
No ratings yet
Unit 3 Part 2 DFS
4 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
DC EXP8-1
No ratings yet
DC EXP8-1
5 pages
DFS-Based Railway Reservation
No ratings yet
DFS-Based Railway Reservation
8 pages
Distributed Systems: (3rd Edition)
No ratings yet
Distributed Systems: (3rd Edition)
36 pages
NoteGPT - End To End Data Analytics Project - Banking Domain - Data Analysis Using Python, MySQL and Power BI
No ratings yet
NoteGPT - End To End Data Analytics Project - Banking Domain - Data Analysis Using Python, MySQL and Power BI
19 pages
WebLogic Persistent Store Guide
No ratings yet
WebLogic Persistent Store Guide
2 pages
Iviewdataloggingtutorial
No ratings yet
Iviewdataloggingtutorial
6 pages
Managing Security of Information
No ratings yet
Managing Security of Information
4 pages
Comprehensive AV Exclusion Recommendations
No ratings yet
Comprehensive AV Exclusion Recommendations
36 pages
UNIT 5 Notes
No ratings yet
UNIT 5 Notes
47 pages
ALL CHART Minitab Practice
No ratings yet
ALL CHART Minitab Practice
15 pages
Abap Table
No ratings yet
Abap Table
101 pages
DBMS Quiz-1
No ratings yet
DBMS Quiz-1
19 pages
Basics of Hadoop
No ratings yet
Basics of Hadoop
19 pages
Linked Lists
No ratings yet
Linked Lists
12 pages
Aiml Notes Unit-4
No ratings yet
Aiml Notes Unit-4
25 pages
00 Project 3A - Research Paper
No ratings yet
00 Project 3A - Research Paper
2 pages
Database Lab Activity
No ratings yet
Database Lab Activity
3 pages
Pro*C Programming Guide
No ratings yet
Pro*C Programming Guide
18 pages
SQL*Net Tracing in Oracle DB Guide
No ratings yet
SQL*Net Tracing in Oracle DB Guide
8 pages
Questions: 1 Answers: 1: More Details
No ratings yet
Questions: 1 Answers: 1: More Details
24 pages
Dbms File
No ratings yet
Dbms File
9 pages
Database and Auditing Concepts Guide
No ratings yet
Database and Auditing Concepts Guide
4 pages
Database Intro: Pavan Varma
No ratings yet
Database Intro: Pavan Varma
5 pages
Dbmss
No ratings yet
Dbmss
14 pages
A Database Engine
No ratings yet
A Database Engine
3 pages
SID Sas 94 Win-32 2017
No ratings yet
SID Sas 94 Win-32 2017
3 pages
Knowledge Discovery Process
No ratings yet
Knowledge Discovery Process
24 pages
Dessin Technique de Mode PDF
No ratings yet
Dessin Technique de Mode PDF
27 pages
Metadata for Data Governance Guide
No ratings yet
Metadata for Data Governance Guide
7 pages
Joining and Relating Tables in Arcgis: in This Chapter You Will Learn
No ratings yet
Joining and Relating Tables in Arcgis: in This Chapter You Will Learn
6 pages
Timescaledb: SQL Made Scalable For Time-Series Data: 1 Background
No ratings yet
Timescaledb: SQL Made Scalable For Time-Series Data: 1 Background
7 pages
Rana Phar Final Name
No ratings yet
Rana Phar Final Name
6 pages
BMW E Platform Development Tools
No ratings yet
BMW E Platform Development Tools
26 pages

Distributed File System Overview

Uploaded by

Distributed File System Overview

Uploaded by

UNIT 3

Distributed File system

• A DFS provides organizations with a scalable system to

You might also like